How does the future of coding handles missing mixed data lik Future of Coding #thinking-together

Join Slack

How does the future of coding handles missing/mixe...

# thinking-together

Mariano Guerra

08/04/2020, 11:08 AM

How does the future of coding handles missing/mixed data like undefined, null or values that change type over time?

Orion Reed

08/04/2020, 11:12 AM

Do you mean how future-oriented programming languages move away from those constructs?

Mariano Guerra

08/04/2020, 11:14 AM

or how they don't move away 🙂

Mariano Guerra

08/04/2020, 11:14 AM

or sideways 😛

Orion Reed

08/04/2020, 11:19 AM

Dependent types, typed holes, pure functional languages, all have interesting ideas. Then there’s state, I highly recommend Rich Hickey’s talk

‘The Value of Values’▾

👍 1

Mariano Guerra

08/04/2020, 11:23 AM

data exists before entering the system and its shape not necessarily matches what a type system would like to handle

👍 1

💯 1

Mariano Guerra

08/04/2020, 11:23 AM

https://xkcd.com/1838/

😎 1

Duncan Cragg

08/04/2020, 11:23 AM

That's two questions! I distinguish "null" from "undefined"; there's a difference between "nothing known (yet)" - undefined - and "known (to be) nothing" - null.

Mariano Guerra

08/04/2020, 11:24 AM

at least two questions

☝️ 1

Duncan Cragg

08/04/2020, 11:24 AM

Or between Not Applicable and Not Available

Mariano Guerra

08/04/2020, 11:25 AM

I once implemented integration for a system that sent numbers when there were numbers, empty strings some times (I guess it meant no data) the string "N.A." some other times

Duncan Cragg

08/04/2020, 11:27 AM

The other question, about types: My types are basically syntactic: strings that can be matched by parsers. So any property can change type if it wants.

Chris Knott

08/04/2020, 12:14 PM

I see "null" as a very bad implementation of "missing". In general it's an attractive pattern to be able to build up an object gradually over separate steps. The way C++ or Java are designed, the easiest way to do this is using null. (It can also be done by using lots of separate interfaces, but this is more work). The reason null is a billion dollar mistake is because it allows you to 1. null everything, 2. retrieve a null value. If the null pointer exception was thrown at the moment you called x = obj->value (if value is null, not obj) it wouldn't be able to permeate so much. JS actually does the concept of 'missing' correctly but then throws in null as a kind of turd in the punch bowl. The other point has been addressed by modern languages such as Kotlin where properties must be explicitly marked as nullable (I would make it 'optional'), and then reference with ?> instead of ->

Paul W Homer

08/04/2020, 1:08 PM

I tend to see null as ‘optional’, and then handle ‘partial data’ as a different ‘entity’: http://theprogrammersparadox.blogspot.com/2015/11/containers-collections-and-null.html

Andrew F

08/04/2020, 5:47 PM

The ideas I'm kicking around for values that change type over time look vaguely like dependent types, specifically types that depend on a time or generation/version parameter. I don't think you can avoid ending up with something morally equivalent to dynamic typing: you're going to branch on (or use as a lookup key, whatever) the current type somehow, even if you manage to stash the branch in the runtime. I more often think of this in terms of migrating between different serialization formats or database schemata. Maybe another (equivalent?) perspective is as a discriminated union type where you keep adding variants over time. (Have you ever noticed that an instance of a sum type looks a lot like a dependent pair, with the payload dependent on the value of the discriminant? Is that an artifact of my shaky understanding or does everyone know that already?)

Garth Goldwater

08/04/2020, 6:11 PM

here’s a very pragmatic approach to null in the (unfortunately dormant i think) tulip programming language:

https://youtu.be/lvclTCDeIsY▾

😎 1

Garth Goldwater

08/04/2020, 6:13 PM

and @Andrew F here’s one of Tulip’s designers talking specifically on variants:

https://youtu.be/ZQkIWWTygio▾

Andrew F

08/04/2020, 7:38 PM

@Garth Goldwater that looks neat. Tagwords are a great idea. The little ascii faces are great too. :)

shalabh

08/05/2020, 7:48 AM

I second Hickey's talk - interesting questions in there. I have more than a few things to say about this topic. My position is that the generic nullability is bad. Also, empty strings are bad. Yes. Think about this: we've had the concept of 'zero' for 1000s of years but only very recently added the concept of 'empty string' - why? Could it be mainly for modeling in computers? There is no zero-like symbol you can write in a paper form where text is expected. Yes you could write

N/A

but that conflates null and "". Sometimes you might be asked to write why it is

N/A

. Consider that there are valid questions for which the meaningful answer is zero: "How many toilet paper rolls are there in the store?", "What is the temperature of the snow?" etc. There are no questions where the 'empty string' answer has a clear-cut meaning: "What is your spouse's name?", "What city do you live in?" etc. What does an answer of

""

mean? It could mean "I don't know" or "I dont want to tell you" or "There isn't one". Can these always be mapped to null or ""? I don't think so. In type systems that have both, the empty string and null, you end up with the question of 'what does null mean' vs 'what does empty string mean'. Sometimes when you have two missing values in the world that you want to model - you end up mapping these to '' and null. But really this has nothing to do with a generic solution - it's often just a reality twisted to fit the types. If you want to model three missing values e.g. 'unknown', 'non-existent', 'secret/intentionally-blank' - how do you map these to "" and null? Maybe the best solution here is to model the value as an enum?

unknown, non_existent, intentionally_blank, value(real_string_here)

. AFAIK no systems, languages have a great solution. However I feel better solutions lie in exploring the 'information modeling' space - RDF etc (perhaps with some kind of versioned schemas to describe what is possible and necessary.) Basically we don't want multiple data representations to correspond to a single reality (which is what happens in null vs "", or sometimes with nested

option<option<option<t>>>

Orion Reed

08/05/2020, 7:58 AM

@shalabh to add to your points on how reality and implementation play into our constructs of strings. If we take away implementation and ways to model them, strings are really just symbols in a total order. In that context, spaces don’t make much sense, neither do tabs, empty strings too. In my humble opinion we need to build a construct for orders and symbols (not just text, but any symbol you can imagine) and then modern string types can be considered special cases where we are talking about a total order of ‘text’ symbols.

Duncan Cragg

08/05/2020, 8:19 AM

In Onex I have no empty string concept and also no control characters (space is a control character in that statement)

Orion Reed

08/05/2020, 8:22 AM

@Duncan Cragg sounds interesting! Do you have a link?

Duncan Cragg

08/05/2020, 8:22 AM

I have unknown (perhaps like undefined) and nothing (maybe that's null) as distinct concepts or special symbols

Duncan Cragg

08/05/2020, 8:23 AM

Oh hai Orion, um, documentation is a little thin and/or dated 😧

Duncan Cragg

08/05/2020, 8:28 AM

Other shocking things about Onex: a property can be a single symbol but you can add more symbols, at which point it becomes a list. Indexed from 1. 😊

Duncan Cragg

08/05/2020, 8:30 AM

Maybe I should write an up to date description of this..

👍 1

shalabh

08/05/2020, 8:36 AM

@Orion Reed I assume you mean a total order on words? Yes that could be one way to model strings. 'Text' however is a widespread concept outside computers and often mixed in with presentation as well (are paragraph boundaries important? Are underlines important?) So in some sense having a flexible 'container' of media provided by the user is reasonable too. Kinda like a 'bitmap drawing' or even 'rich text'. I don't think strings are particularly fundamental - they model some aspects of text (e.g. paragraphs) but not others (color, underline). In any case, the system won't look inside this media object to make decisions - it will just pass this object around and the meaning is entirely interpreted by another person at some other time. This is where I think "" and null become interesting: often we write

if text is none: ...

. So the system does look inside this shape.

shalabh

08/05/2020, 8:42 AM

The 'auto-list' model is interesting, and probably right. "I have one apple" should only be represented one way, so

apple

[apple]

looks suspicious.

Mariano Guerra

08/05/2020, 8:46 AM

The thing I find interesting is that if a single value is not a special (and default case) then you don't need null or similar

Mariano Guerra

08/05/2020, 8:47 AM

if everything is a sequence (of potentially zero or one items), then an empty sequence would be "null", the cool thing is that if operations on values internally translate to map then handling the empty case comes for free. This sounds a little bit like "nil punning" in clojure, which is a pain when you find a nil and don't know where it became nil since everyone is passing it around happily

Duncan Cragg

08/05/2020, 9:40 AM

In Onex, empty sequence=empty symbol=nothing =the whole property isn't there. In an object property, if you empty the value or clear the list, the property itself is deleted; if you want a placeholder you just use unknown

Duncan Cragg

08/05/2020, 9:41 AM

conflating nothing with unknown is a source of uncountable glitches in the history of software

amiga tick 1

Duncan Cragg

08/05/2020, 9:42 AM

Note that this is all to help non-techies feel at home. Yes, non-techies would wonder why

apple

isn't the same as

[apple]

, or even why have those brackets

Duncan Cragg

08/05/2020, 9:43 AM

so I have

fruit: apple

then

fruit: apple pear

and no brackets, no "list type"

Duncan Cragg

08/05/2020, 9:44 AM

and so

fruit:[1]

will always be

apple

☝️ 2

Dan Cook

08/06/2020, 7:56 AM

If undefined means "not known yet" and null means "known to be nothing", then what about the situation when a value is known to be the value

undefined

Duncan Cragg

08/06/2020, 10:58 PM

(just recovering from the neuronal warp that question induced)

Duncan Cragg

08/06/2020, 10:59 PM

maybe "not known yet" can also encompass "not known, and who knows why or when"

Duncan Cragg

08/06/2020, 11:00 PM

got any examples?

Garth Goldwater

08/07/2020, 12:14 AM

i’m dealing with that trying to do something somewhere between (un)typed holes and a structural editor—if a user has a key but its property isn’t defined yet, and then they skip ahead to elsewhere in the syntax tree, i’d say that i know they want a value for the property later but it’s not strictly defined yet

👍 2

Garth Goldwater

08/07/2020, 12:14 AM

pretty sure i can just like... ignore that situation though lol

Garth Goldwater

08/07/2020, 12:16 AM

i haven’t stopped thinking about your no lists/everything is a list model @Duncan Cragg... i think you might be spiritually correct, not making afforsances for user convenience

Garth Goldwater

08/07/2020, 12:17 AM

reminds me of how a lot of stuff “just works” in APL as a result of rank polymorphism, but also seems a bit further than that

shalabh

08/07/2020, 12:47 AM

In Python, there is no char type. They are just strings of length 1. So

a[0] == a

is true for strings of length 1. You can also loop over them

for char in a

will work for zero, one or longer length strings. Things don't come crashing down.

shalabh

08/07/2020, 12:50 AM

BTW, one angle to think about this is that 'nothing' is a property of the field itself (e.g. fields are boxes, and one box is empty). While 'unknown' is a special kind of object that I can put in a box. Interesting.. can I put two unknowns in a box? Or an

unknown

and an

apple

Dan Cook

08/07/2020, 3:07 AM

I think that falls apart if you can get back 'nothing' as a value. But if you can't, then what happens when you try to get it? The only other options I know are throwing an error, or some kind of Haskell-like "Maybe" construct

Duncan Cragg

08/07/2020, 1:01 PM

@shalabh to avoid that problematic subtlety, I don't have empty boxes, if you empty it, it disappears! (i.e. the object property is deleted) If you try to get it, you get nothing back.

Duncan Cragg

08/07/2020, 1:01 PM

You can put

fruit: *unknown* apple *unknown* banana

, yes

Duncan Cragg

08/07/2020, 1:02 PM

But

fruit: *nothing* apple *nothing* mango

collapses to just

fruit: apple mango

Duncan Cragg

08/07/2020, 1:03 PM

and of course

fruit: *nothing* banana

is just

fruit: banana

(no list any more)

Duncan Cragg

08/07/2020, 1:05 PM

@Dan Cook what falls apart if you can get back nothing ?

Dan Cook

08/07/2020, 3:01 PM

If nothing is not a value and just a property or state, then what is the result of getting nothing? If you can get back nothing, then nothing has to be a value. ... But the solution you just described (i.e. nothing just disappears from containers) is interesting! So nothing is somewhat like a value, but it's a disappearing value.

Duncan Cragg

08/07/2020, 3:22 PM

I suppose it depends if you're on the "left hand side" (matching) or "right hand side" (setting) of a rewrite rule. If you're matching, you have to be able to match with a symbol, so can use nothing to say you really want that to be absent. If you're setting, and use nothing, that's when the disappearing trick happens.

Duncan Cragg

08/07/2020, 3:24 PM

Obviously, with homoiconicity (ooo), you can't let the rewrite rule itself reduce because the matching side will disappear!

2 Views

Open in Slack

Previous Next