One paper <@UCQ4BBFQ9> references in the blog post...
# thinking-together
s
One paper @jdjakub references in the blog post is Infra: Structure all the way down (https://sites.cs.ucsb.edu/~holl/pubs/Hall-2017-Onward.pdf). @Joe Trellick has recommended that one to me a while ago also. So I finally read it. And then I read Christopher Hall’s thesis (http://www.christopherkhall.com/Dissertation.pdf), which is basically a not even that much longer version of the paper. I find it quite interesting. The storage format is really well thought out and has some interesting affordances for use cases I didn’t expect to be served by a storage format directly, but now after reading make a lot of sense to me. Not much has happened since the release of the paper it seems. Google Scholar only found one referencing paper, unsurprisingly one that’s also mentioned in @jdjakub’s blog post, and which is a critique to a paper @Bosmon has his name on. We’re all here in this together it seems. :-) The website talks about open source (still pending university’s approval) and standardization. But given the status of the site it could equally likely be that the author has moved on. Has anybody done anything with Infra, or knows more about what has happened or is happening with it? And what are your thoughts about Infra?
💯 1
❤️ 1
j
I invited @Chris Hall to the slack a while back, so maybe he can comment? I really like how detailed he gets with the format, in specifying the byte structure and such (random thought: what if it was a superset of UTF-8?). I find it an interesting idea to take text and elevate it from this ground-level-only representation (a string of characters) to something recursive and more powerful. Infra doesn't bottom out like text does, but can have nesting of various containers, references, and an interesting concept to apply changes to references called a patch. My one thought is that maybe by starting with text and elevating it makes Infra limited in some ways. Text is linear and one dimensional. Infra reflects that in that it has byte-length prefixes for all elements. I think if the goal is to have this be used as a base level for implementing a programming environments or other application, there would still be a lot of work linearizing the parts of the system so it could be encoded in Infra. The thought about using Infra even in memory on the heap (so you don't need serializers/de-serializers) is interesting, but I'm not sure many systems would be able to do that with the linearity constraint.
e
Infra is both brilliant and ignorant at the same time. It is kind of neat to consider a text form that has meta-data above it, that you can access. that keeps the detail from overwhelming you. that's intriguing and deserves more study - is reminiscent of Xanadu in that sense. But it is ignorant because their serialization is basically a very rudimentary version of Adobe Flash's AMF (action message format), which is up to version 3 and handles very tricky things like serialization of circular rings, and has dozens of tricky data formats like fonts, bitmaps, mp3 sounds, etc. AMF is fully in the public domain and I believe is used by Hack as well. A good binary serialization system is found in many high quality environments, and the mark of quality in a serialization format is the range of datatypes that can be encoded. Using XML and JSON, two of the absolute weakest-extant encoding formats, as your starting point was their mistake. Many of the latest languages have serialization built in, and it is a huge amount of code to do this well, one of the monster issues confronting anyone trying to develop a new system. I would wager not 100 people on the planet earth understand the truetype language, which underlies all font rendering. Most people think that a font is a bunch of pictures of a letter, but in reality it is a bizarre language custom made for describing letter shapes, and it is executed by an interpreter to generate letters, and since it is code it can have bugs, and Microsoft in one of their releases of Office shipped fonts that had crash bugs in them, and wreaked havoc with one of my publishing products that produced thumbnails in the background of every font. Ever wonder why PDF won the battle for document format, even though it is not a file format but a glob of postscript code? Primarily because Adobe was one of the only companies that knows Truetype (and their own font format, which they called Type 1).