I m stewing on the idea of a literate codec <https qoiformat Future of Coding #thinking-together

I'm stewing on the idea of a "literate codec" -- <...

Walker Griggs

10/21/2024, 4:14 PM

I'm stewing on the idea of a "literate codec" -- the "quite OK" ecosystem feels like a good place to start. Can anyone recommend modern alternatives to CWEB?

Kartik Agaram

10/21/2024, 5:09 PM

Two comments on opposite extremes: 1. I quite like the blog post about QOI. Perhaps it doesn't matter that it's not literate? 2. After starting out enamored with LP, lately I think the most important thing is that a program is easy to run. If the iteration loop is right it's enough to just throw small programs at someone. They can use it if they care about the domain. If it doesn't seem like people read it, it's because descriptions like this are akin to rooms of requirement. People will come to it when they're ready, months or years later. Just make sure it runs then. The how to run instructions need to be rock solid. Everything else is gravy and nice to have -- if you meet the preconditions of receptive reader and easy to run. So I think classic dead-fish LP emphasizes the wrong things.

Kartik Agaram

10/21/2024, 5:28 PM

Are you familiar with the QOI eco-system? This is the first I'm hearing about it, and I'm immediately suspicious of the airy "20% better compression" claim in the repo. Have other people validated this claim, do you know? It feels more believable if they claimed "20x faster encode/decode, 20x shorter implementation, 20% worse compression."

Walker Griggs

10/21/2024, 10:08 PM

No one really uses qoi or qoa in production, from what I can tell

Walker Griggs

10/21/2024, 10:10 PM

I think the project stems from the desire to make as simple of a codec / format as possible while still being somewhat performant. I do also have a hard time believing the "20x blah blah" .. I don't think that was evaluated with much rigor. Or at least, it's cherry picking favorable stats

Walker Griggs

10/21/2024, 10:11 PM

as far as the LP part of the Q, I'm actually less concerned with it being easy to run and more concerned with it being easy to read. This all stemmed from a number of conversations I had at a conference this week about "how do we teach people how codecs work" etc.

Walker Griggs

10/21/2024, 10:11 PM

My favorite example of this is: https://www.amazon.com/Introduction-Video-Compression-Fore-June/dp/1451522274?ref_=ast_author_dp

Kartik Agaram

10/21/2024, 10:35 PM

I'm actually less concerned with it being easy to run and more concerned with it being easy to read.

Yeah, my claim is that this is a false dichotomy. Reading and running are both contributors to helping build a mental model of a program in someone's head. Reading without running runs into all the Bret Victor criticisms we know and love here. There are a few different LP systems out there. I've built one myself and know of several more by just people in my circle. This page is one list. But they haven't caught on much, and I think it's because conflating code with books pulls in considerations from the publishing industry that don't actually help build mental models in people's heads. Literate programs look like blog posts, and reading them doesn't actually get people to engage actively with them. If you separate them from the publishing angle with its irrelevant constraints, other form factors seem more promising: • Textbooks in the context of some formal class, with exercises. (Your link above seems to be in this category.) • Documentation in the context of a specific program. These kinds of circumstances are why in the past year I've started to care more about running first before I even start reading. If you can run it, the reading experience can be more fault tolerant, and it can be more economic to provide. I get the sense I might be talking past you, so definitely let me know if I'm misunderstanding your question.

Kartik Agaram

10/21/2024, 10:38 PM

I've been trying not to plug my own stuff, but a couple of links might help triangulate where I'm coming from: • My literate programming approach. I used this for several years. • A post on why people don't read programs.

Konrad Hinsen

10/22/2024, 6:25 AM

I very much agree that it takes both reading and running to engage with code, and that's indeed a major issue with traditional LP. Also with Open Source, btw, which suggests that having full access to the code is enough to understand and modify code, even if it's a huger mess and impossible to build. That said, integrating code with a narrative becomes very relevant when you also add data (via visualizations). It's not code you engage with then, but data, computational models, etc. This is the reason why notebooks were so much more successful than traditional code-centric LP. One major weakness of notebooks is the single narrative. What I would like to have is a graph of narratives, code, and data, everything being interactive. I am aware of two real-life systems that enable this: Glamorous Toolkit and Webstrates.

Jack Rusher

10/22/2024, 8:02 AM

+1 on read and run, preferably in a system that allows per form evaluation to aid in codebase exploration

Alex McLean

10/23/2024, 10:46 AM

Computer programs are complex systems so it is impossible to understand them just by reading them.

Alex McLean

10/23/2024, 10:50 AM

For example

(x ^ y) % 9 == 0

is easy to understand as code but when you run it you get something in a completely different domain with effects and relationships that you couldn't have predicted

Alex McLean

10/23/2024, 10:52 AM

image.png

Alex McLean

10/23/2024, 10:52 AM

ref https://x.com/aemkei/status/1378106734871461890

Alex McLean

10/23/2024, 10:53 AM

that's where liveness comes in - connecting code, domain and programmer

Kartik Agaram

10/24/2024, 10:08 AM

Self-referentially, Alex's comment just took us from abstract generalizations about code directly to the domain of code.

Kartik Agaram

10/24/2024, 10:14 AM

But now this thread connects up for me with a recent discussion on Mastodon about what 'understanding' means, and where 'understanding' lies.

Computer programs are complex systems so it is impossible to understand them just by reading them.

It is arguably also impossible to understand most programs today just by running them.

Kartik Agaram

10/24/2024, 10:28 AM

Which now connects up for me with the podcast episode on Programming as Theory-building: a lot of "understanding" a program comes from figuring out which inputs to pass to it. And I don't mean just some static list of inputs captured in a unit test. You learn the broad categories of phenomena you can expect from a domain, akin to the kinds of orbits people have discovered so far for the 3-body problem. I've learned knacks like this from working with others in the past. (Apologies if all this seems too much of a tangent to the original thread. I can start a fresh one if so.)

Beni Cherniavsky-Paskin

10/27/2024, 4:53 PM

I recall Xiph doing some well visualized and explained posts on their research on experimental video codec: https://xiph.org/daala/

Walker Griggs

10/28/2024, 7:59 PM

I have not abandoned this thread... just gotten busy. I'll swing back shortly to read, digest, and respond.

Walker Griggs

10/28/2024, 8:00 PM

and also, yes, I've loved Monty's documentation and videos -- shame they take so much time to produce

Open in Slack

Previous Next