To clarify terms, by pixel you mean three arrays, one each for RGB, and a fourth containing X/Y co-ordinates on a 2D grid (which could be a screen, or presumably could be abstracted?) Or other things?
I’m not sure how other primitive things necessary to make this work in the fashion described, like neighbour calculations, would be crafted in pixel-terms, or if the “instead of bits” interpretation is a bit too limiting.
If you’re interested in computing on a spatial grid, in which cells can contain color information, though, you might find agent-based modeling frameworks an interesting place to start.
A lot of our basic example sims in HASH are around things like color passing:
https://hash.ai/models?query=color&sort=relevance&page=1
Some docs at
https://hash.ai/docs/simulation if you want to have a play. It’s all free, and mostly open