I completed a toy setup of GEPA with some fricken good results! I am very happy to have a methodology that removes some of the guess work with prompt design. Its a really simple algorithm that orchestrates a genetic evolution where the mutation operator is asking a LLM to improve the prompt based on the diagnostic trace your evals. So simple! I also enabled websearch during reflection so it can actually do its research when improving the prompt. This means it would adapt the web documentation to suit the prompt format automatically. Very good, been looking for something like this and it did not disappoint.
https://observablehq.com/@tomlarkworthy/gepa