https://futureofcoding.org/ logo
#of-ai
Title
k

Kartik Agaram

08/10/2023, 10:15 PM
Grokking
In 2021, researchers made a striking discovery while training a series of tiny models on toy tasks . They found a set of models that suddenly flipped from memorizing their training data to correctly generalizing on unseen inputs after training for much longer. This phenomenon – where generalization seems to happen abruptly and long after fitting the training data – is called grokking and has sparked a flurry of interest .
https://pair.withgoogle.com/explorables/grokking
a

alltom

09/04/2023, 9:16 PM
Oh wow, this is a better overview than the first paper I read on this. Thanks!