How Do Machines Grok Data?
Overtrained neural networks discover novel solutions
https://www.quantamagazine.org/how-do-machines-grok-data-20240412
https://news.ycombinator.com/item?id=40020702
* machine learning: neural network (linear algebra) over data
* train on training data to minimize error to expected result ("memorization")
* test on test data
* overfitting: overtrained on training data, error increases on test data
* h/e massively overtrained LLM discard "memorized" solution, acquire "generalization"capabilities💡
