openai / grok
- вторник, 19 марта 2024 г. в 00:00:01
This is the code for the paper Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets by Alethea Power, Yuri Burda, Harri Edwards, Igor Babuschkin, and Vedant Misra
pip install -e .
./scripts/train.py