| ▲ | ofsen 7 hours ago | |
This looks like exact copy of this video of andrej karpathy ( https://youtu.be/kCc8FmEb1nY ) but in a writing format, am i wrong ? | ||
| ▲ | mellosouls 22 minutes ago | parent | next [-] | |
The page describes its relationship to nanogpt. ...nanoGPT targets reproducing GPT-2 (124M params) and covers a lot of ground. This project strips it down to the essentials and scales it to a ~10M param model that trains on a laptop in under an hour... | ||
| ▲ | drcongo 4 hours ago | parent | prev [-] | |
Yes, you are. | ||