▲ | rsanek 5 days ago | |||||||
Anyone else read the book that the author mentions, Build a Large Language Model (from Scratch) [0]? After watching Karpathy's video [1] I've been looking for a good source to do a deeper dive. [0] https://www.manning.com/books/build-a-large-language-model-f... | ||||||||
▲ | tanelpoder 5 days ago | parent | next [-] | |||||||
Yes, can confirm, the book is great. I was also happy to see that the author correctly (in my mind) used the term “embedding vectors” vs. “vector embeddings” that most others seem to use… Some more context about my pet peeve: https://tanelpoder.com/posts/embedding-vectors-vs-vector-emb... | ||||||||
▲ | malshe 5 days ago | parent | prev | next [-] | |||||||
Here is the code used in the book - https://github.com/rasbt/LLMs-from-scratch | ||||||||
▲ | gchadwick 5 days ago | parent | prev | next [-] | |||||||
I thought it was a great book, dives into all the details and lays it out step by step with some nice examples. Obviously it's a pretty basic architecture and very simplistic training but I found it gave me the grounding to then understand more complex architectures. | ||||||||
▲ | kamranjon 5 days ago | parent | prev | next [-] | |||||||
It’s good - I’m working through it right now | ||||||||
▲ | horizion2025 5 days ago | parent | prev | next [-] | |||||||
Is there a non-video equivalent. I always prefer reading/digesting at my own pace compared to following a video. | ||||||||
| ||||||||
▲ | tra3 5 days ago | parent | prev | next [-] | |||||||
Is [1] worth a watch if I want to get a high level/basic understanding of how LLMs work? | ||||||||
| ||||||||
▲ | ForceBru 5 days ago | parent | prev [-] | |||||||
Yes, it's really good |