Remix.run Logo
rsanek 5 days ago

Anyone else read the book that the author mentions, Build a Large Language Model (from Scratch) [0]? After watching Karpathy's video [1] I've been looking for a good source to do a deeper dive.

[0] https://www.manning.com/books/build-a-large-language-model-f...

[1] https://www.youtube.com/watch?v=7xTGNNLPyMI

tanelpoder 5 days ago | parent | next [-]

Yes, can confirm, the book is great. I was also happy to see that the author correctly (in my mind) used the term “embedding vectors” vs. “vector embeddings” that most others seem to use… Some more context about my pet peeve: https://tanelpoder.com/posts/embedding-vectors-vs-vector-emb...

malshe 5 days ago | parent | prev | next [-]

Here is the code used in the book - https://github.com/rasbt/LLMs-from-scratch

gchadwick 5 days ago | parent | prev | next [-]

I thought it was a great book, dives into all the details and lays it out step by step with some nice examples. Obviously it's a pretty basic architecture and very simplistic training but I found it gave me the grounding to then understand more complex architectures.

kamranjon 5 days ago | parent | prev | next [-]

It’s good - I’m working through it right now

horizion2025 5 days ago | parent | prev | next [-]

Is there a non-video equivalent. I always prefer reading/digesting at my own pace compared to following a video.

gpjt 5 days ago | parent [-]

Check the first link in the parent comment, it's a link to the book.

tra3 5 days ago | parent | prev | next [-]

Is [1] worth a watch if I want to get a high level/basic understanding of how LLMs work?

rsanek 5 days ago | parent [-]

Yeah, it's very well done

ForceBru 5 days ago | parent | prev [-]

Yes, it's really good