DeepDive in everything of Llama3: revealing detailed insights and implementation github.com 189 points · therealoliver · 22 hours ago
I like the use of the functional API here. I learned through a similar route and it was very helpful for me compared to trying to understand `torch.nn.Module`.Here's a gist of my learning path if it's helpful to anyone: https://gist.github.com/kevmo314/294001659324429bae6749062a9...
I hadn't realized OpenAI's tiktoken Python library could work with other models outside of the OpenAI family, that's really useful: https://github.com/therealoliver/Deepdive-llama3-from-scratc...
kevmo314 ·21 hours ago
Here's a gist of my learning path if it's helpful to anyone: https://gist.github.com/kevmo314/294001659324429bae6749062a9...
Show replies
simonw ·21 hours ago
Show replies
aghilmort ·21 hours ago
Show replies
curtisszmania ·21 hours ago
FreebasingLLMs ·17 hours ago
Show replies