ahmadalnaib/LLM.md

Forked from rain-1/LLM.md

Created March 29, 2023 05:39

Star (0) You must be signed in to star a gist
Fork (0) You must be signed in to fork a gist

Learn more about clone URLs
Clone this repository at <script src="https://gist.github.com/ahmadalnaib/f6e4ede3da748e9ac8c724cea62f43a0.js"></script>
Save ahmadalnaib/f6e4ede3da748e9ac8c724cea62f43a0 to your computer and use it in GitHub Desktop.

Download ZIP

LLM Introduction: Learn Language Models

Raw

LLM.md

Purpose

Bootstrap knowledge of LLMs ASAP.

Prelude

Neural network links before starting with transformers.

Youtube Lessons

Andrej Karpathy The spelled-out intro to language modeling: building makemore: basic. bi-gram name generator model by counting, then by NN. using pytorch.
Andrej Karpathy Building makemore Part 2: MLP:
Andrej Karpathy Building makemore Part 3: Activations & Gradients, BatchNorm):
Andrej Karpathy Building makemore Part 4: Becoming a Backprop Ninja:
Hedu AI Visual Guide to Transformer Neural Networks - (Episode 1) Position Embeddings: Tokens are embedded into a semantic space. sine/cosine position encoding explained very well.
Hedu AI Visual Guide to Transformer Neural Networks - (Episode 2) Multi-Head & Self-Attention: Clear overview of multi-head attention.
Hedu AI Visual Guide to Transformer Neural Networks - (Episode 3) Decoder’s Masked Attention: Further details on the transformer architecture.
Andrej Karpathy Andrej Karpathy - Let's build GPT: from scratch, in code, spelled out.: build up a Shakespeare gpt-2-like from scratch. starts with bi-gram and adds features one by one. pytorch.
Chris Olah CS25 I Stanford Seminar - Transformer Circuits, Induction Heads, In-Context Learning: Interpretation. Deep look into the mechanics of induction heads. Companion article
Jay Alammar The Illustrated Word2vec - A Gentle Intro to Word Embeddings in Machine Learning
Jay Alammar How GPT3 Works - Easily Explained with Animations: extremely high level basic overview.
Jay Alammar The Narrated Transformer Language Model : much deeper look at the architecture. goes into detail. Companion article.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment