Week 9 — Neural Networks from Scratch (Karpathy)
Karpathy "Neural Networks: Zero to Hero" — Episode 1: micrograd Part 1 · Episode 1 continued — code along · Finish micrograd, internalize backprop fully · Karpathy Episode 2: makemore (bigram model) · …
7 daily tasks
Week 10 — Deep Learning Theory Foundation
Activation functions: ReLU, Sigmoid, Tanh, GELU, SiLU · Forward + backward propagation deep dive · Loss functions: MSE, Cross-entropy, BCE · Optimizers: SGD, Momentum, Adam, AdamW · …
7 daily tasks
Week 11 — PyTorch Mastery
Tensors, devices, basic ops · Autograd: requires_grad, .backward(), .no_grad() · nn.Module, custom layers · Dataset and DataLoader · …
7 daily tasks
Week 12 — Computer Vision Foundations (CNNs)
What is convolution? Filters, stride, padding · Pooling, receptive field · CNN architectures: LeNet, AlexNet, VGG · ResNet + residual connections (critical) · …
7 daily tasks
Week 13 — NLP Fundamentals + Word Embeddings
Tokenization basics: characters, words, subwords · BPE (Byte-Pair Encoding) — how GPT tokenizes · Karpathy tokenizer Part 2 + finish · Word2Vec, GloVe (legacy but interview-relevant) · …
7 daily tasks
Week 14 — The Transformer Architecture
"Attention Is All You Need" paper — first pass, skim · 3Blue1Brown "But what is a GPT?" + "Attention in transformers, visually explained" · Self-attention math: Q, K, V matrices · Multi-head attention, positional encodings (sinusoidal, RoPE, ALiBi) · …
7 daily tasks
Week 15 — Build GPT from Scratch with Karpathy
Karpathy "Let's build GPT: from scratch" — minutes 0-30 · Continue — minutes 30-60 · Continue — minutes 60-90 · Continue — minutes 90-120 · …
7 daily tasks
Week 16 — HuggingFace Ecosystem + Fine-Tuning Basics
HF Transformers: pipeline, AutoModel, AutoTokenizer · Loading + using pretrained models (BERT, GPT-2) · Tokenizers in HF, padding, attention masks · Datasets library: load, map, filter · …
7 daily tasks