Course Schedule
Weekwise Schedule
Tentative and subject to change
| Theme | Week | Topics | 
|---|---|---|
| LLM Foundations I | 1.1 | Orientation, Transformer architecture | 
| 1.2 | Transformer Architecture - GPT 1 and 2 | |
| 2.1 | Tokenization, Pretraining objectives | |
| 2.2 | Mixture of Experts | |
| LLM Foundation II | 3.1 | Case studies: State-of-the-art open-source LLM architectures | 
| 3.2 | Scaling Laws, Emergent properties | |
| GPU Basics | 4.1 | GPU architecture deep dive | 
| 4.2 | Parallelism: Multi GPU, Multi Node | |
| 5.1 | On-Prem Hardware Stack Deep Dive | |
| Inference | 5.2 | Inference Strategies | 
| 6.1 | Inference Math and Bottlenecks | |
| 6.2 | Efficient Attention & KV Caching | |
| Efficient Inference & Quantization | 7.1 | Quantization Fundamentals | 
| 7.2 | Inference Engines and Multi GPU | |
| Fine-Tuning Fundamentals | 8.1 | Full Fine-Tuning vs. PEFT — When to Use Each | 
| 8.2 | Instruction Tuning | |
| 9.1 | Alignment (RLHF, DPO etc) | |
| 9.2 | More RL | |
| Reasoning | 10.1 | Reasoning & Chain-of-Thought | 
| 10.2 | CoT, Tree-of-Thought, Self-Consistency — Prompt Engineering as Code | |
| RAG | 11.1 | RAG Fundamentals - Context-engineering, embeddings, search and rerankers | 
| 11.2 | Evaluating RAG | |
| Agents | 12.1 | ReAct Framework: Thought → Action → Observation | 
| Tool Use & Function Calling | 12.2 | MCP introduction | 
| 12.3 | Agentic RAG, Multi Agent Orchestration, Multimodal Agents | |
| Agent Finetuning | 13.1 | Fine Tuning for Tool calling | 
| 13.2 | Agent Evaluation & Safety | |
| Evaluation | 14.1 | Evaluation | 
| 14.2 | Observability & Monitoring | |
| Multimodal Models | 15.1 | Multi Modal Architecture: Image, Audio and Video models, Running Locally | 
| 15.2 | Fine tuning multimodal models | |
| LLMs on the Edge | 16.1 | Edge-Optimized LLM Architectures, case studies | 
| 16.2 | Edge Optimization techniques | |
| Security & Privacy Engineering | 17.1 | Threat Model: Prompt Injection, Jailbreaking, Data Leakage | 
| Frontiers | 17.2 | Emerging Topics: Mamba, Qwen Next, Hybrid architectures | 
| Presentations | 18.1 | Student Presentations I | 
| 18.2 | Student Presentations II | 
Materials
- Lecture slides and notes will be shared here as class progresses.