Large Language Models : A Hands-on Approach
LLMs have changed how we build intelligent applications.
This course aims to bridge the gap between:
Using LLMs via APIs -> Engineering LLM systems
| Option | Cost | Notes |
|---|---|---|
| Google Colab | Free | Limited GPU, good for experiments |
| GCP Free Credits | Free | $300 for new accounts |
| Colab Pro+ | Paid | More GPU time, better GPUs |
| RunPod | Paid | Flexible, longer runs |
| Lightning AI | Paid | Easy setup, good UX |
Free options suffice for most of our course needs
| Component | Weight | Notes |
|---|---|---|
| Quizzes | 20% | 5 short in-class quizzes |
| Assignments | 40% | 2 assignments (May and June) |
| Final Project/Presentation | 40% | June 3rd week onwards |
–
Individual or group project (max 2).
Build an end-to-end LLM application:
Deliverables: Documentation + Demo / Presentation
| Module | Topic |
|---|---|
| 1 | LLM Foundations - Transformers, GPT-2, Modern Architectures, MoE, OSS Models |
| 2 | GPUs - Architecture and Programming, Multi-GPU Parallelism, |
| 3 | Inference - Sampling, KV Caching, Quantization, Speculative Decoding, Model Serving |
| 4 | RAG & Agents - RAG Fundamentals, ReAct, Tools, Protocols, Agents, Finetuning |
| 5 | Evaluation - Moderl Evaluations, Benchmarks, LLM-as-a-Judge |
| Module | Topic |
|---|---|
| 6 | Fine-Tuning - SFT, PEFT (LoRA, QLoRA etc),RLHF (DPO, GRPO etc), Distillation |
| 7 | Reasoning - Chain-of-Thought (CoT), Test Time Scaling, Finetuning |
| 8 | Multimodal - Multimodal Architectures, Finetuning |
| 9 | Edge Deployment - Edge architectures, Optimization and Deployment |