Note: Since this is the first time the class is being taught, the schedule may adjust if we need more or less time on certain topics.
| Transformers & Pretraining Scaling |
| 1/21 | Introduction & Transformers [ slides ] | | |
| 1/26 | No classes (canceled due to inclement weather) |
| 1/28 | Transformers (cont'd) [ slides ] | | |
| 2/2 | Transformers (cont'd) [ slides ] | | |
| 2/4 | Pretraining scaling [ slides ] | | |
| 2/9 | Decoding and inference [ slides ] | | |
| 2/11 | Multimodal models [ slides ] | | |
| Efficient training & inference |
| 2/16 | Efficient training [ slides ] | | |
| 2/18 | Efficient training (cont'd) & inference [ slides ] | | |
| Post-training & Reinforcement Learning |
| 2/23 | Post-training [ slides ] | | |
| 2/25 | Post-training (cont'd) [ slides ] | | |
| 3/2 | Policy gradient algorithms [ slides ] | | |
| Policy gradient algorithms (cont'd) + Large Reasoning Models |
| 3/4 | Policy gradient algorithms (cont'd) + Test-time scaling [ slides ] | | |
| Spring break |
| 3/9 | No classes |
| 3/11 | No classes |
| Agents & Compound AI systems |
| 3/16 | Test-time scaling (cont'd) + Compound AI systems [ slides ] | | |
| 3/18 | Agents [ slides ] | | |
| Student presentations & discussions |
| 3/23 | (Kiymet & Enoch): LLM-as-a-Judge & LLM Evaluation [ slides ] | | |
| 3/25 | (Neelesh & Sriram): Context Engineering & Agentic Memory [ slides ] | | |
| 3/30 | (Mokshitha & Ishtiaque & Muhammad): Long-context processing [ slides ] | | |
| 4/1 | (Alexa & Simon & Hajra): AI for scientific research [ slides ] | | |
| 4/6 | (Caleb & Sangwook): Test-time discovery and optimization [ slides ] | | |
| 4/8 | (Rituraj & Jing): Deep Think [ slides ] | | |
| 4/13 | (Yu-Min & Yeana & Pin-Jie): Model interpretability [ slides ] | | |
| 4/15 | (Farhana & Bikash & Heajun): Diffusion models & Diffusion LLMs [ slides ] | | |
| 4/20 | (Cameron & Briana): AI security & privacy [ slides ] | | |
| 4/22 | (Umid & Jafar): AI bias [ slides ] | | |
| 4/27 | (Sneha & Aravinda): Test-time scaling for evaluation [ slides ] | | |
| 4/29 | (Hani & Najibul): Chain-of-Thought monitorability and controllability [ slides ] | | |
| Final exam |
| 5/4 | No classes |
| 5/6 | Exam (in-class) [ slides ] | | |