schedule

Note: Since this is the first time the class is being taught, the schedule may adjust if we need more or less time on certain topics.

Date Lecture Readings Logistics
Transformers & Pretraining Scaling
1/21 Introduction & Transformers [ slides ]
  • No associated readings

1/26 No classes (canceled due to inclement weather)
1/28 Transformers (cont'd) [ slides ]

2/2 Transformers (cont'd) [ slides ]

2/4 Pretraining scaling [ slides ]

2/9 Decoding and inference [ slides ]

2/11 Multimodal models [ slides ]

Efficient training & inference
2/16 Efficient training [ slides ]

2/18 Efficient training (cont'd) & inference [ slides ]

Post-training & Reinforcement Learning
2/23 Post-training [ slides ]

2/25 Post-training (cont'd) [ slides ]

3/2 Policy gradient algorithms [ slides ]

Policy gradient algorithms (cont'd) + Large Reasoning Models
3/4 Policy gradient algorithms (cont'd) + Test-time scaling [ slides ]

Spring break
3/9 No classes
3/11 No classes
Agents & Compound AI systems
3/16 Test-time scaling (cont'd) + Compound AI systems [ slides ]

3/18 Agents [ slides ]

Student presentations & discussions
3/23 (Kiymet & Enoch): LLM-as-a-Judge & LLM Evaluation [ slides ]

3/25 (Neelesh & Sriram): Context Engineering & Agentic Memory [ slides ]

3/30 (Mokshitha & Ishtiaque & Muhammad): Long-context processing [ slides ]

4/1 (Alexa & Simon & Hajra): AI for scientific research [ slides ]

4/6 (Caleb & Sangwook): Test-time discovery and optimization [ slides ]

4/8 (Rituraj & Jing): Deep Think [ slides ]

4/13 (Yu-Min & Yeana & Pin-Jie): Model interpretability [ slides ]

4/15 (Farhana & Bikash & Heajun): Diffusion models & Diffusion LLMs [ slides ]

4/20 (Cameron & Briana): AI security & privacy [ slides ]

4/22 (Umid & Jafar): AI bias [ slides ]

4/27 (Sneha & Aravinda): Test-time scaling for evaluation [ slides ]

4/29 (Hani & Najibul): Chain-of-Thought monitorability and controllability [ slides ]

Final exam
5/4 No classes
5/6 Exam (in-class) [ slides ]