Note: Since this is the first time the class is being taught, the schedule may adjust if we need more or less time on certain topics.
| 1/21 | Introduction [ slides ] | | |
| Language Modeling |
| 1/23 | Language modeling [ slides ] | | Homework 0 released on Piazza (due 2/7) |
| 1/28 | Neural language models [ slides ] | | |
| 1/30 | Backpropagation [ slides ] | | Quiz 0 released on Piazza (due 2/7) |
| 2/4 | Class canceled because Tu was sick |
| 2/6 | Word Embeddings [ slides ] | | |
| Transformers and the Evolution of LLMs |
| 2/11 | Class canceled due to inclement weather |
| 2/13 | Transformers [ slides ] | | |
| 2/18 | The Era of BERT [ slides ] | | |
| 2/20 | Scaling LLM Pretraining [ slides ] | | |
| LLM Capabilities and Evaluation |
| 2/25 | LLM Prompting [ slides ] | | |
| 2/27 | LLM Decoding [ slides ] | | |
| 3/4 | Instruction tuning [ slides ] | | |
| 3/6 | LLM Alignment [ slides ] | | |
| 3/11 | No classes (Spring break) |
| 3/13 | No classes (Spring break) |
| 3/18 | LLM Evaluation [ slides ] | | |
| Improving LLM Efficiency and Adaptability |
| 3/20 | Parameter-efficient fine-tuning [ slides ] | | |
| 3/25 | Mixture of Experts [ slides ] | | |
| 3/27 | Model Merging [ slides ] | | |
| 4/1 | Distillation, quantization, and pruning [ slides ] | | |
| 4/3 | Long-context LLMs [ slides ] | | |
| Advanced LLMs and Compound AI Systems |
| 4/8 | Advanced reasoning & Test-time scaling [ slides ] | | |
| 4/10 | Advanced reasoning & Test-time scaling (cont'd) [ slides ] | | |
| 4/15 | Retrieval-augmented generation (RAG) & Tool-use LLMs [ slides ] | | |
| 4/17 | LLM Agents [ slides ] | | |
| Other topics |
| 4/22 | Multimodal LLMs [ slides ] | | |
| 4/24 | LLM Safety and Security [ slides ] | | |
| 4/29 | No classes |
| 5/1 | No classes |
| 5/6 | Project presentations [ slides ] | | |
| 5/8 | Project presentations [ slides ] | | |