Note: Since this is the first time the class is being taught, the schedule may adjust if we need more or less time on certain topics.
1/21 | Introduction [ slides ] | | |
Language Modeling |
1/23 | Language modeling [ slides ] | | Homework 0 released on Piazza (due 2/7) |
1/28 | Neural language models [ slides ] | | |
1/30 | Backpropagation [ slides ] | | Quiz 0 released on Piazza (due 2/7) |
2/4 | Class canceled because Tu was sick |
2/6 | Word Embeddings [ slides ] | | |
Transformers and the Evolution of LLMs |
2/11 | Class canceled due to inclement weather |
2/13 | Transformers [ slides ] | | |
2/18 | The Era of BERT [ slides ] | | |
2/20 | Scaling LLM Pretraining [ slides ] | | |
LLM Capabilities and Evaluation |
2/25 | LLM Prompting [ slides ] | | |
2/27 | LLM Decoding [ slides ] | | |
3/4 | Instruction tuning [ slides ] | | |
3/6 | LLM Alignment [ slides ] | | |
3/11 | No classes (Spring break) |
3/13 | No classes (Spring break) |
3/18 | LLM Evaluation [ slides ] | | |
Improving LLM Efficiency and Adaptability |
3/20 | Parameter-efficient fine-tuning [ slides ] | | |
3/25 | Mixture of Experts [ slides ] | | |
3/27 | Model Merging [ slides ] | | |
4/1 | Distillation, quantization, and pruning [ slides ] | | |
4/3 | Long-context LLMs [ slides ] | | |
Advanced LLMs and Compound AI Systems |
4/8 | Thinking LLMs [ slides ] | | |
4/10 | Scaling test-time compute [ slides ] | | |
4/15 | Retrieval-augmented generation (RAG) & Tool-use LLMs [ slides ] | | |
4/17 | LLM Agents [ slides ] | | |
Other topics |
4/22 | Multimodal LLMs & Multilingual LLMs [ slides ] | | |
4/24 | Code and Math LLMs [ slides ] | | |
4/29 | Token-free LLMs [ slides ] | | |
5/1 | LLM Safety and Security [ slides ] | | |
5/6 | State Space Models (SSM) [ slides ] | | |
5/14 | Project presentations (Time & Location: TBD) [ slides ] | | |