schedule

Note: Since this is the first time the class is being taught, the schedule may adjust if we need more or less time on certain topics.

Date Lecture Readings Logistics
1/21 Introduction [ slides ]
  • No associated readings

Language Modeling
1/23 Language modeling [ slides ]

Homework 0 released on Piazza (due 2/7)

1/28 Neural language models [ slides ]

1/30 Backpropagation [ slides ]

Quiz 0 released on Piazza (due 2/7)

2/4 Class canceled because Tu was sick
2/6 Word Embeddings [ slides ]

Transformers and the Evolution of LLMs
2/11 Class canceled due to inclement weather
2/13 Transformers [ slides ]

2/18 The Era of BERT [ slides ]

2/20 Scaling LLM Pretraining [ slides ]

LLM Capabilities and Evaluation
2/25 LLM Prompting [ slides ]

2/27 LLM Decoding [ slides ]

3/4 Instruction tuning [ slides ]

3/6 LLM Alignment [ slides ]

3/11 No classes (Spring break)
3/13 No classes (Spring break)
3/18 LLM Evaluation [ slides ]

Improving LLM Efficiency and Adaptability
3/20 Parameter-efficient fine-tuning [ slides ]

3/25 Mixture of Experts [ slides ]

3/27 Model Merging [ slides ]

4/1 Distillation, quantization, and pruning [ slides ]

4/3 Long-context LLMs [ slides ]

Advanced LLMs and Compound AI Systems
4/8 Thinking LLMs [ slides ]

4/10 Scaling test-time compute [ slides ]

4/15 Retrieval-augmented generation (RAG) & Tool-use LLMs [ slides ]

4/17 LLM Agents [ slides ]

Other topics
4/22 Multimodal LLMs & Multilingual LLMs [ slides ]

4/24 Code and Math LLMs [ slides ]

4/29 Token-free LLMs [ slides ]

5/1 LLM Safety and Security [ slides ]

5/6 State Space Models (SSM) [ slides ]

5/14 Project presentations (Time & Location: TBD) [ slides ]