** Exact topics and schedule subject to change, based on student interests and course discussions. **
| 2/3 |
Week 1.1 Course introduction [slides] [video]
-
Multimodal core challenges
-
Course syllabus
|
|
| 2/5 |
Week 1.2 Multimodal datasets [slides] [video]
-
Research tasks and datasets
-
Intro to AI research
|
|
| 2/10 |
Week 2.1 AI tutorial [slides] [video]
-
Data processing and visualization
-
Pytorch and AI modeling
|
|
| 2/12 |
Week 2.2 Unimodal representations [slides] [video]
-
Dimensions of heterogeneity
-
Common model architectures
|
|
| 2/17 |
Week 3.1 No class, shifted President's day
|
|
| 2/19 |
Week 3.2 Multimodal fusion [slides] [video]
-
Early and late fusion
-
Explainable fusion
|
|
| 2/24 |
Week 4.1 More fusion [slides] [video]
-
Higher-order interactions
-
Multimodal fusion models
|
|
| 2/26 |
Week 4.2 Multimodal alignment [slides] [video]
-
Multimodal grounding
-
Aligned representations
|
|
| 3/3 |
Week 5.1 Large multimodal models [slides] [video]
-
Multimodal transformers
-
Pre-training & fine-tuning
|
|
| 3/5 |
Week 5.2 Multimodal LLMs tutorial [slides] [video]
-
Fine-tuning
-
Instruction tuning
|
|
| 3/10 |
Week 6.1 Multimodal generation [slides] [video]
-
Translation, summarization, creation
-
Model evaluation and ethics
|
|
| 3/12 |
Week 6.2 Modern generative AI [slides] [video]
-
VAEs, diffusion, flow models
-
Controllable generation
|
|
| 3/17 |
Week 7.1 Midterm review [slides] [video]
-
Multimodal fusion & alignment
-
Multimodal foundation models
|
|
| 3/19 |
Week 7.2 In-class midterm
|
|
| 3/24 |
Week 8.1 Spring Break – No lectures
|
| 3/26 |
Week 8.2 Spring Break – No lectures
|
| 3/31 |
Week 9.1 Multimodal reasoning [slides] [video]
-
Reinforcement learning
-
Multi-step reasoning
|
|
| 4/2 |
Week 9.2 Explainable reasoning [slides] [video]
-
Explainable AI
-
LLMs for explainability
|
|
| 4/7 |
Week 10.1 Multimodal interaction [slides] [video]
-
Interactive agents
-
Interactive reasoning
|
|
| 4/9 |
Week 10.2 Cross-modal transfer [slides] [video]
-
Modality transfer and co-learning
-
Self-training and multitask learning
|
|
| 4/14 |
Week 11.1 Multimodal & manufacturing [slides] [video]
-
Multisensor fusion
-
Applications in manufacturing
|
|
| 4/16 |
Week 11.2 Multimodal & design [slides] [video]
-
Multimodal agents
-
Applications in design
|
|
| 4/21 |
Week 12.1 Prescriptive modeling [slides] [video]
-
Optimization & prediction
-
Causal & counterfactual
|
|
| 4/23 |
Week 12.2 Agents tutorial [slides] [video]
-
Multimodal agent pipelines
-
Agent evaluation
|
|
| 4/28 |
Week 13.1 Multimodal & cities [slides] [video]
-
Spatial and temporal fusion
-
Applications in cities
|
|
| 4/30 |
Week 13.2 Multimodal & transportation [slides] [video]
-
Autonomous vehicles
-
Applications in transportation
|
|
| 5/5 |
Week 14.1 Self-evolving AI [slides] [video]
-
Continual learning
-
Self-evolution and optimization
|
|
| 5/7 |
Week 14.2 AI for new senses [slides] [video]
-
Touch, smell, taste
-
Human-AI interaction
|
|
| 5/12 |
Week 15.1 Project presentations
|
|