** Exact topics and schedule subject to change, based on student interests and course discussions. **

Date Topics Readings
2/3 Week 1.1 Course introduction [slides] [video]
  • Multimodal core challenges
  • Course syllabus
2/5 Week 1.2 Multimodal datasets [slides] [video]
  • Research tasks and datasets
  • Intro to AI research
2/10 Week 2.1 AI tutorial [slides] [video]
  • Data processing and visualization
  • Pytorch and AI modeling
2/12 Week 2.2 Unimodal representations [slides] [video]
  • Dimensions of heterogeneity
  • Common model architectures
2/17 Week 3.1 No class, shifted President's day
2/19 Week 3.2 Multimodal fusion [slides] [video]
  • Early and late fusion
  • Explainable fusion
2/24 Week 4.1 More fusion [slides] [video]
  • Higher-order interactions
  • Multimodal fusion models
2/26 Week 4.2 Multimodal alignment [slides] [video]
  • Multimodal grounding
  • Aligned representations
3/3 Week 5.1 Large multimodal models [slides] [video]
  • Multimodal transformers
  • Pre-training & fine-tuning
3/5 Week 5.2 Multimodal LLMs tutorial [slides] [video]
  • Fine-tuning
  • Instruction tuning
3/10 Week 6.1 Multimodal generation [slides] [video]
  • Translation, summarization, creation
  • Model evaluation and ethics
3/12 Week 6.2 Modern generative AI [slides] [video]
  • VAEs, diffusion, flow models
  • Controllable generation
3/17 Week 7.1 Midterm review [slides] [video]
  • Multimodal fusion & alignment
  • Multimodal foundation models
3/19 Week 7.2 In-class midterm
3/24 Week 8.1 Spring Break – No lectures
3/26 Week 8.2 Spring Break – No lectures
3/31 Week 9.1 Multimodal reasoning [slides] [video]
  • Reinforcement learning
  • Multi-step reasoning
4/2 Week 9.2 Explainable reasoning [slides] [video]
  • Explainable AI
  • LLMs for explainability
4/7 Week 10.1 Multimodal interaction [slides] [video]
  • Interactive agents
  • Interactive reasoning
4/9 Week 10.2 Cross-modal transfer [slides] [video]
  • Modality transfer and co-learning
  • Self-training and multitask learning
4/14 Week 11.1 Multimodal & manufacturing [slides] [video]
  • Multisensor fusion
  • Applications in manufacturing
4/16 Week 11.2 Multimodal & design [slides] [video]
  • Multimodal agents
  • Applications in design
4/21 Week 12.1 Prescriptive modeling [slides] [video]
  • Optimization & prediction
  • Causal & counterfactual
4/23 Week 12.2 Agents tutorial [slides] [video]
  • Multimodal agent pipelines
  • Agent evaluation
4/28 Week 13.1 Multimodal & cities [slides] [video]
  • Spatial and temporal fusion
  • Applications in cities
4/30 Week 13.2 Multimodal & transportation [slides] [video]
  • Autonomous vehicles
  • Applications in transportation
5/5 Week 14.1 Self-evolving AI [slides] [video]
  • Continual learning
  • Self-evolution and optimization
5/7 Week 14.2 AI for new senses [slides] [video]
  • Touch, smell, taste
  • Human-AI interaction
5/12 Week 15.1 Project presentations