OpenTouch: Bringing Full-Hand Touch to Real-World Interaction
Ray Song, Jinzhou Li, Rao Fu, Devin Murphy, Kaichen Zhou, Rishi Shiv, Yaqi Li, Haoyu Xiong, Crystal Owens, Yilun Du, Yiyue Luo, Xianyi Cheng, Antonio Torralba, Wojciech Matusik, Paul Pu Liang
arXiv preprint
Dec 2025
QoQ-Med: Building Multimodal Clinical Foundation Models with Domain-Aware GRPO Training
David Dai, Peilin Chen, Chanakya Ekbote, Paul Pu Liang
NeurIPS 2025
(oral)
Dec 2025
What One Cannot, Two Can: Two-Layer Transformers Provably Represent Induction Heads on Any-Order Markov Chains
Chanakya Ekbote, Marco Bondaschi, Nived Rajaraman, Jason Lee, Michael Gastpar, Ashok Vardhan Makkuva, Paul Pu Liang
NeurIPS 2025
(spotlight)
Dec 2025
Multimodal Fusion Balancing Through Game-Theoretic Regularization
Konstantinos Kontras, Thomas Strypsteen, Christos Chatzichristos, Paul Pu Liang, Matthew Blaschko, Maarten De Vos
NeurIPS 2025
(spotlight)
Dec 2025
MimeQA: Towards Socially-intelligent Nonverbal Foundation Models
Hengzhi Li, Megan Tjandrasuwita, Yi R Fung, Armando Solar-Lezama, Paul Pu Liang
NeurIPS 2025
Dec 2025
Partial Information Decomposition via Normalizing Flows in Latent Gaussian Distributions
Wenyuan Zhao, Adithya Balachandran, Chao Tian, Paul Pu Liang
NeurIPS 2025
Dec 2025
REGen: Multimodal Retrieval-Embedded Generation for Long-to-Short Video Editing
Weihan Xu, Yimeng Ma, Jingyue Huang, Yang Li, Wenye Ma, Taylor Berg-Kirkpatrick, Julian McAuley, Paul Pu Liang, Hao-Wen Dong
NeurIPS 2025
Dec 2025
Position: Simulating Society Requires Simulating Thought
Chance Jiajie Li, Jiayi Wu, Zhenze Mo, Ao Qu, Yuhan Tang, Kaiya Ivy Zhao, Yulu Gan, Jie Fan, Jiangbo Yu, Jinhua Zhao, Paul Pu Liang, Luis Alonso, Kent Larson
NeurIPS 2025
Dec 2025
FairGRPO: Towards Fair Reasoning Foundation Models for Clinical Diagnosis
Shiqi Dai, David Dai, Jiaee Cheong, Paul Pu Liang
arXiv preprint
Dec 2025
Page-4D: Disentangled Pose and Geometry Estimation for 4D Perception
Kaichen Zhou, Yuhan Wang, Grace Chen, Xinhai Chang, Gaspard Beaudouin, Fangneng Zhan, Paul Pu Liang, Mengyu Wang
arXiv preprint
Oct 2025
Human Behavior Atlas: Benchmarking Unified Psychological and Social Behavior Understanding
Keane Ong, David Dai, Carol Li, Dewei Feng, Hengzhi Li, Jingyao Wu, Jiaee Cheong, Rui Mao, Gianmarco Mengaldo, Erik Cambria, Paul Pu Liang
arXiv preprint
Oct 2025
Dialogues with AI Reduce Beliefs in Misinformation but Build No Lasting Discernment Skills
Anku Rani, Valdemar Danry, Paul Pu Liang, Andrew B Lippman, Pattie Maes
arXiv preprint
Oct 2025
RADAR: A Reasoning-Guided Attribution Framework for Explainable Visual Data Analysis
Anku Rani, Aparna Garimella, Apoorv Saxena, Balaji Vasan Srinivasan, Paul Pu Liang
arXiv preprint
Aug 2025
Learn Globally, Speak Locally: Bridging the Gaps in Multilingual Reasoning
Jaedong Hwang, Kumar Tanmay, Seok-Jin Lee, Ayush Agrawal, Hamid Palangi, Kumar Ayush, Ila Fiete, Paul Pu Liang
arXiv preprint
Jul 2025
CLIMB: Data Foundations for Large Scale Multimodal Clinical Foundation Models
David Dai, Peilin Chen, Malinda Lu, Daniel Li, Haowen Wei, Hejie Cui, Paul Pu Liang
ICML 2025
Jul 2025
Understanding the Emergence of Multimodal Representation Alignment
Megan Tjandrasuwita, Chanakya Ekbote, Liu Ziyin, Paul Pu Liang
ICML 2025
Jul 2025
MEM1: Learning to Synergize Memory and Reasoning for Efficient Long-Horizon Agents
Zijian Zhou, Ao Qu, Zhaoxuan Wu, Sunghwan Kim, Alok Prakash, Daniela Rus, Jinhua Zhao, Bryan Kian Hsiang Low, Paul Pu Liang
arXiv preprint
Jun 2025
PuzzleWorld: A Benchmark for Multimodal, Open-Ended Reasoning in Puzzlehunts
Hengzhi Li, Brendon Jiang, Alexander Naehu, Regan Song, Justin Zhang, Megan Tjandrasuwita, Chanakya Ekbote, Steven-Shine Chen, Adithya Balachandran, David Dai, Rebecca Chang, Paul Pu Liang
arXiv preprint
Jun 2025
SmellNet: A Large-scale Dataset for Real-world Smell Recognition
Dewei Feng, Carol Li, David Dai, Paul Pu Liang
arXiv preprint
May 2025
Interactive Sketchpad: A Multimodal Tutoring System for Collaborative, Visual Problem-Solving
Steven-Shine Chen, Jimin Lee, Paul Pu Liang
CHI 2025 Late-Breaking Work
Apr 2025
Fits like a Flex-Glove: Automatic Design of Personalized FPCB-Based Tactile Sensing Gloves
Devin Murphy, Yichen Li, Crystal Owens, Layla Stanton, Young Joong Lee, Paul Pu Liang, Yiyue Luo, Antonio Torralba, Wojciech Matusik
CHI 2025 Late-Breaking Work
Apr 2025
Progressive Compositionality In Text-to-Image Generative Models
Evans Han, Linghao Jin, Xiaofeng Liu, Paul Pu Liang
ICLR 2025
(spotlight)
Apr 2025
OS-ATLAS: A Foundation Action Model for Generalist GUI Agents
Zhiyong Wu, Zhenyu Wu, Fangzhi Xu, Yian Wang, Qiushi Sun, Chengyou Jia, Kanzhi Cheng, Zichen Ding, Liheng Chen, Paul Pu Liang, Yu Qiao
ICLR 2025
(spotlight)
Apr 2025
VideoWebArena: Evaluating Long Context Multimodal Agents with Video Understanding Web Tasks
Lawrence Jang, Yinheng Li, Charles Ding, Justin Lin, Paul Pu Liang, Dan Zhao, Rogerio Bonatti, Kazuhito Koishida
ICLR 2025
Apr 2025
TeaserGen: Generating Teasers for Long Documentaries
Weihan Xu, Paul Pu Liang, Haven Kim, Julian McAuley, Taylor Berg-Kirkpatrick, Hao-Wen Dong
ICLR 2025
Apr 2025
Beyond the Imitation Game: Quantifying and Extrapolating the Capabilities of Language Models
442 authors including Paul Pu Liang
ICLR 2025, TMLR 2023
(finalist for outstanding certification)
Apr 2025