Logo, Multisensory Intelligence
OpenTouch: Bringing Full-Hand Touch to Real-World Interaction

OpenTouch: Bringing Full-Hand Touch to Real-World Interaction

Ray Song, Jinzhou Li, Rao Fu, Devin Murphy, Kaichen Zhou, Rishi Shiv, Yaqi Li, Haoyu Xiong, Crystal Owens, Yilun Du, Yiyue Luo, Xianyi Cheng, Antonio Torralba, Wojciech Matusik, Paul Pu Liang

arXiv preprint

Dec 2025

QoQ-Med: Building Multimodal Clinical Foundation Models with Domain-Aware GRPO Training

QoQ-Med: Building Multimodal Clinical Foundation Models with Domain-Aware GRPO Training

David Dai, Peilin Chen, Chanakya Ekbote, Paul Pu Liang

NeurIPS 2025

(oral)

Dec 2025

What One Cannot, Two Can: Two-Layer Transformers Provably Represent Induction Heads on Any-Order Markov Chains

What One Cannot, Two Can: Two-Layer Transformers Provably Represent Induction Heads on Any-Order Markov Chains

Chanakya Ekbote, Marco Bondaschi, Nived Rajaraman, Jason Lee, Michael Gastpar, Ashok Vardhan Makkuva, Paul Pu Liang

NeurIPS 2025

(spotlight)

Dec 2025

Multimodal Fusion Balancing Through Game-Theoretic Regularization

Multimodal Fusion Balancing Through Game-Theoretic Regularization

Konstantinos Kontras, Thomas Strypsteen, Christos Chatzichristos, Paul Pu Liang, Matthew Blaschko, Maarten De Vos

NeurIPS 2025

(spotlight)

Dec 2025

MimeQA: Towards Socially-intelligent Nonverbal Foundation Models

MimeQA: Towards Socially-intelligent Nonverbal Foundation Models

Hengzhi Li, Megan Tjandrasuwita, Yi R Fung, Armando Solar-Lezama, Paul Pu Liang

NeurIPS 2025

Dec 2025

Partial Information Decomposition via Normalizing Flows in Latent Gaussian Distributions

Partial Information Decomposition via Normalizing Flows in Latent Gaussian Distributions

Wenyuan Zhao, Adithya Balachandran, Chao Tian, Paul Pu Liang

NeurIPS 2025

Dec 2025

REGen: Multimodal Retrieval-Embedded Generation for Long-to-Short Video Editing

REGen: Multimodal Retrieval-Embedded Generation for Long-to-Short Video Editing

Weihan Xu, Yimeng Ma, Jingyue Huang, Yang Li, Wenye Ma, Taylor Berg-Kirkpatrick, Julian McAuley, Paul Pu Liang, Hao-Wen Dong

NeurIPS 2025

Dec 2025

Position: Simulating Society Requires Simulating Thought

Position: Simulating Society Requires Simulating Thought

Chance Jiajie Li, Jiayi Wu, Zhenze Mo, Ao Qu, Yuhan Tang, Kaiya Ivy Zhao, Yulu Gan, Jie Fan, Jiangbo Yu, Jinhua Zhao, Paul Pu Liang, Luis Alonso, Kent Larson

NeurIPS 2025

Dec 2025

FairGRPO: Towards Fair Reasoning Foundation Models for Clinical Diagnosis

FairGRPO: Towards Fair Reasoning Foundation Models for Clinical Diagnosis

Shiqi Dai, David Dai, Jiaee Cheong, Paul Pu Liang

arXiv preprint

Dec 2025

Page-4D: Disentangled Pose and Geometry Estimation for 4D Perception

Page-4D: Disentangled Pose and Geometry Estimation for 4D Perception

Kaichen Zhou, Yuhan Wang, Grace Chen, Xinhai Chang, Gaspard Beaudouin, Fangneng Zhan, Paul Pu Liang, Mengyu Wang

arXiv preprint

Oct 2025

Human Behavior Atlas: Benchmarking Unified Psychological and Social Behavior Understanding

Human Behavior Atlas: Benchmarking Unified Psychological and Social Behavior Understanding

Keane Ong, David Dai, Carol Li, Dewei Feng, Hengzhi Li, Jingyao Wu, Jiaee Cheong, Rui Mao, Gianmarco Mengaldo, Erik Cambria, Paul Pu Liang

arXiv preprint

Oct 2025

Dialogues with AI Reduce Beliefs in Misinformation but Build No Lasting Discernment Skills

Dialogues with AI Reduce Beliefs in Misinformation but Build No Lasting Discernment Skills

Anku Rani, Valdemar Danry, Paul Pu Liang, Andrew B Lippman, Pattie Maes

arXiv preprint

Oct 2025

RADAR: A Reasoning-Guided Attribution Framework for Explainable Visual Data Analysis

RADAR: A Reasoning-Guided Attribution Framework for Explainable Visual Data Analysis

Anku Rani, Aparna Garimella, Apoorv Saxena, Balaji Vasan Srinivasan, Paul Pu Liang

arXiv preprint

Aug 2025

Learn Globally, Speak Locally: Bridging the Gaps in Multilingual Reasoning

Learn Globally, Speak Locally: Bridging the Gaps in Multilingual Reasoning

Jaedong Hwang, Kumar Tanmay, Seok-Jin Lee, Ayush Agrawal, Hamid Palangi, Kumar Ayush, Ila Fiete, Paul Pu Liang

arXiv preprint

Jul 2025

CLIMB: Data Foundations for Large Scale Multimodal Clinical Foundation Models

CLIMB: Data Foundations for Large Scale Multimodal Clinical Foundation Models

David Dai, Peilin Chen, Malinda Lu, Daniel Li, Haowen Wei, Hejie Cui, Paul Pu Liang

ICML 2025

Jul 2025

Understanding the Emergence of Multimodal Representation Alignment

Understanding the Emergence of Multimodal Representation Alignment

Megan Tjandrasuwita, Chanakya Ekbote, Liu Ziyin, Paul Pu Liang

ICML 2025

Jul 2025

MEM1: Learning to Synergize Memory and Reasoning for Efficient Long-Horizon Agents

MEM1: Learning to Synergize Memory and Reasoning for Efficient Long-Horizon Agents

Zijian Zhou, Ao Qu, Zhaoxuan Wu, Sunghwan Kim, Alok Prakash, Daniela Rus, Jinhua Zhao, Bryan Kian Hsiang Low, Paul Pu Liang

arXiv preprint

Jun 2025

PuzzleWorld: A Benchmark for Multimodal, Open-Ended Reasoning in Puzzlehunts

PuzzleWorld: A Benchmark for Multimodal, Open-Ended Reasoning in Puzzlehunts

Hengzhi Li, Brendon Jiang, Alexander Naehu, Regan Song, Justin Zhang, Megan Tjandrasuwita, Chanakya Ekbote, Steven-Shine Chen, Adithya Balachandran, David Dai, Rebecca Chang, Paul Pu Liang

arXiv preprint

Jun 2025

SmellNet: A Large-scale Dataset for Real-world Smell Recognition

SmellNet: A Large-scale Dataset for Real-world Smell Recognition

Dewei Feng, Carol Li, David Dai, Paul Pu Liang

arXiv preprint

May 2025

Interactive Sketchpad: A Multimodal Tutoring System for Collaborative, Visual Problem-Solving

Interactive Sketchpad: A Multimodal Tutoring System for Collaborative, Visual Problem-Solving

Steven-Shine Chen, Jimin Lee, Paul Pu Liang

CHI 2025 Late-Breaking Work

Apr 2025

Fits like a Flex-Glove: Automatic Design of Personalized FPCB-Based Tactile Sensing Gloves

Fits like a Flex-Glove: Automatic Design of Personalized FPCB-Based Tactile Sensing Gloves

Devin Murphy, Yichen Li, Crystal Owens, Layla Stanton, Young Joong Lee, Paul Pu Liang, Yiyue Luo, Antonio Torralba, Wojciech Matusik

CHI 2025 Late-Breaking Work

Apr 2025

Progressive Compositionality In Text-to-Image Generative Models

Progressive Compositionality In Text-to-Image Generative Models

Evans Han, Linghao Jin, Xiaofeng Liu, Paul Pu Liang

ICLR 2025

(spotlight)

Apr 2025

OS-ATLAS: A Foundation Action Model for Generalist GUI Agents

OS-ATLAS: A Foundation Action Model for Generalist GUI Agents

Zhiyong Wu, Zhenyu Wu, Fangzhi Xu, Yian Wang, Qiushi Sun, Chengyou Jia, Kanzhi Cheng, Zichen Ding, Liheng Chen, Paul Pu Liang, Yu Qiao

ICLR 2025

(spotlight)

Apr 2025

VideoWebArena: Evaluating Long Context Multimodal Agents with Video Understanding Web Tasks

VideoWebArena: Evaluating Long Context Multimodal Agents with Video Understanding Web Tasks

Lawrence Jang, Yinheng Li, Charles Ding, Justin Lin, Paul Pu Liang, Dan Zhao, Rogerio Bonatti, Kazuhito Koishida

ICLR 2025

Apr 2025

TeaserGen: Generating Teasers for Long Documentaries

TeaserGen: Generating Teasers for Long Documentaries

Weihan Xu, Paul Pu Liang, Haven Kim, Julian McAuley, Taylor Berg-Kirkpatrick, Hao-Wen Dong

ICLR 2025

Apr 2025

Beyond the Imitation Game: Quantifying and Extrapolating the Capabilities of Language Models

Beyond the Imitation Game: Quantifying and Extrapolating the Capabilities of Language Models

442 authors including Paul Pu Liang

ICLR 2025, TMLR 2023

(finalist for outstanding certification)

Apr 2025