Paul Pu Liang

18
Papers
54
Total Citations

Papers (18)

VideoWebArena: Evaluating Long Context Multimodal Agents with Video Understanding Web Tasks

ICLR 2025
25
citations

CLIMB: Data Foundations for Large Scale Multimodal Clinical Foundation Models

ICML 2025
12
citations

Progressive Compositionality in Text-to-Image Generative Models

ICLR 2025
9
citations

OS-ATLAS: Foundation Action Model for Generalist GUI Agents

ICLR 2025
8
citations

Social-IQ: A Question Answering Benchmark for Artificial Social Intelligence

CVPR 2019
0
citations

Rethinking Architecture Design for Tackling Data Heterogeneity in Federated Learning

CVPR 2022arXiv
0
citations

Lecture Presentations Multimodal Dataset: Towards Understanding Multimodality in Educational Videos

ICCV 2023
0
citations

Diverse and Admissible Trajectory Prediction through Multimodal Context Understanding

ECCV 2020
0
citations

PACS: A Dataset for Physical Audiovisual Commonsense Reasoning

ECCV 2022
0
citations

Partial Information Decomposition via Normalizing Flows in Latent Gaussian Distributions

NeurIPS 2025
0
citations

What One Cannot, Two Can: Two-Layer Transformers Provably Represent Induction Heads on Any-Order Markov Chains

NeurIPS 2025arXiv
0
citations

FLHetBench: Benchmarking Device and State Heterogeneity in Federated Learning

CVPR 2024
0
citations

Modeling Dense Multimodal Interactions Between Biological Pathways and Histology for Survival Prediction

CVPR 2024
0
citations

Deep Gamblers: Learning to Abstain with Portfolio Theory

NeurIPS 2019
0
citations

Read and Reap the Rewards: Learning to Play Atari with the Help of Instruction Manuals

NeurIPS 2023
0
citations

Localized Symbolic Knowledge Distillation for Visual Commonsense Models

NeurIPS 2023
0
citations

Quantifying & Modeling Multimodal Interactions: An Information Decomposition Framework

NeurIPS 2023
0
citations

Factorized Contrastive Learning: Going Beyond Multi-view Redundancy

NeurIPS 2023
0
citations