Hao Chen

57
Papers
380
Total Citations
1
Affiliations

Affiliations

CMU

Papers (57)

VoCo: A Simple-yet-Effective Volume Contrastive Learning Framework for 3D Medical Image Analysis

CVPR 2024
70
citations

ImageFolder: Autoregressive Image Generation with Folded Tokens

ICLR 2025
63
citations

What Matters When Repurposing Diffusion Models for General Dense Perception Tasks?

ICLR 2025arXiv
56
citations

SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer

CVPR 2025
32
citations

360+x: A Panoptic Multi-modal Scene Understanding Dataset

CVPR 2024
24
citations

OSV: One Step is Enough for High-Quality Image to Video Generation

CVPR 2025
22
citations

FreeCompose: Generic Zero-Shot Image Composition with Diffusion Prior

ECCV 2024
12
citations

Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration

NeurIPS 2025
12
citations

SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories

CVPR 2025
11
citations

WeatherGFM: Learning a Weather Generalist Foundation Model via In-context Learning

ICLR 2025
9
citations

Chain of Attack: On the Robustness of Vision-Language Models Against Transfer-Based Adversarial Attacks

CVPR 2025
9
citations

DiffCalib: Reformulating Monocular Camera Calibration as Diffusion-Based Dense Incident Map Generation

AAAI 2025
9
citations

TG-LLaVA: Text Guided LLaVA via Learnable Latent Embeddings

AAAI 2025
8
citations

FOCUS: Knowledge-enhanced Adaptive Visual Compression for Few-shot Whole Slide Image Classification

CVPR 2025
7
citations

Fast Encoding and Decoding for Implicit Video Representation

ECCV 2024
7
citations

PEACE: Empowering Geologic Map Holistic Understanding with MLLMs

CVPR 2025arXiv
6
citations

SwitchLingua: The First Large-Scale Multilingual and Multi-Ethnic Code-Switching Dataset

NeurIPS 2025
5
citations

Improving Multimodal Learning Balance and Sufficiency through Data Remixing

ICML 2025
4
citations

Distilled Prompt Learning for Incomplete Multimodal Survival Prediction

CVPR 2025
4
citations

SDP-CROWN: Efficient Bound Propagation for Neural Network Verification with Tightness of Semidefinite Programming

ICML 2025
3
citations

VA-MoE: Variables-Adaptive Mixture of Experts for Incremental Weather Forecasting

ICCV 2025
2
citations

Point Cloud Upsampling Using Conditional Diffusion Module with Adaptive Noise Suppression

CVPR 2025
2
citations

Rethinking the Bias of Foundation Model under Long-tailed Distribution

ICML 2025
1
citations

Evaluating Program Semantics Reasoning with Type Inference in System $F$

NeurIPS 2025
1
citations

Revisiting Open-Set Panoptic Segmentation

AAAI 2024
1
citations

A General Framework for Learning from Weak Supervision

ICML 2024
0
citations

Completing Visual Objects via Bridging Generation and Segmentation

ICML 2024
0
citations

Floating Anchor Diffusion Model for Multi-motif Scaffolding

ICML 2024
0
citations

Post-hoc Part-Prototype Networks

ICML 2024
0
citations

CompeteAI: Understanding the Competition Dynamics of Large Language Model-based Agents

ICML 2024
0
citations

Generative Active Learning for Long-tailed Instance Segmentation

ICML 2024
0
citations

EchoTraffic: Enhancing Traffic Anomaly Understanding with Audio-Visual Insights

CVPR 2025
0
citations

Towards a Self-contained Data-driven Global Weather Forecasting Framework

ICML 2024
0
citations

Dual-Interrelated Diffusion Model for Few-Shot Anomaly Image Generation

CVPR 2025
0
citations

Monocular and Generalizable Gaussian Talking Head Animation

CVPR 2025
0
citations

Satellite Observations Guided Diffusion Model for Accurate Meteorological States at Arbitrary Resolution

CVPR 2025
0
citations

POMATO: Marrying Pointmap Matching with Temporal Motions for Dynamic 3D Reconstruction

ICCV 2025
0
citations

Scaling Tumor Segmentation: Best Lessons from Real and Synthetic Data

ICCV 2025
0
citations

FW-Merging: Scaling Model Merging with Frank-Wolfe Optimization

ICCV 2025
0
citations

UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI

ICCV 2025
0
citations

Separation for Better Integration: Disentangling Edge and Motion in Event-based Deblurring

ICCV 2025
0
citations

Conditional Visual Autoregressive Modeling for Pathological Image Restoration

ICCV 2025
0
citations

SurfaceSplat: Connecting Surface Reconstruction and Gaussian Splatting

ICCV 2025
0
citations

Unified Open-World Segmentation with Multi-Modal Prompts

ICCV 2025
0
citations

Learning Concept Prerequisite Relation via Global Knowledge Relation Optimization

AAAI 2025
0
citations

Know Where You Are From: Event-Based Segmentation via Spatio-Temporal Propagation

AAAI 2025
0
citations

MM-Tracker: Motion Mamba for UAV-platform Multiple Object Tracking

AAAI 2025
0
citations

ESEG: Event-Based Segmentation Boosted by Explicit Edge-Semantic Guidance

AAAI 2025
0
citations

Time Series Supplier Allocation via Deep Black-Litterman Model

AAAI 2025
0
citations

Towards Loss-Resilient Image Coding for Unstable Satellite Networks

AAAI 2025
0
citations

PromptMRG: Diagnosis-Driven Prompts for Medical Report Generation

AAAI 2024
0
citations

Retrieval-Augmented Primitive Representations for Compositional Zero-Shot Learning

AAAI 2024
0
citations

A Dynamic GCN with Cross-Representation Distillation for Event-Based Learning

AAAI 2024
0
citations

MICA: Towards Explainable Skin Lesion Diagnosis via Multi

AAAI 2024
0
citations

DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative Data

CVPR 2024
0
citations

Video Frame Interpolation via Direct Synthesis with the Event-based Reference

CVPR 2024
0
citations

FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition

CVPR 2024
0
citations