Hao Chen
57
Papers
380
Total Citations
1
Affiliations
Affiliations
CMU
Papers (57)
VoCo: A Simple-yet-Effective Volume Contrastive Learning Framework for 3D Medical Image Analysis
CVPR 2024
70
citations
ImageFolder: Autoregressive Image Generation with Folded Tokens
ICLR 2025
63
citations
What Matters When Repurposing Diffusion Models for General Dense Perception Tasks?
ICLR 2025arXiv
56
citations
SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer
CVPR 2025
32
citations
360+x: A Panoptic Multi-modal Scene Understanding Dataset
CVPR 2024
24
citations
OSV: One Step is Enough for High-Quality Image to Video Generation
CVPR 2025
22
citations
FreeCompose: Generic Zero-Shot Image Composition with Diffusion Prior
ECCV 2024
12
citations
Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration
NeurIPS 2025
12
citations
SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories
CVPR 2025
11
citations
WeatherGFM: Learning a Weather Generalist Foundation Model via In-context Learning
ICLR 2025
9
citations
Chain of Attack: On the Robustness of Vision-Language Models Against Transfer-Based Adversarial Attacks
CVPR 2025
9
citations
DiffCalib: Reformulating Monocular Camera Calibration as Diffusion-Based Dense Incident Map Generation
AAAI 2025
9
citations
TG-LLaVA: Text Guided LLaVA via Learnable Latent Embeddings
AAAI 2025
8
citations
FOCUS: Knowledge-enhanced Adaptive Visual Compression for Few-shot Whole Slide Image Classification
CVPR 2025
7
citations
Fast Encoding and Decoding for Implicit Video Representation
ECCV 2024
7
citations
PEACE: Empowering Geologic Map Holistic Understanding with MLLMs
CVPR 2025arXiv
6
citations
SwitchLingua: The First Large-Scale Multilingual and Multi-Ethnic Code-Switching Dataset
NeurIPS 2025
5
citations
Improving Multimodal Learning Balance and Sufficiency through Data Remixing
ICML 2025
4
citations
Distilled Prompt Learning for Incomplete Multimodal Survival Prediction
CVPR 2025
4
citations
SDP-CROWN: Efficient Bound Propagation for Neural Network Verification with Tightness of Semidefinite Programming
ICML 2025
3
citations
VA-MoE: Variables-Adaptive Mixture of Experts for Incremental Weather Forecasting
ICCV 2025
2
citations
Point Cloud Upsampling Using Conditional Diffusion Module with Adaptive Noise Suppression
CVPR 2025
2
citations
Rethinking the Bias of Foundation Model under Long-tailed Distribution
ICML 2025
1
citations
Evaluating Program Semantics Reasoning with Type Inference in System $F$
NeurIPS 2025
1
citations
Revisiting Open-Set Panoptic Segmentation
AAAI 2024
1
citations
A General Framework for Learning from Weak Supervision
ICML 2024
0
citations
Completing Visual Objects via Bridging Generation and Segmentation
ICML 2024
0
citations
Floating Anchor Diffusion Model for Multi-motif Scaffolding
ICML 2024
0
citations
Post-hoc Part-Prototype Networks
ICML 2024
0
citations
CompeteAI: Understanding the Competition Dynamics of Large Language Model-based Agents
ICML 2024
0
citations
Generative Active Learning for Long-tailed Instance Segmentation
ICML 2024
0
citations
EchoTraffic: Enhancing Traffic Anomaly Understanding with Audio-Visual Insights
CVPR 2025
0
citations
Towards a Self-contained Data-driven Global Weather Forecasting Framework
ICML 2024
0
citations
Dual-Interrelated Diffusion Model for Few-Shot Anomaly Image Generation
CVPR 2025
0
citations
Monocular and Generalizable Gaussian Talking Head Animation
CVPR 2025
0
citations
Satellite Observations Guided Diffusion Model for Accurate Meteorological States at Arbitrary Resolution
CVPR 2025
0
citations
POMATO: Marrying Pointmap Matching with Temporal Motions for Dynamic 3D Reconstruction
ICCV 2025
0
citations
Scaling Tumor Segmentation: Best Lessons from Real and Synthetic Data
ICCV 2025
0
citations
FW-Merging: Scaling Model Merging with Frank-Wolfe Optimization
ICCV 2025
0
citations
UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI
ICCV 2025
0
citations
Separation for Better Integration: Disentangling Edge and Motion in Event-based Deblurring
ICCV 2025
0
citations
Conditional Visual Autoregressive Modeling for Pathological Image Restoration
ICCV 2025
0
citations
SurfaceSplat: Connecting Surface Reconstruction and Gaussian Splatting
ICCV 2025
0
citations
Unified Open-World Segmentation with Multi-Modal Prompts
ICCV 2025
0
citations
Learning Concept Prerequisite Relation via Global Knowledge Relation Optimization
AAAI 2025
0
citations
Know Where You Are From: Event-Based Segmentation via Spatio-Temporal Propagation
AAAI 2025
0
citations
MM-Tracker: Motion Mamba for UAV-platform Multiple Object Tracking
AAAI 2025
0
citations
ESEG: Event-Based Segmentation Boosted by Explicit Edge-Semantic Guidance
AAAI 2025
0
citations
Time Series Supplier Allocation via Deep Black-Litterman Model
AAAI 2025
0
citations
Towards Loss-Resilient Image Coding for Unstable Satellite Networks
AAAI 2025
0
citations
PromptMRG: Diagnosis-Driven Prompts for Medical Report Generation
AAAI 2024
0
citations
Retrieval-Augmented Primitive Representations for Compositional Zero-Shot Learning
AAAI 2024
0
citations
A Dynamic GCN with Cross-Representation Distillation for Event-Based Learning
AAAI 2024
0
citations
MICA: Towards Explainable Skin Lesion Diagnosis via Multi
AAAI 2024
0
citations
DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative Data
CVPR 2024
0
citations
Video Frame Interpolation via Direct Synthesis with the Event-based Reference
CVPR 2024
0
citations
FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition
CVPR 2024
0
citations