Haochen Wang
7
Papers
97
Total Citations
Papers (7)
JeDi: Joint-Image Diffusion Models for Finetuning-Free Personalized Text-to-Image Generation
CVPR 2024
47
citations
Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness
ICCV 2025
28
citations
The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer
ICCV 2025
20
citations
MVU-Eval: Towards Multi-Video Understanding Evaluation for Multimodal LLMs
NeurIPS 2025arXiv
2
citations
Holistic Tokenizer for Autoregressive Image Generation
ICCV 2025
0
citations
Object-centric Video Question Answering with Visual Grounding and Referring
ICCV 2025
0
citations
Alpha Invariance: On Inverse Scaling Between Distance and Volume Density in Neural Radiance Fields
CVPR 2024
0
citations