Haodong Duan
14
Papers
67
Total Citations
Papers (14)
OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?
CVPR 2025
37
citations
MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models
ICLR 2025
18
citations
Creation-MMBench: Assessing Context-Aware Creative Intelligence in MLLMs
ICCV 2025arXiv
12
citations
MM-IFEngine: Towards Multimodal Instruction Following
ICCV 2025
0
citations
Visual-RFT: Visual Reinforcement Fine-Tuning
ICCV 2025
0
citations
OCSampler: Compressing Videos to One Clip With Single-Step Sampling
CVPR 2022arXiv
0
citations
Revisiting Skeleton-Based Action Recognition
CVPR 2022arXiv
0
citations
TRB: A Novel Triplet Representation for Understanding 2D Human Body
ICCV 2019
0
citations
SkeleTR: Towards Skeleton-based Action Recognition in the Wild
ICCV 2023
0
citations
Omni-sourced Webly-supervised Learning for Video Recognition
ECCV 2020
0
citations
TransRank: Self-Supervised Video Representation Learning via Ranking-Based Transformation Recognition
CVPR 2022arXiv
0
citations
Image Quality Assessment: From Human to Machine Preference
CVPR 2025
0
citations
Information Density Principle for MLLM Benchmarks
ICCV 2025
0
citations
JourneyDB: A Benchmark for Generative Image Understanding
NeurIPS 2023
0
citations