Shiyu Huang
7
Papers
1,535
Total Citations
Papers (7)
CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer
ICLR 2025
1,318
citations
LVBench: An Extreme Long Video Understanding Benchmark
ICCV 2025
208
citations
DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization
AAAI 2024arXiv
9
citations
MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models
CVPR 2025
0
citations
ICT: Image-Object Cross-Level Trusted Intervention for Mitigating Object Hallucination in Large Vision-Language Models
CVPR 2025
0
citations
Expecting the Unexpected: Training Detectors for Unusual Pedestrians With Adversarial Imposters
CVPR 2017arXiv
0
citations
SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks
NeurIPS 2023
0
citations