Lu Hou
8
Papers
534
Total Citations
Papers (8)
TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding
CVPR 2024
356
citations
VITATECS: A Diagnostic Dataset for Temporal Concept Understanding of Video-Language Models
ECCV 2024
49
citations
EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions
CVPR 2025arXiv
44
citations
ILLUME: Illuminating Your LLMs to See, Draw, and Self-Enhance
ICCV 2025
43
citations
FlatQuant: Flatness Matters for LLM Quantization
ICML 2025
29
citations
HiRes-LLaVA: Restoring Fragmentation Input in High-Resolution Large Vision-Language Models
CVPR 2025
13
citations
MoPE-CLIP: Structured Pruning for Efficient Vision-Language Models with Module-wise Pruning Error Metric
CVPR 2024
0
citations
OAC: Output-adaptive Calibration for Accurate Post-training Quantization
AAAI 2025
0
citations