Lu Hou
13
Papers
534
Total Citations
Papers (13)
TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding
CVPR 2024
356
citations
VITATECS: A Diagnostic Dataset for Temporal Concept Understanding of Video-Language Models
ECCV 2024
49
citations
EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions
CVPR 2025
44
citations
ILLUME: Illuminating Your LLMs to See, Draw, and Self-Enhance
ICCV 2025
43
citations
FlatQuant: Flatness Matters for LLM Quantization
ICML 2025
29
citations
HiRes-LLaVA: Restoring Fragmentation Input in High-Resolution Large Vision-Language Models
CVPR 2025
13
citations
OAC: Output-adaptive Calibration for Accurate Post-training Quantization
AAAI 2025
0
citations
MoPE-CLIP: Structured Pruning for Efficient Vision-Language Models with Module-wise Pruning Error Metric
CVPR 2024
0
citations
FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation
NeurIPS 2023
0
citations
Normalization Helps Training of Quantized LSTM
NeurIPS 2019
0
citations
DynaBERT: Dynamic BERT with Adaptive Width and Depth
NeurIPS 2020
0
citations
Towards Efficient Post-training Quantization of Pre-trained Language Models
NeurIPS 2022
0
citations
Wukong: A 100 Million Large-scale Chinese Cross-modal Pre-training Benchmark
NeurIPS 2022
0
citations