Lu Hou

8

Papers

534

Total Citations

Papers (8)

TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding

VITATECS: A Diagnostic Dataset for Temporal Concept Understanding of Video-Language Models

EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions

ILLUME: Illuminating Your LLMs to See, Draw, and Self-Enhance

FlatQuant: Flatness Matters for LLM Quantization

HiRes-LLaVA: Restoring Fragmentation Input in High-Resolution Large Vision-Language Models

MoPE-CLIP: Structured Pruning for Efficient Vision-Language Models with Module-wise Pruning Error Metric

OAC: Output-adaptive Calibration for Accurate Post-training Quantization