Liming Zhao
5
Papers
35
Total Citations
Papers (5)
Improved Video VAE for Latent Video Diffusion Model
CVPR 2025arXiv
19
citations
Hybrid-Level Instruction Injection for Video Token Compression in Multi-modal Large Language Models
CVPR 2025arXiv
14
citations
ContextHOI: Spatial Context Learning for Human-Object Interaction Detection
AAAI 2025
2
citations
FuseTeacher: Modality-fused Encoders are Strong Vision Supervisors
ECCV 2024
0
citations
Orchestrating the Symphony of Prompt Distribution Learning for Human-Object Interaction Detection
AAAI 2025
0
citations