Hang Li
8
Papers
559
Total Citations
Papers (8)
Vision-Language Foundation Models as Effective Robot Imitators
ICLR 2024
310
citations
Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation
ICLR 2024
236
citations
FedBiP: Heterogeneous One-Shot Federated Learning with Personalized Latent Diffusion Models
CVPR 2025arXiv
13
citations
Make Pixels Dance: High-Dynamic Video Generation
CVPR 2024
0
citations
MIMO: A Medical Vision Language Model with Visual Referring Multimodal Input and Pixel Grounding Multimodal Output
CVPR 2025
0
citations
Boximator: Generating Rich and Controllable Motions for Video Synthesis
ICML 2024
0
citations
Learning Flow Fields in Attention for Controllable Person Image Generation
CVPR 2025
0
citations
Self-Discovering Interpretable Diffusion Latent Directions for Responsible Text-to-Image Generation
CVPR 2024
0
citations