Biao Gong

17
Papers
307
Total Citations

Papers (17)

Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following

CVPR 2024
78
citations

Animate-X: Universal Character Image Animation with Enhanced Motion Representation

ICLR 2025
59
citations

A Recipe for Scaling up Text-to-Video Generation with Text-free Videos

CVPR 2024
53
citations

Learning Disentangled Identifiers for Action-Customized Text-to-Image Generation

CVPR 2024
23
citations

StyleTokenizer: Defining Image Style by a Single Instance for Controlling Diffusion Models

ECCV 2024
22
citations

ResMaster: Mastering High-Resolution Image Generation via Structural and Fine-Grained Guidance

AAAI 2025
21
citations

Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning

CVPR 2025
18
citations

Mimir: Improving Video Diffusion Models for Precise Text Understanding

CVPR 2025
16
citations

MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation

CVPR 2025
13
citations

Learning Visual Generative Priors without Text

CVPR 2025
4
citations

Check Locate Rectify: A Training-Free Layout Calibration System for Text-to-Image Generation

CVPR 2024
0
citations

Troika: Multi-Path Cross-Modal Traction for Compositional Zero-Shot Learning

CVPR 2024
0
citations

DreamRelation: Relation-Centric Video Customization

ICCV 2025
0
citations

VoP: Text-Video Co-Operative Prompt Tuning for Cross-Modal Retrieval

CVPR 2023arXiv
0
citations

ViM: Vision Middleware for Unified Downstream Transferring

ICCV 2023arXiv
0
citations

ObjectRelator: Enabling Cross-View Object Relation Understanding Across Ego-Centric and Exo-Centric Perspectives

ICCV 2025
0
citations

Scanning Only Once: An End-to-end Framework for Fast Temporal Grounding in Long Videos

ICCV 2023arXiv
0
citations