Xiaohu Qie
18
Papers
1,423
Total Citations
Papers (18)
T2I-Adapter: Learning Adapters to Dig Out More Controllable Ability for Text-to-Image Diffusion
AAAI 2024arXiv
1,423
citations
Bridging Video-Text Retrieval With Multiple Choice Questions
CVPR 2022arXiv
0
citations
Object-Aware Video-Language Pre-Training for Retrieval
CVPR 2022arXiv
0
citations
BTS: A Bi-Lingual Benchmark for Text Segmentation in the Wild
CVPR 2022
0
citations
UMT: Unified Multi-Modal Transformers for Joint Video Moment Retrieval and Highlight Detection
CVPR 2022arXiv
0
citations
Accelerating Vision-Language Pretraining With Free Language Modeling
CVPR 2023arXiv
0
citations
All in One: Exploring Unified Video-Language Pre-Training
CVPR 2023arXiv
0
citations
ViLEM: Visual-Language Error Modeling for Image-Text Retrieval
CVPR 2023
0
citations
RILS: Masked Visual Reconstruction in Language Semantic Space
CVPR 2023arXiv
0
citations
Dream3D: Zero-Shot Text-to-3D Synthesis Using 3D Shape Prior and Text-to-Image Diffusion Models
CVPR 2023arXiv
0
citations
Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
ICCV 2023
0
citations
Order-Prompted Tag Sequence Generation for Video Tagging
ICCV 2023
0
citations
MasaCtrl: Tuning-Free Mutual Self-Attention Control for Consistent Image Synthesis and Editing
ICCV 2023arXiv
0
citations
HOSNeRF: Dynamic Human-Object-Scene Neural Radiance Fields from a Single Video
ICCV 2023arXiv
0
citations
OmniZoomer: Learning to Move and Zoom in on Sphere at High-Resolution
ICCV 2023arXiv
0
citations
MILES: Visual BERT Pre-training with Injected Language Semantics for Video-Text Retrieval
ECCV 2022
0
citations
Tenrec: A Large-scale Multipurpose Benchmark Dataset for Recommender Systems
NeurIPS 2022
0
citations
DeVRF: Fast Deformable Voxel Radiance Fields for Dynamic Scenes
NeurIPS 2022
0
citations