Xiaohu Qie

18
Papers
1,423
Total Citations

Papers (18)

T2I-Adapter: Learning Adapters to Dig Out More Controllable Ability for Text-to-Image Diffusion

AAAI 2024arXiv
1,423
citations

Bridging Video-Text Retrieval With Multiple Choice Questions

CVPR 2022arXiv
0
citations

Object-Aware Video-Language Pre-Training for Retrieval

CVPR 2022arXiv
0
citations

BTS: A Bi-Lingual Benchmark for Text Segmentation in the Wild

CVPR 2022
0
citations

UMT: Unified Multi-Modal Transformers for Joint Video Moment Retrieval and Highlight Detection

CVPR 2022arXiv
0
citations

Accelerating Vision-Language Pretraining With Free Language Modeling

CVPR 2023arXiv
0
citations

All in One: Exploring Unified Video-Language Pre-Training

CVPR 2023arXiv
0
citations

ViLEM: Visual-Language Error Modeling for Image-Text Retrieval

CVPR 2023
0
citations

RILS: Masked Visual Reconstruction in Language Semantic Space

CVPR 2023arXiv
0
citations

Dream3D: Zero-Shot Text-to-3D Synthesis Using 3D Shape Prior and Text-to-Image Diffusion Models

CVPR 2023arXiv
0
citations

Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

ICCV 2023
0
citations

Order-Prompted Tag Sequence Generation for Video Tagging

ICCV 2023
0
citations

MasaCtrl: Tuning-Free Mutual Self-Attention Control for Consistent Image Synthesis and Editing

ICCV 2023arXiv
0
citations

HOSNeRF: Dynamic Human-Object-Scene Neural Radiance Fields from a Single Video

ICCV 2023arXiv
0
citations

OmniZoomer: Learning to Move and Zoom in on Sphere at High-Resolution

ICCV 2023arXiv
0
citations

MILES: Visual BERT Pre-training with Injected Language Semantics for Video-Text Retrieval

ECCV 2022
0
citations

Tenrec: A Large-scale Multipurpose Benchmark Dataset for Recommender Systems

NeurIPS 2022
0
citations

DeVRF: Fast Deformable Voxel Radiance Fields for Dynamic Scenes

NeurIPS 2022
0
citations