Baining Guo
25
Papers
97
Total Citations
Papers (25)
CCEdit: Creative and Controllable Video Editing via Diffusion Models
CVPR 2024
77
citations
ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation
CVPR 2025
20
citations
Improved Noise Schedule for Diffusion Training
ICCV 2025
0
citations
Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis
ICCV 2025
0
citations
InstructDiffusion: A Generalist Modeling Interface for Vision Tasks
CVPR 2024
0
citations
MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation
CVPR 2024
0
citations
Learning Pyramid-Context Encoder Network for High-Quality Image Inpainting
CVPR 2019
0
citations
Face X-Ray for More General Face Forgery Detection
CVPR 2020
0
citations
Learning Texture Transformer Network for Image Super-Resolution
CVPR 2020arXiv
0
citations
StyleSwin: Transformer-Based GAN for High-Resolution Image Generation
CVPR 2022arXiv
0
citations
Swin Transformer V2: Scaling Up Capacity and Resolution
CVPR 2022arXiv
0
citations
CSWin Transformer: A General Vision Transformer Backbone With Cross-Shaped Windows
CVPR 2022arXiv
0
citations
Vector Quantized Diffusion Model for Text-to-Image Synthesis
CVPR 2022arXiv
0
citations
Protecting Celebrities From DeepFake With Identity Consistency Transformer
CVPR 2022arXiv
0
citations
MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation
CVPR 2023
0
citations
RODIN: A Generative Model for Sculpting 3D Digital Avatars Using Diffusion
CVPR 2023arXiv
0
citations
iCLIP: Bridging Image Classification and Contrastive Language-Image Pre-Training for Visual Recognition
CVPR 2023
0
citations
Unsupervised Extraction of Video Highlights Via Robust Recurrent Auto-Encoders
ICCV 2015
0
citations
Swin Transformer: Hierarchical Vision Transformer Using Shifted Windows
ICCV 2021arXiv
0
citations
Efficient Diffusion Training via Min-SNR Weighting Strategy
ICCV 2023arXiv
0
citations
Adaptive Frequency Filters As Efficient Global Token Mixers
ICCV 2023arXiv
0
citations
Improving CLIP Fine-tuning Performance
ICCV 2023
0
citations
Advancing High-Resolution Video-Language Representation With Large-Scale Video Transcriptions
CVPR 2022arXiv
0
citations
UniGraspTransformer: Simplified Policy Distillation for Scalable Dexterous Robotic Grasping
CVPR 2025
0
citations
Compressing Neural Networks using the Variational Information Bottleneck
ICML 2018
0
citations