Baining Guo

25
Papers
97
Total Citations

Papers (25)

CCEdit: Creative and Controllable Video Editing via Diffusion Models

CVPR 2024
77
citations

ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation

CVPR 2025
20
citations

Improved Noise Schedule for Diffusion Training

ICCV 2025
0
citations

Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis

ICCV 2025
0
citations

InstructDiffusion: A Generalist Modeling Interface for Vision Tasks

CVPR 2024
0
citations

MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation

CVPR 2024
0
citations

Learning Pyramid-Context Encoder Network for High-Quality Image Inpainting

CVPR 2019
0
citations

Face X-Ray for More General Face Forgery Detection

CVPR 2020
0
citations

Learning Texture Transformer Network for Image Super-Resolution

CVPR 2020arXiv
0
citations

StyleSwin: Transformer-Based GAN for High-Resolution Image Generation

CVPR 2022arXiv
0
citations

Swin Transformer V2: Scaling Up Capacity and Resolution

CVPR 2022arXiv
0
citations

CSWin Transformer: A General Vision Transformer Backbone With Cross-Shaped Windows

CVPR 2022arXiv
0
citations

Vector Quantized Diffusion Model for Text-to-Image Synthesis

CVPR 2022arXiv
0
citations

Protecting Celebrities From DeepFake With Identity Consistency Transformer

CVPR 2022arXiv
0
citations

MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation

CVPR 2023
0
citations

RODIN: A Generative Model for Sculpting 3D Digital Avatars Using Diffusion

CVPR 2023arXiv
0
citations

iCLIP: Bridging Image Classification and Contrastive Language-Image Pre-Training for Visual Recognition

CVPR 2023
0
citations

Unsupervised Extraction of Video Highlights Via Robust Recurrent Auto-Encoders

ICCV 2015
0
citations

Swin Transformer: Hierarchical Vision Transformer Using Shifted Windows

ICCV 2021arXiv
0
citations

Efficient Diffusion Training via Min-SNR Weighting Strategy

ICCV 2023arXiv
0
citations

Adaptive Frequency Filters As Efficient Global Token Mixers

ICCV 2023arXiv
0
citations

Improving CLIP Fine-tuning Performance

ICCV 2023
0
citations

Advancing High-Resolution Video-Language Representation With Large-Scale Video Transcriptions

CVPR 2022arXiv
0
citations

UniGraspTransformer: Simplified Policy Distillation for Scalable Dexterous Robotic Grasping

CVPR 2025
0
citations

Compressing Neural Networks using the Variational Information Bottleneck

ICML 2018
0
citations