Aliaksandr Siarohin

29
Papers
663
Total Citations

Papers (29)

Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers

CVPR 2024
341
citations

VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control

ICLR 2025arXiv
114
citations

AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers

CVPR 2025
78
citations

Multi-subject Open-set Personalization in Video Generation

CVPR 2025arXiv
40
citations

Improving the Diffusability of Autoencoders

ICML 2025
34
citations

4Real-Video: Learning Generalizable Photo-Realistic 4D Video Diffusion

CVPR 2025
18
citations

Video Motion Transfer with Diffusion Transformers

CVPR 2025
18
citations

DenseDPO: Fine-Grained Temporal Preference Optimization for Video Diffusion Models

NeurIPS 2025
11
citations

GTR: Improving Large 3D Reconstruction Models through Geometry and Texture Refinement

ICLR 2025
9
citations

Hierarchical Patch Diffusion Models for High-Resolution Video Generation

CVPR 2024
0
citations

Mind the Time: Temporally-Controlled Multi-Event Video Generation

CVPR 2025
0
citations

Deformable GANs for Pose-Based Human Image Generation

CVPR 2018arXiv
0
citations

Animating Arbitrary Objects via Deep Motion Transfer

CVPR 2019
0
citations

Unsupervised Domain Adaptation Using Feature-Whitening and Consensus Loss

CVPR 2019
0
citations

Motion Representations for Articulated Animation

CVPR 2021arXiv
0
citations

Playable Video Generation

CVPR 2021arXiv
0
citations

Playable Environments: Video Manipulation in Space and Time

CVPR 2022
0
citations

DisCoScene: Spatially Disentangled Generative Radiance Fields for Controllable 3D-Aware Scene Synthesis

CVPR 2023arXiv
0
citations

Invertible Neural Skinning

CVPR 2023arXiv
0
citations

3DAvatarGAN: Bridging Domains for Personalized Editable Avatars

CVPR 2023arXiv
0
citations

Unsupervised Volumetric Animation

CVPR 2023arXiv
0
citations

InfiniCity: Infinite-Scale City Synthesis

ICCV 2023arXiv
0
citations

3D-Aware Semantic-Guided Generative Model for Human Synthesis

ECCV 2022
0
citations

AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation

ICCV 2025
0
citations

SPAD: Spatially Aware Multi-View Diffusers

CVPR 2024
0
citations

Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis

CVPR 2024
0
citations

Towards Text-guided 3D Scene Composition

CVPR 2024
0
citations

First Order Motion Model for Image Animation

NeurIPS 2019
0
citations

Autodecoding Latent 3D Diffusion Models

NeurIPS 2023
0
citations