Aliaksandr Siarohin
29
Papers
663
Total Citations
Papers (29)
Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
CVPR 2024
341
citations
VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control
ICLR 2025arXiv
114
citations
AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers
CVPR 2025
78
citations
Multi-subject Open-set Personalization in Video Generation
CVPR 2025arXiv
40
citations
Improving the Diffusability of Autoencoders
ICML 2025
34
citations
4Real-Video: Learning Generalizable Photo-Realistic 4D Video Diffusion
CVPR 2025
18
citations
Video Motion Transfer with Diffusion Transformers
CVPR 2025
18
citations
DenseDPO: Fine-Grained Temporal Preference Optimization for Video Diffusion Models
NeurIPS 2025
11
citations
GTR: Improving Large 3D Reconstruction Models through Geometry and Texture Refinement
ICLR 2025
9
citations
Hierarchical Patch Diffusion Models for High-Resolution Video Generation
CVPR 2024
0
citations
Mind the Time: Temporally-Controlled Multi-Event Video Generation
CVPR 2025
0
citations
Deformable GANs for Pose-Based Human Image Generation
CVPR 2018arXiv
0
citations
Animating Arbitrary Objects via Deep Motion Transfer
CVPR 2019
0
citations
Unsupervised Domain Adaptation Using Feature-Whitening and Consensus Loss
CVPR 2019
0
citations
Motion Representations for Articulated Animation
CVPR 2021arXiv
0
citations
Playable Video Generation
CVPR 2021arXiv
0
citations
Playable Environments: Video Manipulation in Space and Time
CVPR 2022
0
citations
DisCoScene: Spatially Disentangled Generative Radiance Fields for Controllable 3D-Aware Scene Synthesis
CVPR 2023arXiv
0
citations
Invertible Neural Skinning
CVPR 2023arXiv
0
citations
3DAvatarGAN: Bridging Domains for Personalized Editable Avatars
CVPR 2023arXiv
0
citations
Unsupervised Volumetric Animation
CVPR 2023arXiv
0
citations
InfiniCity: Infinite-Scale City Synthesis
ICCV 2023arXiv
0
citations
3D-Aware Semantic-Guided Generative Model for Human Synthesis
ECCV 2022
0
citations
AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation
ICCV 2025
0
citations
SPAD: Spatially Aware Multi-View Diffusers
CVPR 2024
0
citations
Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis
CVPR 2024
0
citations
Towards Text-guided 3D Scene Composition
CVPR 2024
0
citations
First Order Motion Model for Image Animation
NeurIPS 2019
0
citations
Autodecoding Latent 3D Diffusion Models
NeurIPS 2023
0
citations