Poster "diffusion transformer" Papers
11 papers found
ACE: All-round Creator and Editor Following Instructions via Diffusion Transformer
Zhen Han, Zeyinzi Jiang, Yulin Pan et al.
ICLR 2025posterarXiv:2410.00086
43
citations
Acquire and then Adapt: Squeezing out Text-to-Image Model for Image Restoration
Junyuan Deng, Xinyi Wu, Yongxing Yang et al.
CVPR 2025posterarXiv:2504.15159
3
citations
Dense2MoE: Restructuring Diffusion Transformer to MoE for Efficient Text-to-Image Generation
Youwei Zheng, Yuxi Ren, Xin Xia et al.
ICCV 2025posterarXiv:2510.09094
4
citations
DiTaiListener: Controllable High Fidelity Listener Video Generation with Diffusion
Maksim Siniukov, Di Chang, Minh Tran et al.
ICCV 2025posterarXiv:2504.04010
3
citations
IRASim: A Fine-Grained World Model for Robot Manipulation
Fangqi Zhu, Hongtao Wu, Song Guo et al.
ICCV 2025posterarXiv:2406.14540
21
citations
Language-Guided Image Tokenization for Generation
Kaiwen Zha, Lijun Yu, Alireza Fathi et al.
CVPR 2025posterarXiv:2412.05796
23
citations
Multi-subject Open-set Personalization in Video Generation
Tsai-Shien Chen, Aliaksandr Siarohin, Willi Menapace et al.
CVPR 2025posterarXiv:2501.06187
40
citations
TokMan:Tokenize Manhattan Mask Optimization for Inverse Lithography
Yiwen Wu, Yuyang Chen, Ye Xia et al.
NeurIPS 2025poster
VACE: All-in-One Video Creation and Editing
Zeyinzi Jiang, Zhen Han, Chaojie Mao et al.
ICCV 2025posterarXiv:2503.07598
169
citations
VideoVLA: Video Generators Can Be Generalizable Robot Manipulators
Yichao Shen, Fangyun Wei, Zhiying Du et al.
NeurIPS 2025posterarXiv:2512.06963
3
citations
CasCast: Skillful High-resolution Precipitation Nowcasting via Cascaded Modelling
JUNCHAO GONG, LEI BAI, Peng Ye et al.
ICML 2024poster