Jiashi Feng
19
Papers
571
Total Citations
Papers (19)
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
CVPR 2024
318
citations
Dora: Sampling and Benchmarking for 3D Shape Variational Auto-Encoders
CVPR 2025
45
citations
Prompting Depth Anything for 4K Resolution Accurate Metric Depth Estimation
CVPR 2025
44
citations
DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention
CVPR 2025
38
citations
VideoWorld: Exploring Knowledge Learning from Unlabeled Videos
CVPR 2025
28
citations
GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation
ICCV 2025
22
citations
The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer
ICCV 2025
20
citations
Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation
ICCV 2025
17
citations
MagicArticulate: Make Your 3D Models Articulation-Ready
CVPR 2025
16
citations
AdjointDPM: Adjoint Sensitivity Method for Gradient Backpropagation of Diffusion Probabilistic Models
ICLR 2024
12
citations
Flash-VStream: Efficient Real-Time Understanding for Long Video Streams
ICCV 2025
11
citations
QK-Edit: Revisiting Attention-based Injection in MM-DiT for Image and Video Editing
ICCV 2025
0
citations
Parallelized Autoregressive Visual Generation
CVPR 2025
0
citations
Video Depth Anything: Consistent Depth Estimation for Super-Long Videos
CVPR 2025
0
citations
MV-Adapter: Multimodal Video Transfer Learning for Video Text Retrieval
CVPR 2024
0
citations
PixelLM: Pixel Reasoning with Large Multimodal Model
CVPR 2024
0
citations
Video Recognition in Portrait Mode
CVPR 2024
0
citations
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
CVPR 2024
0
citations
VISTA-LLAMA: Reducing Hallucination in Video Language Models via Equal Distance to Visual Tokens
CVPR 2024
0
citations