Jiashi Feng

19
Papers
571
Total Citations

Papers (19)

MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

CVPR 2024
318
citations

Dora: Sampling and Benchmarking for 3D Shape Variational Auto-Encoders

CVPR 2025
45
citations

Prompting Depth Anything for 4K Resolution Accurate Metric Depth Estimation

CVPR 2025
44
citations

DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention

CVPR 2025
38
citations

VideoWorld: Exploring Knowledge Learning from Unlabeled Videos

CVPR 2025
28
citations

GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation

ICCV 2025
22
citations

The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer

ICCV 2025
20
citations

Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation

ICCV 2025
17
citations

MagicArticulate: Make Your 3D Models Articulation-Ready

CVPR 2025
16
citations

AdjointDPM: Adjoint Sensitivity Method for Gradient Backpropagation of Diffusion Probabilistic Models

ICLR 2024
12
citations

Flash-VStream: Efficient Real-Time Understanding for Long Video Streams

ICCV 2025
11
citations

QK-Edit: Revisiting Attention-based Injection in MM-DiT for Image and Video Editing

ICCV 2025
0
citations

Parallelized Autoregressive Visual Generation

CVPR 2025
0
citations

Video Depth Anything: Consistent Depth Estimation for Super-Long Videos

CVPR 2025
0
citations

MV-Adapter: Multimodal Video Transfer Learning for Video Text Retrieval

CVPR 2024
0
citations

PixelLM: Pixel Reasoning with Large Multimodal Model

CVPR 2024
0
citations

Video Recognition in Portrait Mode

CVPR 2024
0
citations

Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data

CVPR 2024
0
citations

VISTA-LLAMA: Reducing Hallucination in Video Language Models via Equal Distance to Visual Tokens

CVPR 2024
0
citations