Jiashi Feng

127
Papers
1,954
Total Citations

Papers (127)

Dual Path Networks

NeurIPS 2017arXiv
883
citations

MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

CVPR 2024
318
citations

Dual-Agent GANs for Photorealistic and Identity Preserving Profile Face Synthesis

NeurIPS 2017
166
citations

Tree-Structured Reinforcement Learning for Sequential Object Localization

NeurIPS 2016arXiv
129
citations

Predicting Scene Parsing and Motion Dynamics in the Future

NeurIPS 2017arXiv
78
citations

Adversarial Self-Supervised Learning for Semi-Supervised 3D Action Recognition

ECCV 2020
76
citations

Multimodal Learning and Reasoning for Visual Question Answering

NeurIPS 2017
51
citations

Dora: Sampling and Benchmarking for 3D Shape Variational Auto-Encoders

CVPR 2025
45
citations

Prompting Depth Anything for 4K Resolution Accurate Metric Depth Estimation

CVPR 2025
44
citations

DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention

CVPR 2025
38
citations

VideoWorld: Exploring Knowledge Learning from Unlabeled Videos

CVPR 2025
28
citations

GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation

ICCV 2025
22
citations

The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer

ICCV 2025
20
citations

Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation

ICCV 2025
17
citations

MagicArticulate: Make Your 3D Models Articulation-Ready

CVPR 2025
16
citations

AdjointDPM: Adjoint Sensitivity Method for Gradient Backpropagation of Diffusion Probabilistic Models

ICLR 2024
12
citations

Flash-VStream: Efficient Real-Time Understanding for Long Video Streams

ICCV 2025
11
citations

Deep Joint Rain Detection and Removal From a Single Image

CVPR 2017arXiv
0
citations

Deep Self-Taught Learning for Weakly Supervised Object Localization

CVPR 2017arXiv
0
citations

Video Depth Anything: Consistent Depth Estimation for Super-Long Videos

CVPR 2025
0
citations

Object Region Mining With Adversarial Erasing: A Simple Classification to Semantic Segmentation Approach

CVPR 2017arXiv
0
citations

Outlier-Robust Tensor PCA

CVPR 2017
0
citations

Deep Future Gaze: Gaze Anticipation on Egocentric Videos Using Adversarial Networks

CVPR 2017
0
citations

Learning Detection With Diverse Proposals

CVPR 2017arXiv
0
citations

MoNet: Deep Motion Exploitation for Video Object Segmentation

CVPR 2018
0
citations

Adversarial Complementary Learning for Weakly Supervised Object Localization

CVPR 2018arXiv
0
citations

Deep Adversarial Subspace Clustering

CVPR 2018
0
citations

Human Pose Estimation With Parsing Induced Learner

CVPR 2018
0
citations

Towards Pose Invariant Face Recognition in the Wild

CVPR 2018
0
citations

Left-Right Comparative Recurrent Model for Stereo Matching

CVPR 2018arXiv
0
citations

Zigzag Learning for Weakly Supervised Object Detection

CVPR 2018arXiv
0
citations

Weakly Supervised Phrase Localization With Multi-Scale Anchored Transformer Network

CVPR 2018
0
citations

Learning Markov Clustering Networks for Scene Text Detection

CVPR 2018arXiv
0
citations

Revisiting Dilated Convolution: A Simple Approach for Weakly- and Semi-Supervised Semantic Segmentation

CVPR 2018arXiv
0
citations

Graph-Based Global Reasoning Networks

CVPR 2019
0
citations

Frame-Consistent Recurrent Video Deraining With Dual-Level Flow

CVPR 2019
0
citations

A Simple Pooling-Based Design for Real-Time Salient Object Detection

CVPR 2019
0
citations

Distilling Object Detectors With Fine-Grained Feature Imitation

CVPR 2019
0
citations

Few-Shot Adaptive Faster R-CNN

CVPR 2019
0
citations

Partial Order Pruning: For Best Speed/Accuracy Trade-Off in Neural Architecture Search

CVPR 2019
0
citations

PPDM: Parallel Point Detection and Matching for Real-Time Human-Object Interaction Detection

CVPR 2020arXiv
0
citations

Central Similarity Quantization for Efficient Image and Video Retrieval

CVPR 2020arXiv
0
citations

Revisiting Knowledge Distillation via Label Smoothing Regularization

CVPR 2020
0
citations

Strip Pooling: Rethinking Spatial Pooling for Scene Parsing

CVPR 2020arXiv
0
citations

PSGAN: Pose and Expression Robust Spatial-Aware GAN for Customizable Makeup Transfer

CVPR 2020arXiv
0
citations

Overcoming Classifier Imbalance for Long-Tail Object Detection With Balanced Group Softmax

CVPR 2020arXiv
0
citations

Boosting Few-Shot Learning With Adaptive Margin Loss

CVPR 2020arXiv
0
citations

Improving Convolutional Networks With Self-Calibrated Convolutions

CVPR 2020
0
citations

Body Meshes as Points

CVPR 2021arXiv
0
citations

Coordinate Attention for Efficient Mobile Network Design

CVPR 2021arXiv
0
citations

Domain Adaptation With Auxiliary Target Domain-Oriented Classifier

CVPR 2021arXiv
0
citations

PoseAug: A Differentiable Pose Augmentation Framework for 3D Human Pose Estimation

CVPR 2021arXiv
0
citations

Continual Learning via Bit-Level Information Preserving

CVPR 2021arXiv
0
citations

DINE: Domain Adaptation From Single and Multiple Black-Box Predictors

CVPR 2022arXiv
0
citations

Mimicking the Oracle: An Initial Phase Decorrelation Approach for Class Incremental Learning

CVPR 2022arXiv
0
citations

MetaFormer Is Actually What You Need for Vision

CVPR 2022arXiv
0
citations

Shunted Self-Attention via Multi-Scale Token Aggregation

CVPR 2022arXiv
0
citations

PoseTriplet: Co-Evolving 3D Human Pose Estimation, Imitation, and Hallucination Under Self-Supervision

CVPR 2022arXiv
0
citations

Revisiting Temporal Modeling for CLIP-Based Image-to-Video Knowledge Transferring

CVPR 2023arXiv
0
citations

TAPS3D: Text-Guided 3D Textured Shape Generation From Pseudo Supervision

CVPR 2023arXiv
0
citations

Diffusion Probabilistic Model Made Slim

CVPR 2023arXiv
0
citations

OmniAvatar: Geometry-Guided Controllable 3D Head Synthesis

CVPR 2023arXiv
0
citations

Clover: Towards a Unified Video-Language Alignment and Fusion Model

CVPR 2023arXiv
0
citations

Learning The Structure of Deep Convolutional Networks

ICCV 2015
0
citations

Neural Person Search Machines

ICCV 2017arXiv
0
citations

FoveaNet: Perspective-Aware Urban Scene Parsing

ICCV 2017arXiv
0
citations

Recurrent 3D-2D Dual Learning for Large-Pose Facial Landmark Detection

ICCV 2017
0
citations

Regional Interactive Image Segmentation Networks

ICCV 2017
0
citations

Video Scene Parsing With Predictive Feature Learning

ICCV 2017arXiv
0
citations

MultiSeg: Semantically Meaningful, Scale-Diverse Segmentations From Minimal User Input

ICCV 2019
0
citations

Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks With Octave Convolution

ICCV 2019
0
citations

Dynamic Kernel Distillation for Efficient Pose Estimation in Videos

ICCV 2019
0
citations

Single-Stage Multi-Person Pose Machines

ICCV 2019
0
citations

Few-Shot Object Detection via Feature Reweighting

ICCV 2019
0
citations

Foreground-Aware Pyramid Reconstruction for Alignment-Free Occluded Person Re-Identification

ICCV 2019
0
citations

PANet: Few-Shot Image Semantic Segmentation With Prototype Alignment

ICCV 2019
0
citations

PnP-DETR: Towards Efficient Visual Analysis With Transformers

ICCV 2021
0
citations

Voxel Transformer for 3D Object Detection

ICCV 2021arXiv
0
citations

Tokens-to-Token ViT: Training Vision Transformers From Scratch on ImageNet

ICCV 2021arXiv
0
citations

AutoSpace: Neural Architecture Search With Less Human Interference

ICCV 2021arXiv
0
citations

Global Knowledge Calibration for Fast Open-Vocabulary Segmentation

ICCV 2023arXiv
0
citations

Dataset Quantization

ICCV 2023arXiv
0
citations

GETAvatar: Generative Textured Meshes for Animatable Human Avatars

ICCV 2023
0
citations

Rethinking Bottleneck Structure for Efficient Mobile Network Design

ECCV 2020
0
citations

A Balanced and Uncertainty-aware Approach for Partial Domain Adaptation

ECCV 2020
0
citations

The Devil is in Classification: A Simple Framework for Long-tail Instance Segmentation

ECCV 2020
0
citations

Geometry-Guided Progressive NeRF for Generalizable and Efficient Neural Human Rendering

ECCV 2022
0
citations

Slim Scissors: Segmenting Thin Object from Synthetic Background

ECCV 2022
0
citations

Memory-Augmented Attribute Manipulation Networks for Interactive Fashion Search

CVPR 2017
0
citations

Parallelized Autoregressive Visual Generation

CVPR 2025
0
citations

QK-Edit: Revisiting Attention-based Injection in MM-DiT for Image and Video Editing

ICCV 2025
0
citations

MV-Adapter: Multimodal Video Transfer Learning for Video Text Retrieval

CVPR 2024
0
citations

PixelLM: Pixel Reasoning with Large Multimodal Model

CVPR 2024
0
citations

Video Recognition in Portrait Mode

CVPR 2024
0
citations

Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data

CVPR 2024
0
citations

VISTA-LLAMA: Reducing Hallucination in Video Language Models via Equal Distance to Visual Tokens

CVPR 2024
0
citations

Reversible Recursive Instance-Level Object Segmentation

CVPR 2016
0
citations

Recurrently Target-Attending Tracking

CVPR 2016
0
citations

Recurrent Face Aging

CVPR 2016
0
citations

Highway Vehicle Counting in Compressed Domain

CVPR 2016
0
citations

Semantic Object Parsing With Local-Global Long Short-Term Memory

CVPR 2016
0
citations

Natural Language Object Retrieval

CVPR 2016
0
citations

Tensor Robust Principal Component Analysis: Exact Recovery of Corrupted Low-Rank Tensors via Convex Optimization

CVPR 2016
0
citations

Interpretable Structure-Evolving LSTM

CVPR 2017arXiv
0
citations

Perceptual Generative Adversarial Networks for Small Object Detection

CVPR 2017arXiv
0
citations

New Insight into Hybrid Stochastic Gradient Descent: Beyond With-Replacement Sampling and Convexity

NeurIPS 2018
0
citations

A^2-Nets: Double Attention Networks

NeurIPS 2018
0
citations

Efficient Stochastic Gradient Hard Thresholding

NeurIPS 2018
0
citations

Efficient Meta Learning via Minibatch Proximal Update

NeurIPS 2019
0
citations

Inference Stage Optimization for Cross-scenario 3D Human Pose Estimation

NeurIPS 2020
0
citations

Improving Generalization in Reinforcement Learning with Mixture Regularization

NeurIPS 2020
0
citations

Residual Distillation: Towards Portable Deep Neural Networks without Shortcuts

NeurIPS 2020
0
citations

ConvBERT: Improving BERT with Span-based Dynamic Convolution

NeurIPS 2020
0
citations

Towards Theoretically Understanding Why Sgd Generalizes Better Than Adam in Deep Learning

NeurIPS 2020
0
citations

No Fear of Heterogeneity: Classifier Calibration for Federated Learning with Non-IID Data

NeurIPS 2021
0
citations

Direct Multi-view Multi-person 3D Pose Estimation

NeurIPS 2021
0
citations

All Tokens Matter: Token Labeling for Training Better Vision Transformers

NeurIPS 2021
0
citations

Towards Understanding Why Lookahead Generalizes Better Than SGD and Beyond

NeurIPS 2021
0
citations

Unleashing the Power of Contrastive Self-Supervised Visual Models via Contrast-Regularized Fine-Tuning

NeurIPS 2021
0
citations

Scaling & Shifting Your Features: A New Baseline for Efficient Model Tuning

NeurIPS 2022
0
citations

Sharpness-Aware Training for Free

NeurIPS 2022
0
citations

Self-Supervised Aggregation of Diverse Experts for Test-Agnostic Long-Tailed Recognition

NeurIPS 2022
0
citations

XAGen: 3D Expressive Human Avatars Generation

NeurIPS 2023
0
citations

Expanding Small-Scale Datasets with Guided Imagination

NeurIPS 2023
0
citations

WSNet: Compact and Efficient Networks Through Weight Sampling

ICML 2018
0
citations

Policy Optimization with Demonstrations

ICML 2018
0
citations

Understanding Generalization and Optimization Performance of Deep CNNs

ICML 2018
0
citations