Jiashi Feng
127
Papers
1,954
Total Citations
Papers (127)
Dual Path Networks
NeurIPS 2017arXiv
883
citations
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
CVPR 2024
318
citations
Dual-Agent GANs for Photorealistic and Identity Preserving Profile Face Synthesis
NeurIPS 2017
166
citations
Tree-Structured Reinforcement Learning for Sequential Object Localization
NeurIPS 2016arXiv
129
citations
Predicting Scene Parsing and Motion Dynamics in the Future
NeurIPS 2017arXiv
78
citations
Adversarial Self-Supervised Learning for Semi-Supervised 3D Action Recognition
ECCV 2020
76
citations
Multimodal Learning and Reasoning for Visual Question Answering
NeurIPS 2017
51
citations
Dora: Sampling and Benchmarking for 3D Shape Variational Auto-Encoders
CVPR 2025
45
citations
Prompting Depth Anything for 4K Resolution Accurate Metric Depth Estimation
CVPR 2025
44
citations
DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention
CVPR 2025
38
citations
VideoWorld: Exploring Knowledge Learning from Unlabeled Videos
CVPR 2025
28
citations
GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation
ICCV 2025
22
citations
The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer
ICCV 2025
20
citations
Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation
ICCV 2025
17
citations
MagicArticulate: Make Your 3D Models Articulation-Ready
CVPR 2025
16
citations
AdjointDPM: Adjoint Sensitivity Method for Gradient Backpropagation of Diffusion Probabilistic Models
ICLR 2024
12
citations
Flash-VStream: Efficient Real-Time Understanding for Long Video Streams
ICCV 2025
11
citations
Deep Joint Rain Detection and Removal From a Single Image
CVPR 2017arXiv
0
citations
Deep Self-Taught Learning for Weakly Supervised Object Localization
CVPR 2017arXiv
0
citations
Video Depth Anything: Consistent Depth Estimation for Super-Long Videos
CVPR 2025
0
citations
Object Region Mining With Adversarial Erasing: A Simple Classification to Semantic Segmentation Approach
CVPR 2017arXiv
0
citations
Outlier-Robust Tensor PCA
CVPR 2017
0
citations
Deep Future Gaze: Gaze Anticipation on Egocentric Videos Using Adversarial Networks
CVPR 2017
0
citations
Learning Detection With Diverse Proposals
CVPR 2017arXiv
0
citations
MoNet: Deep Motion Exploitation for Video Object Segmentation
CVPR 2018
0
citations
Adversarial Complementary Learning for Weakly Supervised Object Localization
CVPR 2018arXiv
0
citations
Deep Adversarial Subspace Clustering
CVPR 2018
0
citations
Human Pose Estimation With Parsing Induced Learner
CVPR 2018
0
citations
Towards Pose Invariant Face Recognition in the Wild
CVPR 2018
0
citations
Left-Right Comparative Recurrent Model for Stereo Matching
CVPR 2018arXiv
0
citations
Zigzag Learning for Weakly Supervised Object Detection
CVPR 2018arXiv
0
citations
Weakly Supervised Phrase Localization With Multi-Scale Anchored Transformer Network
CVPR 2018
0
citations
Learning Markov Clustering Networks for Scene Text Detection
CVPR 2018arXiv
0
citations
Revisiting Dilated Convolution: A Simple Approach for Weakly- and Semi-Supervised Semantic Segmentation
CVPR 2018arXiv
0
citations
Graph-Based Global Reasoning Networks
CVPR 2019
0
citations
Frame-Consistent Recurrent Video Deraining With Dual-Level Flow
CVPR 2019
0
citations
A Simple Pooling-Based Design for Real-Time Salient Object Detection
CVPR 2019
0
citations
Distilling Object Detectors With Fine-Grained Feature Imitation
CVPR 2019
0
citations
Few-Shot Adaptive Faster R-CNN
CVPR 2019
0
citations
Partial Order Pruning: For Best Speed/Accuracy Trade-Off in Neural Architecture Search
CVPR 2019
0
citations
PPDM: Parallel Point Detection and Matching for Real-Time Human-Object Interaction Detection
CVPR 2020arXiv
0
citations
Central Similarity Quantization for Efficient Image and Video Retrieval
CVPR 2020arXiv
0
citations
Revisiting Knowledge Distillation via Label Smoothing Regularization
CVPR 2020
0
citations
Strip Pooling: Rethinking Spatial Pooling for Scene Parsing
CVPR 2020arXiv
0
citations
PSGAN: Pose and Expression Robust Spatial-Aware GAN for Customizable Makeup Transfer
CVPR 2020arXiv
0
citations
Overcoming Classifier Imbalance for Long-Tail Object Detection With Balanced Group Softmax
CVPR 2020arXiv
0
citations
Boosting Few-Shot Learning With Adaptive Margin Loss
CVPR 2020arXiv
0
citations
Improving Convolutional Networks With Self-Calibrated Convolutions
CVPR 2020
0
citations
Body Meshes as Points
CVPR 2021arXiv
0
citations
Coordinate Attention for Efficient Mobile Network Design
CVPR 2021arXiv
0
citations
Domain Adaptation With Auxiliary Target Domain-Oriented Classifier
CVPR 2021arXiv
0
citations
PoseAug: A Differentiable Pose Augmentation Framework for 3D Human Pose Estimation
CVPR 2021arXiv
0
citations
Continual Learning via Bit-Level Information Preserving
CVPR 2021arXiv
0
citations
DINE: Domain Adaptation From Single and Multiple Black-Box Predictors
CVPR 2022arXiv
0
citations
Mimicking the Oracle: An Initial Phase Decorrelation Approach for Class Incremental Learning
CVPR 2022arXiv
0
citations
MetaFormer Is Actually What You Need for Vision
CVPR 2022arXiv
0
citations
Shunted Self-Attention via Multi-Scale Token Aggregation
CVPR 2022arXiv
0
citations
PoseTriplet: Co-Evolving 3D Human Pose Estimation, Imitation, and Hallucination Under Self-Supervision
CVPR 2022arXiv
0
citations
Revisiting Temporal Modeling for CLIP-Based Image-to-Video Knowledge Transferring
CVPR 2023arXiv
0
citations
TAPS3D: Text-Guided 3D Textured Shape Generation From Pseudo Supervision
CVPR 2023arXiv
0
citations
Diffusion Probabilistic Model Made Slim
CVPR 2023arXiv
0
citations
OmniAvatar: Geometry-Guided Controllable 3D Head Synthesis
CVPR 2023arXiv
0
citations
Clover: Towards a Unified Video-Language Alignment and Fusion Model
CVPR 2023arXiv
0
citations
Learning The Structure of Deep Convolutional Networks
ICCV 2015
0
citations
Neural Person Search Machines
ICCV 2017arXiv
0
citations
FoveaNet: Perspective-Aware Urban Scene Parsing
ICCV 2017arXiv
0
citations
Recurrent 3D-2D Dual Learning for Large-Pose Facial Landmark Detection
ICCV 2017
0
citations
Regional Interactive Image Segmentation Networks
ICCV 2017
0
citations
Video Scene Parsing With Predictive Feature Learning
ICCV 2017arXiv
0
citations
MultiSeg: Semantically Meaningful, Scale-Diverse Segmentations From Minimal User Input
ICCV 2019
0
citations
Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks With Octave Convolution
ICCV 2019
0
citations
Dynamic Kernel Distillation for Efficient Pose Estimation in Videos
ICCV 2019
0
citations
Single-Stage Multi-Person Pose Machines
ICCV 2019
0
citations
Few-Shot Object Detection via Feature Reweighting
ICCV 2019
0
citations
Foreground-Aware Pyramid Reconstruction for Alignment-Free Occluded Person Re-Identification
ICCV 2019
0
citations
PANet: Few-Shot Image Semantic Segmentation With Prototype Alignment
ICCV 2019
0
citations
PnP-DETR: Towards Efficient Visual Analysis With Transformers
ICCV 2021
0
citations
Voxel Transformer for 3D Object Detection
ICCV 2021arXiv
0
citations
Tokens-to-Token ViT: Training Vision Transformers From Scratch on ImageNet
ICCV 2021arXiv
0
citations
AutoSpace: Neural Architecture Search With Less Human Interference
ICCV 2021arXiv
0
citations
Global Knowledge Calibration for Fast Open-Vocabulary Segmentation
ICCV 2023arXiv
0
citations
Dataset Quantization
ICCV 2023arXiv
0
citations
GETAvatar: Generative Textured Meshes for Animatable Human Avatars
ICCV 2023
0
citations
Rethinking Bottleneck Structure for Efficient Mobile Network Design
ECCV 2020
0
citations
A Balanced and Uncertainty-aware Approach for Partial Domain Adaptation
ECCV 2020
0
citations
The Devil is in Classification: A Simple Framework for Long-tail Instance Segmentation
ECCV 2020
0
citations
Geometry-Guided Progressive NeRF for Generalizable and Efficient Neural Human Rendering
ECCV 2022
0
citations
Slim Scissors: Segmenting Thin Object from Synthetic Background
ECCV 2022
0
citations
Memory-Augmented Attribute Manipulation Networks for Interactive Fashion Search
CVPR 2017
0
citations
Parallelized Autoregressive Visual Generation
CVPR 2025
0
citations
QK-Edit: Revisiting Attention-based Injection in MM-DiT for Image and Video Editing
ICCV 2025
0
citations
MV-Adapter: Multimodal Video Transfer Learning for Video Text Retrieval
CVPR 2024
0
citations
PixelLM: Pixel Reasoning with Large Multimodal Model
CVPR 2024
0
citations
Video Recognition in Portrait Mode
CVPR 2024
0
citations
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
CVPR 2024
0
citations
VISTA-LLAMA: Reducing Hallucination in Video Language Models via Equal Distance to Visual Tokens
CVPR 2024
0
citations
Reversible Recursive Instance-Level Object Segmentation
CVPR 2016
0
citations
Recurrently Target-Attending Tracking
CVPR 2016
0
citations
Recurrent Face Aging
CVPR 2016
0
citations
Highway Vehicle Counting in Compressed Domain
CVPR 2016
0
citations
Semantic Object Parsing With Local-Global Long Short-Term Memory
CVPR 2016
0
citations
Natural Language Object Retrieval
CVPR 2016
0
citations
Tensor Robust Principal Component Analysis: Exact Recovery of Corrupted Low-Rank Tensors via Convex Optimization
CVPR 2016
0
citations
Interpretable Structure-Evolving LSTM
CVPR 2017arXiv
0
citations
Perceptual Generative Adversarial Networks for Small Object Detection
CVPR 2017arXiv
0
citations
New Insight into Hybrid Stochastic Gradient Descent: Beyond With-Replacement Sampling and Convexity
NeurIPS 2018
0
citations
A^2-Nets: Double Attention Networks
NeurIPS 2018
0
citations
Efficient Stochastic Gradient Hard Thresholding
NeurIPS 2018
0
citations
Efficient Meta Learning via Minibatch Proximal Update
NeurIPS 2019
0
citations
Inference Stage Optimization for Cross-scenario 3D Human Pose Estimation
NeurIPS 2020
0
citations
Improving Generalization in Reinforcement Learning with Mixture Regularization
NeurIPS 2020
0
citations
Residual Distillation: Towards Portable Deep Neural Networks without Shortcuts
NeurIPS 2020
0
citations
ConvBERT: Improving BERT with Span-based Dynamic Convolution
NeurIPS 2020
0
citations
Towards Theoretically Understanding Why Sgd Generalizes Better Than Adam in Deep Learning
NeurIPS 2020
0
citations
No Fear of Heterogeneity: Classifier Calibration for Federated Learning with Non-IID Data
NeurIPS 2021
0
citations
Direct Multi-view Multi-person 3D Pose Estimation
NeurIPS 2021
0
citations
All Tokens Matter: Token Labeling for Training Better Vision Transformers
NeurIPS 2021
0
citations
Towards Understanding Why Lookahead Generalizes Better Than SGD and Beyond
NeurIPS 2021
0
citations
Unleashing the Power of Contrastive Self-Supervised Visual Models via Contrast-Regularized Fine-Tuning
NeurIPS 2021
0
citations
Scaling & Shifting Your Features: A New Baseline for Efficient Model Tuning
NeurIPS 2022
0
citations
Sharpness-Aware Training for Free
NeurIPS 2022
0
citations
Self-Supervised Aggregation of Diverse Experts for Test-Agnostic Long-Tailed Recognition
NeurIPS 2022
0
citations
XAGen: 3D Expressive Human Avatars Generation
NeurIPS 2023
0
citations
Expanding Small-Scale Datasets with Guided Imagination
NeurIPS 2023
0
citations
WSNet: Compact and Efficient Networks Through Weight Sampling
ICML 2018
0
citations
Policy Optimization with Demonstrations
ICML 2018
0
citations
Understanding Generalization and Optimization Performance of Deep CNNs
ICML 2018
0
citations