Shuicheng Yan

84
Papers
1,914
Total Citations

Papers (84)

Dual Path Networks

NeurIPS 2017arXiv
883
citations

Highly Efficient Salient Object Detection with 100K Parameters

ECCV 2020
198
citations

Matching-CNN Meets KNN: Quasi-Parametric Human Parsing

CVPR 2015
168
citations

Dual-Agent GANs for Photorealistic and Identity Preserving Profile Face Synthesis

NeurIPS 2017
166
citations

Tree-Structured Reinforcement Learning for Sequential Object Localization

NeurIPS 2016arXiv
129
citations

Point Cloud Mamba: Point Cloud Learning via State Space Model

AAAI 2025
81
citations

Predicting Scene Parsing and Motion Dynamics in the Future

NeurIPS 2017arXiv
78
citations

Towards Semantic Equivalence of Tokenization in Multimodal LLM

ICLR 2025
57
citations

Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

ICLR 2025
43
citations

MoH: Multi-Head Attention as Mixture-of-Head Attention

ICML 2025
37
citations

MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts

ICLR 2025
31
citations

Improving Video Segmentation via Dynamic Anchor Queries

ECCV 2024
19
citations

Explore In-Context Segmentation via Latent Diffusion Models

AAAI 2025
14
citations

JarvisArt: Liberating Human Artistic Creativity via an Intelligent Photo Retouching Agent

NeurIPS 2025arXiv
9
citations

PointDGMamba: Domain Generalization of Point Cloud Classification via Generalized State Space Model

AAAI 2025
1
citations

Perceptual Generative Adversarial Networks for Small Object Detection

CVPR 2017arXiv
0
citations

Deep Joint Rain Detection and Removal From a Single Image

CVPR 2017arXiv
0
citations

Memory-Augmented Attribute Manipulation Networks for Interactive Fashion Search

CVPR 2017
0
citations

Object Region Mining With Adversarial Erasing: A Simple Classification to Semantic Segmentation Approach

CVPR 2017arXiv
0
citations

Semantic Segmentation via Structured Patch Prediction, Context CRF and Guidance CRF

CVPR 2017
0
citations

More Is Less: A More Complicated Network With Less Inference Complexity

CVPR 2017arXiv
0
citations

Human Pose Estimation With Parsing Induced Learner

CVPR 2018
0
citations

Towards Pose Invariant Face Recognition in the Wild

CVPR 2018
0
citations

Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation

CVPR 2018arXiv
0
citations

Neural Style Transfer via Meta Networks

CVPR 2018
0
citations

AdversarialNAS: Adversarial Neural Architecture Search for GANs

CVPR 2020arXiv
0
citations

PSGAN: Pose and Expression Robust Spatial-Aware GAN for Customizable Makeup Transfer

CVPR 2020arXiv
0
citations

MetaFormer Is Actually What You Need for Vision

CVPR 2022arXiv
0
citations

Deep Color Consistent Network for Low-Light Image Enhancement

CVPR 2022
0
citations

Position-Guided Text Prompt for Vision-Language Pre-Training

CVPR 2023arXiv
0
citations

Exploring Incompatible Knowledge Transfer in Few-Shot Image Generation

CVPR 2023arXiv
0
citations

Towards Computational Baby Learning: A Weakly-Supervised Approach for Object Detection

ICCV 2015
0
citations

Cross-Domain Image Retrieval With a Dual Attribute-Aware Ranking Network

ICCV 2015
0
citations

Task-Driven Feature Pooling for Image Classification

ICCV 2015
0
citations

Human Parsing With Contextualized Convolutional Neural Network

ICCV 2015
0
citations

Additive Nearest Neighbor Feature Maps

ICCV 2015
0
citations

Conditional Convolutional Neural Network for Modality-Aware Face Recognition

ICCV 2015
0
citations

Personalized Age Progression With Aging Dictionary

ICCV 2015
0
citations

Neural Person Search Machines

ICCV 2017arXiv
0
citations

FoveaNet: Perspective-Aware Urban Scene Parsing

ICCV 2017arXiv
0
citations

Recurrent 3D-2D Dual Learning for Large-Pose Facial Landmark Detection

ICCV 2017
0
citations

Scale-Adaptive Convolutions for Scene Parsing

ICCV 2017
0
citations

Video Scene Parsing With Predictive Feature Learning

ICCV 2017arXiv
0
citations

Single-Stage Multi-Person Pose Machines

ICCV 2019
0
citations

Very Long Natural Scenery Image Prediction by Outpainting

ICCV 2019
0
citations

PnP-DETR: Towards Efficient Visual Analysis With Transformers

ICCV 2021
0
citations

Tokens-to-Token ViT: Training Vision Transformers From Scratch on ImageNet

ICCV 2021arXiv
0
citations

STPrivacy: Spatio-Temporal Privacy-Preserving Action Recognition

ICCV 2023arXiv
0
citations

Masked Diffusion Transformer is a Strong Image Synthesizer

ICCV 2023arXiv
0
citations

Rethinking Bottleneck Structure for Efficient Mobile Network Design

ECCV 2020
0
citations

Self-Promoted Supervision for Few-Shot Transformer

ECCV 2022
0
citations

Geometry-Guided Progressive NeRF for Generalizable and Efficient Neural Human Rendering

ECCV 2022
0
citations

Improving Vision Transformers by Revisiting High-Frequency Components

ECCV 2022
0
citations

DualFormer: Local-Global Stratified Transformer for Efficient Video Recognition

ECCV 2022
0
citations

Video Graph Transformer for Video Question Answering

ECCV 2022
0
citations

Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks With Octave Convolution

ICCV 2019
0
citations

Combating Multimodal LLM Hallucination via Bottom-Up Holistic Reasoning

AAAI 2025
0
citations

InceptionNeXt: When Inception Meets ConvNeXt

CVPR 2024
0
citations

Structural Sparse Tracking

CVPR 2015
0
citations

Shape Driven Kernel Adaptation in Convolutional Neural Network for Robust Facial Traits Recognition

CVPR 2015
0
citations

Simultaneous Feature Learning and Hash Coding With Deep Neural Networks

CVPR 2015
0
citations

Motion Part Regularization: Improving Action Recognition via Trajectory Selection

CVPR 2015
0
citations

Deep Domain Adaptation for Describing People Based on Fine-Grained Clothing Attributes

CVPR 2015
0
citations

SOLD: Sub-Optimal Low-rank Decomposition for Efficient Video Segmentation

CVPR 2015
0
citations

Reversible Recursive Instance-Level Object Segmentation

CVPR 2016
0
citations

Recurrently Target-Attending Tracking

CVPR 2016
0
citations

Recurrent Face Aging

CVPR 2016
0
citations

Semantic Object Parsing With Local-Global Long Short-Term Memory

CVPR 2016
0
citations

Tensor Robust Principal Component Analysis: Exact Recovery of Corrupted Low-Rank Tensors via Convex Optimization

CVPR 2016
0
citations

Interpretable Structure-Evolving LSTM

CVPR 2017arXiv
0
citations

A^2-Nets: Double Attention Networks

NeurIPS 2018
0
citations

Efficient Meta Learning via Minibatch Proximal Update

NeurIPS 2019
0
citations

ConvBERT: Improving BERT with Span-based Dynamic Convolution

NeurIPS 2020
0
citations

How Should Pre-Trained Language Models Be Fine-Tuned Towards Adversarial Robustness?

NeurIPS 2021
0
citations

Direct Multi-view Multi-person 3D Pose Estimation

NeurIPS 2021
0
citations

Towards Understanding Why Lookahead Generalizes Better Than SGD and Beyond

NeurIPS 2021
0
citations

EnvPool: A Highly Parallel Reinforcement Learning Environment Execution Engine

NeurIPS 2022
0
citations

Inception Transformer

NeurIPS 2022arXiv
0
citations

Mutual Information Regularized Offline Reinforcement Learning

NeurIPS 2023
0
citations

Gaussian Mixture Solvers for Diffusion Models

NeurIPS 2023
0
citations

On Calibrating Diffusion Probabilistic Models

NeurIPS 2023
0
citations

Efficient Diffusion Policies For Offline Reinforcement Learning

NeurIPS 2023
0
citations

ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection

NeurIPS 2023
0
citations

WSNet: Compact and Efficient Networks Through Weight Sampling

ICML 2018
0
citations