Ming-Ming Cheng

86
Papers
633
Total Citations

Papers (86)

Deep Hough Transform for Semantic Line Detection

ECCV 2020
218
citations

Highly Efficient Salient Object Detection with 100K Parameters

ECCV 2020
198
citations

DFormer: Rethinking RGBD Representation Learning for Semantic Segmentation

ICLR 2024
96
citations

Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation

ICML 2024
20
citations

VisualCloze: A Universal Image Generation Framework via Visual In-Context Learning

ICCV 2025
20
citations

TeMO: Towards Text-Driven 3D Stylization for Multi-Object Meshes

CVPR 2024
15
citations

Fine-Grained Knowledge Selection and Restoration for Non-exemplar Class Incremental Learning

AAAI 2024arXiv
13
citations

From Words to Worth: Newborn Article Impact Prediction with LLM

AAAI 2025
11
citations

Task-Adaptive Saliency Guidance for Exemplar-free Class Incremental Learning

CVPR 2024
8
citations

DIPO: Dual-State Images Controlled Articulated Object Generation Powered by Diverse Data

NeurIPS 2025
8
citations

TempSamp-R1: Effective Temporal Sampling with Reinforcement Fine-Tuning for Video LLMs

NeurIPS 2025
6
citations

Unbiased Region-Language Alignment for Open-Vocabulary Dense Prediction

ICCV 2025
6
citations

KAC: Kolmogorov-Arnold Classifier for Continual Learning

CVPR 2025
5
citations

Towards RAW Object Detection in Diverse Conditions

CVPR 2025
5
citations

Re-Aligning Language to Visual Objects with an Agentic Workflow

ICLR 2025
3
citations

DISTA-Net: Dynamic Closely-Spaced Infrared Small Target Unmixing

ICCV 2025
1
citations

Deeply Supervised Salient Object Detection With Short Connections

CVPR 2017arXiv
0
citations

GMS: Grid-based Motion Statistics for Fast, Ultra-Robust Feature Correspondence

CVPR 2017
0
citations

Revisiting Video Saliency: A Large-Scale Benchmark and a New Model

CVPR 2018arXiv
0
citations

Crowd Counting With Deep Negative Correlation Learning

CVPR 2018
0
citations

RegularFace: Deep Face Recognition via Exclusive Regularization

CVPR 2019
0
citations

Multi-Level Context Ultra-Aggregation for Stereo Matching

CVPR 2019
0
citations

A Simple Pooling-Based Design for Real-Time Salient Object Detection

CVPR 2019
0
citations

Contrast Prior and Fluid Pyramid Integration for RGBD Salient Object Detection

CVPR 2019
0
citations

An Iterative and Cooperative Top-Down and Bottom-Up Inference Network for Salient Object Detection

CVPR 2019
0
citations

S4Net: Single Stage Salient-Instance Segmentation

CVPR 2019
0
citations

Shifting More Attention to Video Salient Object Detection

CVPR 2019
0
citations

IP102: A Large-Scale Benchmark Dataset for Insect Pest Recognition

CVPR 2019
0
citations

Camouflaged Object Detection

CVPR 2020
0
citations

Taking a Deeper Look at Co-Salient Object Detection

CVPR 2020
0
citations

Rethinking Computer-Aided Tuberculosis Diagnosis

CVPR 2020
0
citations

Strip Pooling: Rethinking Spatial Pooling for Scene Parsing

CVPR 2020arXiv
0
citations

VecRoad: Point-Based Iterative Graph Exploration for Road Graphs Extraction

CVPR 2020
0
citations

Interactive Image Segmentation With First Click Attention

CVPR 2020
0
citations

Improving Convolutional Networks With Self-Calibrated Convolutions

CVPR 2020
0
citations

DOTS: Decoupling Operation and Topology in Differentiable Architecture Search

CVPR 2021arXiv
0
citations

Temporal Modulation Network for Controllable Space-Time Video Super-Resolution

CVPR 2021arXiv
0
citations

Representative Batch Normalization With Feature Calibration

CVPR 2021
0
citations

Global2Local: Efficient Structure Search for Video Action Segmentation

CVPR 2021arXiv
0
citations

FocusCut: Diving Into a Focus View in Interactive Segmentation

CVPR 2022
0
citations

Representation Compensation Networks for Continual Semantic Segmentation

CVPR 2022arXiv
0
citations

Towards an End-to-End Framework for Flow-Guided Video Inpainting

CVPR 2022arXiv
0
citations

Localization Distillation for Dense Object Detection

CVPR 2022arXiv
0
citations

Multi-Space Neural Radiance Fields

CVPR 2023arXiv
0
citations

Endpoints Weight Fusion for Class Incremental Semantic Segmentation

CVPR 2023
0
citations

Looking Through the Glass: Neural Surface Reconstruction Against High Specular Reflections

CVPR 2023arXiv
0
citations

AMT: All-Pairs Multi-Field Transforms for Efficient Frame Interpolation

CVPR 2023arXiv
0
citations

Structure-Measure: A New Way to Evaluate Foreground Maps

ICCV 2017
0
citations

Zero-Shot Emotion Recognition via Affective Structural Embedding

ICCV 2019
0
citations

Integral Object Mining via Online Attention Accumulation

ICCV 2019
0
citations

Scoot: A Perceptual Metric for Facial Sketches

ICCV 2019
0
citations

DFormerv2: Geometry Self-Attention for RGBD Semantic Segmentation

CVPR 2025
0
citations

Optimizing the F-Measure for Threshold-Free Salient Object Detection

ICCV 2019
0
citations

Image Inpainting With Learnable Bidirectional Attention Maps

ICCV 2019
0
citations

Joint Acne Image Grading and Counting via Label Distribution Learning

ICCV 2019
0
citations

Personalized Image Semantic Segmentation

ICCV 2021arXiv
0
citations

iNAS: Integral NAS for Device-Aware Salient Object Detection

ICCV 2021
0
citations

Masked Autoencoders are Efficient Class Incremental Learners

ICCV 2023arXiv
0
citations

Masked Diffusion Transformer is a Strong Image Synthesizer

ICCV 2023arXiv
0
citations

SLAN: Self-Locator Aided Network for Vision-Language Understanding

ICCV 2023
0
citations

Large Selective Kernel Network for Remote Sensing Object Detection

ICCV 2023arXiv
0
citations

SRFormer: Permuted Self-Attention for Single Image Super-Resolution

ICCV 2023arXiv
0
citations

Gradient-Induced Co-Saliency Detection

ECCV 2020
0
citations

VQFR: Blind Face Restoration with Vector-Quantized Dictionary and Parallel Decoder

ECCV 2022
0
citations

"Restore Globally, Refine Locally: A Mask-Guided Scheme to Accelerate Super-Resolution Networks"

ECCV 2022
0
citations

Long-Tailed Class Incremental Learning

ECCV 2022
0
citations

EGNet: Edge Guidance Network for Salient Object Detection

ICCV 2019
0
citations

GET: Unlocking the Multi-modal Potential of CLIP for Generalized Category Discovery

CVPR 2025
0
citations

RSAR: Restricted State Angle Resolver and Rotated SAR Benchmark

CVPR 2025
0
citations

TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction

ICCV 2025
0
citations

Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing

ICCV 2025
0
citations

Revisiting Efficient Semantic Segmentation: Learning Offsets for Better Spatial and Class Feature Alignment

ICCV 2025
0
citations

Advancing Textual Prompt Learning with Anchored Attributes

ICCV 2025
0
citations

AR-1-to-3: Single Image to Consistent 3D Object via Next-View Prediction

ICCV 2025
0
citations

Knowledge Graph Enhanced Generative Multi-modal Models for Class-Incremental Learning

NeurIPS 2025
0
citations

CorrMatch: Label Propagation via Correlation Matching for Semi-Supervised Semantic Segmentation

CVPR 2024
0
citations

CrossKD: Cross-Head Knowledge Distillation for Object Detection

CVPR 2024
0
citations

Traffic Scene Parsing through the TSP6K Dataset

CVPR 2024
0
citations

PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding

CVPR 2024
0
citations

Generative Multi-modal Models are Good Class Incremental Learners

CVPR 2024
0
citations

Object Region Mining With Adversarial Erasing: A Simple Classification to Semantic Segmentation Approach

CVPR 2017arXiv
0
citations

Richer Convolutional Features for Edge Detection

CVPR 2017arXiv
0
citations

Self-Erasing Network for Integral Object Attention

NeurIPS 2018
0
citations

Unsupervised Scale-consistent Depth and Ego-motion Learning from Monocular Video

NeurIPS 2019
0
citations

ICNet: Intra-saliency Correlation Network for Co-Saliency Detection

NeurIPS 2020
0
citations

SegNeXt: Rethinking Convolutional Attention Design for Semantic Segmentation

NeurIPS 2022
0
citations