🧬Vision Recognition

Semantic Segmentation

Pixel-wise classification of images

100 papers2,541 total citations
Compare with other topics
Feb '24 Jan '26437 papers
Also includes: semantic segmentation, scene segmentation, pixel classification

Top Papers

#1

EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything

Yunyang Xiong, Balakrishnan Varadarajan, Lemeng Wu et al.

CVPR 2024
241
citations
#2

SCTNet: Single Branch CNN with Transformer Semantic Information for Real-Time Segmentation

Authors: Zhengze Xu, Dongyue Wu, Changqian Yu et al.

AAAI 2024arXiv:2312.17071
real-time segmentationsemantic segmentationsingle branch cnntransformer semantic information+3
126
citations
#3

SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference

Feng Wang, Jieru Mei, Alan Yuille

ECCV 2024
120
citations
#4

Stronger Fewer & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic Segmentation

ZHIXIANG WEI, Lin Chen, Xiaoxiao Ma et al.

CVPR 2024
100
citations
#5

SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation

Bin Xie, Jiale Cao, Jin Xie et al.

CVPR 2024
90
citations
#6

Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation

Sihan liu, Yiwei Ma, Xiaoqing Zhang et al.

CVPR 2024
89
citations
#7

GROUNDHOG: Grounding Large Language Models to Holistic Segmentation

Yichi Zhang, Ziqiao Ma, Xiaofeng Gao et al.

CVPR 2024
75
citations
#8

ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference

Mengcheng Lan, Chaofeng Chen, Yiping Ke et al.

ECCV 2024
68
citations
#9

OmniSeg3D: Omniversal 3D Segmentation via Hierarchical Contrastive Learning

Haiyang Ying, Yixuan Yin, Jinzhi Zhang et al.

CVPR 2024
66
citations
#10

Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation

Shuting He, Henghui Ding

CVPR 2024
64
citations
#11

Towards Semantic Equivalence of Tokenization in Multimodal LLM

Shengqiong Wu, Hao Fei, Xiangtai Li et al.

ICLR 2025
57
citations
#12

MuSc: Zero-Shot Industrial Anomaly Classification and Segmentation with Mutual Scoring of the Unlabeled Images

Xurui Li, Ziming Huang, Feng Xue et al.

ICLR 2024
55
citations
#13

A Decade's Battle on Dataset Bias: Are We There Yet?

Zhuang Liu, Kaiming He

ICLR 2025
52
citations
#14

FlashSplat: 2D to 3D Gaussian Splatting Segmentation Solved Optimally

Qiuhong Shen, Xingyi Yang, Xinchao Wang

ECCV 2024
45
citations
#15

Better Call SAL: Towards Learning to Segment Anything in Lidar

Aljoša Ošep, Tim Meinhardt, Francesco Ferroni et al.

ECCV 2024
38
citations
#16

Distilling Semantic Priors from SAM to Efficient Image Restoration Models

Quan Zhang, Xiaoyu Liu, Wei Li et al.

CVPR 2024
36
citations
#17

RobustSAM: Segment Anything Robustly on Degraded Images

Wei-Ting Chen, Yu Jiet Vong, Sy-Yen Kuo et al.

CVPR 2024
35
citations
#18

Generalizable Whole Slide Image Classification with Fine-Grained Visual-Semantic Interaction

Hao Li, Ying Chen, Yifei Chen et al.

CVPR 2024
32
citations
#19

CFR-ICL: Cascade-Forward Refinement with Iterative Click Loss for Interactive Image Segmentation

Shoukun Sun, Min Xian, Fei Xu et al.

AAAI 2024arXiv:2303.05620
interactive image segmentationclick-based segmentationcascade-forward refinementiterative click loss+4
32
citations
#20

Text4Seg: Reimagining Image Segmentation as Text Generation

Mengcheng Lan, Chaofeng Chen, Yue Zhou et al.

ICLR 2025
32
citations
#21

Skeleton Recall Loss for Connectivity Conserving and Resource Efficient Segmentation of Thin Tubular Structures

Yannick Kirchhoff, Maximilian Rokuss, Saikat Roy et al.

ECCV 2024
31
citations
#22

EAGLE: Eigen Aggregation Learning for Object-Centric Unsupervised Semantic Segmentation

Chanyoung Kim, Woojung Han, Dayun Ju et al.

CVPR 2024
30
citations
#23

Style Blind Domain Generalized Semantic Segmentation via Covariance Alignment and Semantic Consistence Contrastive Learning

Woo-Jin Ahn, Geun-Yeong Yang, Hyunduck Choi et al.

CVPR 2024
30
citations
#24

Relevant Intrinsic Feature Enhancement Network for Few-Shot Semantic Segmentation

Xiaoyi Bao, Jie Qin, Siyang Sun et al.

AAAI 2024arXiv:2312.06474
few-shot semantic segmentationintrinsic feature enhancementmulti-level prototype generationsemantic ambiguity+4
30
citations
#25

Hunting Attributes: Context Prototype-Aware Learning for Weakly Supervised Semantic Segmentation

feilong tang, Zhongxing Xu, Zhaojun QU et al.

CVPR 2024
29
citations
#26

Frozen CLIP: A Strong Backbone for Weakly Supervised Semantic Segmentation

Bingfeng Zhang, Siyue Yu, Yunchao Wei et al.

CVPR 2024
29
citations
#27

RUN: Reversible Unfolding Network for Concealed Object Segmentation

Chunming He, Rihan Zhang, Fengyang Xiao et al.

ICML 2025
28
citations
#28

Open-Vocabulary Semantic Segmentation with Image Embedding Balancing

Xiangheng Shan, Dongyue Wu, Guilin Zhu et al.

CVPR 2024
28
citations
#29

Curriculum Point Prompting for Weakly-Supervised Referring Image Segmentation

Qiyuan Dai, Sibei Yang

CVPR 2024
25
citations
#30

Image-to-Image Matching via Foundation Models: A New Perspective for Open-Vocabulary Semantic Segmentation

Yuan Wang, Rui Sun, Naisong Luo et al.

CVPR 2024
25
citations
#31

Tyche: Stochastic In-Context Learning for Medical Image Segmentation

Marianne Rakic, Hallee Wong, Jose Javier Gonzalez Ortiz et al.

CVPR 2024
24
citations
#32

Interactive Medical Image Segmentation: A Benchmark Dataset and Baseline

Junlong Cheng, Bin Fu, Jin Ye et al.

CVPR 2025
24
citations
#33

Reliability in Semantic Segmentation: Can We Use Synthetic Data?

Thibaut Loiseau, Tuan Hung Vu, Mickael Chen et al.

ECCV 2024
22
citations
#34

Adaptive FSS: A Novel Few-Shot Segmentation Framework via Prototype Enhancement

Jing Wang, Jiangyun Li, Chen Chen et al.

AAAI 2024arXiv:2312.15731
few-shot segmentationprototype enhancementadapter mechanismmeta-learning+3
21
citations
#35

Region-Adaptive Transform with Segmentation Prior for Image Compression

Yuxi Liu, Wenhan Yang, Huihui Bai et al.

ECCV 2024arXiv:2403.00628
learned image compressionregion-adaptive transformsegmentation prioradaptive convolutions+3
21
citations
#36

When Semantic Segmentation Meets Frequency Aliasing

Linwei Chen, Lin Gu, Ying Fu

ICLR 2024
21
citations
#37

Visual Prompting for Generalized Few-shot Segmentation: A Multi-scale Approach

Mir Rayat Imtiaz Hossain, Mennatullah Siam, Leonid Sigal et al.

CVPR 2024
21
citations
#38

Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation

Yunheng Li, Zhong-Yu Li, Quan-Sheng Zeng et al.

ICML 2024
vision-language modelszero-shot semantic segmentationmulti-level visual featuresembedding alignment+2
20
citations
#39

Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation

Jiaqi Huang, Zunnan Xu, Ting Liu et al.

AAAI 2025
20
citations
#40

The Devil is in Temporal Token: High Quality Video Reasoning Segmentation

Sitong Gong, Yunzhi Zhuge, Lu Zhang et al.

CVPR 2025
19
citations
#41

Image-Text Co-Decomposition for Text-Supervised Semantic Segmentation

Ji-Jia Wu, Andy Chia-Hao Chang, Chieh-Yu Chuang et al.

CVPR 2024
19
citations
#42

SFPNet: Sparse Focal Point Network for Semantic Segmentation on General LiDAR Point Clouds

Yanbo Wang, Wentao Zhao, Cao Chuan et al.

ECCV 2024
17
citations
#43

Weakly Supervised Semantic Segmentation for Driving Scenes

Dongseob Kim, Seungho Lee, Junsuk Choe et al.

AAAI 2024arXiv:2312.13646
weakly supervised semantic segmentationdriving scene datasetscontrastive language-image pre-trainingsmall object detection+4
17
citations
#44

Scalable Image Tokenization with Index Backpropagation Quantization

Fengyuan Shi, Zhuoyan Luo, Yixiao Ge et al.

ICCV 2025
16
citations
#45

PartSTAD: 2D-to-3D Part Segmentation Task Adaptation

Hyunjin Kim, Minhyuk Sung

ECCV 2024
16
citations
#46

Learning Hierarchical Image Segmentation For Recognition and By Recognition

Tsung-Wei Ke, Sangwoo Mo, Stella Yu

ICLR 2024
16
citations
#47

Kandinsky Conformal Prediction: Efficient Calibration of Image Segmentation Algorithms

Joren Brunekreef, Eric Marcus, Ray Sheombarsing et al.

CVPR 2024
15
citations
#48

MeshSegmenter: Zero-Shot Mesh Segmentation via Texture Synthesis

ziming zhong, Yanyu Xu, Jing Li et al.

ECCV 2024
15
citations
#49

Versatile Medical Image Segmentation Learned from Multi-Source Datasets via Model Self-Disambiguation

Xiaoyang Chen, Hao Zheng, Yuemeng LI et al.

CVPR 2024
15
citations
#50

OTSeg: Multi-prompt Sinkhorn Attention for Zero-Shot Semantic Segmentation

Kwanyoung Kim, Yujin Oh, Jong Chul Ye

ECCV 2024
13
citations
#51

MCPNet: An Interpretable Classifier via Multi-Level Concept Prototypes

Bor Shiun Wang, Chien-Yi Wang, Wei-Chen Chiu

CVPR 2024
13
citations
#52

Spike2Former: Efficient Spiking Transformer for High-performance Image Segmentation

Zhenxin Lei, Man Yao, Jiakui Hu et al.

AAAI 2025
13
citations
#53

PH-Net: Semi-Supervised Breast Lesion Segmentation via Patch-wise Hardness

Siyao Jiang, Huisi Wu, Junyang Chen et al.

CVPR 2024
13
citations
#54

USE: Universal Segment Embeddings for Open-Vocabulary Image Segmentation

Xiaoqi Wang, Wenbin He, Xiwei Xuan et al.

CVPR 2024
13
citations
#55

Scribble Hides Class: Promoting Scribble-Based Weakly-Supervised Semantic Segmentation

Xinliang Zhang, Lei Zhu, Hangzhou He et al.

AAAI 2024arXiv:2402.17555
weakly-supervised semantic segmentationscribble annotationpseudo-label generationlocalization rectification module+3
13
citations
#56

Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation

Seonghoon Yu, Paul Hongsuck Seo, Jeany Son

ECCV 2024
12
citations
#57

Discover and Mitigate Multiple Biased Subgroups in Image Classifiers

Zeliang Zhang, Mingqian Feng, Zhiheng Li et al.

CVPR 2024
12
citations
#58

Early Preparation Pays Off: New Classifier Pre-tuning for Class Incremental Semantic Segmentation

Zhengyuan Xie, Haiquan Lu, Jia-wen Xiao et al.

ECCV 2024
12
citations
#59

InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models

Cong Wei, Yujie Zhong, yingsen zeng et al.

ICCV 2025
12
citations
#60

SRFormer: Text Detection Transformer with Incorporated Segmentation and Regression

Qingwen Bu, Sungrae Park, Minsoo Khang et al.

AAAI 2024arXiv:2308.10531
text detectionsegmentation-based methodsregression-based methodsdetr-based model+3
12
citations
#61

Prototype-Based Image Prompting for Weakly Supervised Histopathological Image Segmentation

Qingchen Tang, Lei Fan, Maurice Pagnucco et al.

CVPR 2025
12
citations
#62

Improving Bird's Eye View Semantic Segmentation by Task Decomposition

Tianhao Zhao, Yongcan Chen, Yu Wu et al.

CVPR 2024
12
citations
#63

Audio-Visual Instance Segmentation

Ruohao Guo, Xianghua Ying, Yaru Chen et al.

CVPR 2025
11
citations
#64

BLADE: Box-Level Supervised Amodal Segmentation through Directed Expansion

Zhaochen Liu, Zhixuan Li, Tingting Jiang

AAAI 2024arXiv:2401.01642
amodal segmentationbox-level supervisiondirected expansionoccluded objects+3
11
citations
#65

Flattening the Parent Bias: Hierarchical Semantic Segmentation in the Poincaré Ball

Simon Weber, Barış Zöngür, Nikita Araslanov et al.

CVPR 2024
11
citations
#66

Semantic and Sequential Alignment for Referring Video Object Segmentation

Feiyu Pan, Hao Fang, Fangkai Li et al.

CVPR 2025
11
citations
#67

Stable Segment Anything Model

Qi Fan, Xin Tao, Lei Ke et al.

ICLR 2025
11
citations
#68

Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary Segmentation

Yuheng Shi, Minjing Dong, Chang Xu

ICCV 2025
11
citations
#69

PMT: Progressive Mean Teacher via Exploring Temporal Consistency for Semi-Supervised Medical Image Segmentation

Ning Gao, Sanping Zhou, Le Wang et al.

ECCV 2024
10
citations
#70

ContextSeg: Sketch Semantic Segmentation by Querying the Context with Attention

Jiawei Wang, Changjian Li

CVPR 2024
10
citations
#71

HiMTok: Learning Hierarchical Mask Tokens for Image Segmentation with Large Multimodal Model

Tao Wang, Changxu Cheng, Lingfeng Wang et al.

ICCV 2025
9
citations
#72

RMP-SAM: Towards Real-Time Multi-Purpose Segment Anything

Shilin Xu, Haobo Yuan, Qingyu Shi et al.

ICLR 2025
9
citations
#73

ZoRI: Towards Discriminative Zero-Shot Remote Sensing Instance Segmentation

Shiqi Huang, Shuting He, Bihan Wen

AAAI 2025
9
citations
#74

DIAL: Dense Image-text ALignment for Weakly Supervised Semantic Segmentation

Soojin Jang, JungMin Yun, JuneHyoung Kwon et al.

ECCV 2024
8
citations
#75

Unsupervised Audio-Visual Segmentation with Modality Alignment

Swapnil Bhosale, Haosen Yang, Diptesh Kanojia et al.

AAAI 2025
8
citations
#76

Topograph: An Efficient Graph-Based Framework for Strictly Topology Preserving Image Segmentation

Laurin Lux, Alexander H Berger, Alexander Weers et al.

ICLR 2025
8
citations
#77

SemReg: Semantics Constrained Point Cloud Registration

Sheldon Fung, Xuequan Lu, Dasith de Silva Edirimuni et al.

ECCV 2024
point cloud registrationcross-modal frameworksemantic feature fusioncross-attention mechanism+3
7
citations
#78

Instruction-guided Multi-Granularity Segmentation and Captioning with Large Multimodal Model

Xu Yuan, Li Zhou, Zenghui Sun et al.

AAAI 2025
7
citations
#79

M3amba: Memory Mamba is All You Need for Whole Slide Image Classification

Tingting Zheng, Kui Jiang, Yi Xiao et al.

CVPR 2025
7
citations
#80

Background Adaptation with Residual Modeling for Exemplar-Free Class-Incremental Semantic Segmentation

Anqi Zhang, Guangyu Gao

ECCV 2024
7
citations
#81

FOCUS: Knowledge-enhanced Adaptive Visual Compression for Few-shot Whole Slide Image Classification

Zhengrui Guo, Conghao Xiong, Jiabo MA et al.

CVPR 2025
7
citations
#82

Knowledge-Enhanced Historical Document Segmentation and Recognition

En-Hao Gao, Yu-Xuan Huang, Wen-Chao Hu et al.

AAAI 2024
7
citations
#83

Extreme Point Supervised Instance Segmentation

Hyeonjun Lee, Sehyun Hwang, Suha Kwak

CVPR 2024
7
citations
#84

SMITE: Segment Me In TimE

Amirhossein Alimohammadi, Sauradip Nag, Saeid Asgari et al.

ICLR 2025
7
citations
#85

ML-SemReg: Boosting Point Cloud Registration with Multi-level Semantic Consistency

Shaocheng Yan, Pengcheng Shi, Jiayuan Li

ECCV 2024
6
citations
#86

Inversion Circle Interpolation: Diffusion-based Image Augmentation for Data-scarce Classification

Yanghao Wang, Long Chen

CVPR 2025
6
citations
#87

Semantic Library Adaptation: LoRA Retrieval and Fusion for Open-Vocabulary Semantic Segmentation

Reza Qorbani, Gianluca Villani, Theodoros Panagiotakopoulos et al.

CVPR 2025
6
citations
#88

Enhancing SAM with Efficient Prompting and Preference Optimization for Semi-supervised Medical Image Segmentation

Aishik Konwer, Zhijian Yang, Erhan Bas et al.

CVPR 2025
6
citations
#89

DyCON: Dynamic Uncertainty-aware Consistency and Contrastive Learning for Semi-supervised Medical Image Segmentation

Maregu Assefa, Muzammal Naseer, IYYAKUTTI IYAPPAN GANAPATHI et al.

CVPR 2025arXiv:2504.04566
semi-supervised learningmedical image segmentationconsistency learningcontrastive learning+4
6
citations
#90

SPIN: Hierarchical Segmentation with Subpart Granularity in Natural Images

josh myers-dean, Jarek T Reynolds, Brian Price et al.

ECCV 2024
6
citations
#91

ProtoArgNet: Interpretable Image Classification with Super-Prototypes and Argumentation

Hamed Ayoobi, Nico Potyka, Francesca Toni

AAAI 2025
6
citations
#92

EOV-Seg: Efficient Open-Vocabulary Panoptic Segmentation

Hongwei Niu, Jie Hu, Jianghang Lin et al.

AAAI 2025
6
citations
#93

Exploring Simple Open-Vocabulary Semantic Segmentation

Zihang Lai

CVPR 2025
6
citations
#94

Weakly Supervised Co-training with Swapping Assignments for Semantic Segmentation

Xinyu Yang, Hossein Rahmani, Sue Black et al.

ECCV 2024arXiv:2402.17891
weakly supervised semantic segmentationclass activation mapspseudo-label generationonline cam optimization+3
5
citations
#95

Exact: Exploring Space-Time Perceptive Clues for Weakly Supervised Satellite Image Time Series Semantic Segmentation

Hao Zhu, Yan Zhu, Jiayu Xiao et al.

CVPR 2025
5
citations
#96

POT: Prototypical Optimal Transport for Weakly Supervised Semantic Segmentation

Jian Wang, Tianhong Dai, Bingfeng Zhang et al.

CVPR 2025
5
citations
#97

AlignZeg: Mitigating Objective Misalignment for Zero-shot Semantic Segmentation

Jiannan Ge, Lingxi Xie, Hongtao Xie et al.

ECCV 2024
5
citations
#98

Learning from the Web: Language Drives Weakly-Supervised Incremental Learning for Semantic Segmentation

Chang Liu, Giulia Rizzoli, Pietro Zanuttigh et al.

ECCV 2024arXiv:2407.13363
semantic segmentationweakly-supervised learningincremental learningweb image mining+3
5
citations
#99

Seg2Reg: Differentiable 2D Segmentation to 1D Regression Rendering for 360 Room Layout Reconstruction

Cheng Sun, Wei-En Tai, Yu-Lin Shih et al.

CVPR 2024
5
citations
#100

Interpretable Image Classification via Non-parametric Part Prototype Learning

Zhijie Zhu, Lei Fan, Maurice Pagnucco et al.

CVPR 2025
5
citations