Most Cited 2025 "learning rate warmup" Papers

22,274 papers found • Page 83 of 112

#16401

PBFG: A New Physically-Based Dataset and Removal of Lens Flares and Glares

Jie Zhu, Sungkil Lee

ICCV 2025
#16402

Correspondence as Video: Test-Time Adaption on SAM2 for Reference Segmentation in the Wild

Haoran Wang, Zekun Li, Jian Zhang et al.

ICCV 2025arXiv:2508.07759
#16403

An Information-Theoretic Regularizer for Lossy Neural Image Compression

ZHANG YINGWEN, Meng Wang, Xihua Sheng et al.

ICCV 2025arXiv:2411.16727
#16404

Knowledge-Guided Part Segmentation

Xuejian Gou, Fang Liu, Licheng Jiao et al.

ICCV 2025
#16405

Controllable Feature Whitening for Hyperparameter-Free Bias Mitigation

Yooshin Cho, Hanbyel Cho, Janghyeon Lee et al.

ICCV 2025arXiv:2507.20284
#16406

InfoBridge: Balanced Multimodal Integration through Conditional Dependency Modeling

Chenxin Li, Yifan Liu, Panwang Pan et al.

ICCV 2025
#16407

FusionPhys: A Flexible Framework for Fusing Complementary Sensing Modalities in Remote Physiological Measurement

Chenhang Ying, Huiyu Yang, Jieyi Ge et al.

ICCV 2025
#16408

LLM-assisted Entropy-based Adaptive Distillation for Unsupervised Fine-grained Visual Representation Learning

Jianfeng Dong, Danfeng Luo, Daizong Liu et al.

ICCV 2025
#16409

Don’t Let It Fade: Preserving Edits in Diffusion Language Models via Token Timestep Allocation

Woojin Kim, Jaeyoung Do

NEURIPS 2025
#16410

Power of Cooperative Supervision: Multiple Teachers Framework for Advanced 3D Semi-Supervised Object Detection

Jin-Hee Lee, Jae-keun Lee, Jeseok Kim et al.

ICCV 2025
#16411

ASGS: Single-Domain Generalizable Open-Set Object Detection via Adaptive Subgraph Searching

Yuxuan Yuan, Luyao Tang, Chaoqi Chen et al.

ICCV 2025
#16412

DADet: Safeguarding Image Conditional Diffusion Models against Adversarial and Backdoor Attacks via Diffusion Anomaly Detection

Hongwei Yu, Xinlong Ding, Jiawei Li et al.

ICCV 2025highlight
#16413

LEGO-Maker: A Semantic-Driven Algorithm for Text-to-3D Generation

Yifei Zhang, Lei Chen

ICCV 2025
#16414

COVTrack: Continuous Open-Vocabulary Tracking via Adaptive Multi-Cue Fusion

Zekun Qian, Ruize Han, Zhixiang Wang et al.

ICCV 2025
#16415

MOBIUS: Big-to-Mobile Universal Instance Segmentation via Multi-modal Bottleneck Fusion and Calibrated Decoder Pruning

Mattia Segu, Marta Tintore Gazulla, Yongqin Xian et al.

ICCV 2025arXiv:2510.15026
#16416

monoVLN: Bridging the Observation Gap between Monocular and Panoramic Vision and Language Navigation

Ren-Jie Lu, Yu Zhou, hao cheng et al.

ICCV 2025
#16417

CIARD: Cyclic Iterative Adversarial Robustness Distillation

Liming Lu, Shuchao Pang, Xu Zheng et al.

ICCV 2025arXiv:2509.12633
#16418

Multi-head Temporal Latent Attention

Keqi Deng, Phil Woodland

NEURIPS 2025oralarXiv:2505.13544
#16419

Performing Defocus Deblurring by Modeling its Formation Process

Zhengbo Zhang, Lin Geng Foo, Hossein Rahmani et al.

ICCV 2025
#16420

CasP: Improving Semi-Dense Feature Matching Pipeline Leveraging Cascaded Correspondence Priors for Guidance

Peiqi Chen, Lei Yu, Yi Wan et al.

ICCV 2025highlightarXiv:2507.17312
#16421

Supervised Exploratory Learning for Long-Tailed Visual Recognition

Zhongquan Jian, Yanhao Chen, Wangyancheng Wangyancheng et al.

ICCV 2025
#16422

Toward Long-Tailed Online Anomaly Detection through Class-Agnostic Concepts

Chiao-An Yang, Kuan-Chuan Peng, Raymond A. Yeh

ICCV 2025arXiv:2507.16946
#16423

More Reliable Pseudo-labels, Better Performance: A Generalized Approach to Single Positive Multi-label Learning

Luong Tran, Thieu Vo, Anh Nguyen et al.

ICCV 2025arXiv:2508.20381
#16424

DCHM: Depth-Consistent Human Modeling for Multiview Detection

Jiahao Ma, Tianyu Wang, Miaomiao Liu et al.

ICCV 2025arXiv:2507.14505
#16425

Adversarial Robustness of Discriminative Self-Supervised Learning in Vision

Ömer Veysel Çağatan, Ömer TAL, M. Emre Gursoy

ICCV 2025arXiv:2503.06361
#16426

Active Perception Meets Rule-Guided RL: A Two-Phase Approach for Precise Object Navigation in Complex Environments

Liang Qin, Min Wang, Peiwei Li et al.

ICCV 2025
#16427

UNIS: A Unified Framework for Achieving Unbiased Neural Implicit Surfaces in Volume Rendering

Junkai Deng, Hanting Niu, Jiaze Li et al.

ICCV 2025
#16428

Partial Forward Blocking: A Novel Data Pruning Paradigm for Lossless Training Acceleration

Dongyue Wu, Zilin Guo, Jialong Zuo et al.

ICCV 2025arXiv:2506.23674
#16429

IntrinsicControlNet: Cross-distribution Image Generation with Real and Unreal

Jiayuan Lu, Rengan Xie, Zixuan Xie et al.

ICCV 2025
#16430

Doodle Your Keypoints: Sketch-Based Few-Shot Keypoint Detection

Subhajit Maity, Ayan Bhunia, Subhadeep Koley et al.

ICCV 2025arXiv:2507.07994
#16431

RODS: Robust Optimization Inspired Diffusion Sampling for Detecting and Reducing Hallucination in Generative Models

Yiqi Tian, Pengfei Jin, Mingze Yuan et al.

NEURIPS 2025arXiv:2507.12201
#16432

Loss Functions for Predictor-based Neural Architecture Search

Han Ji, Yuqi Feng, Jiahao Fan et al.

ICCV 2025arXiv:2506.05869
#16433

Advancing Text-to-3D Generation with Linearized Lookahead Variational Score Distillation

Yu Lei, Bingde Liu, Qingsong Xie et al.

ICCV 2025arXiv:2507.09748
#16434

StolenLoRA: Exploring LoRA Extraction Attacks via Synthetic Data

Yixu Wang, Yan Teng, Yingchun Wang et al.

ICCV 2025highlightarXiv:2509.23594
#16435

Domain-aware Category-level Geometry Learning Segmentation for 3D Point Clouds

Pei He, Lingling Li, Licheng Jiao et al.

ICCV 2025arXiv:2508.11265
#16436

GaussianReg: Rapid 2D/3D Registration for Emergency Surgery via Explicit 3D Modeling with Gaussian Primitives

Weihao Yu, Xiaoqing Guo, Xinyu Liu et al.

ICCV 2025
#16437

ArgoTweak: Towards Self-Updating HD Maps through Structured Priors

Lena Wild, Rafael Valencia, Patric Jensfelt

ICCV 2025arXiv:2509.08764
#16438

Event-aided Dense and Continuous Point Tracking: Everywhere and Anytime

Zhexiong Wan, Jianqin Luo, Yuchao Dai et al.

ICCV 2025
#16439

Context-Aware Academic Emotion Dataset and Benchmark

Luming Zhao, Jingwen Xuan, Jiamin Lou et al.

ICCV 2025arXiv:2507.00586
#16440

TPG-INR: Target Prior-Guided Implicit 3D CT Reconstruction for Enhanced Sparse-view Imaging

QingleiCao QingleiCao, Ziyao Tang, Xiaoqin Tang

ICCV 2025highlight
#16441

TITAN: Query-Token based Domain Adaptive Adversarial Learning

Tajamul Ashraf, Janibul Bashir

ICCV 2025arXiv:2506.21484
#16442

Deciphering Cross-Modal Alignment in Large Vision-Language Models via Modality Integration Rate

Qidong Huang, Xiaoyi Dong, Pan Zhang et al.

ICCV 2025
#16443

Efficient Visual Place Recognition Through Multimodal Semantic Knowledge Integration

Sitao Zhang, Hongda Mao, Qingshuang Chen et al.

ICCV 2025
#16444

COME: Dual Structure-Semantic Learning with Collaborative MoE for Universal Lesion Detection Across Heterogeneous Ultrasound Datasets

Lingyu Chen, Yawen Zeng, Yue Wang et al.

ICCV 2025arXiv:2508.09886
#16445

NATRA: Noise-Agnostic Framework for Trajectory Prediction with Noisy Observations

Rongqing Li, Changsheng Li, Ruilin Lv et al.

ICCV 2025
#16446

MS3D: High-Quality 3D Generation via Multi-Scale Representation Modeling

Guan Luo, Jianfeng Zhang

ICCV 2025
#16447

UniDxMD: Towards Unified Representation for Cross-Modal Unsupervised Domain Adaptation in 3D Semantic Segmentation

Zhengyin Liang, Hui Yin, Min Liang et al.

ICCV 2025highlight
#16448

Hybrid Layout Control for Diffusion Transformer: Fewer Annotations, Superior Aesthetics

Keming Wu, Junwen Chen, Zhanhao Liang et al.

ICCV 2025
#16449

PLAN: Proactive Low-Rank Allocation for Continual Learning

XIEQUN WANG, Zhan Zhuang, Yu Zhang

ICCV 2025arXiv:2510.21188
#16450

Leveraging Spatial Invariance to Boost Adversarial Transferability

Zihan Zhou, LI LI, Yanli Ren et al.

ICCV 2025
#16451

One Encoder to Rule them All: Representation Learning for Model-free Visual Reinforcement Learning using Fourier Neural Operators

Parag Dutta, Mohd Ayyoob, Shalabh Bhatnagar et al.

ICCV 2025
#16452

FedXDS: Leveraging Model Attribution Methods to counteract Data Heterogeneity in Federated Learning

Maximilian Hoefler, Karsten Mueller, Wojciech Samek

ICCV 2025
#16453

Visual Textualization for Image Prompted Object Detection

Yongjian Wu, Yang Zhou, Jiya Saiyin et al.

ICCV 2025arXiv:2506.23785
#16454

Test-Time Prompt Tuning for Zero-Shot Depth Completion

Chanhwi Jeong, Inhwan Bae, Jin-Hwi Park et al.

ICCV 2025highlight
#16455

LLaVA-SP: Enhancing Visual Representation with Visual Spatial Tokens for MLLMs

Haoran Lou, Chunxiao Fan, Ziyan Liu et al.

ICCV 2025arXiv:2507.00505
#16456

Transformer-based Tooth Alignment Prediction with Occlusion and Collision Constraints

DongZhenXing DongZhenXing, Jiazhou Chen

ICCV 2025arXiv:2410.20806
#16457

SD2Actor: Continuous State Decomposition via Diffusion Embeddings for Robotic Manipulation

lijiayi jiayi

ICCV 2025
#16458

Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis

Xinyu Hou, Zongsheng Yue, Xiaoming Li et al.

ICCV 2025arXiv:2411.17769
#16459

Learning Counterfactually Decoupled Attention for Open-World Model Attribution

Yu Zheng, Boyang Gong, Fanye Kong et al.

ICCV 2025arXiv:2506.23074
#16460

Scene Graph Guided Generation: Enable Accurate Relations Generation in Text-to-Image Models via Textural Rectification

Guibao SHEN, Luozhou Wang, Jiantao Lin et al.

ICCV 2025
#16461

ReMP-AD: Retrieval-enhanced Multi-modal Prompt Fusion for Few-Shot Industrial Visual Anomaly Detection

Hongchi Ma, Guanglei Yang, Debin Zhao et al.

ICCV 2025
#16462

GMMamba: Group Masking Mamba for Whole Slide Image Classification

Tingting Zheng, Hongxun Yao, Kui Jiang et al.

ICCV 2025
#16463

RareCLIP: Rarity-aware Online Zero-shot Industrial Anomaly Detection

Jianfang He, Min Cao, Silong Peng et al.

ICCV 2025
#16464

Temporal Rate Reduction Clustering for Human Motion Segmentation

Xianghan Meng, Zhengyu Tong, Zhiyuan Huang et al.

ICCV 2025arXiv:2506.21249
#16465

Hierarchy UGP: Hierarchy Unified Gaussian Primitive for Large-Scale Dynamic Scene Reconstruction

Hongyang Sun, Qinglin Yang, Jiawei Wang et al.

ICCV 2025
#16466

Backdoor Mitigation by Distance-Driven Detoxification

Shaokui Wei, Jiayin Liu, Hongyuan Zha

ICCV 2025highlightarXiv:2411.09585
#16467

Democratizing High-Fidelity Co-Speech Gesture Video Generation

Xu Yang, Shaoli Huang, Shenbo Xie et al.

ICCV 2025arXiv:2507.06812
#16468

HFD-Teacher: High-Frequency Depth Distillation from Depth Foundation Models for Enhanced Depth Completion

Zhiyuan Yang, Anqi Cheng, Haiyue Zhu et al.

ICCV 2025
#16469

Failure Cases Are Better Learned But Boundary Says Sorry: Facilitating Smooth Perception Change for Accuracy-Robustness Trade-Off in Adversarial Training

Yanyun Wang, Li Liu

ICCV 2025arXiv:2508.02186
#16470

Separation for Better Integration: Disentangling Edge and Motion in Event-based Deblurring

Yufei Zhu, Hao Chen, Yongjian Deng et al.

ICCV 2025
#16471

CMAD: Correlation-Aware and Modalities-Aware Distillation for Multimodal Sentiment Analysis with Missing Modalities

Yan Zhuang, Minhao Liu, Wei Bai et al.

ICCV 2025
#16472

FedWSQ: Efficient Federated Learning with Weight Standardization and Distribution-Aware Non-Uniform Quantization

Seung-Wook Kim, Seongyeol Kim, Jiah Kim et al.

ICCV 2025arXiv:2506.23516
#16473

Diversity-Enhanced Distribution Alignment for Dataset Distillation

Hongcheng Li, Yucan Zhou, Xiaoyan Gu et al.

ICCV 2025
#16474

Height-Fidelity Dense Global Fusion for Multi-modal 3D Object Detection

Hanshi Wang, Jin Gao, Weiming Hu et al.

ICCV 2025highlightarXiv:2507.04369
#16475

SMSTracker: Tri-path Score Mask Sigma Fusion for Multi-Modal Tracking

Sixian Chan, Zedong Li, Xiaoqin Zhang et al.

ICCV 2025highlight
#16476

Two Losses, One Goal: Balancing Conflict Gradients for Semi-supervised Semantic Segmentation

Rui Sun, Huayu Mai, Wangkai Li et al.

ICCV 2025highlight
#16477

CMB-ML: A Cosmic Microwave Background Dataset for the Oldest Possible Computer Vision Task

James Amato, Yunan Xie, Leonel Medina-Varela et al.

ICCV 2025
#16478

Adapt Foundational Segmentation Models with Heterogeneous Searching Space

Li Yi, Jie Hu, Songan Zhang et al.

ICCV 2025
#16479

Think Twice: Test-Time Reasoning for Robust CLIP Zero-Shot Classification

Shenyu Lu, Zhaoying Pan, Xiaoqian Wang

ICCV 2025
#16480

Adversarial Purification via Super-Resolution and Diffusion

Mincheol Park, Cheonjun Park, Seungseop Lim et al.

ICCV 2025
#16481

FEVER-OOD: Free Energy Vulnerability Elimination for Robust Out-of-Distribution Detection

Brian Isaac-Medina, Mauricio Che, Yona Falinie A. Gaus et al.

ICCV 2025arXiv:2412.01596
#16482

EditCLIP: Representation Learning for Image Editing

Qian Wang, Aleksandar Cvejic, Abdelrahman Eldesokey et al.

ICCV 2025arXiv:2503.20318
#16483

Allowing Oscillation Quantization: Overcoming Solution Space Limitation in Low Bit-Width Quantization

Weiying Xie, Zihan Meng, Jitao Ma et al.

ICCV 2025
#16484

SDFormer: Vision-based 3D Semantic Scene Completion via SAM-assisted Dual-channel Voxel Transformer

Yujie Xue, Huilong Pi, Jiapeng Zhang et al.

ICCV 2025
#16485

TopoTTA: Topology-Enhanced Test-Time Adaptation for Tubular Structure Segmentation

Jiale Zhou, Wenhan Wang, Shikun Li et al.

ICCV 2025arXiv:2508.00442
#16486

Multimodal Large Language Model-Guided ISP Hyperparameter Optimization with Dynamic Preference Learning

Xinyu Sun, Zhikun Zhao, congyan lang et al.

ICCV 2025
#16487

DeFSS: Image-to-Mask Denoising Learning for Few-shot Segmentation

Zishu Qin, Junhao Xu, Weifeng Ge

ICCV 2025
#16488

A Generalized Label Shift Perspective for Cross-Domain Gaze Estimation

Hao-Ran Yang, Xiaohui Chen, Chuan-Xian Ren

NEURIPS 2025arXiv:2505.13043
#16489

TAD-E2E: A Large-scale End-to-end Autonomous Driving Dataset

Chang Liu, mingxuzhu mingxuzhu, Zheyuan Zhang et al.

ICCV 2025
#16490

Photolithography Overlay Map Generation with Implicit Knowledge Distillation Diffusion Transformer

YuanFu Yang, Hsiu-Hui Hsiao

ICCV 2025
#16491

What's Making That Sound Right Now? Video-centric Audio-Visual Localization

hahyeon choi, Junhoo Lee, Nojun Kwak

ICCV 2025arXiv:2507.04667
#16492

VehicleMAE: View-asymmetry Mutual Learning for Vehicle Re-identification Pre-training via Masked AutoEncoders

Qi Wang, Zeyu Zhang, Dong Wang et al.

ICCV 2025
#16493

ReTracker: Exploring Image Matching for Robust Online Any Point Tracking

Dongli Tan, Xingyi He, Sida Peng et al.

ICCV 2025highlight
#16494

MagicCity: Geometry-Aware 3D City Generation from Satellite Imagery with Multi-View Consistency

Xingbo YAO, xuanmin Wang, Hao WU et al.

ICCV 2025
#16495

Multi-scenario Overlapping Text Segmentation with Depth Awareness

Yang Liu, Xudong Xie, Yuliang Liu et al.

ICCV 2025
#16496

FullDiT: Video Generative Foundation Models with Multimodal Control via Full Attention

Xuan Ju, Weicai Ye, Quande Liu et al.

ICCV 2025
#16497

SC-Lane: Slope-aware and Consistent Road Height Estimation Framework for 3D Lane Detection

Chaesong Park, Eunbin Seo, JihyeonHwang JihyeonHwang et al.

ICCV 2025arXiv:2508.10411
#16498

Long-Tailed Classification with Multi-Granularity Semantics

Yuting Liu, Liu Yang, Yu Wang

ICCV 2025
#16499

ConceptSplit: Decoupled Multi-Concept Personalization of Diffusion Models via Token-wise Adaptation and Attention Disentanglement

Habin Lim, Youngseob Won, Juwon Seo et al.

ICCV 2025arXiv:2510.04668
#16500

Backdoor Defense via Enhanced Splitting and Trap Isolation

Hongrui Yu, Lu Qi, Wanyu Lin et al.

ICCV 2025
#16501

Learning Hierarchical Line Buffer for Image Processing

Jiacheng Li, Feiran Li, Daisuke Iso

ICCV 2025
#16502

Attention to the Burtiness in Visual Prompt Tuning!

Yuzhu Wang, Manni Duan, Shu Kong

ICCV 2025
#16503

Humans as Checkerboards: Calibrating Camera Motion Scale for World-Coordinate Human Mesh Recovery

Fengyuan Yang, Kerui Gu, Ha Linh Nguyen et al.

ICCV 2025arXiv:2407.00574
#16504

Overcoming Dual Drift for Continual Long-Tailed Visual Question Answering

Feifei Zhang, Zhihao Wang, Xi Zhang et al.

ICCV 2025
#16505

Is Meta-Learning Out? Rethinking Unsupervised Few-Shot Classification with Limited Entropy

Yunchuan Guan, Yu Liu, Ke Zhou et al.

ICCV 2025arXiv:2509.13185
#16506

GeoMan: Temporally Consistent Human Geometry Estimation using Image-to-Video Diffusion

Gwanghyun Kim, Xueting Li, Ye Yuan et al.

ICCV 2025arXiv:2505.23085
#16507

χ: Symmetry Understanding of 3D Shapes via Chirality Disentanglement

Weikang Wang, Tobias Weißberg, Nafie El Amrani et al.

ICCV 2025
#16508

Dirichlet-Constrained Variational Codebook Learning for Temporally Coherent Video Face Restoration

Baoyou Chen, Ce Liu, Weihao Yuan et al.

ICCV 2025highlightarXiv:2506.13355
#16509

COSTARR: Consolidated Open Set Technique with Attenuation for Robust Recognition

Ryan Rabinowitz, Steve Cruz, Walter Scheirer et al.

ICCV 2025arXiv:2508.01087
#16510

Prototype-based Contrastive Learning with Stage-wise Progressive Augmentation for Self-Supervised Fine-Grained Learning

BaoFeng Tan, Xiu-Shen Wei, Lin Zhao

ICCV 2025
#16511

Neural Architecture Search Driven by Locally Guided Diffusion for Personalized Federated Learning

PENG LIAO, Xilu Wang, Yaochu Jin et al.

ICCV 2025
#16512

Hierarchical 3D Scene Graphs Construction Outdoors

Jon Nyffeler, Federico Tombari, Daniel Barath

ICCV 2025
#16513

Cycle-Consistent Learning for Joint Layout-to-Image Generation and Object Detection

Xinhao Cai, Qiuxia Lai, Gensheng Pei et al.

ICCV 2025
#16514

Bridging Local Inductive Bias and Long-Range Dependencies with Pixel-Mamba for End-to-end Whole Slide Image Analysis

Zhongwei Qiu, Hanqing Chao, Tiancheng Lin et al.

ICCV 2025
#16515

Neuroverse3D: Developing In-Context Learning Universal Model for Neuroimaging in 3D

Jiesi Hu, Hanyang Peng, Yanwu Yang et al.

ICCV 2025arXiv:2503.02410
#16516

Incremental Few-Shot Semantic Segmentation via Multi-Level Switchable Visual Prompts

Maoxian Wan, Kaige Li, Qichuan Geng et al.

ICCV 2025
#16517

Noise-Modeled Diffusion Models for Low-Light Spike Image Restoration

Ruonan Liu, Lin Zhu, Xijie Xiang et al.

ICCV 2025highlight
#16518

StyleSRN: Scene Text Image Super-Resolution with Text Style Embedding

Shengrong Yuan, Runmin Wang, Ke Hao et al.

ICCV 2025
#16519

Frequency-Guided Diffusion for Training-Free Text-Driven Image Translation

Zheng Gao, Jifei Song, Zhensong Zhang et al.

ICCV 2025
#16520

Personalized Federated Learning under Local Supervision

Qiqi Liu, Jiaqiang Li, Yuchen Liu et al.

ICCV 2025
#16521

Frequency-Semantic Enhanced Variational Autoencoder for Zero-Shot Skeleton-based Action Recognition

Wenhan Wu, Zhishuai Guo, Chen Chen et al.

ICCV 2025arXiv:2506.22179
#16522

How Do Multimodal Large Language Models Handle Complex Multimodal Reasoning? Placing Them in An Extensible Escape Game

Ziyue Wang, Yurui Dong, Fuwen Luo et al.

ICCV 2025
#16523

Towards Human-like Virtual Beings: Simulating Human Behavior in 3D Scenes

CHEN LIANG, Wenguan Wang, Yi Yang

ICCV 2025
#16524

Cross-Category Subjectivity Generalization for Style-Adaptive Sketch Re-ID

Zechao Hu, Zhengwei Yang, Hao Li et al.

ICCV 2025
#16525

S3R-GS: Streamlining the Pipeline for Large-Scale Street Scene Reconstruction

Guangting Zheng, Jiajun Deng, Xiaomeng Chu et al.

ICCV 2025arXiv:2503.08217
#16526

The Source Image is the Best Attention for Infrared and Visible Image Fusion

Song Wang, Xie Han, Liqun Kuang et al.

ICCV 2025
#16527

Uncalibrated Structure from Motion on a Sphere

Jonathan Ventura, Viktor Larsson, Fredrik Kahl

ICCV 2025
#16528

To Label or Not to Label: PALM – A Predictive Model for Evaluating Sample Efficiency in Active Learning Models

Julia Machnio, Mads Nielsen, Mostafa Mehdipour Ghazi

ICCV 2025arXiv:2507.15381
#16529

Learnable Fractional Reaction-Diffusion Dynamics for Under-Display ToF Imaging and Beyond

Xin Qiao, Matteo Poggi, Xing Wei et al.

ICCV 2025arXiv:2511.01704
#16530

Wave-MambaAD: Wavelet-driven State Space Model for Multi-class Unsupervised Anomaly Detection

Qiao Zhang, Mingwen Shao, Xinyuan Chen et al.

ICCV 2025
#16531

3D Test-time Adaptation via Graph Spectral Driven Point Shift

Xin Wei, Qin Yang, Yijie Fang et al.

ICCV 2025arXiv:2507.18225
#16532

Task-Decoupled Bézier Surface Constraint for Uneven Low-Light Image Enhancement

Xingxiang Zhou, Xiangdong Su, Haoran Zhang et al.

ICCV 2025
#16533

EMoTive: Event-guided Trajectory Modeling for 3D Motion Estimation

Zengyu Wan, Wei Zhai, Yang Cao et al.

ICCV 2025arXiv:2503.11371
#16534

ZIUM: Zero-Shot Intent-Aware Adversarial Attack on Unlearned Models

Hyun Jun Yook, Ga San Jhun, Cho Hyun et al.

ICCV 2025arXiv:2507.21985
#16535

KDA: Knowledge Diffusion Alignment with Enhanced Context for Video Temporal Grounding

Ran Ran, Jiwei Wei, Shiyuan He et al.

ICCV 2025
#16536

STEP-DETR: Advancing DETR-based Semi-Supervised Object Detection with Super Teacher and Pseudo-Label Guided Text Queries

Tahira Shehzadi, Khurram Azeem Hashmi, Shalini Sarode et al.

ICCV 2025
#16537

Text-to-Any-Skeleton Motion Generation Without Retargeting

Qingyuan Liu, Ke Lv, Kun Dong et al.

ICCV 2025
#16538

Completing 3D Partial Assemblies with View-Consistent 2D-3D Correspondence

Weihao Wang, Yu Lan, Mingyu You et al.

ICCV 2025
#16539

Aligning Global Semantics and Local Textures in Generative Video Enhancement

Zhikai Chen, Fuchen Long, Zhaofan Qiu et al.

ICCV 2025
#16540

Simulating Dual-Pixel Images From Ray Tracing For Depth Estimation

Fengchen He, Dayang Zhao, Hao Xu et al.

ICCV 2025arXiv:2503.11213
#16541

Mind the Gap: Preserving and Compensating for the Modality Gap in CLIP-Based Continual Learning

Linlan Huang, Xusheng Cao, Haori Lu et al.

ICCV 2025highlightarXiv:2507.09118
#16542

Registration beyond Points: General Affine Subspace Alignment via Geodesic Distance on Grassmann Manifold

Jaeho Shin, Hyeonjae Gil, Junwoo Jang et al.

ICCV 2025highlightarXiv:2507.17998
#16543

Category-Specific Selective Feature Enhancement for Long-Tailed Multi-Label Image Classification

Ruiqi Du, Xu Tang, Xiangrong Zhang et al.

ICCV 2025
#16544

A View-consistent Sampling Method for Regularized Training of Neural Radiance Fields

Aoxiang Fan, Corentin Dumery, Nicolas Talabot et al.

ICCV 2025arXiv:2507.04408
#16545

FedDifRC: Unlocking the Potential of Text-to-Image Diffusion Models in Heterogeneous Federated Learning

Huan Wang, Haoran Li, Huaming Chen et al.

ICCV 2025arXiv:2507.06482
#16546

Lark: Low-Rank Updates After Knowledge Localization for Few-shot Class-Incremental Learning

Jinxin Shi, Jiabao Zhao, Yifan Yang et al.

ICCV 2025
#16547

A Constrained Optimization Approach for Gaussian Splatting from Coarsely-posed Images and Noisy Lidar Point Clouds

Jizong Peng, Tze Ho Elden Tse, Kai Xu et al.

ICCV 2025highlightarXiv:2504.09129
#16548

Conditional Visual Autoregressive Modeling for Pathological Image Restoration

Ziyi Liu, Zhe Xu, Jiabo MA et al.

ICCV 2025
#16549

EYE3:Turn Anything into Naked-eye 3D

Yingde Song, Zongyuan Yang, Baolin Liu et al.

ICCV 2025
#16550

C2MIL: Synchronizing Semantic and Topological Causalities in Multiple Instance Learning for Robust and Interpretable Survival Analysis

Min Cen, Zhenfeng Zhuang, Yuzhe Zhang et al.

ICCV 2025
#16551

High-Resolution Spatiotemporal Modeling with Global-Local State Space Models for Video-Based Human Pose Estimation

Runyang Feng, Hyung Jin Chang, Tze Ho Elden Tse et al.

ICCV 2025arXiv:2510.11017
#16552

Rainbow Delay Compensation: A Multi-Agent Reinforcement Learning Framework for Mitigating Observation Delays

Songchen Fu, Siang Chen, Shaojing Zhao et al.

NEURIPS 2025
#16553

Hierarchical Divide-and-Conquer Grouping for Classification Adaptation of Pre-Trained Models

Ziqian Lu, Yunlong Yu, Qinyue Tong et al.

ICCV 2025
#16554

Revolutionizing Graph Aggregation: From Suppression to Amplification via BoostGCN

Jiaxin Wu, Chenglong Pang, Guangxiong Chen et al.

NEURIPS 2025
#16555

TryOn-Refiner: Conditional Rectified-flow-based TryOn Refiner for More Accurate Detail Reconstruction

Wen Qian

ICCV 2025
#16556

Mitigating Catastrophic Overfitting in Fast Adversarial Training via Label Information Elimination

Chao Pan, Ke Tang, Li Qing et al.

ICCV 2025
#16557

Scoring, Remember, and Reference: Catching Camouflaged Objects in Videos

Yuang Feng, Shuyong Gao, Fuzhen Yan et al.

ICCV 2025arXiv:2503.17050
#16558

On the Complexity-Faithfulness Trade-off of Gradient-Based Explanations

Amir Mehrpanah, Matteo Gamba, Kevin Smith et al.

ICCV 2025arXiv:2508.10490
#16559

LA-MOTR: End-to-End Multi-Object Tracking by Learnable Association

Peng Wang, Yongcai Wang, Hualong Cao et al.

ICCV 2025
#16560

TransiT: Transient Transformer for Non-line-of-sight Videography

Ruiqian Li, Siyuan Shen, Suan Xia et al.

ICCV 2025arXiv:2503.11328
#16561

Spectral Sensitivity Estimation with an Uncalibrated Diffraction Grating

Lilika Makabe, Hiroaki Santo, Fumio Okura et al.

ICCV 2025arXiv:2508.00330
#16562

Geometric Alignment and Prior Modulation for View-Guided Point Cloud Completion on Unseen Categories

Jingqiao Xiu, Yicong Li, Na Zhao et al.

ICCV 2025
#16563

Motion-2-to-3: Leveraging 2D Motion Data for 3D Motion Generations

Ruoxi Guo, Huaijin Pi, Zehong Shen et al.

ICCV 2025
#16564

FVGen: Accelerating Novel-View Synthesis with Adversarial Video Diffusion Distillation

Wenbin Teng, Gonglin Chen, Haiwei Chen et al.

ICCV 2025arXiv:2508.06392
#16565

PVMamba: Parallelizing Vision Mamba via Dynamic State Aggregation

Fei Xie, Zhongdao Wang, Weijia Zhang et al.

ICCV 2025
#16566

CoralSRT: Revisiting Coral Reef Semantic Segmentation by Feature Rectifying via Self-supervised Guidance

Zheng Ziqiang, Wong Kwan, Binh-Son Hua et al.

ICCV 2025
#16567

Diagnosing Pretrained Models for Out-of-distribution Detection

Haipeng Xiong, Kai Xu, Angela Yao

ICCV 2025
#16568

CHORDS: Diffusion Sampling Accelerator with Multi-core Hierarchical ODE Solvers

Jiaqi Han, Haotian Ye, Puheng Li et al.

ICCV 2025arXiv:2507.15260
#16569

Adversarial Training for Probabilistic Robustness

YI ZHANG, Yuhang Chen, Zhen Chen et al.

ICCV 2025
#16570

Learning to See Inside Opaque Liquid Containers using Speckle Vibrometry

Matan Kichler, Shai Bagon, Mark Sheinin

ICCV 2025arXiv:2507.20757
#16571

Scaling Omni-modal Pretraining with Multimodal Context: Advancing Universal Representation Learning Across Modalities

Yiyuan Zhang, Handong Li, Jing Liu et al.

ICCV 2025
#16572

LightBSR: Towards Lightweight Blind Super-Resolution via Discriminative Implicit Degradation Representation Learning

Jiang Yuan, ji ma, Bo Wang et al.

ICCV 2025arXiv:2506.22710
#16573

When Pixel Difference Patterns Meet ViT: PiDiViT for Few-Shot Object Detection

Hongliang Zhou, Yongxiang Liu, Canyu Mo et al.

ICCV 2025
#16574

Integrating Biological Knowledge for Robust Microscopy Image Profiling on De Novo Cell Lines

Jiayuan Chen, Thai-Hoang Pham, Yuanlong Wang et al.

ICCV 2025highlightarXiv:2507.10737
#16575

Learning Normals of Noisy Points by Local Gradient-Aware Surface Filtering

Qing Li, Huifang Feng, Xun Gong et al.

ICCV 2025arXiv:2507.03394
#16576

Keep Your Friends Close, and Your Enemies Farther: Distance-aware Voxel-wise Contrastive Learning for Semi-supervised Multi-organ Segmentation

Haochen Zhao, Jianwei Niu, Xuefeng Liu et al.

ICCV 2025
#16577

Bayesian-Inspired Space-Time Superpixels

Kent Gauen, Stanley Chan

ICCV 2025
#16578

Kaleidoscopic Background Attack: Disrupting Pose Estimation with Multi-Fold Radial Symmetry Textures

Xinlong Ding, Hongwei Yu, Jiawei Li et al.

ICCV 2025highlightarXiv:2507.10265
#16579

INSTINCT: Instance-Level Interaction Architecture for Query-Based Collaborative Perception

yunjiang xu, Yupeng Ouyang, Lingzhi Li et al.

ICCV 2025arXiv:2509.23700
#16580

SPD: Shallow Backdoor Protecting Deep Backdoor Against Backdoor Detection

Shunjie Yuan, Xinghua Li, Xuelin Cao et al.

ICCV 2025
#16581

Rethinking DPO-style Diffusion Aligning Frameworks

XUN WU, Shaohan Huang, Lingjie Jiang et al.

ICCV 2025highlight
#16582

Debiased Curriculum Adaptation for Safe Transfer Learning in Chest X-ray Classification

Mingyang Liu, Xinyang Chen, Yang Shu et al.

ICCV 2025
#16583

End-to-End Entity-Predicate Association Reasoning for Dynamic Scene Graph Generation

LiWei Wang, YanDuo Zhang, Tao Lu et al.

ICCV 2025
#16584

Ensemble Foreground Management for Unsupervised Object Discovery

Ziling Wu, Armaghan Moemeni, Praminda Caleb-Solly

ICCV 2025highlightarXiv:2507.20860
#16585

Forensic-MoE: Exploring Comprehensive Synthetic Image Detection Traces with Mixture of Experts

Mingqi Fang, Ziguang Li, Lingyun Yu et al.

ICCV 2025
#16586

Adaptive Learning of High-Value Regions for Semi-Supervised Medical Image Segmentation

Tao Lei, Ziyao Yang, Xingwu wang et al.

ICCV 2025
#16587

Entropy-Adaptive Diffusion Policy Optimization with Dynamic Step Alignment

Renye Yan, Jikang Cheng, Yaozhong Gan et al.

ICCV 2025
#16588

MA-CIR: A Multimodal Arithmetic Benchmark for Composed Image Retrieval

Jaeseok Byun, Young Kyun Jang, Seokhyeon Jeong et al.

ICCV 2025
#16589

Leveraging Panoptic Scene Graph for Evaluating Fine-Grained Text-to-Image Generation

Xueqing Deng, Linjie Yang, Qihang Yu et al.

ICCV 2025
#16590

Physical Degradation Model-Guided Interferometric Hyperspectral Reconstruction with Unfolding Transformer

Yuansheng Li, Yunhao Zou, Linwei Chen et al.

ICCV 2025arXiv:2506.21880
#16591

VPR-Cloak: A First Look at Privacy Cloak Against Visual Place Recognition

Shuting Dong, Mingzhi Chen, Feng Lu et al.

ICCV 2025
#16592

Hierarchical Variational Test-Time Prompt Generation for Zero-Shot Generalization

Zhaoyang Wu, Fang Liu, Licheng Jiao et al.

ICCV 2025
#16593

CO2-Net: A Physics-Informed Spatio-Temporal Model for Global Surface CO2 Reconstruction

Hao Zheng, Yuting Zheng, Hanbo Huang et al.

ICCV 2025
#16594

HOMO-Feature: Cross-Arbitrary-Modal Image Matching with Homomorphism of Organized Major Orientation

Chenzhong Gao, Wei Li, Desheng Weng

ICCV 2025
#16595

OCSplats: Observation Completeness Quantification and Label Noise Separation in 3DGS

Han Ling, Yinghui Sun, Xian Xu et al.

ICCV 2025arXiv:2508.01239
#16596

GSOT3D: Towards Generic 3D Single Object Tracking in the Wild

Yifan Jiao, Yunhao Li, Junhua Ding et al.

ICCV 2025arXiv:2412.02129
#16597

Semantic Alignment and Reinforcement for Data-Free Quantization of Vision Transformers

Yunshan Zhong, Yuyao Zhou, Yuxin Zhang et al.

ICCV 2025arXiv:2412.16553
#16598

Guiding Diffusion Models with Adaptive Negative Sampling Without External Resources

Alakh Desai, Nuno Vasconcelos

ICCV 2025
#16599

WAVE: Warp-Based View Guidance for Consistent Novel View Synthesis Using a Single Image

Jiwoo Park, Tae Choi, Youngjun Jun et al.

ICCV 2025arXiv:2506.23518
#16600

PersonaCraft: Personalized and Controllable Full-Body Multi-Human Scene Generation Using Occlusion-Aware 3D-Conditioned Diffusion

Gwanghyun Kim, Suh Jeon Jeon, Seunggyu Lee et al.

ICCV 2025arXiv:2411.18068