Most Cited 2025 Poster Papers

22,274 papers found • Page 84 of 112

#16601

CaliMatch: Adaptive Calibration for Improving Safe Semi-supervised Learning

Jinsoo Bae, Seoung Bum Kim, Hyungrok Do

ICCV 2025arXiv:2508.00922
#16602

Harnessing Input-Adaptive Inference for Efficient VLN

Dongwoo Kang, Akhil Perincherry, Zachary Coalson et al.

ICCV 2025arXiv:2508.09262
#16603

Rethinking the Upsampling Process in Light Field Super-Resolution with Spatial-Epipolar Implicit Image Function

Ruixuan Cong, Yu Wang, Mingyuan Zhao et al.

ICCV 2025
#16604

Lightweight and Fast Real-time Image Enhancement via Decomposition of the Spatial-aware Lookup Tables

Wontae Kim, Keuntek Lee, Nam Ik Cho

ICCV 2025arXiv:2508.16121
#16605

EventUPS: Uncalibrated Photometric Stereo Using an Event Camera

Jinxiu Liang, Bohan Yu, Siqi Yang et al.

ICCV 2025highlight
#16606

Neural Multi-View Self-Calibrated Photometric Stereo without Photometric Stereo Cues

Xu Cao, Takafumi Taketomi

ICCV 2025arXiv:2507.23162
#16607

RayPose: Ray Bundling Diffusion for Template Views in Unseen 6D Object Pose Estimation

Junwen Huang, Shishir Reddy Vutukur, Peter Yu et al.

ICCV 2025arXiv:2510.18521
#16608

Tensor-aggregated LoRA in Federated Fine-tuning

Zhixuan Li, Binqian Xu, Xiangbo Shu et al.

ICCV 2025
#16609

Less is More: Empowering GUI Agent with Context-Aware Simplification

Gongwei Chen, Xurui Zhou, Rui Shao et al.

ICCV 2025highlightarXiv:2507.03730
#16610

Generalized Deep Multi-view Clustering via Causal Learning with Partially Aligned Cross-view Correspondence

Xihong Yang, Siwei Wang, Jiaqi Jin et al.

ICCV 2025arXiv:2509.16022
#16611

Language-Driven Multi-Label Zero-Shot Learning with Semantic Granularity

Shouwen Wang, Qian Wan, Junbin Gao et al.

ICCV 2025
#16612

Backdooring Self-Supervised Contrastive Learning by Noisy Alignment

Tuo Chen, Jie Gui, Minjing Dong et al.

ICCV 2025arXiv:2508.14015
#16613

CounterPC: Counterfactual Feature Realignment for Unsupervised Domain Adaptation on Point Clouds

Feng Yang, Yichao Cao, Xiu Su et al.

ICCV 2025highlight
#16614

Robust Dataset Condensation using Supervised Contrastive Learning

Nicole Kim, Hwanjun Song

ICCV 2025
#16615

Liberated-GS: 3D Gaussian Splatting Independent from SfM Point Clouds

Weihong Pan, Xiaoyu Zhang, Hongjia Zhai et al.

ICCV 2025
#16616

Unlocking the Potential of Diffusion Priors in Blind Face Restoration

Yunqi Miao, Zhiyu Qu, Mingqi Gao et al.

ICCV 2025arXiv:2508.08556
#16617

Instruction-Grounded Visual Projectors for Continual Learning of Generative Vision-Language Models

Hyundong Jin, Hyung Jin Chang, Eunwoo Kim

ICCV 2025arXiv:2508.00260
#16618

AccidentalGS: 3D Gaussian Splatting from Accidental Camera Motion

Mao Mao, Xujie Shen, Guyuan Chen et al.

ICCV 2025
#16619

Event-boosted Deformable 3D Gaussians for Dynamic Scene Reconstruction

Wenhao Xu, Wenming Weng, Yueyi Zhang et al.

ICCV 2025arXiv:2411.16180
#16620

Learning Separable Fine-Grained Representation via Dendrogram Construction from Coarse Labels for Fine-grained Visual Recognition

Guanghui Shi, Xuefeng liang, Wenjie Li et al.

ICCV 2025
#16621

Hyper-Depth: Hypergraph-based Multi-Scale Representation Fusion for Monocular Depth Estimation

Lin Bie, Siqi Li, Yifan Feng et al.

ICCV 2025
#16622

STaR: Seamless Spatial-Temporal Aware Motion Retargeting with Penetration and Consistency Constraints

Xiaohang Yang, Qing Wang, Jiahao Yang et al.

ICCV 2025arXiv:2504.06504
#16623

Unknown Text Learning for CLIP-based Few-Shot Open-set Recognition

Rui Ma, Qilong Wang, Bing Cao et al.

ICCV 2025
#16624

MRGen: Segmentation Data Engine For Underrepresented MRI Modalities

Haoning Wu, Ziheng Zhao, Ya Zhang et al.

ICCV 2025arXiv:2412.04106
#16625

One Object, Multiple Lies: A Benchmark for Cross-task Adversarial Attack on Unified Vision-Language Models

Jiale Zhao, XINYANG JIANG, Junyao Gao et al.

ICCV 2025arXiv:2507.07709
#16626

MoFRR: Mixture of Diffusion Models for Face Retouching Restoration

Jiaxin Liu, Qichao Ying, Zhenxing Qian et al.

ICCV 2025arXiv:2507.19770
#16627

Adversarial Reconstruction Feedback for Robust Fine-grained Generalization

Shijie Wang, Jian Shi, Haojie Li

ICCV 2025arXiv:2507.21742
#16628

Unified Adversarial Augmentation for Improving Palmprint Recognition

Jianlong Jin, Chenglong Zhao, Ruixin Zhang et al.

ICCV 2025
#16629

Continual Adaptation: Environment-Conditional Parameter Generation for Object Detection in Dynamic Scenarios

Deng Li, Aming WU, Yang Li et al.

ICCV 2025arXiv:2506.24063
#16630

Uncover Treasures in DCT: Advancing JPEG Quality Enhancement by Exploiting Latent Correlations

jing Yang, Qunliang Xing, Mai Xu et al.

ICCV 2025arXiv:2506.21171
#16631

Unified Multi-Agent Trajectory Modeling with Masked Trajectory Diffusion

songru Yang, Zhenwei Shi, Zhengxia Zou

ICCV 2025
#16632

Enhancing Transferability of Targeted Adversarial Examples via Inverse Target Gradient Competition and Spatial Distance Stretching

Zhankai Li, Weiping Wang, jie li et al.

ICCV 2025
#16633

LDPose: Towards Inclusive Human Pose Estimation for Limb-Deficient Individuals in the Wild

Jiaying Ying, Heming Du, Kaihao Zhang et al.

ICCV 2025
#16634

Images as Noisy Labels: Unleashing the Potential of the Diffusion Model for Open-Vocabulary Semantic Segmentation

Fan Li, Xuanbin Wang, Xuan Wang et al.

ICCV 2025highlight
#16635

ContextFace: Generating Facial Expressions from Emotional Contexts

minjung kim, Minsang Kim, Seung Jun Baek

ICCV 2025
#16636

SMP-Attack: Boosting the Transferability of Feature Importance-based Adversarial Attack with Semantics-aware Multi-granularity Patchout

Wen Yang, Guodong Liu, Di Ming

ICCV 2025
#16637

Spatial-Temporal Forgery Trace based Forgery Image Identification

Yilin Wang, Zunlei Feng, Jiachi Wang et al.

ICCV 2025
#16638

Towards Annotation-Free Evaluation: KPAScore for Human Keypoint Detection

Xiaoxiao Wang, Chunxiao Li, Peng Sun et al.

ICCV 2025
#16639

Ultra High-Resolution Image Inpainting with Patch-Based Content Consistency Adapter

JianHui Zhang, Shen Cheng, Qirui Sun et al.

ICCV 2025arXiv:2510.13419
#16640

Agreement aware and dissimilarity oriented GLOM

Ru Zeng, Yan Song, Yang ZHANG et al.

ICCV 2025
#16641

The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation

Aoxiong Yin, Kai Shen, Yichong Leng et al.

ICCV 2025arXiv:2503.04606
#16642

Bridging Class Imbalance and Partial Labeling via Spectral-Balanced Energy Propagation for Skeleton-based Action Recognition

Yandan Wang, Chenqi Guo, Yinglong Ma et al.

ICCV 2025
#16643

MeasureXpert: Automatic Anthropometric Measurement Extraction from Two Unregistered, Partial, Posed, and Dressed Body Scans

Ran Zhao, Xinxin Dai, Pengpeng Hu et al.

ICCV 2025
#16644

ForeSight: Multi-View Streaming Joint Object Detection and Trajectory Forecasting

Sandro Papais, Letian Wang, Brian Cheong et al.

ICCV 2025arXiv:2508.07089
#16645

PROL : Rehearsal Free Continual Learning in Streaming Data via Prompt Online Learning

Muhammad Anwar Ma'sum, Mahardhika Pratama, Savitha Ramasamy et al.

ICCV 2025arXiv:2507.12305
#16646

Dual Domain Control via Active Learning for Remote Sensing Domain Incremental Object Detection

Jiachen Sun, De Cheng, Xi Yang et al.

ICCV 2025
#16647

SUV: Suppressing Undesired Video Content via Semantic Modulation Based on Text Embeddings

Xiang Lv, Mingwen Shao, Lingzhuang Meng et al.

ICCV 2025
#16648

Enpowering Your Pansharpening Models with Generalizability: Unified Distribution is All You Need

Yongchuan Cui, Peng Liu, HUI ZHANG

ICCV 2025arXiv:2510.22217
#16649

From Sharp to Blur: Unsupervised Domain Adaptation for 2D Human Pose Estimation Under Extreme Motion Blur Using Event Cameras

Youngho Kim, Hoonhee Cho, Kuk-Jin Yoon

ICCV 2025arXiv:2507.22438
#16650

LLM Thought Divergence and Convergence for Dialogue-Based Image Generation Control

Hui Li

ICCV 2025
#16651

MemDistill: Distilling LiDAR Knowledge into Memory for Camera-Only 3D Object Detection

Donghyeon Kwon, Youngseok Yoon, Hyeongseok Son et al.

ICCV 2025
#16652

Cooperative Pseudo Labeling for Unsupervised Federated Classification

Kuangpu Guo, Lijun Sheng, Yongcan Yu et al.

ICCV 2025arXiv:2510.10100
#16653

MissRAG: Addressing the Missing Modality Challenge in Multimodal Large Language Models

Vittorio Pipoli, Alessia Saporita, Federico Bolelli et al.

ICCV 2025
#16654

CoStoDet-DDPM: Collaborative Training of Stochastic and Deterministic Models Improves Surgical Workflow Anticipation and Recognition

Kaixiang Yang, Xin Li, Qiang Li et al.

ICCV 2025arXiv:2503.10216
#16655

Exploring Weather-aware Aggregation and Adaptation for Semantic Segmentation under Adverse Conditions

Yuwen Pan, Rui Sun, Wangkai Li et al.

ICCV 2025
#16656

Factorized Learning for Temporally Grounded Video-Language Models

Wenzheng Zeng, Difei Gao, Mike Zheng Shou et al.

ICCV 2025arXiv:2512.24097
#16657

Unsupervised RGB-D Point Cloud Registration for Scenes with Low Overlap and Photometric Inconsistency

yejun Shou, Haocheng Wang, Lingfeng Shen et al.

ICCV 2025
#16658

DISTIL: Data-Free Inversion of Suspicious Trojan Inputs via Latent Diffusion

Hossein Mirzaei, Zeinab Taghavi, Sepehr Rezaee et al.

ICCV 2025arXiv:2507.22813
#16659

TOGA: Temporally Grounded Open-Ended Video QA with Weak Supervision

Ayush Gupta, Anirban Roy, Rama Chellappa et al.

ICCV 2025arXiv:2506.09445
#16660

DynFaceRestore: Balancing Fidelity and Quality in Diffusion-Guided Blind Face Restoration with Dynamic Blur-Level Mapping and Guidance

Huu Phu Do, Yu-Wei Chen, Yi-Cheng Liao et al.

ICCV 2025highlightarXiv:2507.13797
#16661

Gradient-Reweighted Adversarial Camouflage for Physical Object Detection Evasion

Jiawei Liang, Siyuan Liang, Tianrui Lou et al.

ICCV 2025
#16662

Training-free Geometric Image Editing on Diffusion Models

Hanshen Zhu, Zhen Zhu, Kaile Zhang et al.

ICCV 2025arXiv:2507.23300
#16663

ART: Adaptive Relation Tuning for Generalized Relation Prediction

Gopika Sudhakaran, Hikaru Shindo, Patrick Schramowski et al.

ICCV 2025arXiv:2507.23543
#16664

Sparsity Outperforms Low-Rank Projections in Few-Shot Adaptation

Nairouz Mrabah, Nicolas Richet, Ismail Ayed et al.

ICCV 2025arXiv:2504.12436
#16665

WINS: Winograd Structured Pruning for Fast Winograd Convolution

Cheonjun Park, Hyunjae Oh, Mincheol Park et al.

ICCV 2025highlight
#16666

MixA: A Mixed Attention approach with Stable Lightweight Linear Attention to enhance Efficiency of Vision Transformers at the Edge

Sabbir Ahmed, Jingtao Li, Weiming Zhuang et al.

ICCV 2025
#16667

Transparent Vision: A Theory of Hierarchical Invariant Representations

Shuren Qi, Yushu Zhang, CHAO WANG et al.

ICCV 2025
#16668

DuET: Dual Incremental Object Detection via Exemplar-Free Task Arithmetic

Munish Monga, Vishal Chudasama, Pankaj Wasnik et al.

ICCV 2025arXiv:2506.21260
#16669

RetinexMCNet: A Memory Controller Dominated Network for Low-Light Video Enhancement Based on Retinex

Meiao Wang, Xuejing Kang, Yaxi Lu et al.

ICCV 2025
#16670

Sliced Wasserstein Bridge for Open-Vocabulary Video Instance Segmentation

Zheyun Qin, Deng Yu, Chuanchen Luo et al.

ICCV 2025highlight
#16671

Efficient Event Camera Data Pretraining with Adaptive Prompt Fusion

Quanmin Liang, Qiang Li, Shuai Liu et al.

ICCV 2025
#16672

Lightweight Gradient-Aware Upscaling of 3D Gaussian Splatting Images

Simon Niedermayr, Christoph Neuhauser, Rüdiger Westermann

ICCV 2025arXiv:2503.14171
#16673

SAM Encoder Breach by Adversarial Simplicial Complex Triggers Downstream Model Failures

Yi Qin, Rui Wang, Tao Huang et al.

ICCV 2025arXiv:2508.06127
#16674

3D Gaussian Splatting Driven Multi-View Robust Physical Adversarial Camouflage Generation

Tianrui Lou, Xiaojun Jia, Siyuan Liang et al.

ICCV 2025arXiv:2507.01367
#16675

Head2Body: Body Pose Generation from Multi-sensory Head-mounted Inputs

Minh Tran, Hongda Mao, Qingshuang Chen et al.

ICCV 2025
#16676

Gaze-Language Alignment for Zero-Shot Prediction of Visual Search Targets from Human Gaze Scanpaths

Sounak Mondal, Naveen Sendhilnathan, Ting Zhang et al.

ICCV 2025
#16677

Looking in the Mirror: A Faithful Counterfactual Explanation Method for Interpreting Deep Image Classification Models

Townim Chowdhury, Vu Phan, Kewen Liao et al.

ICCV 2025arXiv:2509.16822
#16678

FLSeg: Enhancing Privacy and Robustness in Federated Learning under Heterogeneous Data via Model Segmentation

Zichun Su, Zhi Lu, Yutong Wu et al.

ICCV 2025
#16679

Self-Calibrating Gaussian Splatting for Large Field-of-View Reconstruction

Youming Deng, Wenqi Xian, Guandao Yang et al.

ICCV 2025highlight
#16680

LGA-Net: Learning Local and Global Affinities for Sparse Scribble based Image Colorization

Hongjin Lyu, Bo Li, Paul Rosin et al.

ICCV 2025
#16681

Gradient Decomposition and Alignment for Incremental Object Detection

Wenlong Luo, Shizhou Zhang, De Cheng et al.

ICCV 2025
#16682

PacGDC: Label-Efficient Generalizable Depth Completion with Projection Ambiguity and Consistency

Haotian Wang, Aoran Xiao, Xiaoqin Zhang et al.

ICCV 2025arXiv:2507.07374
#16683

MSQ: Memory-Efficient Bit Sparsification Quantization

Seokho Han, Seoyeon Yoon, Jinhee Kim et al.

ICCV 2025arXiv:2507.22349
#16684

SuMa: A Subspace Mapping Approach for Robust and Effective Concept Erasure in Text-to-Image Diffusion Models

Kien Nguyen, Anh Tran, Cuong Pham

ICCV 2025arXiv:2509.05625
#16685

ImageGem: In-the-wild Generative Image Interaction Dataset for Generative Model Personalization

Yuanhe Guo, Linxi Xie, Zhuoran Chen et al.

ICCV 2025arXiv:2510.18433
#16686

Recovering Parametric Scenes from Very Few Time-of-Flight Pixels

Carter Sifferman, Yiquan Li, Yiming Li et al.

ICCV 2025arXiv:2509.16132
#16687

Learning Visual Proxy for Compositional Zero-Shot Learning

Shiyu Zhang, Cheng Yan, Yang Liu et al.

ICCV 2025arXiv:2501.13859
#16688

MCAM: Multimodal Causal Analysis Model for Ego-Vehicle-Level Driving Video Understanding

Tongtong Cheng, Rongzhen Li, Yixin Xiong et al.

ICCV 2025arXiv:2507.06072
#16689

When and Where do Data Poisons Attack Textual Inversion?

Jeremy Styborski, Mingzhi Lyu, Jiayou Lu et al.

ICCV 2025arXiv:2507.10578
#16690

Clink! Chop! Thud! - Learning Object Sounds from Real-World Interactions

Mengyu Yang, Yiming Chen, Haozheng Pei et al.

ICCV 2025arXiv:2510.02313
#16691

Rethinking Few Shot CLIP Benchmarks: A Critical Analysis in the Inductive Setting

Alexey Kravets, Da Chen, Vinay Namboodiri

ICCV 2025arXiv:2507.20834
#16692

Discovering Divergent Representations between Text-to-Image Models

Lisa Dunlap, Trevor Darrell, Joseph Gonzalez et al.

ICCV 2025arXiv:2509.08940
#16693

Engage for All: Making Ordinary Image Descriptions Appealing Again!

Yuyan Chen, Yifan Jiang, Li Zhou et al.

ICCV 2025
#16694

AU-Blendshape for Fine-grained Stylized 3D Facial Expression Manipulation

Hao Li, Ju Dai, Feng Zhou et al.

ICCV 2025arXiv:2507.12001
#16695

Understanding Personal Concept in Open-Vocabulary Semantic Segmentation

Sunghyun Park, Jungsoo Lee, Shubhankar Borse et al.

ICCV 2025arXiv:2507.11030
#16696

Geometry Distributions

Biao Zhang, Jing Ren, Peter Wonka

ICCV 2025highlightarXiv:2411.16076
#16697

Trial-Oriented Visual Rearrangement

Yuyi Liu, Xinhang Song, Tianliang Qi et al.

ICCV 2025
#16698

Debiased Teacher for Day-to-Night Domain Adaptive Object Detection

Yiming Cui, Liang Li, Haibing YIN et al.

ICCV 2025
#16699

Towards Effective Foundation Model Adaptation for Extreme Cross-Domain Few-Shot Learning

Fei Zhou, Peng Wang, Lei Zhang et al.

ICCV 2025
#16700

Hierarchy-Aware Pseudo Word Learning with Text Adaptation for Zero-Shot Composed Image Retrieval

Zhe Li, Lei Zhang, Zheren Fu et al.

ICCV 2025
#16701

FuXi-RTM: A Physics-Guided Prediction Framework with Radiative Transfer Modeling

qiusheng huang, Xiaohui Zhong, Xu Fan et al.

ICCV 2025highlightarXiv:2503.19940
#16702

UPP: Unified Point-Level Prompting for Robust Point Cloud Analysis

Zixiang Ai, Zhenyu Cui, Yuxin Peng et al.

ICCV 2025arXiv:2507.18997
#16703

Visual Interestingness Decoded: How GPT-4o Mirrors Human Interests

Fitim Abdullahu, Helmut Grabner

ICCV 2025arXiv:2510.13316
#16704

Towards Performance Consistency in Multi-Level Model Collaboration

Qi Li, Runpeng Yu, Xinchao Wang

ICCV 2025
#16705

Probabilistic Inertial Poser (ProbIP): Uncertainty-aware Human Motion Modeling from Sparse Inertial Sensors

Min Kim, Younho Jeon, Sungho Jo

ICCV 2025
#16706

SFUOD: Source-Free Unknown Object Detection

Keon-Hee Park, Seun-An Choe, Gyeong-Moon Park

ICCV 2025arXiv:2507.17373
#16707

ConstStyle: Robust Domain Generalization with Unified Style Transformation

Nam Duong Tran, Nam Nguyen Phuong, Hieu Pham et al.

ICCV 2025arXiv:2509.05975
#16708

SummDiff: Generative Modeling of Video Summarization with Diffusion

Kwanseok Kim, Jaehoon Hahm, Sumin Kim et al.

ICCV 2025highlightarXiv:2510.08458
#16709

RALoc: Enhancing Outdoor LiDAR Localization via Rotation Awareness

Yuyang Yang, Wen Li, Sheng Ao et al.

ICCV 2025highlight
#16710

ConsistentCity: Semantic Flow-guided Occupancy DiT for Temporally Consistent Driving Scene Synthesis

Benjin Zhu, Xiaogang Wang, Hongsheng Li

ICCV 2025
#16711

CLOT: Closed Loop Optimal Transport for Unsupervised Action Segmentation

Elena Bueno-Benito, Mariella Dimiccoli

ICCV 2025arXiv:2507.03539
#16712

Dual-Temporal Exemplar Representation Network for Video Semantic Segmentation

Xiaolong Xu, Lei Zhang, Jiayi Li et al.

ICCV 2025
#16713

Progressive Distribution Bridging: Unsupervised Adaptation for Large-scale Pre-trained Models via Adaptive Auxiliary Data

Weinan He, Yixin Zhang, Zilei Wang

ICCV 2025
#16714

Imbalance in Balance: Online Concept Balancing in Generation Models

Yukai Shi, Jiarong Ou, Rui Chen et al.

ICCV 2025arXiv:2507.13345
#16715

Vision-Language Interactive Relation Mining for Open-Vocabulary Scene Graph Generation

Yukuan Min, Muli Yang, Jinhao Zhang et al.

ICCV 2025
#16716

OrderChain: Towards General Instruct-Tuning for Stimulating the Ordinal Understanding Ability of MLLM

Jinhong Wang, Shuo Tong, Jintai CHEN et al.

ICCV 2025arXiv:2504.04801
#16717

Unified Open-World Segmentation with Multi-Modal Prompts

Yang Liu, Yufei Yin, Chenchen Jing et al.

ICCV 2025arXiv:2510.10524
#16718

PASG: A Closed-Loop Framework for Automated Geometric Primitive Extraction and Semantic Anchoring in Robotic Manipulation

Zhihao ZHU, Yifan Zheng, Siyu Pan et al.

ICCV 2025arXiv:2508.05976
#16719

MDD: A Dataset for Text-and-Music Conditioned Duet Dance Generation

Prerit Gupta, Jason Alexander Fotso-Puepi, Zhengyuan Li et al.

ICCV 2025arXiv:2508.16911
#16720

LHM: Large Animatable Human Reconstruction Model for Single Image to 3D in Seconds

Lingteng Qiu, Xiaodong Gu, Peihao Li et al.

ICCV 2025
#16721

KOEnsAttack: Towards Efficient Data-Free Black-Box Adversarial Attacks via Knowledge-Orthogonalized Substitute Ensembles

Chaoyong Yang, Jia-Li Yin, Bin Chen et al.

ICCV 2025
#16722

RogSplat: Robust Gaussian Splatting via Generative Priors

Hanyang Kong, Xingyi Yang, Xinchao Wang

ICCV 2025
#16723

Not Only Vision: Evolve Visual Speech Recognition via Peripheral Information

Zhaoxin Yuan, Shuang Yang, Shiguang Shan et al.

ICCV 2025
#16724

FedAGC: Federated Continual Learning with Asymmetric Gradient Correction

Chengchao Zhang, Fanhua Shang, Hongying Liu et al.

ICCV 2025
#16725

Joint Learning of Pose Regression and Denoising Diffusion with Score Scaling Sampling for Category-level 6D Pose Estimation

Seunghyun Lee, Tae-Kyun Kim

ICCV 2025arXiv:2510.04125
#16726

Intra-modal and Cross-modal Synchronization for Audio-visual Deepfake Detection and Temporal Localization

Ashutosh Anshul, Shreyas Gopal, Deepu Rajan et al.

ICCV 2025
#16727

Enhancing Adversarial Transferability by Balancing Exploration and Exploitation with Gradient-Guided Sampling

Zenghao Niu, Weicheng Xie, Siyang Song et al.

ICCV 2025arXiv:2511.00411
#16728

Any-SSR: How Recursive Least Squares Works in Continual Learning of Large Language Model

Kai Tong, Kang Pan, Xiao Zhang et al.

ICCV 2025
#16729

MinCD-PnP: Learning 2D-3D Correspondences with Approximate Blind PnP

Pei An, Jiaqi Yang, Muyao Peng et al.

ICCV 2025arXiv:2507.15257
#16730

Federated Representation Angle Learning

Liping Yi, Han Yu, Gang Wang et al.

ICCV 2025
#16731

GeoDistill: Geometry-Guided Self-Distillation for Weakly Supervised Cross-View Localization

Shaowen Tong, Zimin Xia, Alexandre Alahi et al.

ICCV 2025arXiv:2507.10935
#16732

GauUpdate: New Object Insertion in 3D Gaussian Fields with Consistent Global Illumination

Chengwei REN, Fan Zhang, Liangchao Xu et al.

ICCV 2025
#16733

Diffusion-based Source-biased Model for Single Domain Generalized Object Detection

Han Jiang, Wenfei Yang, Tianzhu Zhang et al.

ICCV 2025
#16734

Enhanced Event-based Dense Stereo via Cross-Sensor Knowledge Distillation

Haihao Zhang, Yunjian Zhang, Jianing Li et al.

ICCV 2025
#16735

Your Text Encoder Can Be An Object-Level Watermarking Controller

Naresh Kumar Devulapally, Mingzhen Huang, Vishal Asnani et al.

ICCV 2025arXiv:2503.11945
#16736

Music Grounding by Short Video

Zijie Xin, Minquan Wang, Jingyu Liu et al.

ICCV 2025arXiv:2408.16990
#16737

Enhanced Pansharpening via Quaternion Spatial-Spectral Interactions

Dong Li, Chunhui Luo, Yuanfei Bao et al.

ICCV 2025
#16738

Client2Vec: Improving Federated Learning by Distribution Shifts Aware Client Indexing

Yongxin Guo, Lin Wang, Xiaoying Tang et al.

ICCV 2025arXiv:2405.16233
#16739

Scaling and Taming Adversarial Training with Synthetic Data

Juntao Wu, Xianting Huang, Yu Chen et al.

ICCV 2025
#16740

Instance-Level Video Depth in Groups Beyond Occlusions

Yuan Liang, Yang Zhou, Ziming Sun et al.

ICCV 2025
#16741

Flow Stochastic Segmentation Networks

Fabio De Sousa Ribeiro, Omar Todd, Charles Jones et al.

ICCV 2025arXiv:2507.18838
#16742

DisCoPatch: Taming Adversarially-driven Batch Statistics for Improved Out-of-Distribution Detection

Francisco Caetano, Christiaan Viviers, Luis Zavala-Mondragón et al.

ICCV 2025arXiv:2501.08005
#16743

Future-Aware Interaction Network For Motion Forecasting

Shijie Li, Chunyu Liu, Xun Xu et al.

ICCV 2025arXiv:2503.06565
#16744

Resolving Token-Space Gradient Conflicts: Token Space Manipulation for Transformer-Based Multi-Task Learning

Wooseong Jeong, Kuk-Jin Yoon

ICCV 2025arXiv:2507.07485
#16745

From Gaze to Movement: Predicting Visual Attention for Autonomous Driving Human-Machine Interaction based on Programmatic Imitation Learning

Yexin Huang, Yongbin Lin, Lishengsa Yue et al.

ICCV 2025
#16746

Intervening in Black Box: Concept Bottleneck Model for Enhancing Human Neural Network Mutual Understanding

Nuoye Xiong, Anqi Dong, Ning Wang et al.

ICCV 2025arXiv:2506.22803
#16747

DreamCube: RGB-D Panorama Generation via Multi-plane Synchronization

Yukun Huang, Yanning Zhou, Jianan Wang et al.

ICCV 2025
#16748

From Enhancement to Understanding: Build a Generalized Bridge for Low-light Vision via Semantically Consistent Unsupervised Fine-tuning

Sen Wang, Shao Zeng, Tianjun Gu et al.

ICCV 2025arXiv:2507.08380
#16749

MVTrajecter: Multi-View Pedestrian Tracking with Trajectory Motion Cost and Trajectory Appearance Cost

Taiga Yamane, Ryo Masumura, Satoshi Suzuki et al.

ICCV 2025arXiv:2509.01157
#16750

ScanEdit: Hierarchically-Guided Functional 3D Scan Editing

Mohamed El Amine Boudjoghra, Ivan Laptev, Angela Dai

ICCV 2025arXiv:2504.15049
#16751

Optical Model-Driven Sharpness Mapping for Autofocus in Small Depth-of-Field and Severe Defocus Scenarios

Chen-Liang Fan, Mingpei Cao, Chih-Chien Hung et al.

ICCV 2025
#16752

HyPiDecoder: Hybrid Pixel Decoder for Efficient Segmentation and Detection

Fengzhe Zhou, Humphrey Shi

ICCV 2025
#16753

G2D: Boosting Multimodal Learning with Gradient-Guided Distillation

Mohammed Rakib, Arunkumar Bagavathi

ICCV 2025
#16754

Unified Video Generation via Next-Set Prediction in Continuous Domain

Zhanzhou Feng, Qingpei Guo, Xinyu Xiao et al.

ICCV 2025
#16755

Optimal Transport for Brain-Image Alignment: Unveiling Redundancy and Synergy in Neural Information Processing

Yang Xiao, Wang Lu, Jie Ji et al.

ICCV 2025arXiv:2503.10663
#16756

Safeguarding Vision-Language Models: Mitigating Vulnerabilities to Gaussian Noise in Perturbation-based Attacks

Jiawei Wang, Yushen Zuo, Yuanjun Chai et al.

ICCV 2025arXiv:2504.01308
#16757

Optimize Any Topology: A Foundation Model for Shape- and Resolution-Free Structural Topology Optimization

Amin Heyrani Nobari, Lyle Regenwetter, Cyril Picard et al.

NEURIPS 2025arXiv:2510.23667
#16758

Active Learning Meets Foundation Models: Fast Remote Sensing Data Annotation for Object Detection

Marvin Burges, Philipe Dias, Dalton Lunga et al.

ICCV 2025
#16759

Omni-scene Perception-oriented Point Cloud Geometry Enhancement for Coordinate Quantization

Wang Liu, Wei Gao

ICCV 2025
#16760

3BASiL: An Algorithmic Framework for Sparse plus Low-Rank Compression of LLMs

Mehdi Makni, Xiang Meng, Rahul Mazumder

NEURIPS 2025
#16761

Auto-Regressive Transformation for Image Alignment

Kanggeon Lee, Soochahn Lee, Kyoung Mu Lee

ICCV 2025arXiv:2505.04864
#16762

Training-Free Industrial Defect Generation with Diffusion Models

Ruyi Xu, Yen-Tzu Chiu, Tai-I Chen et al.

ICCV 2025
#16763

Feature Decomposition-Recomposition in Large Vision-Language Model for Few-Shot Class-Incremental Learning

Zongyao Xue, Meina Kan, Shiguang Shan et al.

ICCV 2025
#16764

Zero-Shot Composed Image Retrieval via Dual-Stream Instruction-Aware Distillation

Wenliang Zhong, Rob Barton, Weizhi An et al.

ICCV 2025
#16765

More effort is needed to protect pedestrian privacy in the era of AI

Xingchen Zhang, Zixian Zhao

NEURIPS 2025oral
#16766

Test-Time Adaptive Object Detection with Foundation Model

Yingjie Gao, Yanan Zhang, Zhi Cai et al.

NEURIPS 2025arXiv:2510.25175
#16767

PolypSense3D: A Multi-Source Benchmark Dataset for Depth-Aware Polyp Size Measurement in Endoscopy

Ruyu Liu, Lin Wang, Zhou Mingming et al.

NEURIPS 2025
#16768

SpinMeRound: Consistent Multi-View Identity Generation Using Diffusion Models

Stathis Galanakis, Alexandros Lattas, Stylianos Moschoglou et al.

ICCV 2025arXiv:2504.10716
#16769

Reconstructing Heterogeneous Biomolecules via Hierarchical Gaussian Mixtures and Part Discovery

Shayan Shekarforoush, David Lindell, Marcus Brubaker et al.

NEURIPS 2025arXiv:2506.09063
#16770

Disentangling misreporting from genuine adaptation in strategic settings: a causal approach

Dylan Zapzalka, Trenton Chang, Lindsay Warrenburg et al.

NEURIPS 2025
#16771

Connectome-Based Modelling Reveals Orientation Maps in the Drosophila Optic Lobe

Jia Nuo Liew, Shenghan Lin, Bowen Chen et al.

NEURIPS 2025
#16772

Online Multi-Class Selection with Group Fairness Guarantee

Faraz Zargari, Hossein Jazi, Lyndon Hallett et al.

NEURIPS 2025arXiv:2510.21055
#16773

Beyond Scalar Rewards: An Axiomatic Framework for Lexicographic MDPs

Mehran Shakerinava, Siamak Ravanbakhsh, Adam Oberman

NEURIPS 2025spotlightarXiv:2505.12049
#16774

Orthogonal Contrastive Learning for Multi-Representation fMRI Analysis

Tony Yousefnezhad

NEURIPS 2025oral
#16775

Near-Optimal Experiment Design in Linear non-Gaussian Cyclic Models

Ehsan Sharifian, Saber Salehkaleybar, Negar Kiyavash

NEURIPS 2025spotlightarXiv:2509.21423
#16776

Localized Data Shapley: Accelerating Valuation for Nearest Neighbor Algorithms

Guangyi Zhang, Yanhao Wang, Chengliang Chai et al.

NEURIPS 2025
#16777

UniDomain: Pretraining a Unified PDDL Domain from Real-World Demonstrations for Generalizable Robot Task Planning

Haoming Ye, Yunxiao Xiao, Cewu Lu et al.

NEURIPS 2025
#16778

No Object Is an Island: Enhancing 3D Semantic Segmentation Generalization with Diffusion Models

Fan Li, Xuan Wang, Xuanbin Wang et al.

NEURIPS 2025
#16779

AI Progress Should Be Measured by Capability-Per-Resource, Not Scale Alone: A Framework for Gradient-Guided Resource Allocation in LLMs

David McCoy, Yulun Wu, Zachary Butzin-Dozier

NEURIPS 2025arXiv:2511.01077
#16780

Don’t Give Up on Democratizing AI for the Wrong Reasons

Annette Zimmermann, Andrew Zeppa, Srijan Pandey et al.

NEURIPS 2025
#16781

Is Visual in-Context Learning for Compositional Medical Tasks within Reach?

Simon Reiß, Zdravko Marinov, Alexander Jaus et al.

ICCV 2025arXiv:2507.00868
#16782

InvRGB+L: Inverse Rendering of Complex Scenes with Unified Color and LiDAR Reflectance Modeling

Xiaoxue Chen, Bhargav Chandaka, Chih-Hao Lin et al.

ICCV 2025arXiv:2507.17613
#16783

SSRB: Direct Natural Language Querying to Massive Heterogeneous Semi-Structured Data

Xin Zhang, Mingxin Li, Yanzhao Zhang et al.

NEURIPS 2025
#16784

ChemX: A Collection of Chemistry Datasets for Benchmarking Automated Information Extraction

Anastasia Vepreva, Julia Razlivina, Mariia Eremeyeva et al.

NEURIPS 2025
#16785

UDC-VIT: A Real-World Video Dataset for Under-Display Cameras

Kyusu Ahn, JiSoo Kim, Sangik Lee et al.

ICCV 2025highlightarXiv:2501.18545
#16786

Task-Aware Prompt Gradient Projection for Parameter-Efficient Tuning Federated Class-Incremental Learning

Hualong Ke, Yachao Zhang, Jiangming Shi et al.

ICCV 2025
#16787

A Learning-Augmented Approach to Online Allocation Problems

Ilan Cohen, Debmalya Panigrahi

NEURIPS 2025
#16788

Reduction-based Pseudo-label Generation for Instance-dependent Partial Label Learning

Congyu Qiao, Ning Xu, Yihao Hu et al.

NEURIPS 2025arXiv:2410.20797
#16789

More Than Just Functional: LLM-as-a-Critique for Efficient Code Generation

Derui Zhu, Dingfan Chen, jinfu chen et al.

NEURIPS 2025
#16790

Memory-Augmented Potential Field Theory: A Framework for Adaptive Control in Non-Convex Domains

Dongzhe Zheng, Wenjie Mei

NEURIPS 2025arXiv:2509.19672
#16791

Tru-POMDP: Task Planning Under Uncertainty via Tree of Hypotheses and Open-Ended POMDPs

Wenjing Tang, Xinyu He, Yongxi Huang et al.

NEURIPS 2025arXiv:2506.02860
#16792

Learning to Plan Like the Human Brain via Visuospatial Perception and Semantic-Episodic Synergistic Decision-Making

Tianyuan Jia, Ziyu Li, Qing Li et al.

NEURIPS 2025
#16793

Dr. RAW: Towards General High-Level Vision from RAW with Efficient Task Conditioning

Wenjun Huang, Ziteng Cui, Yinqiang Zheng et al.

NEURIPS 2025
#16794

Predictable Scale (Part II) --- Farseer: A Refined Scaling Law in LLMs

Houyi Li, Wenzhen Zheng, Qiufeng Wang et al.

NEURIPS 2025spotlight
#16795

Cognitive Predictive Processing: A Human-inspired Framework for Adaptive Exploration in Open-World Reinforcement Learning

boheng liu, Ziyu Li, Chenghua Duan et al.

NEURIPS 2025
#16796

A unified framework for establishing the universal approximation of transformer-type architectures

Jingpu Cheng, Ting Lin, Zuowei Shen et al.

NEURIPS 2025arXiv:2506.23551
#16797

PlanarGS: High-Fidelity Indoor 3D Gaussian Splatting Guided by Vision-Language Planar Priors

Xirui Jin, Renbiao Jin, Boying Li et al.

NEURIPS 2025arXiv:2510.23930
#16798

Trust Region Reward Optimization and Proximal Inverse Reward Optimization Algorithm

Yang Chen, Menglin Zou, Jiaqi Zhang et al.

NEURIPS 2025arXiv:2509.23135
#16799

A Dynamic Learning Strategy for Dempster-Shafer Theory with Applications in Classification and Enhancement

Linlin Fan, Xingyu Liu, Mingliang Zhou et al.

NEURIPS 2025
#16800

RankSEG-RMA: An Efficient Segmentation Algorithm via Reciprocal Moment Approximation

Zixun Wang, Ben Dai

NEURIPS 2025arXiv:2510.15362