Most Cited 2025 "ventral stream selectivity" Papers

22,274 papers found • Page 109 of 112

#21601

Gradient Extrapolation for Debiased Representation Learning

Ihab Asaad, Maha Shadaydeh, Joachim Denzler

ICCV 2025posterarXiv:2503.13236
#21602

World4Drive: End-to-End Autonomous Driving via Intention-aware Physical Latent World Model

Yupeng Zheng, Pengxuan Yang, Zebin Xing et al.

ICCV 2025posterarXiv:2507.00603
#21603

Customizing Domain Adapters for Domain Generalization

Yuyang Ji, Zeyi Huang, Haohan Wang et al.

ICCV 2025poster
#21604

EllieSQL: Cost-Efficient Text-to-SQL with Complexity-Aware Routing

Yizhang Zhu, Runzhi JIANG, Boyan Li et al.

COLM 2025paper
#21605

Soft Separation and Distillation: Toward Global Uniformity in Federated Unsupervised Learning

Hung-Chieh Fang, Hsuan-Tien Lin, Irwin King et al.

ICCV 2025posterarXiv:2508.01251
#21606

Image as an IMU: Estimating Camera Motion from a Single Motion-Blurred Image

Jerred Chen, Ronald Clark

ICCV 2025posterarXiv:2503.17358
#21607

Adaptive Dual Uncertainty Optimization: Boosting Monocular 3D Object Detection under Test-Time Shifts

Zixuan Hu, Dongxiao Li, Xinzhu Ma et al.

ICCV 2025highlightarXiv:2508.20488
#21608

Routing Mamba: Scaling State Space Models with Mixture-of-Experts Projection

Zheng Zhan, Liliang Ren, Shuohang Wang et al.

NEURIPS 2025posterarXiv:2506.18145
#21609

Training Language Models to Generate Quality Code with Program Analysis Feedback

Feng Yao, Zilong Wang, Liyuan Liu et al.

NEURIPS 2025posterarXiv:2505.22704
#21610

Beyond Losses Reweighting: Empowering Multi-Task Learning via the Generalization Perspective

Hoang Phan, Tung Lam Tran, Quyen Tran et al.

ICCV 2025highlightarXiv:2211.13723
#21611

Mixture of Inputs: Text Generation Beyond Discrete Token Sampling

Yufan Zhuang, Liyuan Liu, Chandan Singh et al.

NEURIPS 2025poster
#21612

Learning Null Geodesics for Gravitational Lensing Rendering in General Relativity

Mingyuan Sun, Zheng Fang, Jiaxu Wang et al.

ICCV 2025posterarXiv:2507.15775
#21613

Object-centric Video Question Answering with Visual Grounding and Referring

Haochen Wang, Qirui Chen, Cilin Yan et al.

ICCV 2025posterarXiv:2507.19599
#21614

Stochastic Regret Guarantees for Online Zeroth- and First-Order Bilevel Optimization

Parvin Nazari, Bojian Hou, Davoud Ataee Tarzanagh et al.

NEURIPS 2025posterarXiv:2511.01126
#21615

NSNQuant: A Double Normalization Approach for Calibration-Free Low-Bit Vector Quantization of KV Cache

Donghyun Son, Euntae Choi, Sungjoo Yoo

NEURIPS 2025posterarXiv:2505.18231
#21616

Exploiting Frequency Dynamics for Enhanced Multimodal Event-based Action Recognition

Meiqi Cao, Xiangbo Shu, Xin Jiang et al.

ICCV 2025poster
#21617

3D Equivariant Visuomotor Policy Learning via Spherical Projection

Boce Hu, Dian Wang, David Klee et al.

NEURIPS 2025spotlightarXiv:2505.16969
#21618

OmniCast: A Masked Latent Diffusion Model for Weather Forecasting Across Time Scales

Tung Nguyen, Tuan Pham, Troy Arcomano et al.

NEURIPS 2025posterarXiv:2510.18707
#21619

How Far are AI-generated Videos from Simulating the 3D Visual World: A Learned 3D Evaluation Approach

Chirui CHANG, Jiahui Liu, Zhengzhe Liu et al.

ICCV 2025posterarXiv:2406.19568
#21620

Parameter-Free Hypergraph Neural Network for Few-Shot Node Classification

Chaewoon Bae, Doyun Choi, Jaehyun Lee et al.

NEURIPS 2025posterarXiv:2510.21462
#21621

Stop DDoS Attacking the Research Community with AI-Generated Survey Papers

Jianghao Lin, Rong Shan, Jiachen Zhu et al.

NEURIPS 2025oralarXiv:2510.09686
#21622

WIPES: Wavelet-based Visual Primitives

Wenhao Zhang, Hao Zhu, Delong Wu et al.

ICCV 2025posterarXiv:2508.12615
#21623

High-Performance Arithmetic Circuit Optimization via Differentiable Architecture Search

Xilin Xia, Jie Wang, Wanbo Zhang et al.

NEURIPS 2025spotlight
#21624

CoSMIC: Continual Self-supervised Learning for Multi-Domain Medical Imaging via Conditional Mutual Information Maximization

Yihang Liu, Ying Wen, Longzhen Yang et al.

ICCV 2025poster
#21625

Rethinking Approximate Gaussian Inference in Classification

Bálint Mucsányi, Nathaël Da Costa, Philipp Hennig

NEURIPS 2025posterarXiv:2502.03366
#21626

Adv-SSL: Adversarial Self-Supervised Representation Learning with Theoretical Guarantees

Chenguang Duan, Yuling Jiao, Huazhen Lin et al.

NEURIPS 2025posterarXiv:2408.08533
#21627

Information Retrieval Induced Safety Degradation in AI Agents

Cheng Yu, Benedikt Stroebl, Diyi Yang et al.

NEURIPS 2025posterarXiv:2505.14215
#21628

Diffusion Curriculum: Synthetic-to-Real Data Curriculum via Image-Guided Diffusion

Yijun Liang, Shweta Bhardwaj, Tianyi Zhou

ICCV 2025posterarXiv:2410.13674
#21629

Put CASH on Bandits: A Max K-Armed Problem for Automated Machine Learning

Amir Rezaei Balef, Claire Vernade, Katharina Eggensperger

NEURIPS 2025posterarXiv:2505.05226
#21630

Advancing Textual Prompt Learning with Anchored Attributes

Zheng Li, Yibing Song, Ming-Ming Cheng et al.

ICCV 2025posterarXiv:2412.09442
#21631

Memory-Integrated Reconfigurable Adapters: A Unified Framework for Settings with Multiple Tasks

Susmit Agrawal, Krishn Vishwas Kher, Saksham Mittal et al.

NEURIPS 2025posterarXiv:2512.00940
#21632

Dual-Rate Dynamic Teacher for Source-Free Domain Adaptive Object Detection

Qi He, Xiao Wu, Jun-Yan He et al.

ICCV 2025poster
#21633

OV3D-CG: Open-vocabulary 3D Instance Segmentation with Contextual Guidance

Mingquan Zhou, Chen He, Ruiping Wang et al.

ICCV 2025poster
#21634

Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition

Zhisheng Zhong, Chengyao Wang, Yuqi Liu et al.

ICCV 2025posterarXiv:2412.09501
#21635

Enhancing Mamba Decoder with Bidirectional Interaction in Multi-Task Dense Prediction

Mang Cao, Sanping Zhou, Yizhe Li et al.

ICCV 2025posterarXiv:2508.20376
#21636

SITE: towards Spatial Intelligence Thorough Evaluation

Wenqi Wang, Reuben Tan, Pengyue Zhu et al.

ICCV 2025posterarXiv:2505.05456
#21637

SHIFT: Smoothing Hallucinations by Information Flow Tuning for Multimodal Large Language Models

Sudong Wang, Yunjian Zhang, Yao Zhu et al.

ICCV 2025poster
#21638

Consistently Simulating Human Personas with Multi-Turn Reinforcement Learning

Marwa Abdulhai, Ryan Cheng, Donovan Clay et al.

NEURIPS 2025posterarXiv:2511.00222
#21639

Precise Diffusion Inversion: Towards Novel Samples and Few-Step Models

Jing Zuo, Luoping Cui, Chuang Zhu et al.

NEURIPS 2025poster
#21640

Text2VDM: Text to Vector Displacement Maps for Expressive and Interactive 3D Sculpting

Hengyu Meng, Duotun Wang, Zhijing Shao et al.

ICCV 2025posterarXiv:2502.20045
#21641

Mamba-3VL: Taming State Space Model for 3D Vision Language Learning

Yuan Wang, Yuxin Chen, Zhongang Qi et al.

ICCV 2025poster
#21642

Self supervised learning for in vivo localization of microelectrode arrays using raw local field potential

Tianxiao He, Malhar Patel, Chenyi Li et al.

NEURIPS 2025poster
#21643

MobileViCLIP: An Efficient Video-Text Model for Mobile Devices

Min Yang, Zihan Jia, Zhilin Dai et al.

ICCV 2025posterarXiv:2508.07312
#21644

Collective Counterfactual Explanations: Balancing Individual Goals and Collective Dynamics

Ahmad-Reza Ehyaei, Ali Shirali, Samira Samadi

NEURIPS 2025posterarXiv:2402.04579
#21645

MATE: Motion-Augmented Temporal Consistency for Event-based Point Tracking

Han Han, Wei Zhai, Yang Cao et al.

ICCV 2025posterarXiv:2412.01300
#21646

Asynchronous Event Error-Minimizing Noise for Safeguarding Event Dataset

Ruofei WANG, Peiqi Duan, Boxin Shi et al.

ICCV 2025highlightarXiv:2507.05728
#21647

Vector Contrastive Learning For Pixel-Wise Pretraining In Medical Vision

Yuting He, Shuo Li

ICCV 2025posterarXiv:2506.20850
#21648

AdaptiVocab: Enhancing LLM Efficiency in Focused Domains through Lightweight Vocabulary Adaptation

Itay Nakash, Nitay Calderon, Eyal Ben-David et al.

COLM 2025paper
#21649

Efficient Fine-Tuning of Large Models via Nested Low-Rank Adaptation

Lujun Li, Cheng Lin, Dezhi Li et al.

ICCV 2025poster
#21650

Dual-level Prototype Learning for Composite Degraded Image Restoration

Zhongze Wang, Haitao Zhao, Lujian Yao et al.

ICCV 2025poster
#21651

Efficient Input-level Backdoor Defense on Text-to-Image Synthesis via Neuron Activation Variation

Shengfang ZHAI, Jiajun Li, Yue Liu et al.

ICCV 2025highlightarXiv:2503.06453
#21652

Online Mixture of Experts: No-Regret Learning for Optimal Collective Decision-Making

Larkin Liu, Jalal Etesami

NEURIPS 2025posterarXiv:2510.21788
#21653

GReg: Geometry-Aware Region Refinement for Sign Language Video Generation

Tongkai Shi, Lianyu Hu, Fanhua Shang et al.

ICCV 2025poster
#21654

FineMotion: A Dataset and Benchmark with both Spatial and Temporal Annotation for Fine-grained Motion Generation and Editing

Bizhu Wu, Jinheng Xie, Meidan Ding et al.

ICCV 2025posterarXiv:2507.19850
#21655

Topology-aware Graph Diffusion Model with Persistent Homology

Joonhyuk Park, Donghyun Lee, Yujee Song et al.

NEURIPS 2025poster
#21656

Knowledge Transfer from Interaction Learning

Yilin Gao, Kangyi Chen, Zhongxing Peng et al.

ICCV 2025posterarXiv:2509.18733
#21657

WIR3D: Visually-Informed and Geometry-Aware 3D Shape Abstraction

Richard Liu, Daniel Fu, Noah Tan et al.

ICCV 2025posterarXiv:2505.04813
#21658

Synthesizing Near-Boundary OOD Samples for Out-of-Distribution Detection

Jinglun Li, Kaixun Jiang, Zhaoyu Chen et al.

ICCV 2025highlightarXiv:2507.10225
#21659

Cassic: Towards Content-Adaptive State-Space Models for Learned Image Compression

Shiyu Qin, Jinpeng Wang, Yimin Zhou et al.

ICCV 2025poster
#21660

SpectralAR: Spectral Autoregressive Visual Generation

Yuanhui Huang, Weiliang Chen, Wenzhao Zheng et al.

ICCV 2025posterarXiv:2506.10962
#21661

Boosting Adversarial Transferability via Negative Hessian Trace Regularization

Yunfei Long, Zilin Tian, Liguo Zhang et al.

ICCV 2025poster
#21662

OneGT: One-Shot Geometry-Texture Neural Rendering for Head Avatars

Jinshu Chen, Bingchuan Li, Fan Zhang et al.

ICCV 2025poster
#21663

Unsupervised Visible-Infrared Person Re-identification under Unpaired Settings

Haoyu Yao, Bin Yang, Wenke Huang et al.

ICCV 2025poster
#21664

Adaptive Prompt Learning via Gaussian Outlier Synthesis for Out-of-distribution Detection

Yongkang Zhang, Dongyu She, Zhong Zhou

ICCV 2025poster
#21665

Traceable and Explainable Multimodal Large Language Models: An Information-Theoretic View

Zihan Huang, Junda Wu, Rohan Surana et al.

COLM 2025paper
#21666

A Differentiable Wave Optics Model for End-to-End Computational Imaging System Optimization

Chi-Jui Ho, Yash Belhe, Steve Rotenberg et al.

ICCV 2025posterarXiv:2412.09774
#21667

OCK: Unsupervised Dynamic Video Prediction with Object-Centric Kinematics

YeonJi Song, Jaein Kim, Suhyung Choi et al.

ICCV 2025posterarXiv:2404.18423
#21668

Intra-view and Inter-view Correlation Guided Multi-view Novel Class Discovery

Xinhang Wan, Jiyuan Liu, Qian Qu et al.

ICCV 2025posterarXiv:2507.12029
#21669

HUST: High-Fidelity Unbiased Skin Tone Estimation via Texture Quantization

Zimin Ran, Xingyu Ren, Xiang An et al.

ICCV 2025poster
#21670

ProbMED: A Probabilistic Framework for Medical Multimodal Binding

Yuan Gao, Sangwook Kim, Jianzhong You et al.

ICCV 2025posterarXiv:2509.25711
#21671

CATP-LLM: Empowering Large Language Models for Cost-Aware Tool Planning

Duo Wu, Jinghe Wang, Yuan Meng et al.

ICCV 2025posterarXiv:2411.16313
#21672

Dynamic Group Detection using VLM-augmented Temporal Groupness Graph

Kaname Yokoyama, Chihiro Nakatani, Norimichi Ukita

ICCV 2025posterarXiv:2509.04758
#21673

CountSE: Soft Exemplar Open-set Object Counting

Shuai Liu, Peng Zhang, Shiwei Zhang et al.

ICCV 2025highlight
#21674

Accident Anticipation via Temporal Occurrence Prediction

Tianhao Zhao, Yiyang Zou, Zihao Mao et al.

NEURIPS 2025oralarXiv:2510.22260
#21675

GenieBlue: Integrating both Linguistic and Multimodal Capabilities for Large Language Models on Mobile Devices

Xudong LU, Yinghao Chen, Renshou Wu et al.

ICCV 2025posterarXiv:2503.06019
#21676

MedVSR: Medical Video Super-Resolution with Cross State-Space Propagation

Xinyu Liu, Guolei Sun, Cheng Wang et al.

ICCV 2025posterarXiv:2509.21265
#21677

Trans-Adapter: A Plug-and-Play Framework for Transparent Image Inpainting

Yuekun Dai, Haitian Li, Shangchen Zhou et al.

ICCV 2025posterarXiv:2508.01098
#21678

Generating Physically Sound Designs from Text and a Set of Physical Constraints

Gregory Barber, Todd Henry, Mulugeta Haile

NEURIPS 2025poster
#21679

Generalization-Preserved Learning: Closing the Backdoor to Catastrophic Forgetting in Continual Deepfake Detection

Xueyi Zhang, Peiyin Zhu, Chengwei Zhang et al.

ICCV 2025poster
#21680

IGD: Instructional Graphic Design with Multimodal Layer Generation

Yadong Qu, Shancheng Fang, Yuxin Wang et al.

ICCV 2025posterarXiv:2507.09910
#21681

Parameter-Efficient Adaptation of Geospatial Foundation Models through Embedding Deflection

Romain Thoreau, Valerio Marsocci, Dawa Derksen

ICCV 2025posterarXiv:2503.09493
#21682

Provably Efficient RL under Episode-Wise Safety in Constrained MDPs with Linear Function Approximation

Toshinori Kitamura, Arnob Ghosh, Tadashi Kozuno et al.

NEURIPS 2025spotlightarXiv:2502.10138
#21683

CityGS-X: A Scalable Architecture for Efficient and Geometrically Accurate Large-Scale Scene Reconstruction

Yuanyuan Gao, Hao Li, Jiaqi Chen et al.

ICCV 2025posterarXiv:2503.23044
#21684

AIRA: Activation-Informed Low-Rank Adaptation for Large Models

Lujun Li, Dezhi Li, Cheng Lin et al.

ICCV 2025poster
#21685

scGeneScope: A Treatment-Matched Single Cell Imaging and Transcriptomics Dataset and Benchmark for Treatment Response Modeling

Joel Dapello, Marcel Nassar, Ridvan Eksi et al.

NEURIPS 2025poster
#21686

Face Retouching with Diffusion Data Generation and Spectral Restorement

Zhidan Xu, Xiaoqin Zhang, Shijian Lu

ICCV 2025poster
#21687

Att-Adapter: A Robust and Precise Domain-Specific Multi-Attributes T2I Diffusion Adapter via Conditional Variational Autoencoder

Wonwoong Cho, Yan-Ying Chen, Matthew Klenk et al.

ICCV 2025highlightarXiv:2503.11937
#21688

Class Token as Proxy: Optimal Transport-assisted Proxy Learning for Weakly Supervised Semantic Segmentation

Jian Wang, Tianhong Dai, Bingfeng Zhang et al.

ICCV 2025poster
#21689

LoRI: Reducing Cross-Task Interference in Multi-Task Low-Rank Adaptation

Juzheng Zhang, Jiacheng You, Ashwinee Panda et al.

COLM 2025paper
#21690

SECA: Semantically Equivalent and Coherent Attacks for Eliciting LLM Hallucinations

Buyun Liang, Liangzu Peng, Jinqi Luo et al.

NEURIPS 2025posterarXiv:2510.04398
#21691

3DGS-LM: Faster Gaussian-Splatting Optimization with Levenberg-Marquardt

Lukas Höllein, Aljaz Bozic, Michael Zollhöfer et al.

ICCV 2025posterarXiv:2409.12892
#21692

Efficient Self-Improvement in Multimodal Large Language Models: A Model-Level Judge-Free Approach

Shijian Deng, Wentian Zhao, Yu-Jhe Li et al.

COLM 2025paper
#21693

GLEAM: Learning Generalizable Exploration Policy for Active Mapping in Complex 3D Indoor Scene

Xiao Chen, Tai Wang, Quanyi Li et al.

ICCV 2025posterarXiv:2505.20294
#21694

CA2C: A Prior-Knowledge-Free Approach for Robust Label Noise Learning via Asymmetric Co-learning and Co-training

Mengmeng Sheng, Zeren Sun, Tianfei Zhou et al.

ICCV 2025poster
#21695

Point Cloud Self-supervised Learning via 3D to Multi-view Masked Learner

Zhimin Chen, Xuewei Chen, Xiao Guo et al.

ICCV 2025posterarXiv:2311.10887
#21696

MSA2: Multi-task Framework with Structure-aware and Style-adaptive Character Representation for Open-set Chinese Text Recognition

Yangfu Li, Hongjian Zhan, Qi Liu et al.

ICCV 2025poster
#21697

VaPR - Vision-language Preference alignment for Reasoning

Rohan Wadhawan, Fabrice Y Harel-Canada, Zi-Yi Dou et al.

COLM 2025paper
#21698

Neural Collapse under Gradient Flow on Shallow ReLU Networks for Orthogonally Separable Data

Hancheng Min, Zhihui Zhu, Rene Vidal

NEURIPS 2025posterarXiv:2510.21078
#21699

MultiModal Action Conditioned Video Simulation

Yichen Li, Antonio Torralba

ICCV 2025poster
#21700

FW-Merging: Scaling Model Merging with Frank-Wolfe Optimization

Hao Chen, Shell Xu Hu, Wayne Luk et al.

ICCV 2025posterarXiv:2503.12649
#21701

VisRL: Intention-Driven Visual Perception via Reinforced Reasoning

Zhangquan Chen, Xufang Luo, Dongsheng Li

ICCV 2025posterarXiv:2503.07523
#21702

ClearSight: Human Vision-Inspired Solutions for Event-Based Motion Deblurring

Xiaopeng LIN, Yulong Huang, Hongwei Ren et al.

ICCV 2025posterarXiv:2501.15808
#21703

MotionBind: Multi-Modal Human Motion Alignment for Retrieval, Recognition, and Generation

Kaleab Kinfu, Rene Vidal

NEURIPS 2025oral
#21704

LEGO-Maker: A Semantic-Driven Algorithm for Text-to-3D Generation

Yifei Zhang, Lei Chen

ICCV 2025poster
#21705

First Attentions Last: Better Exploiting First Attentions for Efficient Parallel Training

Gyudong Kim, Hyukju Na, Jin Kim et al.

NEURIPS 2025poster
#21706

Dense Policy: Bidirectional Autoregressive Learning of Actions

Yue Su, Xinyu Zhan, Hongjie Fang et al.

ICCV 2025posterarXiv:2503.13217
#21707

DOGR: Towards Versatile Visual Document Grounding and Referring

Yinan Zhou, Yuxin Chen, Haokun Lin et al.

ICCV 2025posterarXiv:2411.17125
#21708

ReME: A Data-Centric Framework for Training-Free Open-Vocabulary Segmentation

Xiwei Xuan, Ziquan Deng, Kwan-Liu Ma

ICCV 2025highlightarXiv:2506.21233
#21709

MonoMobility: Zero-Shot 3D Mobility Analysis from Monocular Videos

Hongyi Zhou, Xiaogang Wang, Yulan Guo et al.

ICCV 2025posterarXiv:2505.11868
#21710

Performing Defocus Deblurring by Modeling its Formation Process

Zhengbo Zhang, Lin Geng Foo, Hossein Rahmani et al.

ICCV 2025poster
#21711

Supervised Exploratory Learning for Long-Tailed Visual Recognition

Zhongquan Jian, Yanhao Chen, Wangyancheng Wangyancheng et al.

ICCV 2025poster
#21712

OmniDiff: A Comprehensive Benchmark for Fine-grained Image Difference Captioning

Yuan Liu, Saihui Hou, Saijie Hou et al.

ICCV 2025posterarXiv:2503.11093
#21713

TimeExpert: An Expert-Guided Video LLM for Video Temporal Grounding

Zuhao Yang, Yingchen Yu, Yunqing Zhao et al.

ICCV 2025posterarXiv:2508.01699
#21714

Active Perception Meets Rule-Guided RL: A Two-Phase Approach for Precise Object Navigation in Complex Environments

Liang Qin, Min Wang, Peiwei Li et al.

ICCV 2025poster
#21715

Large Stepsizes Accelerate Gradient Descent for Regularized Logistic Regression

Jingfeng Wu, Pierre Marion, Peter Bartlett

NEURIPS 2025posterarXiv:2506.02336
#21716

GaussianReg: Rapid 2D/3D Registration for Emergency Surgery via Explicit 3D Modeling with Gaussian Primitives

Weihao Yu, Xiaoqing Guo, Xinyu Liu et al.

ICCV 2025poster
#21717

ArgoTweak: Towards Self-Updating HD Maps through Structured Priors

Lena Wild, Rafael Valencia, Patric Jensfelt

ICCV 2025posterarXiv:2509.08764
#21718

SpatialCrafter: Unleashing the Imagination of Video Diffusion Models for Scene Reconstruction from Limited Observations

Songchun Zhang, Huiyao Xu, Sitong Guo et al.

ICCV 2025posterarXiv:2505.11992
#21719

Humans overrely on overconfident language models, across languages

Neil Rathi, Dan Jurafsky, Kaitlyn Zhou

COLM 2025paper
#21720

Hybrid Layout Control for Diffusion Transformer: Fewer Annotations, Superior Aesthetics

Keming Wu, Junwen Chen, Zhanhao Liang et al.

ICCV 2025poster
#21721

Regret-Optimal Q-Learning with Low Cost for Single-Agent and Federated Reinforcement Learning

Haochen Zhang, Zhong Zheng, Lingzhou Xue

NEURIPS 2025posterarXiv:2506.04626
#21722

FedXDS: Leveraging Model Attribution Methods to counteract Data Heterogeneity in Federated Learning

Maximilian Hoefler, Karsten Mueller, Wojciech Samek

ICCV 2025poster
#21723

Visual Textualization for Image Prompted Object Detection

Yongjian Wu, Yang Zhou, Jiya Saiyin et al.

ICCV 2025posterarXiv:2506.23785
#21724

LLaVA-SP: Enhancing Visual Representation with Visual Spatial Tokens for MLLMs

Haoran Lou, Chunxiao Fan, Ziyan Liu et al.

ICCV 2025posterarXiv:2507.00505
#21725

Don’t lie to your friends: Learning what you know from collaborative self-play

Jacob Eisenstein, Reza Aghajani, Adam Fisch et al.

COLM 2025paper
#21726

UFT: Unifying Supervised and Reinforcement Fine-Tuning

Mingyang Liu, Gabriele Farina, Asuman Ozdaglar

NEURIPS 2025posterarXiv:2505.16984
#21727

SuperBPE: Space Travel for Language Models

Alisa Liu, Jonathan Hayase, Valentin Hofmann et al.

COLM 2025paper
#21728

From Queries to Criteria: Understanding How Astronomers Evaluate LLMs

Alina Hyk, Kiera McCormick, Mian Zhong et al.

COLM 2025paper
#21729

GMMamba: Group Masking Mamba for Whole Slide Image Classification

Tingting Zheng, Hongxun Yao, Kui Jiang et al.

ICCV 2025poster
#21730

RareCLIP: Rarity-aware Online Zero-shot Industrial Anomaly Detection

Jianfang He, Min Cao, Silong Peng et al.

ICCV 2025poster
#21731

Estimation of Stochastic Optimal Transport Maps

Sloan Nietert, Ziv Goldfeld

NEURIPS 2025posterarXiv:2512.09499
#21732

Pro3D-Editor: A Progressive Framework for Consistent and Precise 3D Editing

Yang Zheng, Mengqi Huang, Nan Chen et al.

NEURIPS 2025poster
#21733

Temporal Rate Reduction Clustering for Human Motion Segmentation

Xianghan Meng, Zhengyu Tong, Zhiyuan Huang et al.

ICCV 2025posterarXiv:2506.21249
#21734

Follow-the-Perturbed-Leader Nearly Achieves Best-of-Both-Worlds for the m-Set Semi-Bandit Problems

Jingxin Zhan, Yuchen Xin, Chenjie Sun et al.

NEURIPS 2025poster
#21735

Towards Dynamic 3D Reconstruction of Hand-Instrument Interaction in Ophthalmic Surgery

Ming Hu, Zhengdi Yu, feilong tang et al.

NEURIPS 2025spotlightarXiv:2505.17677
#21736

Separation for Better Integration: Disentangling Edge and Motion in Event-based Deblurring

Yufei Zhu, Hao Chen, Yongjian Deng et al.

ICCV 2025poster
#21737

Scalable Cross-View Sample Alignment for Multi-View Clustering with View Structure Similarity

Jun Wang, Zhenglai Li, Chang Tang et al.

NEURIPS 2025spotlight
#21738

Diversity-Enhanced Distribution Alignment for Dataset Distillation

Hongcheng Li, Yucan Zhou, Xiaoyan Gu et al.

ICCV 2025poster
#21739

Adapt Foundational Segmentation Models with Heterogeneous Searching Space

Li Yi, Jie Hu, Songan Zhang et al.

ICCV 2025poster
#21740

Think Twice: Test-Time Reasoning for Robust CLIP Zero-Shot Classification

Shenyu Lu, Zhaoying Pan, Xiaoqian Wang

ICCV 2025poster
#21741

Efficient Algorithms for Robust and Partial Semi-Discrete Optimal Transport

Pankaj Agarwal, Sharath Raghvendra, Pouyan Shirzadian et al.

NEURIPS 2025poster
#21742

Counting Stacked Objects

Corentin Dumery, Noa Ette, Aoxiang Fan et al.

ICCV 2025posterarXiv:2411.19149
#21743

RankMatch: A Novel Approach to Semi-Supervised Label Distribution Learning Leveraging Rank Correlation between Labels

Zhiqiang Kou, Yucheng Xie, Hailin Wang et al.

NEURIPS 2025poster
#21744

Wavy Transformer

Satoshi Noguchi, Yoshinobu Kawahara

NEURIPS 2025posterarXiv:2508.12787
#21745

GestureLSM: Latent Shortcut based Co-Speech Gesture Generation with Spatial-Temporal Modeling

Pinxin Liu, Luchuan Song, Junhua Huang et al.

ICCV 2025posterarXiv:2501.18898
#21746

SDFormer: Vision-based 3D Semantic Scene Completion via SAM-assisted Dual-channel Voxel Transformer

Yujie Xue, Huilong Pi, Jiapeng Zhang et al.

ICCV 2025poster
#21747

TopoTTA: Topology-Enhanced Test-Time Adaptation for Tubular Structure Segmentation

Jiale Zhou, Wenhan Wang, Shikun Li et al.

ICCV 2025posterarXiv:2508.00442
#21748

MagShield: Towards Better Robustness in Sparse Inertial Motion Capture Under Magnetic Disturbances

Yunzhe Shao, Xinyu Yi, Lu Yin et al.

ICCV 2025posterarXiv:2506.22907
#21749

DeFSS: Image-to-Mask Denoising Learning for Few-shot Segmentation

Zishu Qin, Junhao Xu, Weifeng Ge

ICCV 2025poster
#21750

TAD-E2E: A Large-scale End-to-end Autonomous Driving Dataset

Chang Liu, mingxuzhu mingxuzhu, Zheyuan Zhang et al.

ICCV 2025poster
#21751

Photolithography Overlay Map Generation with Implicit Knowledge Distillation Diffusion Transformer

YuanFu Yang, Hsiu-Hui Hsiao

ICCV 2025poster
#21752

VehicleMAE: View-asymmetry Mutual Learning for Vehicle Re-identification Pre-training via Masked AutoEncoders

Qi Wang, Zeyu Zhang, Dong Wang et al.

ICCV 2025poster
#21753

Multi-scenario Overlapping Text Segmentation with Depth Awareness

Yang Liu, Xudong Xie, Yuliang Liu et al.

ICCV 2025poster
#21754

FullDiT: Video Generative Foundation Models with Multimodal Control via Full Attention

Xuan Ju, Weicai Ye, Quande Liu et al.

ICCV 2025poster
#21755

Learning Hierarchical Line Buffer for Image Processing

Jiacheng Li, Feiran Li, Daisuke Iso

ICCV 2025poster
#21756

Humans as Checkerboards: Calibrating Camera Motion Scale for World-Coordinate Human Mesh Recovery

Fengyuan Yang, Kerui Gu, Ha Linh Nguyen et al.

ICCV 2025posterarXiv:2407.00574
#21757

GeoMan: Temporally Consistent Human Geometry Estimation using Image-to-Video Diffusion

Gwanghyun Kim, Xueting Li, Ye Yuan et al.

ICCV 2025posterarXiv:2505.23085
#21758

Stereo Any Video: Temporally Consistent Stereo Matching

Junpeng Jing, Weixun Luo, Ye Mao et al.

ICCV 2025highlightarXiv:2503.05549
#21759

Factor Decorrelation Enhanced Data Removal from Deep Predictive Models

Wenhao Yang, Lin Li, Xiaohui Tao et al.

NEURIPS 2025posterarXiv:2509.23443
#21760

ViT-Split: Unleashing the Power of Vision Foundation Models via Efficient Splitting Heads

Yifan Li, Xin Li, Tianqin Li et al.

ICCV 2025posterarXiv:2506.03433
#21761

Cycle-Consistent Learning for Joint Layout-to-Image Generation and Object Detection

Xinhao Cai, Qiuxia Lai, Gensheng Pei et al.

ICCV 2025poster
#21762

The Dual-Route Model of Induction

Sheridan Feucht, Eric Todd, Byron C Wallace et al.

COLM 2025paper
#21763

CarGait: Cross-Attention based Re-ranking for Gait recognition

Gavriel Habib, Noa Barzilay, Or Shimshi et al.

ICCV 2025posterarXiv:2503.03501
#21764

StyleSRN: Scene Text Image Super-Resolution with Text Style Embedding

Shengrong Yuan, Runmin Wang, Ke Hao et al.

ICCV 2025poster
#21765

Frequency-Guided Diffusion for Training-Free Text-Driven Image Translation

Zheng Gao, Jifei Song, Zhensong Zhang et al.

ICCV 2025poster
#21766

MixPrompt: Efficient Mixed Prompting for Multimodal Semantic Segmentation

Zhiwei Hao, Zhongyu Xiao, Jianyuan Guo et al.

NEURIPS 2025poster
#21767

Frequency-Semantic Enhanced Variational Autoencoder for Zero-Shot Skeleton-based Action Recognition

Wenhan Wu, Zhishuai Guo, Chen Chen et al.

ICCV 2025posterarXiv:2506.22179
#21768

Cross-Category Subjectivity Generalization for Style-Adaptive Sketch Re-ID

Zechao Hu, Zhengwei Yang, Hao Li et al.

ICCV 2025poster
#21769

Do Automatic Factuality Metrics Measure Factuality? A Critical Evaluation

Sanjana Ramprasad, Byron Wallace

NEURIPS 2025posterarXiv:2411.16638
#21770

Learnable Fractional Reaction-Diffusion Dynamics for Under-Display ToF Imaging and Beyond

Xin Qiao, Matteo Poggi, Xing Wei et al.

ICCV 2025posterarXiv:2511.01704
#21771

Task-Circuit Quantization: Leveraging Knowledge Localization and Interpretability for Compression

Hanqi Xiao, Yi-Lin Sung, Elias Stengel-Eskin et al.

COLM 2025paper
#21772

An Adaptive Quantum Circuit of Dempster's Rule of Combination for Uncertain Pattern Classification

Fuyuan Xiao, Yu Zhou, Witold Pedrycz

NEURIPS 2025poster
#21773

Discretized Gaussian Representation for Tomographic Reconstruction

Shaokai Wu, Yuxiang Lu, Yapan Guo et al.

ICCV 2025posterarXiv:2411.04844
#21774

3D Test-time Adaptation via Graph Spectral Driven Point Shift

Xin Wei, Qin Yang, Yijie Fang et al.

ICCV 2025posterarXiv:2507.18225
#21775

EMoTive: Event-guided Trajectory Modeling for 3D Motion Estimation

Zengyu Wan, Wei Zhai, Yang Cao et al.

ICCV 2025posterarXiv:2503.11371
#21776

KDA: Knowledge Diffusion Alignment with Enhanced Context for Video Temporal Grounding

Ran Ran, Jiwei Wei, Shiyuan He et al.

ICCV 2025poster
#21777

VisNumBench: Evaluating Number Sense of Multimodal Large Language Models

Tengjin Weng, Jingyi Wang, Wenhao Jiang et al.

ICCV 2025posterarXiv:2503.14939
#21778

STEP-DETR: Advancing DETR-based Semi-Supervised Object Detection with Super Teacher and Pseudo-Label Guided Text Queries

Tahira Shehzadi, Khurram Azeem Hashmi, Shalini Sarode et al.

ICCV 2025poster
#21779

Completing 3D Partial Assemblies with View-Consistent 2D-3D Correspondence

Weihao Wang, Yu Lan, Mingyu You et al.

ICCV 2025poster
#21780

Aligning Global Semantics and Local Textures in Generative Video Enhancement

Zhikai Chen, Fuchen Long, Zhaofan Qiu et al.

ICCV 2025poster
#21781

Robust 3D-Masked Part-level Editing in 3D Gaussian Splatting with Regularized Score Distillation Sampling

Hayeon Kim, Ji Ha Jang, Se Young Chun

ICCV 2025posterarXiv:2507.11061
#21782

AlignedGen: Aligning Style Across Generated Images

Jiexuan Zhang, Yiheng Du, Qian Wang et al.

NEURIPS 2025posterarXiv:2509.17088
#21783

Seeing through Uncertainty: Robust Task-Oriented Optimization in Visual Navigation

Yiyuan Pan, Yunzhe XU, Zhe Liu et al.

NEURIPS 2025posterarXiv:2510.00441
#21784

Novel Class Discovery for Point Cloud Segmentation via Joint Learning of Causal Representation and Reasoning

Yang Li, Aming WU, Zihao Zhang et al.

NEURIPS 2025posterarXiv:2510.13307
#21785

EvidenceBench: A Benchmark for Extracting Evidence from Biomedical Papers

Jianyou Wang, Weili Cao, Kaicheng Wang et al.

COLM 2025paper
#21786

Structure Matters: Revisiting Boundary Refinement in Video Object Segmentation

Guanyi Qin, Ziyue Wang, Daiyun Shen et al.

ICCV 2025highlightarXiv:2507.18944
#21787

AIM: Amending Inherent Interpretability via Self-Supervised Masking

Eyad Alshami, Shashank Agnihotri, Bernt Schiele et al.

ICCV 2025highlightarXiv:2508.11502
#21788

One Last Attention for Your Vision-Language Model

Liang Chen, Ghazi Shazan Ahmad, Tianjun Yao et al.

ICCV 2025posterarXiv:2507.15480
#21789

Can Language Models Falsify? Evaluating Algorithmic Reasoning with Counterexample Creation

Shiven Sinha, Shashwat Goel, Ponnurangam Kumaraguru et al.

COLM 2025paper
#21790

RobustSplat: Decoupling Densification and Dynamics for Transient-Free 3DGS

Chuanyu Fu, Yuqi Zhang, Kunbin Yao et al.

ICCV 2025posterarXiv:2506.02751
#21791

High-Resolution Spatiotemporal Modeling with Global-Local State Space Models for Video-Based Human Pose Estimation

Runyang Feng, Hyung Jin Chang, Tze Ho Elden Tse et al.

ICCV 2025posterarXiv:2510.11017
#21792

Pi-GPS: Enhancing Geometry Problem Solving by Unleashing the Power of Diagrammatic Information

Junbo Zhao, Ting Zhang, Jiayu Sun et al.

ICCV 2025posterarXiv:2503.05543
#21793

Partition-Then-Adapt: Combating Prediction Bias for Reliable Multi-Modal Test-Time Adaptation

Guowei Wang, Fan Lyu, Changxing Ding

NEURIPS 2025spotlight
#21794

Prior-Guided Diffusion Planning for Offline Reinforcement Learning

Donghyeon Ki, JunHyeok Oh, Seong-Woong Shim et al.

NEURIPS 2025posterarXiv:2505.10881
#21795

Mitigating Catastrophic Overfitting in Fast Adversarial Training via Label Information Elimination

Chao Pan, Ke Tang, Li Qing et al.

ICCV 2025poster
#21796

Consistency Trajectory Matching for One-Step Generative Super-Resolution

Weiyi You, Mingyang Zhang, Leheng Zhang et al.

ICCV 2025posterarXiv:2503.20349
#21797

Amodal Depth Anything: Amodal Depth Estimation in the Wild

Zhenyu Li, Mykola Lavreniuk, Jian Shi et al.

ICCV 2025posterarXiv:2412.02336
#21798

Uncertainty-quantified Rollout Policy Adaptation for Unlabelled Cross-domain Video Temporal Grounding

Jian Hu, Zixu Cheng, Shaogang Gong et al.

NEURIPS 2025oral
#21799

More is Less: The Pitfalls of Multi-Model Synthetic Preference Data in DPO Safety Alignment

Yifan Wang, Runjin Chen, Bolian Li et al.

COLM 2025paper
#21800

One Perturbation is Enough: On Generating Universal Adversarial Perturbations against Vision-Language Pre-training Models

Hao Fang, Jiawei Kong, Wenbo Yu et al.

ICCV 2025posterarXiv:2406.05491