Most Cited 2025 &quot;w4a4 quantization&quot; Papers

NEURIPS 2025arXiv:2503.09271

#14202

DitHub: A Modular Framework for Incremental Open-Vocabulary Object Detection

Chiara Cappellino, Gianluca Mancusi, Matteo Mosconi et al.

ICCV 2025arXiv:2509.14958

#14203

Seeing 3D Through 2D Lenses: 3D Few-Shot Class-Incremental Learning via Cross-Modal Geometric Rectification

Tuo Xiang, Xuemiao Xu, Bangzhen Liu et al.

#14204

Enhancing Zero-shot Object Counting via Text-guided Local Ranking and Number-evoked Global Attention

Shiwei Zhang, Qi Zhou, Wei Ke

ICLR 2025arXiv:2410.13054

#14205

Systems with Switching Causal Relations: A Meta-Causal Perspective

Moritz Willig, Tim Tobiasch, Florian Busch et al.

ICCV 2025arXiv:2507.08044

#14206

ConsNoTrainLoRA: Data-driven Weight Initialization of Low-rank Adapters using Constraints

Debasmit Das, Hyoungwoo Park, Munawar Hayat et al.

#14207

SINGER: Stochastic Network Graph Evolving Operator for High Dimensional PDEs

Mingquan Feng, Yixin Huang, Weixin Liao et al.

NEURIPS 2025arXiv:2507.11932

#14208

Hyperphantasia: A Benchmark for Evaluating the Mental Visualization Capabilities of Multimodal LLMs

Mohammad Shahab Sepehri, Berk Tinaz, Zalan Fabian et al.

ICCV 2025arXiv:2506.22800

#14209

RGE-GS: Reward-Guided Expansive Driving Scene Reconstruction via Diffusion Priors

Sicong Du, Jiarun Liu, Qifeng Chen et al.

ICCV 2025arXiv:2507.00392

#14210

Learning Dense Feature Matching via Lifting Single 2D Image to 3D Space

Yingping Liang, Yutao Hu, Wenqi Shao et al.

#14211

Pose-Guided Temporal Enhancement for Robust Low-Resolution Hand Reconstruction

Kaixin Fan, Pengfei Ren, Jingyu Wang et al.

NEURIPS 2025arXiv:2502.03654

#14212

Gompertz Linear Units: Leveraging Asymmetry for Enhanced Learning Dynamics

Indrashis Das, Mahmoud Safari, Steven Adriaensen et al.

ICCV 2025arXiv:2510.12387

#14213

Scene Coordinate Reconstruction Priors

Wenjing Bian, Axel Barroso-Laguna, Tommaso Cavallari et al.

#14214

Revisiting Large-Scale Non-convex Distributionally Robust Optimization

Qi Zhang, Yi Zhou, Simon Khan et al.

ICCV 2025arXiv:2506.16852

#14215

Controllable and Expressive One-Shot Video Head Swapping

Chaonan Ji, Jinwei Qi, Peng Zhang et al.

NEURIPS 2025oralarXiv:2510.24965

#14216

Exponential Dynamic Energy Network for High Capacity Sequence Memory

Arjun Karuvally, Pichsinee Lertsaroj, Terrence Sejnowski et al.

#14217

Attribute-Missing Multi-view Graph Clustering

Bowen Zhao, Qianqian Wang, Zhengming Ding et al.

CVPR 2025highlightarXiv:2505.21377

#14218

Empowering Vector Graphics with Consistently Arbitrary Viewing and View-dependent Visibility

Yidi Li, Jun Xiao, Zhengda Lu et al.

NEURIPS 2025arXiv:2512.14677

#14219

VASA-3D: Lifelike Audio-Driven Gaussian Head Avatars from a Single Image

Sicheng Xu, Guojun Chen, Jiaolong Yang et al.

NEURIPS 2025arXiv:2508.10133

#14220

MANGO: Multimodal Attention-based Normalizing Flow Approach to Fusion Learning

Thanh-Dat Truong, Christophe Bobda, Nitin Agarwal et al.

ICLR 2025arXiv:2401.07085

#14221

Three Mechanisms of Feature Learning in a Linear Network

Yizhou Xu, Liu Ziyin

CVPR 2025arXiv:2502.19962

#14222

ReCon: Enhancing True Correspondence Discrimination through Relation Consistency for Robust Noisy Correspondence Learning

Quanxing Zha, Xin Liu, Shu-Juan Peng et al.

ICCV 2025arXiv:2508.05123

#14223

Latent Expression Generation for Referring Image Segmentation and Grounding

Seonghoon Yu, Junbeom Hong, Joonseok Lee et al.

CVPR 2025highlightarXiv:2503.09968

#14224

Style Evolving along Chain-of-Thought for Unknown-Domain Object Detection

Zihao Zhang, Aming Wu, Yahong Han

ICCV 2025arXiv:2411.15513

#14225

SPA: Efficient User-Preference Alignment against Uncertainty in Medical Image Segmentation

Jiayuan Zhu, Junde Wu, Cheng Ouyang et al.

NEURIPS 2025oralarXiv:2506.06907

#14226

Uncertainty Estimation on Graphs with Structure Informed Stochastic Partial Differential Equations

Fred Xu, Thomas Markovich

CVPR 2025arXiv:2501.01235

#14227

SVFR: A Unified Framework for Generalized Video Face Restoration

Zhiyao Wang, Xu Chen, Chengming Xu et al.

NEURIPS 2025arXiv:2511.00124

#14228

Cross-fluctuation phase transitions reveal sampling dynamics in diffusion models

Sai Niranjan Ramachandran, Manish Krishan Lal, Suvrit Sra

ICCV 2025arXiv:2506.23152

#14229

DexH2R: A Benchmark for Dynamic Dexterous Grasping in Human-to-Robot Handover

Youzhuo Wang, jiayi ye, Chuyang Xiao et al.

NEURIPS 2025arXiv:2509.07027

#14230

Moment- and Power-Spectrum-Based Gaussianity Regularization for Text-to-Image Models

Jisung Hwang, Jaihoon Kim, Minhyuk Sung

ICCV 2025arXiv:2505.10641

#14231

FRET: Feature Redundancy Elimination for Test Time Adaptation

Linjing You, Jiabao Lu, Xiayuan Huang et al.

NEURIPS 2025spotlightarXiv:2501.18962

#14232

Spend Wisely: Maximizing Post-Training Gains in Iterative Synthetic Data Bootstrapping

Pu Yang, Yunzhen Feng, Ziyuan Chen et al.

NEURIPS 2025arXiv:2505.11816

#14233

Continuous Subspace Optimization for Continual Learning

Quan Cheng, Yuanyu Wan, Lingyu Wu et al.

CVPR 2025arXiv:2501.04293

#14234

TADFormer: Task-Adaptive Dynamic TransFormer for Efficient Multi-Task Learning

Seungmin Baek, Soyul Lee, Hayeon Jo et al.

NEURIPS 2025arXiv:2506.17124

#14235

When Can Model-Free Reinforcement Learning be Enough for Thinking?

Josiah Hanna, Nicholas Corrado

ICCV 2025highlightarXiv:2506.23618

#14236

TurboVSR: Fantastic Video Upscalers and Where to Find Them

Zhongdao Wang, Guodongfang Zhao, Jingjing Ren et al.

ICCV 2025arXiv:2503.17226

#14237

DDB: Diffusion Driven Balancing to Address Spurious Correlations

Aryan Yazdan Parast, Basim Azam, Naveed Akhtar

ICCV 2025arXiv:2510.07302

#14238

SpecGuard: Spectral Projection-based Advanced Invisible Watermarking

Inzamamul Alam, Md Islam, Simon Woo et al.

ICLR 2025arXiv:2501.14641

#14239

Towards Scalable Topological Regularizers

Wong Hiu-Tung, Darrick Lee, Hong Yan

ICCV 2025arXiv:2507.22522

#14240

Recognizing Actions from Robotic View for Natural Human-Robot Interaction

Ziyi Wang, Peiming Li, Hong Liu et al.

NEURIPS 2025arXiv:2508.15593

#14241

Inductive Domain Transfer In Misspecified Simulation-Based Inference

Ortal Senouf, Antoine Wehenkel, Cédric Vincent-Cuaz et al.

NEURIPS 2025arXiv:2507.17657

#14242

Attention (as Discrete-Time Markov) Chains

Yotam Erel, Olaf Dünkel, Rishabh Dabral et al.

#14243

IM-Portrait: Learning 3D-aware Video Diffusion for Photorealistic Talking Heads from Monocular VideosC

Yuan Li, Ziqian Bai, Feitong Tan et al.

ICCV 2025arXiv:2507.04685

#14244

TeethGenerator: A two-stage framework for paired pre- and post-orthodontic 3D dental data generation

Changsong Lei, Yaqian Liang, Shaofeng Wang et al.

NEURIPS 2025spotlightarXiv:2510.26268

#14245

Revisiting Generative Infrared and Visible Image Fusion Based on Human Cognitive Laws

Lin Guo, Xiaoqing Luo, Wei Xie et al.

#14246

Multi-modal Contrastive Learning with Negative Sampling Calibration for Phenotypic Drug Discovery

Jiahua Rao, Hanjing Lin, Leyu Chen et al.

#14247

Proximal Mapping Loss: Understanding Loss Functions in Crowd Counting & Localization

Wei LIN, Jia Wan, Antoni Chan

ICCV 2025arXiv:2411.16167

#14248

Mind the Cost of Scaffold! Benign Clients May Even Become Accomplices of Backdoor Attack

Xingshuo Han, Xuanye Zhang, Xiang Lan et al.

NEURIPS 2025arXiv:2506.14271

#14249

Leader360V: A Large-scale, Real-world 360 Video Dataset for Multi-task Learning in Diverse Environment

WEIMING ZHANG, Dingwen Xiao, Aobotao DAI et al.

NEURIPS 2025arXiv:2507.12318

#14250

Compositional Discrete Latent Code for High Fidelity, Productive Diffusion Models

Samuel Lavoie, Michael Noukhovitch, Aaron Courville

CVPR 2025arXiv:2412.09723

#14251

MAC-Ego3D: Multi-Agent Gaussian Consensus for Real-Time Collaborative Ego-Motion and Photorealistic 3D Reconstruction

Xiaohao Xu, Feng Xue, Shibo Zhao et al.

ICCV 2025arXiv:2507.04123

#14252

Towards Accurate and Efficient 3D Object Detection for Autonomous Driving: A Mixture of Experts Computing System on Edge

Linshen Liu, Boyan Su, Junyue Jiang et al.

ICLR 2025arXiv:2410.03016

#14253

Learning a Fast Mixing Exogenous Block MDP using a Single Trajectory

Alexander Levine, Peter Stone, Amy Zhang

NEURIPS 2025arXiv:2406.04772

#14254

REP: Resource-Efficient Prompting for Rehearsal-Free Continual Learning

Sungho Jeon, Xinyue Ma, Kwang In Kim et al.

NEURIPS 2025arXiv:2502.09933

#14255

MIR-Bench: Can Your LLM Recognize Complicated Patterns via Many-Shot In-Context Reasoning?

Kai Yan, Zhan Ling, Kang Liu et al.

ICCV 2025arXiv:2506.21037

#14256

Reinforcement Learning-Guided Data Selection via Redundancy Assessment

Suorong Yang, Peijia Li, Furao Shen et al.

NEURIPS 2025arXiv:2507.02800

#14257

Time-Masked Transformers with Lightweight Test-Time Adaptation for Neural Speech Decoding

Ebrahim Feghhi, Shreyas Kaasyap, Nima Hadidi et al.

ICLR 2025arXiv:2410.14055

#14258

Feedback Schrödinger Bridge Matching

Panagiotis Theodoropoulos, Nikolaos Komianos, Vincent Pacelli et al.

NEURIPS 2025arXiv:2510.21292

#14259

Additive Models Explained: A Computational Complexity Approach

Shahaf Bassan, Michal Moshkovitz, Guy Katz

ICCV 2025arXiv:2508.21695

#14260

Activation Subspaces for Out-of-Distribution Detection

Barış Zöngür, Robin Hesse, Stefan Roth

ICCV 2025highlightarXiv:2507.21049

#14261

Rep-MTL: Unleashing the Power of Representation-level Task Saliency for Multi-Task Learning

Zedong Wang, Siyuan Li, Dan Xu

NEURIPS 2025arXiv:2505.16311

#14262

Generator-Mediated Bandits: Thompson Sampling for GenAI-Powered Adaptive Interventions

Marc Brooks, Gabriel Durham, Kihyuk Hong et al.

NEURIPS 2025arXiv:2511.11593

#14263

Sound Logical Explanations for Mean Aggregation Graph Neural Networks

Matthew Morris, Ian Horrocks

#14264

AdaptCMVC: Robust Adaption to Incremental Views in Continual Multi-view Clustering

Jing Wang, Songhe Feng, Kristoffer Knutsen Wickstrøm et al.

NEURIPS 2025oralarXiv:2510.21110

#14265

Confounding Robust Deep Reinforcement Learning: A Causal Approach

Mingxuan Li, Junzhe Zhang, Elias Bareinboim

#14266

DynPose: Largely Improving the Efficiency of Human Pose Estimation by a Simple Dynamic Framework

Yalong Xu, Lin Zhao, Chen Gong et al.

ICCV 2025arXiv:2510.19622

#14267

Augmenting Moment Retrieval: Zero-Dependency Two-Stage Learning

Zhengxuan Wei, Jiajin Tang, Sibei Yang

#14268

PersonaHOI: Effortlessly Improving Face Personalization in Human-Object Interaction Generation

Xinting Hu, Haoran Wang, Jan Lenssen et al.

NEURIPS 2025spotlightarXiv:2505.11309

#14269

Decomposing stimulus-specific sensory neural information via diffusion models

Steeve Laquitaine, Simone Azeglio, Carlo Paris et al.

ICLR 2025oralarXiv:2412.03496

#14270

TRENDy: Temporal Regression of Effective Nonlinear Dynamics

Matthew Ricci, Guy Pelc, Zoe Piran et al.

NEURIPS 2025spotlightarXiv:2506.00539

#14271

ARIA: Training Language Agents with Intention-driven Reward Aggregation

Ruihan Yang, yikai zhang, Aili Chen et al.

NEURIPS 2025arXiv:2505.17581

#14272

MODEM: A Morton-Order Degradation Estimation Mechanism for Adverse Weather Image Recovery

Hainuo Wang, Qiming Hu, Xiaojie Guo

#14273

Efficient Causal Decision Making with One-sided Feedback

Jianing Chu, Shu Yang, Wenbin Lu et al.

NEURIPS 2025arXiv:2510.11312

#14274

Nonlinearly Preconditioned Gradient Methods: Momentum and Stochastic Analysis

Konstantinos Oikonomidis, Jan Quan, Panagiotis Patrinos

#14275

PixelStitch: Structure-Preserving Pixel-Wise Bidirectional Warps for Unsupervised Image Stitching

Hengzhe Jin, Lang Nie, Chunyu Lin et al.

ICCV 2025highlightarXiv:2504.00139

#14276

SuperEvent: Cross-Modal Learning of Event-based Keypoint Detection for SLAM

Yannick Burkhardt, Simon Schaefer, Stefan Leutenegger

NEURIPS 2025arXiv:2511.02347

#14277

LTD-Bench: Evaluating Large Language Models by Letting Them Draw

Liuhao Lin, Ke Li, Zihan Xu et al.

NEURIPS 2025arXiv:2505.19646

#14278

Energy-based generator matching: A neural sampler for general state space

Dongyeop Woo, Minsu Kim, Minkyu Kim et al.

CVPR 2025arXiv:2505.12685

#14279

Mamba-Adaptor: State Space Model Adaptor for Visual Recognition

Fei Xie, Jiahao Nie, Yujin Tang et al.

NEURIPS 2025arXiv:2502.05019

#14280

$O(\sqrt{T})$ Static Regret and Instance Dependent Constraint Violation for Constrained Online Convex Optimization

Rahul Vaze, Abhishek Sinha

NEURIPS 2025oralarXiv:2505.11930

#14281

The Logical Expressiveness of Temporal GNNs via Two-Dimensional Product Logics

Marco Sälzer, Przemyslaw Walega, Martin Lange

#14282

Serialization based Point Cloud Oversegmentation

chenghui Lu, Dilong Li, Jianlong Kwan et al.

ICCV 2025arXiv:2509.20022

#14283

PS3: A Multimodal Transformer Integrating Pathology Reports with Histology Images and Biological Pathways for Cancer Survival Prediction

Manahil Raza, Ayesha Azam, Talha Qaiser et al.

ICCV 2025arXiv:2503.17856

#14284

ClaraVid: A Holistic Scene Reconstruction Benchmark From Aerial Perspective With Delentropy-Based Complexity Profiling

Radu Beche, Sergiu Nedevschi

NEURIPS 2025arXiv:2511.01169

#14285

Web-Scale Collection of Video Data for 4D Animal Reconstruction

Brian Nlong Zhao, Jiajun Wu, Shangzhe Wu

NEURIPS 2025arXiv:2512.04310

#14286

RNNs perform task computations by dynamically warping neural representations

Arthur Pellegrino, Angus Chadwick

#14287

Dual Energy-Based Model with Open-World Uncertainty Estimation for Out-of-distribution Detection

Qi Chen, Hu Ding

ICLR 2025arXiv:2405.14596

#14288

Linear Mode Connectivity in Differentiable Tree Ensembles

Ryuichi Kanoh, Mahito Sugiyama

#14289

Discrete Latent Plans via Semantic Skill Abstractions

Haobin Jiang, Wang, Zongqing Lu

CVPR 2025highlightarXiv:2501.01601

#14290

Few-shot Implicit Function Generation via Equivariance

Suizhi Huang, Xingyi Yang, Hongtao Lu et al.

#14291

Adaptive Energy Alignment for Accelerating Test-Time Adaptation

Wonjeong Choi, Do-Yeon Kim, Jungwuk Park et al.

NEURIPS 2025arXiv:2510.21315

#14292

Seemingly Redundant Modules Enhance Robust Odor Learning in Fruit Flies

HaiYang Li, Liao Yu, Qiang Yu et al.

ICCV 2025highlightarXiv:2507.06075

#14293

Discontinuity-aware Normal Integration for Generic Central Camera Models

Francesco Milano, Manuel Lopez-Antequera, Naina Dhingra et al.

ICCV 2025arXiv:2506.22531

#14294

Preserve Anything: Controllable Image Synthesis with Object Preservation

Prasen Kumar Sharma, Neeraj Matiyali, Siddharth Srivastava et al.

NEURIPS 2025arXiv:2505.14511

#14295

ReservoirTTA: Prolonged Test-time Adaptation for Evolving and Recurring Domains

Guillaume Vray, Devavrat Tomar, Xufeng Gao et al.

#14296

Black Hole-Driven Identity Absorbing in Diffusion Models

Muhammad Shaheryar, Jong Taek Lee, Soon Ki Jung

ICCV 2025arXiv:2510.12679

#14297

MCOP: Multi-UAV Collaborative Occupancy Prediction

Zefu Lin, Wenbo Chen, Xiaojuan Jin et al.

ICCV 2025arXiv:2507.09910

#14298

IGD: Instructional Graphic Design with Multimodal Layer Generation

Yadong Qu, Shancheng Fang, Yuxin Wang et al.

#14299

SL2A-INR: Single-Layer Learnable Activation for Implicit Neural Representation

Reza Rezaeian, Moein Heidari, Reza Azad et al.

NEURIPS 2025arXiv:2510.19819

#14300

Is This Tracker On? A Benchmark Protocol for Dynamic Tracking

Ilona Demler, Saumya Chauhan, Georgia Gkioxari

NEURIPS 2025spotlightarXiv:2509.20211

#14301

Practical do-Shapley Explanations with Estimand-Agnostic Causal Inference

Álvaro Parafita, Tomas Garriga, Axel Brando et al.

#14302

3DIS: Depth-Driven Decoupled Image Synthesis for Universal Multi-Instance Generation

Dewei Zhou, Ji Xie, Zongxin Yang et al.

CVPR 2025arXiv:2505.07333

#14303

Link to the Past: Temporal Propagation for Fast 3D Human Reconstruction from Monocular Video

Marchellus Matthew, Nadhira Noor, In Kyu Park

NEURIPS 2025arXiv:2507.01737

#14304

HOI-Dyn: Learning Interaction Dynamics for Human-Object Motion Diffusion

Lin Wu, Zhixiang Chen, Jianglin Lan

NEURIPS 2025arXiv:2510.16869

#14305

No-Regret Online Autobidding Algorithms in First-price Auctions

Yilin LI, Yuan Deng, Wei Tang et al.

CVPR 2025arXiv:2503.13241

#14306

Sampling Innovation-Based Adaptive Compressive Sensing

Zhifu Tian, Tao Hu, Chaoyang Niu et al.

CVPR 2025arXiv:2505.19793

#14307

Depth-Guided Bundle Sampling for Efficient Generalizable Neural Radiance Field Reconstruction

Li Fang, Hao Zhu, Longlong Chen et al.

ICLR 2025arXiv:2408.11760

#14308

R2Det: Exploring Relaxed Rotation Equivariance in 2D Object Detection

Zhiqiang Wu, Yingjie Liu, Hanlin Dong et al.

ICCV 2025arXiv:2507.19140

#14309

Balancing Conservatism and Aggressiveness: Prototype-Affinity Hybrid Network for Few-Shot Segmentation

Tianyu Zou, Shengwu Xiong, Ruilin Yao et al.

NEURIPS 2025arXiv:2511.03108

#14310

miniF2F-Lean Revisited: Reviewing Limitations and Charting a Path Forward

Azim Ospanov, Farzan Farnia, Roozbeh Yousefzadeh

ICCV 2025arXiv:2503.07249

#14311

Text-IRSTD: Leveraging Semantic Text to Promote Infrared Small Target Detection in Complex Scenes

Feng Huang, Shuyuan Zheng, Zhaobing Qiu et al.

#14312

Balanced Ranking with Relative Centrality: A multi-core periphery perspective

Chandra Sekhar Mukherjee, Jiapeng Zhang

ICCV 2025arXiv:2508.12330

#14313

DoppDrive: Doppler-Driven Temporal Aggregation for Improved Radar Object Detection

Yuval Haitman, Oded Bialer

CVPR 2025arXiv:2303.16078

#14314

Practical Solutions to the Relative Pose of Three Calibrated Cameras

Charalambos Tzamos, Viktor Kocur, Yaqing Ding et al.

CVPR 2025arXiv:2503.21854

#14315

Foveated Instance Segmentation

Hongyi Zeng, Wenxuan Liu, Tianhua Xia et al.

ICLR 2025oralarXiv:2410.08439

#14316

Reinforcement Learning for Control of Non-Markovian Cellular Population Dynamics

Josiah Kratz, Jacob Adamczyk

ICCV 2025arXiv:2504.09426

#14317

BabyVLM: Data-Efficient Pretraining of VLMs Inspired by Infant Learning

Shengao Wang, Arjun Chandra, Aoming Liu et al.

NEURIPS 2025oralarXiv:2410.16136

#14318

Modeling Dynamic Neural Activity by combining Naturalistic Video Stimuli and Stimulus-independent Latent Factors

Finn Schmidt, Polina Turishcheva, Suhas Shrinivasan et al.

NEURIPS 2025arXiv:2502.07480

#14319

Beyond Benign Overfitting in Nadaraya-Watson Interpolators

Daniel Barzilai, Guy Kornowski, Ohad Shamir

ICCV 2025arXiv:2507.15480

#14320

One Last Attention for Your Vision-Language Model

Liang Chen, Ghazi Shazan Ahmad, Tianjun Yao et al.

NEURIPS 2025arXiv:2505.10118

#14321

Why 1 + 1 < 1 in Visual Token Pruning: Beyond Naive Integration via Multi-Objective Balanced Covering

Yangfu Li, Hongjian Zhan, Tianyi Chen et al.

NEURIPS 2025oralarXiv:2507.06543

#14322

Token Bottleneck: One Token to Remember Dynamics

Taekyung Kim, Dongyoon Han, Byeongho Heo et al.

CVPR 2025arXiv:2503.18483

#14323

Explaining Domain Shifts in Language: Concept Erasing for Interpretable Image Classification

Zequn Zeng, Yudi Su, Jianqiao Sun et al.

CVPR 2025arXiv:2506.07750

#14324

Difference Inversion: Interpolate and Isolate the Difference with Token Consistency for Image Analogy Generation

Hyunsoo Kim, Donghyun Kim, Suhyun Kim

NEURIPS 2025arXiv:2509.16974

#14325

Hessian-guided Perturbed Wasserstein Gradient Flows for Escaping Saddle Points

Naoya Yamamoto, Juno Kim, Taiji Suzuki

ICCV 2025highlightarXiv:2508.11502

#14326

AIM: Amending Inherent Interpretability via Self-Supervised Masking

Eyad Alshami, Shashank Agnihotri, Bernt Schiele et al.

#14327

S⁴M: Boosting Semi-Supervised Instance Segmentation with SAM

Heeji Yoon, Heeseong Shin, Eunbeen Hong et al.

ICCV 2025arXiv:2504.20042

#14328

CompleteMe: Reference-based Human Image Completion

Yu-Ju Tsai, Brian Price, Qing Liu et al.

NEURIPS 2025oralarXiv:2507.21244

#14329

Bubbleformer: Forecasting Boiling with Transformers

Sheikh Md Shakeel Hassan, Xianwei Zou, Akash Dhruv et al.

ICCV 2025arXiv:2508.00649

#14330

Revisiting Adversarial Patch Defenses on Object Detectors: Unified Evaluation, Large-Scale Dataset, and New Insights

Junhao Zheng, Jiahao Sun, Chenhao Lin et al.

ICCV 2025arXiv:2506.16827

#14331

Beyond Blur: A Fluid Perspective on Generative Diffusion Models

Grzegorz Gruszczynski, Jakub Meixner, Michał Włodarczyk et al.

ICLR 2025arXiv:2410.05655

#14332

Efficient Policy Evaluation with Safety Constraint for Reinforcement Learning

Claire Chen, Shuze Liu, Shangtong Zhang

NEURIPS 2025arXiv:2411.16598

#14333

DiffBreak: Is Diffusion-Based Purification Robust?

Andre Kassis, Urs Hengartner, Yaoliang Yu

ICCV 2025arXiv:2412.06080

#14334

GVDepth: Zero-Shot Monocular Depth Estimation for Ground Vehicles based on Probabilistic Cue Fusion

Karlo Koledic, Luka Petrovic, Ivan Marković et al.

#14335

Risk-Sensitive Variational Actor-Critic: A Model-Based Approach

Alonso Granados, Mohammadreza Ebrahimi, Jason Pacheco

#14336

CoSER: Towards Consistent Dense Multiview Text-to-Image Generator for 3D Creation

Bonan Li, Zicheng Zhang, Xingyi Yang et al.

CVPR 2025highlight

ICCV 2025arXiv:2508.15439

#14337

Aligning Moments in Time using Video Queries

Yogesh Kumar, Uday Agarwal, Manish Gupta et al.

ICCV 2025highlightarXiv:2511.07409

#14338

DIMO: Diverse 3D Motion Generation for Arbitrary Objects

Linzhan Mou, Jiahui Lei, Chen Wang et al.

NEURIPS 2025arXiv:2507.02377

#14339

Sparse Gaussian Processes: Structured Approximations and Power-EP Revisited

Thang Bui, Michalis Titsias

ICCV 2025arXiv:2503.06569

#14340

Global-Aware Monocular Semantic Scene Completion with State Space Models

Shijie Li, Zhongyao Cheng, Rong Li et al.

NEURIPS 2025arXiv:2510.00296

#14341

Beyond Token Probes: Hallucination Detection via Activation Tensors with ACT-ViT

Guy Bar-Shalom, Fabrizio Frasca, Yaniv Galron et al.

NEURIPS 2025arXiv:2510.19348

#14342

A Markov Decision Process for Variable Selection in Branch & Bound

Paul STRANG, Zacharie ALES, Côme Bissuel et al.

ICCV 2025arXiv:2505.19148

#14343

DISTA-Net: Dynamic Closely-Spaced Infrared Small Target Unmixing

Shengdong Han, Shangdong Yang, Yuxuan Li et al.

CVPR 2025highlightarXiv:2506.10182

#14344

Improving Personalized Search with Regularized Low-Rank Parameter Updates

Fiona Ryan, Josef Sivic, Fabian Caba Heilbron et al.

CVPR 2025highlightarXiv:2504.10158

#14345

COUNTS: Benchmarking Object Detectors and Multimodal Large Language Models under Distribution Shifts

Jiansheng Li, Xingxuan Zhang, Hao Zou et al.

NEURIPS 2025arXiv:2511.00116

#14346

LC-Opt: Benchmarking Reinforcement Learning and Agentic AI for End-to-End Liquid Cooling Optimization in Data Centers

Avisek Naug, Antonio Guillen-Perez, Vineet Kumar et al.

CVPR 2025arXiv:2503.12150

#14347

Point-Cache: Test-time Dynamic and Hierarchical Cache for Robust and Generalizable Point Cloud Analysis

Hongyu Sun, Qiuhong Ke, Ming Cheng et al.

CVPR 2025highlightarXiv:2411.16788

#14348

TIDE: Training Locally Interpretable Domain Generalization Models Enables Test-time Correction

Aishwarya Agarwal, Srikrishna Karanam, Vineet Gandhi

NEURIPS 2025oralarXiv:2511.14745

#14349

Look-Ahead Reasoning on Learning Platforms

Haiqing Zhu, Tijana Zrnic, Celestine Mendler-Dünner

ICCV 2025arXiv:2507.05256

#14350

SegmentDreamer: Towards High-fidelity Text-to-3D Synthesis with Segmented Consistency Trajectory Distillation

Jiahao Zhu, Zixuan Chen, Guangcong Wang et al.

NEURIPS 2025arXiv:2506.19839

#14351

Improving Progressive Generation with Decomposable Flow Matching

Moayed Haji-Ali, Willi Menapace, Ivan Skorokhodov et al.

CVPR 2025arXiv:2502.08646

#14352

Poly-Autoregressive Prediction for Modeling Interactions

Neerja Thakkar, Tara Sadjadpour, Jathushan Rajasegaran et al.

NEURIPS 2025arXiv:2502.12527

#14353

Comprehensive Assessment and Analysis for NSFW Content Erasure in Text-to-Image Diffusion models

Die Chen, Zhiwen Li, Cen Chen et al.

ICCV 2025arXiv:2503.17827

#14354

4D-Bench: Benchmarking Multi-modal Large Language Models for 4D Object Understanding

Wenxuan Zhu, Bing Li, Cheng Zheng et al.

#14355

Pseudo Visible Feature Fine-Grained Fusion for Thermal Object Detection

Ting Li, Mao Ye, Tianwen Wu et al.

NEURIPS 2025arXiv:2505.15496

#14356

Fast Rate Bounds for Multi-Task and Meta-Learning with Different Sample Sizes

Hossein Zakerinia, Christoph Lampert

NEURIPS 2025arXiv:2510.20709

#14357

Separating the 'what' and 'how' of compositional computation to enable reuse and continual learning

Haozhe Shan, Sun Minni, Lea Duncker

NEURIPS 2025arXiv:2512.01372

#14358

Structured Spectral Reasoning for Frequency-Adaptive Multimodal Recommendation

Wei Yang, Rui Zhong, Yiqun Chen et al.

ICCV 2025arXiv:2411.13626

#14359

Principles of Visual Tokens for Efficient Video Understanding

Xinyue Hao, Li, Shreyank Gowda et al.

#14360

Graph Neural Network Combining Event Stream and Periodic Aggregation for Low-Latency Event-based Vision

Manon Dampfhoffer, Thomas Mesquida, Damien Joubert et al.

CVPR 2025highlight

CVPR 2025arXiv:2503.13915

#14361

Unlocking the Potential of Unlabeled Data in Semi-Supervised Domain Generalization

Dongkwan Lee, Kyomin Hwang, Nojun Kwak

NEURIPS 2025oralarXiv:2509.18389

#14362

Towards Provable Emergence of In-Context Reinforcement Learning

Jiuqi Wang, Rohan Chandra, Shangtong Zhang

CVPR 2025highlightarXiv:2503.10000

#14363

MetricGrids: Arbitrary Nonlinear Approximation with Elementary Metric Grids based Implicit Neural Representation

Shu Wang, Yanbo Gao, Shuai Li et al.

NEURIPS 2025arXiv:2505.21749

#14364

Revisiting Bi-Linear State Transitions in Recurrent Neural Networks

Reza Ebrahimi, Roland Memisevic

NEURIPS 2025arXiv:2506.02964

#14365

FORLA: Federated Object-centric Representation Learning with Slot Attention

Guiqiu Liao, Matjaz Jogan, Eric Eaton et al.

ICCV 2025arXiv:2507.03434

#14366

Unlearning the Noisy Correspondence Makes CLIP More Robust

Haochen Han, Alex Jinpeng Wang, Peijun Ye et al.

NEURIPS 2025arXiv:2510.25657

#14367

Subgraph Federated Learning via Spectral Methods

Javad Aliakbari, Johan Oestman, Ashkan Panahi et al.

CVPR 2025arXiv:2505.09615

#14368

UWAV: Uncertainty-weighted Weakly-supervised Audio-Visual Video Parsing

Yung-Hsuan Lai, Janek Ebbers, Yu-Chiang Frank Wang et al.

NEURIPS 2025arXiv:2512.06854

#14369

ArchPower: Dataset for Architecture-Level Power Modeling of Modern CPU Design

Qijun Zhang, Yao Lu, Mengming Li et al.

ICCV 2025arXiv:2410.13195

#14370

UniGS: Modeling Unitary 3D Gaussians for Novel View Synthesis from Sparse-view Images

Jiamin WU, Kenkun Liu, Xiaoke Jiang et al.

CVPR 2025arXiv:2503.19846

#14371

Attention IoU: Examining Biases in CelebA using Attention Maps

Aaron Serianni, Tyler Zhu, Olga Russakovsky et al.

NEURIPS 2025arXiv:2510.27153

#14372

Exploring Landscapes for Better Minima along Valleys

Tong Zhao, Jiacheng Li, Yuanchang Zhou et al.

NEURIPS 2025arXiv:2502.18710

#14373

Bridging Critical Gaps in Convergent Learning: How Representational Alignment Evolves Across Layers, Training, and Distribution Shifts

Chaitanya Kapoor, Sudhanshu Srivastava, Meenakshi Khosla

NEURIPS 2025oralarXiv:2512.07599

#14374

Online Segment Any 3D Thing as Instance Tracking

Hanshi Wang, Cai Zijian, Jin Gao et al.

CVPR 2025arXiv:2503.18503

#14375

Deterministic Certification of Graph Neural Networks against Graph Poisoning Attacks with Arbitrary Perturbations

Jiate Li, Meng Pang, Yun Dong et al.

NEURIPS 2025arXiv:2502.07503

#14376

Recursive Inference Scaling: A Winning Path to Scalable Inference in Language and Multimodal Systems

Ibrahim Alabdulmohsin, Xiaohua Zhai

ICCV 2025arXiv:2508.01098

#14377

Trans-Adapter: A Plug-and-Play Framework for Transparent Image Inpainting

Yuekun Dai, Haitian Li, Shangchen Zhou et al.

ICCV 2025arXiv:2507.12988

#14378

Variance-Based Pruning for Accelerating and Compressing Trained Networks

Uranik Berisha, Jens Mehnert, Alexandru Condurache

NEURIPS 2025spotlightarXiv:2601.09825

#14379

Eluder dimension: localise it!

Alireza Bakhtiari, Alex Ayoub, Samuel Robertson et al.

NEURIPS 2025arXiv:2505.09075

#14380

Risk Bounds For Distributional Regression

Carlos Misael Madrid Padilla, OSCAR HERNAN MADRID PADILLA, Sabyasachi Chatterjee

ICCV 2025arXiv:2507.20842

#14381

METEOR: Multi-Encoder Collaborative Token Pruning for Efficient Vision Language Models

Yuchen Liu, Yaoming Wang, Bowen Shi et al.

ICCV 2025arXiv:2507.22604

#14382

ShortFT: Diffusion Model Alignment via Shortcut-based Fine-Tuning

Xiefan Guo, Miaomiao Cui, Liefeng Bo et al.

ICCV 2025arXiv:2506.21884

#14383

UnMix-NeRF: Spectral Unmixing Meets Neural Radiance Fields

Fabian Perez, Sara Rojas Martinez, Carlos Hinojosa et al.

#14384

Keep the Balance: A Parameter-Efficient Symmetrical Framework for RGB+X Semantic Segmentation

Jiaxin Cai, Jingze Su, Qi Li et al.

NEURIPS 2025oralarXiv:2506.03144

#14385

MERIT: Multilingual Semantic Retrieval with Interleaved Multi-Condition Query

Wei Chow, Yuan Gao, Linfeng Li et al.

ICLR 2025arXiv:2501.16918

#14386

On Rollouts in Model-Based Reinforcement Learning

Bernd Frauenknecht, Devdutt Subhasish, Friedrich Solowjow et al.

CVPR 2025arXiv:2505.22458

#14387

Universal Domain Adaptation for Semantic Segmentation

Seun-An Choe, Keon Hee Park, Jinwoo Choi et al.

ICLR 2025arXiv:2501.14211

#14388

When GNNs meet symmetry in ILPs: an orbit-based feature augmentation approach

Qian Chen, Lei Li, Qian Li et al.

NEURIPS 2025oralarXiv:2502.09767

#14389

Non-Markovian Discrete Diffusion with Causal Language Models

Yangtian Zhang, Sizhuang He, Daniel Levine et al.

NEURIPS 2025arXiv:2506.02259

#14390

Stochastically Dominant Peer Prediction

Yichi Zhang, Shengwei Xu, Grant Schoenebeck et al.

ICLR 2025arXiv:2410.02490

#14391

Stochastic variance-reduced Gaussian variational inference on the Bures-Wasserstein manifold

Hoang Phuc Hau Luu, Hanlin Yu, Bernardo Williams et al.

#14392

Training Large Language Models for Retrieval-Augmented Question Answering through Backtracking Correction

Huawen Feng, ZekunYao, Junhao Zheng et al.

NEURIPS 2025arXiv:2509.25217

#14393

Learning to Condition: A Neural Heuristic for Scalable MPE Inference

Brij Malhotra, Shivvrat Arya, Tahrima Rahman et al.

NEURIPS 2025arXiv:2410.20035

#14394

Training the Untrainable: Introducing Inductive Bias via Representational Alignment

Vighnesh Subramaniam, David Mayo, Colin Conwell et al.

#14395

NTClick: Achieving Precise Interactive Segmentation With Noise-tolerant Clicks

Chenyi Zhang, Ting Liu, Xiaochao Qu et al.

CVPR 2025highlight

ICCV 2025arXiv:2509.26231

#14396

IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance

Jiayi Guo, Chuanhao Yan, Xingqian Xu et al.

NEURIPS 2025spotlightarXiv:2508.15183

#14397

Private Hyperparameter Tuning with Ex-Post Guarantee

Badih Ghazi, Pritish Kamath, Alexander Knop et al.

ICCV 2025arXiv:2511.06016

#14398

One-Shot Knowledge Transfer for Scalable Person Re-Identification

Longhua Li, Lei Qi, Xin Geng

NEURIPS 2025arXiv:2503.11544

#14399

AugGen: Synthetic Augmentation using Diffusion Models Can Improve Recognition

Parsa Rahimi, Damien Teney, Sébastien Marcel

ICCV 2025arXiv:2504.07827

#14400

HarmonySeg: Tubular Structure Segmentation with Deep-Shallow Feature Fusion and Growth-Suppression Balanced Loss

Ke Zhang, Yi Huang, Wei Liu et al.