Most Cited 2025 &quot;hardware robotic control&quot; Papers

NEURIPS 2025arXiv:2506.12323

#12202

Doctor Approved: Generating Medically Accurate Skin Disease Images through AI-Expert Feedback

Janet Wang, Yunbei Zhang, Zhengming Ding et al.

NEURIPS 2025arXiv:2507.05526

#12203

Estimating Interventional Distributions with Uncertain Causal Graphs through Meta-Learning

Anish Dhir, Cristiana Diaconu, Valentinian Lungu et al.

NEURIPS 2025arXiv:2506.06290

#12204

CellCLIP - Learning Perturbation Effects in Cell Painting via Text-Guided Contrastive Learning

MingYu Lu, Ethan Weinberger, Chanwoo Kim et al.

NEURIPS 2025oralarXiv:2502.11806

#12205

Exploring the Translation Mechanism of Large Language Models

Hongbin Zhang, Kehai Chen, Xuefeng Bai et al.

ICCV 2025arXiv:2505.04963

#12206

ViCTr: Vital Consistency Transfer for Pathology Aware Image Synthesis

Onkar Susladkar, Gayatri Deshmukh, Yalcin Tur et al.

ICCV 2025highlightarXiv:2505.15123

#12207

Seeing the Trees for the Forest: Rethinking Weakly-Supervised Medical Visual Grounding

Huy Ta, Duy Anh Huynh, Yutong Xie et al.

ICCV 2025arXiv:2503.22668

#12208

Understanding Co-speech Gestures in-the-wild

Sindhu Hegde, K R Prajwal, Taein Kwon et al.

NEURIPS 2025arXiv:2503.21023

#12209

Data Mixture Optimization: A Multi-fidelity Multi-scale Bayesian Framework

Thomson Yen, Andrew Siah, Haozhe Chen et al.

NEURIPS 2025arXiv:2506.06305

#12210

Template-Guided 3D Molecular Pose Generation via Flow Matching and Differentiable Optimization

Noémie Bergues, Arthur Carré, Paul Join-Lambert et al.

CVPR 2025highlightarXiv:2505.09393

#12211

UMotion: Uncertainty-driven Human Motion Estimation from Inertial and Ultra-wideband Units

Huakun Liu, Hiroki Ota, Xin Wei et al.

#12212

D2ST-Adapter: Disentangled-and-Deformable Spatio-Temporal Adapter for Few-shot Action Recognition

Wenjie Pei, Qizhong Tan, Guangming Lu et al.

ICCV 2025

ICCV 2025arXiv:2411.10503

#12213

Everything is a Video: Unifying Modalities through Next-Frame Prediction

G Thomas Hudson, Dean Slack, Thomas Winterbottom et al.

NEURIPS 2025arXiv:2510.18215

#12214

The Bias-Variance Tradeoff in Data-Driven Optimization: A Local Misspecification Perspective

Haixiang Lan, Luofeng Liao, Adam N. Elmachtoub et al.

NEURIPS 2025oralarXiv:2502.01218

#12215

Provable Ordering and Continuity in Vision-Language Pretraining for Generalizable Embodied Agents

Zhizhen Zhang, Lei Zhu, Zhen Fang et al.

NEURIPS 2025arXiv:2505.19947

#12216

MESS+: Dynamically Learned Inference-Time LLM Routing in Model Zoos with Service Level Guarantees

Herbert Woisetschläger, Ryan Zhang, Shiqiang Wang et al.

NEURIPS 2025spotlightarXiv:2505.23599

#12217

On Transferring Transferability: Towards a Theory for Size Generalization

Eitan Levin, Yuxin Ma, Mateo Diaz et al.

ICCV 2025arXiv:2507.15686

#12218

LINR-PCGC: Lossless Implicit Neural Representations for Point Cloud Geometry Compression

Wenjie Huang, Qi Yang, Shuting Xia et al.

ICCV 2025arXiv:2510.15022

#12219

LoRAverse: A Submodular Framework to Retrieve Diverse Adapters for Diffusion Models

Mert Sonmezer, Matthew Zheng, Pinar Yanardag

NEURIPS 2025arXiv:2501.16226

#12220

The Effect of Optimal Self-Distillation in Noisy Gaussian Mixture Model

Kaito Takanami, Takashi Takahashi, Ayaka Sakata

ICCV 2025arXiv:2503.06042

#12221

Improving SAM for Camouflaged Object Detection via Dual Stream Adapters

Jiaming Liu, Linghe Kong, Guihai Chen

NEURIPS 2025arXiv:2506.04775

#12222

Improved Regret Bounds for Linear Bandits with Heavy-Tailed Rewards

Artin Tajdini, Jonathan Scarlett, Kevin Jamieson

NEURIPS 2025arXiv:2510.20295

#12223

Quantifying Distributional Invariance in Causal Subgraph for IRM-Free Graph Generalization

Yang Qiu, Yixiong Zou, Jun Wang et al.

NEURIPS 2025arXiv:2511.02225

#12224

Learning Interactive World Model for Object-Centric Reinforcement Learning

Fan Feng, Phillip Lippe, Sara Magliacane

ICCV 2025arXiv:2503.08512

#12225

SAS: Segment Any 3D Scene with Integrated 2D Priors

Zhuoyuan Li, Jiahao Lu, Jiacheng Deng et al.

ICCV 2025arXiv:2510.11962

#12226

MosaicDiff: Training-free Structural Pruning for Diffusion Model Acceleration Reflecting Pretraining Dynamics

Bowei Guo, Shengkun Tang, Cong Zeng et al.

ICCV 2025arXiv:2504.04029

#12227

Simultaneous Motion And Noise Estimation with Event Cameras

Shintaro Shiba, Yoshimitsu Aoki, Guillermo Gallego

ICCV 2025arXiv:2503.16856

#12228

MMCR: Benchmarking Cross-Source Reasoning in Scientific Papers

Yang Tian, Zheng Lu, Mingqi Gao et al.

NEURIPS 2025arXiv:2505.18346

#12229

On the Mechanisms of Weak-to-Strong Generalization: A Theoretical Perspective

Behrad Moniri, Hamed Hassani

ICCV 2025highlightarXiv:2507.18944

#12230

Structure Matters: Revisiting Boundary Refinement in Video Object Segmentation

Guanyi Qin, Ziyue Wang, Daiyun Shen et al.

NEURIPS 2025arXiv:2505.19415

#12231

MMIG-Bench: Towards Comprehensive and Explainable Evaluation of Multi-Modal Image Generation Models

Hang Hua, Ziyun Zeng, Yizhi Song et al.

ICCV 2025arXiv:2503.13652

#12232

Web Artifact Attacks Disrupt Vision Language Models

Maan Qraitem, Piotr Teterwak, Kate Saenko et al.

NEURIPS 2025oralarXiv:2505.21796

#12233

A General-Purpose Theorem for High-Probability Bounds of Stochastic Approximation with Polyak Averaging

Sajad Khodadadian, Martin Zubeldia

NEURIPS 2025arXiv:2506.01582

#12234

Bayes optimal learning of attention-indexed models

Fabrizio Boncoraglio, Emanuele Troiani, Vittorio Erba et al.

NEURIPS 2025oralarXiv:2510.22860

#12235

Far from the Shallow: Brain-Predictive Reasoning Embedding through Residual Disentanglement

Linyang He, Tianjun Zhong, Richard Antonello et al.

ICCV 2025arXiv:2506.23282

#12236

Autoregressive Denoising Score Matching is a Good Video Anomaly Detector

hanwen Zhang, Congqi Cao, Qinyi Lv et al.

CVPR 2025arXiv:2412.06359

#12237

On-Device Self-Supervised Learning of Low-Latency Monocular Depth from Only Events

Jesse Hagenaars, Yilun Wu, Federico Paredes Valles et al.

#12238

Fast Monte Carlo Tree Diffusion: 100× Speedup via Parallel and Sparse Planning

Jaesik Yoon, Hyeonseo Cho, Yoshua Bengio et al.

#12239

Transformers for Mixed-type Event Sequences

Felix Draxler, Yang Meng, Kai Nelson et al.

NEURIPS 2025oral

NEURIPS 2025arXiv:2510.23111

#12240

Neural Emulator Superiority: When Machine Learning for PDEs Surpasses its Training Data

Felix Koehler, Nils Thuerey

ICCV 2025arXiv:2411.16719

#12241

Learn2Synth: Learning Optimal Data Synthesis Using Hypergradients for Brain Image Segmentation

Xiaoling Hu, Xiangrui Zeng, Oula Puonti et al.

NEURIPS 2025spotlightarXiv:2505.20355

#12242

GraLoRA: Granular Low-Rank Adaptation for Parameter-Efficient Fine-Tuning

Yeonjoon Jung, Daehyun Ahn, Hyungjun Kim et al.

CVPR 2025arXiv:2506.05175

#12243

Track Any Anomalous Object:A Granular Video Anomaly Detection Pipeline

Yuzhi Huang, Chenxin Li, Haitao Zhang et al.

ICCV 2025arXiv:2503.00429

#12244

DADM: Dual Alignment of Domain and Modality for Face Anti-spoofing

Yang JingYi, Xun Lin, Zitong YU et al.

ICCV 2025arXiv:2507.15504

#12245

Quantifying and Narrowing the Unknown: Interactive Text-to-Video Retrieval via Uncertainty Minimization

Bingqing Zhang, Zhuo Cao, Heming Du et al.

ICCV 2025arXiv:2507.05678

#12246

LiON-LoRA: Rethinking LoRA Fusion to Unify Controllable Spatial and Temporal Generation for Video Diffusion

Yisu Zhang, Chenjie Cao, Chaohui Yu et al.

ICCV 2025arXiv:2507.08416

#12247

InstaScene: Towards Complete 3D Instance Decomposition and Reconstruction from Cluttered Scenes

Zesong Yang, Bangbang Yang, Wenqi Dong et al.

CVPR 2025arXiv:2503.12053

#12248

Ferret: An Efficient Online Continual Learning Framework under Varying Memory Constraints

Yuhao Zhou, Yuxin Tian, Jindi Lv et al.

CVPR 2025arXiv:2505.01008

#12249

Where's the Liability in the Generative Era? Recovery-based Black-Box Detection of AI-Generated Content

Haoyue Bai, Yiyou Sun, Wei Cheng et al.

ICCV 2025highlightarXiv:2505.23617

#12250

One Trajectory, One Token: Grounded Video Tokenization via Panoptic Sub-object Trajectory

Chenhao Zheng, Jieyu Zhang, Mohammadreza Salehi et al.

#12251

Incomplete Multi-modal Brain Tumor Segmentation via Learnable Sorting State Space Model

Zheyu Zhang, Yayuan Lu, Feipeng Ma et al.

NEURIPS 2025arXiv:2505.21124

#12252

UniFoil: A Universal Dataset of Airfoils in Transitional and Turbulent Regimes for Subsonic and Transonic Flows

Rohit Kanchi, Benjamin Melanson, Nithin Somasekharan et al.

CVPR 2025highlightarXiv:2503.00643

#12253

Deep Change Monitoring: A Hyperbolic Representative Learning Framework and a Dataset for Long-term Fine-grained Tree Change Detection

Yante Li, Hanwen Qi, Haoyu Chen et al.

NEURIPS 2025arXiv:2510.22158

#12254

Solving Continuous Mean Field Games: Deep Reinforcement Learning for Non-Stationary Dynamics

Lorenzo Magnino, Kai Shao, Zida Wu et al.

ICCV 2025arXiv:2503.05332

#12255

CoMoGaussian: Continuous Motion-Aware Gaussian Splatting from Motion-Blurred Images

Jungho Lee, DongHyeong Kim, Dogyoon Lee et al.

NEURIPS 2025arXiv:2505.21852

#12256

A Provable Approach for End-to-End Safe Reinforcement Learning

Akifumi Wachi, Kohei Miyaguchi, Takumi Tanabe et al.

NEURIPS 2025spotlightarXiv:2511.22640

#12257

Flow Density Control: Generative Optimization Beyond Entropy-Regularized Fine-Tuning

Riccardo De Santi, Marin Vlastelica, Ya-Ping Hsieh et al.

ICCV 2025arXiv:2408.09151

#12258

Timestep-Aware Diffusion Model for Extreme Image Rescaling

Ce Wang, Zhenyu Hu, Wanjie Sun et al.

NEURIPS 2025arXiv:2506.21590

#12259

Representation Consistency for Accurate and Coherent LLM Answer Aggregation

Junqi Jiang, Tom Bewley, Salim I. Amoukou et al.

NEURIPS 2025arXiv:2508.08211

#12260

SAEMark: Steering Personalized Multilingual LLM Watermarks with Sparse Autoencoders

Zhuohao Yu, Xingru Jiang, Weizheng Gu et al.

NEURIPS 2025arXiv:2505.18524

#12261

metaTextGrad: Automatically optimizing language model optimizers

Guowei Xu, Mert Yuksekgonul, Carlos Guestrin et al.

CVPR 2025arXiv:2503.18368

#12262

MoST: Efficient Monarch Sparse Tuning for 3D Representation Learning

Xu Han, Yuan Tang, Jinfeng Xu et al.

NEURIPS 2025arXiv:2511.20273

#12263

Beyond Components: Singular Vector-Based Interpretability of Transformer Circuits

Areeb Ahmad, Abhinav Joshi, Ashutosh Modi

ICCV 2025arXiv:2508.02987

#12264

Adversarial Attention Perturbations for Large Object Detection Transformers

Zachary Yahn, Selim Tekin, Fatih Ilhan et al.

CVPR 2025arXiv:2504.19514

#12265

FSBench: A Figure Skating Benchmark for Advancing Artistic Sports Understanding

Rong Gao, Xin Liu, Zhuozhao Hu et al.

NEURIPS 2025arXiv:2502.02869

#12266

Towards Large-Scale In-Context Reinforcement Learning by Meta-Training in Randomized Worlds

Fan Wang, Pengtao Shao, Yiming Zhang et al.

CVPR 2025arXiv:2412.15396

#12267

Learning Visual Composition through Improved Semantic Guidance

Austin Stone, Hagen Soltau, Robert Geirhos et al.

NEURIPS 2025spotlightarXiv:2503.10799

#12268

Fixed-Point RNNs: Interpolating from Diagonal to Dense

Sajad Movahedi, Felix Sarnthein, Nicola Muca Cirone et al.

NEURIPS 2025arXiv:2511.18890

#12269

Nemotron-Flash: Towards Latency-Optimal Hybrid Small Language Models

Yonggan Fu, Xin Dong, Shizhe Diao et al.

NEURIPS 2025arXiv:2509.23564

#12270

Clean First, Align Later: Benchmarking Preference Data Cleaning for Reliable LLM Alignment

Samuel (Min-Hsuan) Yeh, Sharon Li

#12271

JailbreakDiffBench: A Comprehensive Benchmark for Jailbreaking Diffusion Models

Xiaolong Jin, Zixuan Weng, Hanxi Guo et al.

ICCV 2025

NEURIPS 2025arXiv:2506.01374

#12272

REASONING COMPILER: LLM-Guided Optimizations for Efficient Model Serving

Annabelle Sujun Tang, Christopher Priebe, Rohan Mahapatra et al.

CVPR 2025arXiv:2503.18536

#12273

DiN: Diffusion Model for Robust Medical VQA with Semantic Noisy Labels

Erjian Guo, Zhen Zhao, Zicheng Wang et al.

CVPR 2025arXiv:2504.07853

#12274

V2V3D: View-to-View Denoised 3D Reconstruction for Light Field Microscopy

Jiayin Zhao, Zhenqi Fu, Tao Yu et al.

ICCV 2025arXiv:2503.08010

#12275

SKALD: Learning-Based Shot Assembly for Coherent Multi-Shot Video Creation

Chen Yi Lu, Mehrab Tanjim, Ishita Dasgupta et al.

NEURIPS 2025arXiv:2506.17796

#12276

SING: SDE Inference via Natural Gradients

Amber Hu, Henry Smith, Scott Linderman

NEURIPS 2025arXiv:2510.17313

#12277

Disentanglement Beyond Static vs. Dynamic: A Benchmark and Evaluation Framework for Multi-Factor Sequential Representations

Tal Barami, Nimrod Berman, Ilan Naiman et al.

NEURIPS 2025arXiv:2508.14927

#12278

AI Testing Should Account for Sophisticated Strategic Behaviour

Vojta Kovarik, Eric Chen, Sami Petersen et al.

CVPR 2025highlightarXiv:2503.01261

#12279

Towards Improved Text-Aligned Codebook Learning: Multi-Hierarchical Codebook-Text Alignment with Long Text

Guotao liang, Baoquan Zhang, Zhiyuan Wen et al.

NEURIPS 2025oralarXiv:2502.05295

#12280

GST-UNet: A Neural Framework for Spatiotemporal Causal Inference with Time-Varying Confounding

Miruna Oprescu, David Park, Xihaier Luo et al.

NEURIPS 2025spotlightarXiv:2504.15471

#12281

Bigram Subnetworks: Mapping to Next Tokens in Transformer Language Models

Tyler Chang, Benjamin Bergen

NEURIPS 2025arXiv:2505.22913

#12282

MUSTAFAR: Promoting Unstructured Sparsity for KV Cache Pruning in LLM Inference

Donghyeon Joo, Helya Hosseini, Ramyad Hadidi et al.

NEURIPS 2025arXiv:2510.11473

#12283

VA-GS: Enhancing the Geometric Representation of Gaussian Splatting via View Alignment

Qing Li, Huifang Feng, Xun Gong et al.

ICCV 2025arXiv:2508.00599

#12284

DPoser-X: Diffusion Model as Robust 3D Whole-body Human Pose Prior

Junzhe Lu, Jing Lin, Hongkun Dou et al.

CVPR 2025arXiv:2412.18355

#12285

Handling Spatial-Temporal Data Heterogeneity for Federated Continual Learning via Tail Anchor

Hao Yu, Xin Yang, Le Zhang et al.

NEURIPS 2025oralarXiv:2512.12461

#12286

Cross-Modal Representational Knowledge Distillation for Enhanced Spike-informed LFP Modeling

Eray Erturk, Saba Hashemi, Maryam Shanechi

CVPR 2025arXiv:2409.19601

#12287

Infighting in the Dark: Multi-Label Backdoor Attack in Federated Learning

Ye Li, Yanchao Zhao, chengcheng zhu et al.

ICCV 2025arXiv:2505.23186

#12288

HiGarment: Cross-modal Harmony Based Diffusion Model for Flat Sketch to Realistic Garment Image

Junyi Guo, Jingxuan Zhang, Fangyu Wu et al.

#12289

Robo2VLM: Improving Visual Question Answering using Large-Scale Robot Manipulation Data

Kaiyuan Eric Chen, Shuangyu Xie, Zehan Ma et al.

NEURIPS 2025arXiv:2506.06318

#12290

MoE-Gyro: Self-Supervised Over-Range Reconstruction and Denoising for MEMS Gyroscopes

Feiyang Pan, Shenghe Zheng, Chunyan Yin et al.

NEURIPS 2025oralarXiv:2506.22712

#12291

Generalized Linear Mode Connectivity for Transformers

Alexander Theus, Alessandro Cabodi, Sotiris Anagnostidis et al.

#12292

SynTab-LLaVA: Enhancing Multimodal Table Understanding with Decoupled Synthesis

Bangbang Zhou, Zuan Gao, Zixiao Wang et al.

NEURIPS 2025arXiv:2506.13688

#12293

What Happens During the Loss Plateau? Understanding Abrupt Learning in Transformers

Pulkit Gopalani, Wei Hu

NEURIPS 2025spotlightarXiv:2507.00361

#12294

Affine-Invariant Global Non-Asymptotic Convergence Analysis of BFGS under Self-Concordance

Qiujiang Jin, Aryan Mokhtari

ICCV 2025highlightarXiv:2504.06385

#12295

Fast Globally Optimal and Geometrically Consistent 3D Shape Matching

Paul Roetzer, Florian Bernard

#12296

Beyond Human Perception: Understanding Multi-Object World from Monocular View

Keyu Guo, Yongle Huang, Shijie Sun et al.

NEURIPS 2025oralarXiv:2505.21236

#12297

Breaking the Performance Ceiling in Reinforcement Learning requires Inference Strategies

Felix Chalumeau, Daniel Rajaonarivonivelomanantsoa, Ruan John de Kock et al.

NEURIPS 2025arXiv:2507.03707

#12298

CosmoBench: A Multiscale, Multiview, Multitask Cosmology Benchmark for Geometric Deep Learning

Teresa Huang, Richard Stiskalek, Jun-Young Lee et al.

NEURIPS 2025arXiv:2510.01243

#12299

Detoxifying Large Language Models via Autoregressive Reward Guided Representation Editing

Yisong Xiao, Aishan Liu, Siyuan Liang et al.

NEURIPS 2025spotlightarXiv:2505.12944

#12300

CALM-PDE: Continuous and Adaptive Convolutions for Latent Space Modeling of Time-dependent PDEs

Jan Hagnberger, Daniel Musekamp, Mathias Niepert

NEURIPS 2025arXiv:2506.12025

#12301

Unsupervised Learning for Optimal Transport plan prediction between unbalanced graphs

Sonia Mazelet, Rémi Flamary, Bertrand Thirion

NEURIPS 2025arXiv:2506.01599

#12302

Connecting Neural Models Latent Geometries with Relative Geodesic Representations

Hanlin Yu, Berfin Inal, Georgios Arvanitidis et al.

NEURIPS 2025oralarXiv:2412.06966

#12303

Machine Unlearning Doesn't Do What You Think: Lessons for Generative AI Policy and Research

A. Feder Cooper, Christopher A. Choquette-Choo, Miranda Bogen et al.

#12304

BOE-ViT: Boosting Orientation Estimation with Equivariance in Self-Supervised 3D Subtomogram Alignment

Runmin Jiang, Jackson Daggett, Shriya Pingulkar et al.

CVPR 2025arXiv:2503.22984

#12305

Optimal Transport-Guided Source-Free Adaptation for Face Anti-Spoofing

Zhuowei Li, Tianchen Zhao, Xiang Xu et al.

NEURIPS 2025arXiv:2411.03270

#12306

Stable Matching with Ties: Approximation Ratios and Learning

Shiyun Lin, Simon Mauras, Nadav Merlis et al.

NEURIPS 2025arXiv:2510.13887

#12307

Incomplete Multi-view Clustering via Hierarchical Semantic Alignment and Cooperative Completion

Xiaojian Ding, Lin Zhao, Xian Li et al.

NEURIPS 2025oralarXiv:2505.23155

#12308

PreFM: Online Audio-Visual Event Parsing via Predictive Future Modeling

Xiao Yu, Yan Fang, Yao Zhao et al.

NEURIPS 2025arXiv:2509.15472

#12309

Efficient Multimodal Dataset Distillation via Generative Models

Zhenghao Zhao, Haoxuan Wang, Junyi Wu et al.

NEURIPS 2025arXiv:2505.13515

#12310

LoRASuite: Efficient LoRA Adaptation Across Large Language Model Upgrades

Yanan Li, Fanxu Meng, Muhan Zhang et al.

NEURIPS 2025arXiv:2507.05478

#12311

Dynamic Regret Reduces to Kernelized Static Regret

Andrew Jacobsen, Alessandro Rudi, Francesco Orabona et al.

#12312

GBC-Splat: Generalizable Gaussian-Based Clothed Human Digitalization under Sparse RGB Cameras

Hanzhang Tu, Zhanfeng Liao, Boyao Zhou et al.

CVPR 2025arXiv:2504.02168

#12313

MDP: Multidimensional Vision Model Pruning with Latency Constraint

Xinglong Sun, Barath Lakshmanan, Maying Shen et al.

#12314

DH-Set: Improving Vision-Language Alignment with Diverse and Hybrid Set-Embeddings Learning

Kun Zhang, Jingyu Li, Zhe Li et al.

ICCV 2025arXiv:2507.12933

#12315

DMQ: Dissecting Outliers of Diffusion Models for Post-Training Quantization

Dongyeun Lee, jiwan hur, Hyounguk Shon et al.

NEURIPS 2025arXiv:2506.14652

#12316

Rigor in AI: Doing Rigorous AI Work Requires a Broader, Responsible AI-Informed Conception of Rigor

Alexandra Olteanu, Su Lin Blodgett, Agathe Balayn et al.

CVPR 2025arXiv:2505.08255

#12317

Where the Devil Hides: Deepfake Detectors Can No Longer Be Trusted

Shuaiwei Yuan, Junyu Dong, Yuezun Li

NEURIPS 2025arXiv:2507.03298

#12318

Dyn-O: Building Structured World Models with Object-Centric Representations

Zizhao Wang, Kaixin Wang, Li Zhao et al.

#12319

Hyperbolic Uncertainty-Aware Few-Shot Incremental Point Cloud Segmentation

Tanuj Sur, Samrat Mukherjee, Kaizer Rahaman et al.

NEURIPS 2025arXiv:2509.09802

#12320

Sparse Polyak: an adaptive step size rule for high-dimensional M-estimation

Tianqi Qiao, Marie Maros

NEURIPS 2025arXiv:2407.16139

#12321

Tackling Feature-Classifier Mismatch in Federated Learning via Prompt-Driven Feature Transformation

Xinghao Wu, Xuefeng Liu, Jianwei Niu et al.

NEURIPS 2025arXiv:2412.06740

#12322

Convolution Goes Higher-Order: A Biologically Inspired Mechanism Empowers Image Classification

Simone Azeglio, Olivier Marre, Peter Neri et al.

CVPR 2025arXiv:2503.08422

#12323

JiSAM: Alleviate Labeling Burden and Corner Case Problems in Autonomous Driving via Minimal Real-World Data

Runjian Chen, Wenqi Shao, Bo Zhang et al.

NEURIPS 2025arXiv:2504.06426

#12324

S'MoRE: Structural Mixture of Residual Experts for Parameter-Efficient LLM Fine-tuning

Hanqing Zeng, Yinglong Xia, Zhuokai Zhao et al.

NEURIPS 2025arXiv:2511.07250

#12325

MVU-Eval: Towards Multi-Video Understanding Evaluation for Multimodal LLMs

Tianhao Peng, Haochen Wang, Yuanxing Zhang et al.

ICCV 2025highlightarXiv:2510.15749

#12326

SEGA: A Stepwise Evolution Paradigm for Content-Aware Layout Generation with Design Prior

Bo Zhao, Haoran Wang, Jinghui Wang et al.

NEURIPS 2025arXiv:2506.20879

#12327

MultiHuman-Testbench: Benchmarking Image Generation for Multiple Humans

Shubhankar Borse, Seokeon Choi, Sunghyun Park et al.

NEURIPS 2025oralarXiv:2510.25207

#12328

Selective Learning for Deep Time Series Forecasting

Yisong Fu, Zezhi Shao, Chengqing Yu et al.

NEURIPS 2025arXiv:2510.08748

#12329

Conformal Risk Training: End-to-End Optimization of Conformal Risk Control

Christopher Yeh, Nicolas Christianson, Adam Wierman et al.

CVPR 2025arXiv:2503.00861

#12330

Zero-Shot Head Swapping in Real-World Scenarios

Sohyun Jeong, Taewoong Kang, Hyojin Jang et al.

#12331

Perceptual Video Compression with Neural Wrapping

Muhammad Umar Karim Khan, Aaron Chadha, Mohammad Ashraful Anam et al.

ICCV 2025arXiv:2506.02751

#12332

RobustSplat: Decoupling Densification and Dynamics for Transient-Free 3DGS

Chuanyu Fu, Yuqi Zhang, Kunbin Yao et al.

ICCV 2025arXiv:2504.01386

#12333

DALIP: Distribution Alignment-based Language-Image Pre-Training for Domain-Specific Data

Junjie Wu, Jiangtao Xie, Zhaolin Zhang et al.

ICCV 2025arXiv:2506.22907

#12334

MagShield: Towards Better Robustness in Sparse Inertial Motion Capture Under Magnetic Disturbances

Yunzhe Shao, Xinyu Yi, Lu Yin et al.

NEURIPS 2025arXiv:2508.11330

#12335

Noise Matters: Optimizing Matching Noise for Diffusion Classifiers

Yanghao Wang, Long Chen

NEURIPS 2025arXiv:2407.01344

#12336

Distributionally Robust Performative Optimization

Zhuangzhuang Jia, Yijie Wang, Roy Dong et al.

#12337

Improving Diffusion-based Inverse Algorithms under Few-Step Constraint via Linear Extrapolation

Jiawei Zhang, Ziyuan Liu, Leon Yan et al.

NEURIPS 2025

ICCV 2025arXiv:2405.14715

#12338

Towards Cross-modal Backward-compatible Representation Learning for Vision-Language Models

Young Kyun Jang, Ser-Nam Lim

NEURIPS 2025arXiv:2510.06699

#12339

A Diffusion Model for Regular Time Series Generation from Irregular Data with Completion and Masking

Gal Fadlon, Idan Arbiv, Nimrod Berman et al.

NEURIPS 2025arXiv:2412.00744

#12340

Open-World Drone Active Tracking with Goal-Centered Rewards

Haowei Sun, Jinwu Hu, Zhirui Zhang et al.

ICCV 2025arXiv:2504.12753

#12341

Stronger, Steadier & Superior: Geometric Consistency in Depth VFM Forges Domain Generalized Semantic Segmentation

Siyu Chen, Ting Han, Changshe Zhang et al.

NEURIPS 2025arXiv:2506.07413

#12342

Variational Supervised Contrastive Learning

Ziwen Wang, Jiajun Fan, Thao Nguyen et al.

ICCV 2025arXiv:2510.18437

#12343

Beyond Single Images: Retrieval Self-Augmented Unsupervised Camouflaged Object Detection

Ji Du, Xin WANG, Fangwei Hao et al.

NEURIPS 2025arXiv:2511.05245

#12344

ADPretrain: Advancing Industrial Anomaly Detection via Anomaly Representation Pretraining

Xincheng Yao, Yan Luo, Zefeng Qian et al.

NEURIPS 2025arXiv:2505.19742

#12345

HAODiff: Human-Aware One-Step Diffusion via Dual-Prompt Guidance

JUE GONG, Tingyu Yang, Jingkai Wang et al.

NEURIPS 2025spotlightarXiv:2510.19953

#12346

On the Optimal Construction of Unbiased Gradient Estimators for Zeroth-Order Optimization

Shaocong Ma, Heng Huang

NEURIPS 2025oralarXiv:2505.21426

#12347

Learning Individual Behavior in Agent-Based Models with Graph Diffusion Networks

Francesco Cozzi, Marco Pangallo, Alan Perotti et al.

NEURIPS 2025arXiv:2505.16947

#12348

MixAT: Combining Continuous and Discrete Adversarial Training for LLMs

Csaba Dékány, Stefan Balauca, Dimitar I. Dimitrov et al.

CVPR 2025arXiv:2412.02635

#12349

MetaShadow: Object-Centered Shadow Detection, Removal, and Synthesis

Tianyu Wang, Jianming Zhang, Haitian Zheng et al.

ICCV 2025arXiv:2512.03508

#12350

Exploiting Domain Properties in Language-Driven Domain Generalization for Semantic Segmentation

Seogkyu Jeon, Kibeom Hong, Hyeran Byun

NEURIPS 2025arXiv:2506.09684

#12351

Inv-Entropy: A Fully Probabilistic Framework for Uncertainty Quantification in Language Models

Haoyi Song, Ruihan Ji, Naichen Shi et al.

CVPR 2025arXiv:2411.16129

#12352

Three Cars Approaching within 100m! Enhancing Distant Geometry by Tri-Axis Voxel Scanning for Camera-based Semantic Scene Completion

Jongseong Bae, Junwoo Ha, Ha Young Kim

NEURIPS 2025arXiv:2502.11583

#12353

Distributional Autoencoders Know the Score

Andrej Leban

NEURIPS 2025arXiv:2506.00846

#12354

Infinite-Width Limit of a Single Attention Layer: Analysis via Tensor Programs

Mana Sakai, Ryo Karakida, Masaaki Imaizumi

#12355

Deep Fair Multi-View Clustering with Attention KAN

HaiMing Xu, Qianqian Wang, Boyue Wang et al.

CVPR 2025highlight

NEURIPS 2025arXiv:2506.02846

#12356

PBR-SR: Mesh PBR Texture Super Resolution from 2D Image Priors

Yujin Chen, Yinyu Nie, Benjamin Ummenhofer et al.

ICCV 2025arXiv:2504.13178

#12357

Aligning Constraint Generation with Design Intent in Parametric CAD

Evan Casey, Tianyu Zhang, Shu Ishida et al.

#12358

Distinguish Then Exploit: Source-free Open Set Domain Adaptation via Weight Barcode Estimation and Sparse Label Assignment

Weiming Liu, Jun Dan, Fan Wang et al.

ICCV 2025arXiv:2506.21541

#12359

StruMamba3D: Exploring Structural Mamba for Self-supervised Point Cloud Representation Learning

Chuxin Wang, Yixin Zha, Wenfei Yang et al.

NEURIPS 2025arXiv:2503.10679

#12360

LinEAS: End-to-end Learning of Activation Steering with a Distributional Loss

Pau Rodriguez, Michal Klein, Eleonora Gualdoni et al.

ICCV 2025arXiv:2503.12764

#12361

Decouple to Reconstruct: High Quality UHD Restoration via Active Feature Disentanglement and Reversible Fusion

Yidi Liu, Dong Li, Yuxin Ma et al.

ICCV 2025arXiv:2505.20469

#12362

CCL-LGS: Contrastive Codebook Learning for 3D Language Gaussian Splatting

Lei Tian, Xiaomin Li, Liqian Ma et al.

NEURIPS 2025spotlightarXiv:2502.02513

#12363

Diffusion Generative Modeling on Lie Group Representations

Marco Bertolini, Tuan Le, Djork-Arné Clevert

NEURIPS 2025arXiv:2502.14709

#12364

Group-Level Data Selection for Efficient Pretraining

Zichun Yu, Fei Peng, Jie Lei et al.

NEURIPS 2025oralarXiv:2509.08502

#12365

Chirality in Action: Time-Aware Video Representation Learning by Latent Straightening

Piyush Nitin Bagad, Andrew Zisserman

#12366

The Generative Leap: Tight Sample Complexity for Efficiently Learning Gaussian Multi-Index Models

Alex Damian, Jason Lee, Joan Bruna

NEURIPS 2025oralarXiv:2506.20024

#12367

Elucidated Rolling Diffusion Models for Probabilistic Forecasting of Complex Dynamics

Salva Rühling Cachay, Miika Aittala, Karsten Kreis et al.

ICCV 2025arXiv:2507.02581

#12368

Structure-aware Semantic Discrepancy and Consistency for 3D Medical Image Self-supervised Learning

Tan Pan, Zhaorui Tan, Kaiyu Guo et al.

NEURIPS 2025arXiv:2510.21363

#12369

FairImagen: Post-Processing for Bias Mitigation in Text-to-Image Models

Zihao Fu, Ryan Brown, Shun Shao et al.

CVPR 2025arXiv:2506.07643

#12370

Synthetic Visual Genome

Jae Sung Park, Zixian Ma, Linjie Li et al.

NEURIPS 2025oralarXiv:2506.00329

#12371

Foresight: Adaptive Layer Reuse for Accelerated and High-Quality Text-to-Video Generation

Muhammad Adnan, Nithesh Kurella, Akhil Arunkumar et al.

ICCV 2025arXiv:2503.12720

#12372

Towards Open-World Generation of Stereo Images and Unsupervised Matching

Feng Qiao, Zhexiao Xiong, Eric Xing et al.

NEURIPS 2025arXiv:2509.21359

#12373

Influence Guided Context Selection for Effective Retrieval-Augmented Generation

Jiale Deng, Yanyan Shen, Ziyuan Pei et al.

CVPR 2025arXiv:2504.11295

#12374

Autoregressive Distillation of Diffusion Transformers

Yeongmin Kim, Sotiris Anagnostidis, Yuming Du et al.

NEURIPS 2025arXiv:2505.11081

#12375

ShiQ: Bringing back Bellman to LLMs

Pierre Clavier, Nathan Grinsztajn, Raphaël Avalos et al.

NEURIPS 2025arXiv:2502.03198

#12376

SimSort: A Data-Driven Framework for Spike Sorting by Large-Scale Electrophysiology Simulation

Yimu Zhang, Dongqi Han, Yansen Wang et al.

CVPR 2025arXiv:2503.03519

#12377

Do ImageNet-trained Models Learn Shortcuts? The Impact of Frequency Shortcuts on Generalization

Shunxin Wang, Raymond Veldhuis, Nicola Strisciuglio

CVPR 2025highlightarXiv:2503.01214

#12378

One-Step Event-Driven High-Speed Autofocus

Yuhan Bao, Shaohua Gao, Wenyong Li et al.

CVPR 2025arXiv:2411.10685

#12379

From Prototypes to General Distributions: An Efficient Curriculum for Masked Image Modeling

Jinhong Lin, Cheng-En Wu, Huanran Li et al.

#12380

Overcoming Shortcut Problem in VLM for Robust Out-of-Distribution Detection

Zhuo Xu, Xiang Xiang, Yifan Liang

CVPR 2025highlight

CVPR 2025arXiv:2503.23606

#12381

Blurry-Edges: Photon-Limited Depth Estimation from Defocused Boundaries

Wei Xu, Charlie Wagner, Junjie Luo et al.

ICCV 2025arXiv:2507.16213

#12382

Advancing Visual Large Language Model for Multi-granular Versatile Perception

Wentao Xiang, Haoxian Tan, Cong Wei et al.

ICCV 2025highlightarXiv:2506.23151

#12383

MEMFOF: High-Resolution Training for Memory-Efficient Multi-Frame Optical Flow Estimation

Vladislav Bargatin, Egor Chistov, Alexander Yakovenko et al.

CVPR 2025highlightarXiv:2503.18578

#12384

Galaxy Walker: Geometry-aware VLMs For Galaxy-scale Understanding

Tianyu Chen, Xingcheng Fu, Yisen Gao et al.

CVPR 2025arXiv:2506.15720

#12385

Tripartite Weight-Space Ensemble for Few-Shot Class-Incremental Learning

Juntae Lee, Munawar Hayat, Sungrack Yun

CVPR 2025arXiv:2410.06241

#12386

ByTheWay: Boost Your Text-to-Video Generation Model to Higher Quality in a Training-free Way

Jiazi Bu, Pengyang Ling, Pan Zhang et al.

ICCV 2025highlightarXiv:2508.00558

#12387

Guiding Diffusion-Based Articulated Object Generation by Partial Point Cloud Alignment and Physical Plausibility Constraints

Jens U. Kreber, Joerg Stueckler

#12388

Feedback-Aware MCTS for Goal-Oriented Information Seeking

Harshita Chopra, Chirag Shah

NEURIPS 2025spotlightarXiv:2510.03163

#12389

ROGR: Relightable 3D Objects using Generative Relighting

Jiapeng Tang, Matthew Levine, Dor Verbin et al.

NEURIPS 2025arXiv:2505.17083

#12390

Scale-invariant attention

Ben Anson, Xi Wang, Laurence Aitchison

CVPR 2025arXiv:2503.06746

#12391

Color Alignment in Diffusion

Ka Chun SHUM, Binh-Son Hua, Thanh Nguyen et al.

ICCV 2025arXiv:2411.09145

#12392

Self-Supervised Monocular 4D Scene Reconstruction for Egocentric Videos

Chengbo Yuan, Geng Chen, Li Yi et al.

NEURIPS 2025arXiv:2510.02833

#12393

Attack via Overfitting: 10-shot Benign Fine-tuning to Jailbreak LLMs

Zhixin Xie, Xurui Song, Jun Luo

ICCV 2025arXiv:2503.20349

#12394

Consistency Trajectory Matching for One-Step Generative Super-Resolution

Weiyi You, Mingyang Zhang, Leheng Zhang et al.

NEURIPS 2025arXiv:2510.26795

#12395

Scaling Image Geo-Localization to Continent Level

Philipp Lindenberger, Paul-Edouard Sarlin, Jan Hosang et al.

NEURIPS 2025arXiv:2506.16895

#12396

With Limited Data for Multimodal Alignment, Let the STRUCTURE Guide You

Fabian Gröger, Shuo Wen, Huyen Le et al.

NEURIPS 2025arXiv:2506.08249

#12397

RADAR: Benchmarking Language Models on Imperfect Tabular Data

Ken Gu, Zhihan Zhang, Kate Lin et al.

NEURIPS 2025arXiv:2410.08868

#12398

On the Convergence of Single-Timescale Actor-Critic

Navdeep Kumar, Priyank Agrawal, Giorgia Ramponi et al.

ICCV 2025arXiv:2504.03501

#12399

LV-MAE: Learning Long Video Representations through Masked-Embedding Autoencoders

Ilan Naiman, Emanuel Baruch Baruch, Oron Anschel et al.

#12400

Seek Common Ground While Reserving Differences: Semi-Supervised Image-Text Sentiment Recognition

Wuyou Xia, Guoli Jia, Sicheng Zhao et al.