Most Cited 2025 &quot;stochastic multi-agent systems&quot; Papers

ICLR 2025arXiv:2405.18780

#8402

Certifying Counterfactual Bias in LLMs

Isha Chaudhary, Qian Hu, Manoj Kumar et al.

ICCV 2025arXiv:2503.19065

#8403

WikiAutoGen: Towards Multi-Modal Wikipedia-Style Article Generation

Zhongyu Yang, Jun Chen, Dannong Xu et al.

ICLR 2025arXiv:2502.01962

#8404

Memory Efficient Transformer Adapter for Dense Predictions

Dong Zhang, Rui Yan, Pingcheng Dong et al.

ICLR 2025arXiv:2410.10174

#8405

Balanced Neural ODEs: nonlinear model order reduction and Koopman operator approximations

Julius Aka, Johannes Brunnemann, Jörg Eiden et al.

ICCV 2025arXiv:2507.00992

#8406

UniGlyph: Unified Segmentation-Conditioned Diffusion for Precise Visual Text Synthesis

Yuanrui Wang, Cong Han, Yafei Li et al.

NEURIPS 2025oralarXiv:2505.13279

#8407

Event-Driven Dynamic Scene Depth Completion

Zhiqiang Yan, Jianhao Jiao, Zhengxue Wang et al.

NEURIPS 2025arXiv:2502.02216

#8408

Flatten Graphs as Sequences: Transformers are Scalable Graph Generators

Dexiong Chen, Markus Krimmel, Karsten Borgwardt

NEURIPS 2025spotlightarXiv:2507.14793

#8409

Flow Equivariant Recurrent Neural Networks

Andy Keller

ICCV 2025arXiv:2503.02304

#8410

A Token-level Text Image Foundation Model for Document Understanding

Tongkun Guan, Zining Wang, Pei Fu et al.

ICCV 2025arXiv:2503.19914

#8411

Learning 3D Object Spatial Relationships from Pre-trained 2D Diffusion Models

Sangwon Baik, Hyeonwoo Kim, Hanbyul Joo

CVPR 2025highlightarXiv:2411.15099

#8412

Context-Aware Multimodal Pretraining

Karsten Roth, Zeynep Akata, Dima Damen et al.

CVPR 2025arXiv:2503.16096

#8413

MarkushGrapher: Joint Visual and Textual Recognition of Markush Structures

Lucas Morin, Valery Weber, Ahmed Nassar et al.

#8414

Mono3DVLT: Monocular-Video-Based 3D Visual Language Tracking

Hongkai Wei, YANG YANG, Shijie Sun et al.

#8415

Unity in Diversity: Video Editing via Gradient-Latent Purification

Junyu Gao, Kunlin Yang, Xuan Yao et al.

ICLR 2025arXiv:2410.02116

#8416

Dataset Distillation via Knowledge Distillation: Towards Efficient Self-Supervised Pre-training of Deep Networks

Siddharth Joshi, Jiayi Ni, Baharan Mirzasoleiman

NEURIPS 2025spotlightarXiv:2411.04105

#8417

A Implies B: Circuit Analysis in LLMs for Propositional Logical Reasoning

Guan Zhe Hong, Nishanth Dikkala, Enming Luo et al.

NEURIPS 2025arXiv:2505.24550

#8418

A*-Thought: Efficient Reasoning via Bidirectional Compression for Low-Resource Settings

Xiaoang Xu, Shuo Wang, Xu Han et al.

CVPR 2025arXiv:2503.12840

#8419

Dynamic Derivation and Elimination: Audio Visual Segmentation with Enhanced Audio Semantics

Chen Liu, Liying Yang, Peike Li et al.

ICCV 2025arXiv:2412.03215

#8420

Beyond [cls]: Exploring the True Potential of Masked Image Modeling Representations

Marcin Przewięźlikowski, Randall Balestriero, Wojciech Jasiński et al.

NEURIPS 2025arXiv:2503.08921

#8421

Revisiting Frank-Wolfe for Structured Nonconvex Optimization

Hoomaan Maskan, Yikun Hou, Suvrit Sra et al.

ICLR 2025arXiv:2503.10081

#8422

AdvPaint: Protecting Images from Inpainting Manipulation via Adversarial Attention Disruption

Joonsung Jeon, Woo Jae Kim, Suhyeon Ha et al.

ICLR 2025arXiv:2410.11055

#8423

Varying Shades of Wrong: Aligning LLMs with Wrong Answers Only

Jihan Yao, Wenxuan Ding, Shangbin Feng et al.

CVPR 2025arXiv:2504.20468

#8424

Antidote: A Unified Framework for Mitigating LVLM Hallucinations in Counterfactual Presupposition and Object Perception

Yuanchen Wu, Lu Zhang, Hang Yao et al.

CVPR 2025arXiv:2503.20282

#8425

Faster Parameter-Efficient Tuning with Token Redundancy Reduction

Kwonyoung Kim, Jungin Park, Jin Kim et al.

CVPR 2025arXiv:2506.03737

#8426

ComRoPE: Scalable and Robust Rotary Position Embedding Parameterized by Trainable Commuting Angle Matrices

Hao Yu, Tangyu Jiang, Shuning Jia et al.

ICCV 2025highlightarXiv:2411.15867

#8427

PanoLlama: Generating Endless and Coherent Panoramas with Next-Token-Prediction LLMs

Teng Zhou, Xiaoyu Zhang, Yongchuan Tang

CVPR 2025arXiv:2411.12817

#8428

What Makes a Good Dataset for Knowledge Distillation?

Logan Frank, Jim Davis

CVPR 2025arXiv:2503.16218

#8429

Temporal Score Analysis for Understanding and Correcting Diffusion Artifacts

Yu Cao, Zengqun Zhao, Ioannis Patras et al.

ICLR 2025arXiv:2407.00075

#8430

Logicbreaks: A Framework for Understanding Subversion of Rule-based Inference

Anton Xue, Avishree Khare, Rajeev Alur et al.

NEURIPS 2025arXiv:2505.19949

#8431

Which Data Attributes Stimulate Math and Code Reasoning? An Investigation via Influence Functions

Siqi Kou, Qingyuan Tian, Hanwen Xu et al.

ICLR 2025arXiv:2502.16666

#8432

SBSC: Step-by-Step Coding for Improving Mathematical Olympiad Performance

Kunal Singh, Ankan Biswas, Sayandeep Bhowmick et al.

ICCV 2025arXiv:2504.09039

#8433

Sculpting Memory: Multi-Concept Forgetting in Diffusion Models via Dynamic Mask and Concept-Aware Optimization

Li, Yang Xiao, Jie Ji et al.

CVPR 2025arXiv:2501.04815

#8434

Towards Generalizable Trajectory Prediction using Dual-Level Representation Learning and Adaptive Prompting

Kaouther Messaoud, Matthieu Cord, Alex Alahi

NEURIPS 2025oralarXiv:2512.08765

#8435

Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

Ruihang Chu, Yefei He, Zhekai Chen et al.

NEURIPS 2025arXiv:2410.21273

#8436

On Inductive Biases That Enable Generalization in Diffusion Transformers

Jie An, De Wang, Pengsheng Guo et al.

ICCV 2025arXiv:2508.04705

#8437

Occupancy Learning with Spatiotemporal Memory

Ziyang Leng, Jiawei Yang, Wenlong Yi et al.

CVPR 2025arXiv:2503.15851

#8438

Zero-1-to-A: Zero-Shot One Image to Animatable Head Avatars Using Video Diffusion

Zhenglin Zhou, Fan Ma, Hehe Fan et al.

ICCV 2025arXiv:2503.09131

#8439

MP-HSIR: A Multi-Prompt Framework for Universal Hyperspectral Image Restoration

Zhehui Wu, Yong Chen, Naoto Yokoya et al.

#8440

Radio Frequency Ray Tracing with Neural Object Representation for Enhanced RF Modeling

Xingyu Chen, Zihao Feng, Kun Qian et al.

CVPR 2025arXiv:2512.14542

#8441

HiFi-Portrait: Zero-shot Identity-preserved Portrait Generation with High-fidelity Multi-face Fusion

Yifang Xu, BenXiang Zhai, Yunzhuo Sun et al.

ICCV 2025arXiv:2506.07725

#8442

ETA: Efficiency through Thinking Ahead, A Dual Approach to Self-Driving with Large Models

Shadi Hamdan, Chonghao Sima, Zetong Yang et al.

ICCV 2025arXiv:2412.13195

#8443

CoMPaSS: Enhancing Spatial Understanding in Text-to-Image Diffusion Models

Gaoyang Zhang, Bingtao Fu, Qingnan Fan et al.

CVPR 2025arXiv:2506.08005

#8444

ZeroVO: Visual Odometry with Minimal Assumptions

Lei Lai, Zekai Yin, Eshed Ohn-Bar

CVPR 2025highlightarXiv:2403.11295

#8445

Order-One Rolling Shutter Cameras

Marvin Anas Hahn, Kathlén Kohn, Orlando Marigliano et al.

ICCV 2025highlightarXiv:2502.20158

#8446

Learning to Generalize without Bias for Open-Vocabulary Action Recognition

Yating Yu, Congqi Cao, Yifan Zhang et al.

#8447

Quality over Quantity in Attention Layers: When Adding More Heads Hurts

Noah Amsel, Gilad Yehudai, Joan Bruna

NEURIPS 2025spotlightarXiv:2406.16424

#8448

Memory-Enhanced Neural Solvers for Routing Problems

Felix Chalumeau, Refiloe Shabe, Noah De Nicola et al.

NEURIPS 2025spotlightarXiv:2503.19953

#8449

Self-Supervised Learning of Motion Concepts by Optimizing Counterfactuals

Stefan Stojanov, David Wendt, Seungwoo Kim et al.

ICLR 2025arXiv:2502.10288

#8450

Adversarial Mixup Unlearning

Zhuoyi Peng, Yixuan Tang, Yi Yang

NEURIPS 2025arXiv:2505.12003

#8451

Approximation theory for 1-Lipschitz ResNets

Davide Murari, Takashi Furuya, Carola-Bibiane Schönlieb

ICCV 2025arXiv:2406.09105

#8452

INS-MMBench: A Comprehensive Benchmark for Evaluating LVLMs' Performance in Insurance

Chenwei Lin, Hanjia Lyu, Xian Xu et al.

CVPR 2025arXiv:2505.19799

#8453

A Regularization-Guided Equivariant Approach for Image Restoration

Yulu Bai, Jiahong Fu, Qi Xie et al.

CVPR 2025arXiv:2505.11676

#8454

DPSeg: Dual-Prompt Cost Volume Learning for Open-Vocabulary Semantic Segmentation

Ziyu Zhao, Xiaoguang Li, Lingjia Shi et al.

CVPR 2025arXiv:2411.17786

#8455

DreamCache: Finetuning-Free Lightweight Personalized Image Generation via Feature Caching

Emanuele Aiello, Umberto Michieli, Diego Valsesia et al.

CVPR 2025highlightarXiv:2503.03307

#8456

Full-DoF Egomotion Estimation for Event Cameras Using Geometric Solvers

Ji Zhao, Banglei Guan, Zibin Liu et al.

#8457

Flow-based Variational Mutual Information: Fast and Flexible Approximations

Caleb Dahlke, Jason Pacheco

NEURIPS 2025arXiv:2510.20155

#8458

PartNeXt: A Next-Generation Dataset for Fine-Grained and Hierarchical 3D Part Understanding

Penghao Wang, Yiyang He, Xin Lv et al.

CVPR 2025arXiv:2410.05869

#8459

Believing is Seeing: Unobserved Object Detection using Generative Models

Subhransu S. Bhattacharjee, Dylan Campbell, Rahul Shome

CVPR 2025arXiv:2503.04718

#8460

Floxels: Fast Unsupervised Voxel Based Scene Flow Estimation

David T. Hoffmann, Syed Haseeb Raza, Hanqiu Jiang et al.

ICCV 2025arXiv:2508.09949

#8461

Stable Diffusion Models are Secretly Good at Visual In-Context Learning

Trevine Oorloff, Vishwanath Sindagi, Wele Gedara Chaminda Bandara et al.

#8462

LatentHOI: On the Generalizable Hand Object Motion Generation with Latent Hand Diffusion.

Muchen Li, Sammy Christen, Chengde Wan et al.

#8463

GraphMimic: Graph-to-Graphs Generative Modeling from Videos for Policy Learning

Guangyan Chen, Te Cui, Meiling Wang et al.

ICLR 2025arXiv:2502.16593

#8464

Tracking the Copyright of Large Vision-Language Models through Parameter Learning Adversarial Images

Yubo Wang, Jianting Tang, Liu et al.

NEURIPS 2025spotlightarXiv:2510.01938

#8465

StelLA: Subspace Learning in Low-rank Adaptation using Stiefel Manifold

Zhizhong Li, Sina Sajadmanesh, Jingtao Li et al.

ICLR 2025arXiv:2502.10704

#8466

Occlusion-aware Non-Rigid Point Cloud Registration via Unsupervised Neural Deformation Correntropy

Mingyang Zhao, Gaofeng Meng, Dong-ming Yan

CVPR 2025arXiv:2411.14797

#8467

Continual SFT Matches Multimodal RLHF with Negative Supervision

Ke Zhu, Yu Wang, Yanpeng Sun et al.

NEURIPS 2025oralarXiv:2502.06309

#8468

Analog In-memory Training on General Non-ideal Resistive Elements: The Impact of Response Functions

Zhaoxian Wu, Quan Xiao, Tayfun Gokmen et al.

ICLR 2025arXiv:2405.16195

#8469

Adaptive $Q$-Network: On-the-fly Target Selection for Deep Reinforcement Learning

Théo Vincent, Fabian Wahren, Jan Peters et al.

ICCV 2025arXiv:2412.19089

#8470

Humans as a Calibration Pattern: Dynamic 3D Scene Reconstruction from Unsynchronized and Uncalibrated Videos

Changwoon Choi, Jeongjun Kim, Geonho Cha et al.

NEURIPS 2025arXiv:2505.21844

#8471

Test-Time Adaptation of Vision-Language Models for Open-Vocabulary Semantic Segmentation

Mehrdad Noori, David OSOWIECHI, Gustavo Vargas Hakim et al.

ICLR 2025arXiv:2504.08638

#8472

Transformer Learns Optimal Variable Selection in Group-Sparse Classification

Chenyang Zhang, Xuran Meng, Yuan Cao

ICCV 2025arXiv:2507.23782

#8473

MonoFusion: Sparse-View 4D Reconstruction via Monocular Fusion

Zihan Wang, Jeff Tan, Tarasha Khurana et al.

ICCV 2025arXiv:2505.11868

#8474

MonoMobility: Zero-Shot 3D Mobility Analysis from Monocular Videos

Hongyi Zhou, Xiaogang Wang, Yulan Guo et al.

NEURIPS 2025arXiv:2412.02542

#8475

Unveiling Concept Attribution in Diffusion Models

Nguyen Hung-Quang, Hoang Phan, Khoa D Doan

CVPR 2025arXiv:2503.12507

#8476

Segment Any-Quality Images with Generative Latent Space Enhancement

Guangqian Guo, Yong Guo, Xuehui Yu et al.

NEURIPS 2025arXiv:2506.05285

#8477

RaySt3R: Predicting Novel Depth Maps for Zero-Shot Object Completion

Bardienus Duisterhof, Jan Oberst, Bowen Wen et al.

ICCV 2025arXiv:2504.10434

#8478

Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing

Taihang Hu, Linxuan Li, Kai Wang et al.

CVPR 2025highlightarXiv:2505.04656

#8479

MeshGen: Generating PBR Textured Mesh with Render-Enhanced Auto-Encoder and Generative Data Augmentation

Zilong Chen, Yikai Wang, Wenqiang Sun et al.

NEURIPS 2025arXiv:2505.15239

#8480

Neural Collapse is Globally Optimal in Deep Regularized ResNets and Transformers

Peter Súkeník, Christoph Lampert, Marco Mondelli

NEURIPS 2025spotlightarXiv:2503.02878

#8481

Language Models can Self-Improve at State-Value Estimation for Better Search

Ethan Mendes, Alan Ritter

CVPR 2025arXiv:2503.12460

#8482

Exploring Contextual Attribute Density in Referring Expression Counting

Zhicheng Wang, Zhiyu Pan, Zhan Peng et al.

CVPR 2025highlightarXiv:2506.07087

#8483

UCOD-DPL: Unsupervised Camouflaged Object Detection via Dynamic Pseudo-label Learning

Weiqi Yan, Lvhai Chen, Huaijia Kou et al.

ICLR 2025arXiv:2410.16701

#8484

ClimaQA: An Automated Evaluation Framework for Climate Question Answering Models

Veeramakali Vignesh Manivannan, Yasaman Jafari, Srikar Eranky et al.

ICCV 2025highlightarXiv:2507.18678

#8485

Towards Scalable Spatial Intelligence via 2D-to-3D Data Lifting

Xingyu Miao, Haoran Duan, Quanhao Qian et al.

CVPR 2025arXiv:2412.18177

#8486

Enhancing Online Continual Learning with Plug-and-Play State Space Model and Class-Conditional Mixture of Discretization

Sihao Liu, Yibo Yang, Xiaojie Li et al.

CVPR 2025arXiv:2503.06984

#8487

Synchronized Video-to-Audio Generation via Mel Quantization-Continuum Decomposition

Juncheng Wang, Chao Xu, Cheng Yu et al.

CVPR 2025arXiv:2504.06815

#8488

SVG-IR: Spatially-Varying Gaussian Splatting for Inverse Rendering

Hanxiao Sun, Yupeng Gao, Jin Xie et al.

#8489

SP2T: Sparse Proxy Attention for Dual-stream Point Transformer

Jiaxu Wan, Hong Zhang, Ziqi He et al.

ICCV 2025

NEURIPS 2025spotlightarXiv:2511.02276

#8490

Gradient-Variation Online Adaptivity for Accelerated Optimization with Hölder Smoothness

Yuheng Zhao, Yu-Hu Yan, Kfir Y. Levy et al.

ICCV 2025arXiv:2503.10959

#8491

OuroMamba: A Data-Free Quantization Framework for Vision Mamba

Akshat Ramachandran, Mingyu Lee, Huan Xu et al.

NEURIPS 2025arXiv:2506.05754

#8492

Constrained Sampling for Language Models Should Be Easy: An MCMC Perspective

Emmanuel Anaya Gonzalez, Sairam Vaidya, Kanghee Park et al.

NEURIPS 2025arXiv:2506.08388

#8493

Reinforcement Learning Teachers of Test Time Scaling

Edoardo Cetin, Tianyu Zhao, Yujin Tang

CVPR 2025arXiv:2503.18406

#8494

Instruct-CLIP: Improving Instruction-Guided Image Editing with Automated Data Refinement Using Contrastive Learning

Sherry X. Chen, Misha Sra, Pradeep Sen

CVPR 2025arXiv:2405.18029

#8495

Are Images Indistinguishable to Humans Also Indistinguishable to Classifiers?

Zebin You, Xinyu Zhang, Hanzhong Guo et al.

NEURIPS 2025arXiv:2504.06792

#8496

Domain-Specific Pruning of Large Mixture-of-Experts Models with Few-shot Demonstrations

Zican Dong, Han Peng, Peiyu Liu et al.

ICCV 2025arXiv:2507.17402

#8497

Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning

Jun Li, Jinpeng Wang, Chaolei Tan et al.

#8498

GS-DiT: Advancing Video Generation with Dynamic 3D Gaussian Fields through Efficient Dense 3D Point Tracking

Weikang Bian, Zhaoyang Huang, Xiaoyu Shi et al.

ICCV 2025arXiv:2503.05543

#8499

Pi-GPS: Enhancing Geometry Problem Solving by Unleashing the Power of Diagrammatic Information

Junbo Zhao, Ting Zhang, Jiayu Sun et al.

#8500

Learning Video-Conditioned Policy on Unlabelled Data with Joint Embedding Predictive Transformer

Hao Luo, Zongqing Lu

CVPR 2025arXiv:2509.26025

#8501

PatchVSR: Breaking Video Diffusion Resolution Limits with Patch-wise Video Super-Resolution

Shian Du, Menghan Xia, Chang Liu et al.

CVPR 2025arXiv:2503.21777

#8502

Test-Time Visual In-Context Tuning

Jiahao Xie, Alessio Tonioni, Nathalie Rauschmayr et al.

NEURIPS 2025arXiv:2502.11525

#8503

Beyond Single-Task: Robust Multi-Task Length Generalization for LLMs

Yi Hu, Shijia Kang, Haotong Yang et al.

ICLR 2025arXiv:2310.05397

#8504

Enhancing Clustered Federated Learning: Integration of Strategies and Improved Methodologies

Yongxin Guo, Xiaoying Tang, Tao Lin

CVPR 2025arXiv:2411.12089

#8505

FruitNinja: 3D Object Interior Texture Generation with Gaussian Splatting

Fangyu Wu, Yuhao Chen

ICLR 2025arXiv:2503.14493

#8506

State Space Model Meets Transformer: A New Paradigm for 3D Object Detection

Chuxin Wang, Wenfei Yang, Xiang Liu et al.

CVPR 2025arXiv:2503.08421

#8507

Learning to Detect Objects from Multi-Agent LiDAR Scans without Manual Labels

Qiming Xia, Wenkai Lin, Haoen Xiang et al.

ICCV 2025arXiv:2408.08524

#8508

GS-ID: Illumination Decomposition on Gaussian Splatting via Adaptive Light Aggregation and Diffusion-Guided Material Priors

Kang DU, Zhihao Liang, Yulin Shen et al.

NEURIPS 2025arXiv:2412.20392

#8509

Defending Multimodal Backdoored Models by Repulsive Visual Prompt Tuning

Zhifang Zhang, Shuo He, Haobo Wang et al.

CVPR 2025arXiv:2411.18711

#8510

Evaluating Vision-Language Models as Evaluators in Path Planning

Mohamed Aghzal, Xiang Yue, Erion Plaku et al.

NEURIPS 2025arXiv:2505.20081

#8511

Inference-time Alignment in Continuous Space

Yige Yuan, Teng Xiao, Li Yunfan et al.

ICLR 2025arXiv:2410.09568

#8512

Second-Order Min-Max Optimization with Lazy Hessians

Lesi Chen, Chengchang Liu, Jingzhao Zhang

CVPR 2025arXiv:2504.02397

#8513

Learning Audio-guided Video Representation with Gated Attention for Video-Text Retrieval

Boseung Jeong, Jicheol Park, Sungyeon Kim et al.

ICCV 2025arXiv:2508.05402

#8514

DistillDrive: End-to-End Multi-Mode Autonomous Driving Distillation by Isomorphic Hetero-Source Planning Model

Rui Yu, Xianghang Zhang, Runkai Zhao et al.

NEURIPS 2025arXiv:2510.21542

#8515

HollowFlow: Efficient Sample Likelihood Evaluation using Hollow Message Passing

Johann Flemming Gloy, Simon Olsson

NEURIPS 2025arXiv:2510.06077

#8516

When Thinking Drifts: Evidential Grounding for Robust Video Reasoning

Romy Luo, Zihui (Sherry) Xue, Alex Dimakis et al.

CVPR 2025arXiv:2406.05826

#8517

PSBD: Prediction Shift Uncertainty Unlocks Backdoor Detection

Wei Li, Pin-Yu Chen, Sijia Liu et al.

NEURIPS 2025arXiv:2505.16985

#8518

Extremely Simple Multimodal Outlier Synthesis for Out-of-Distribution Detection and Segmentation

Moru Liu, Hao Dong, Jessica Kelly et al.

NEURIPS 2025arXiv:2506.05596

#8519

Zero-shot protein stability prediction by inverse folding models: a free energy interpretation

Jes Frellsen, Maher Kassem, Tone Bengtsen et al.

NEURIPS 2025arXiv:2505.15807

#8520

The Atlas of In-Context Learning: How Attention Heads Shape In-Context Retrieval Augmentation

Patrick Kahardipraja, Reduan Achtibat, Thomas Wiegand et al.

NEURIPS 2025arXiv:2505.13787

#8521

Preference Learning with Lie Detectors can Induce Honesty or Evasion

Chris Cundy, Adam Gleave

#8522

ZigzagPointMamba: Spatial-Semantic Mamba for Point Cloud Understanding

LinshuangDiao, Sensen Song, Yurong Qian et al.

NEURIPS 2025

CVPR 2025arXiv:2503.20724

#8523

Dynamic Motion Blending for Versatile Motion Editing

Nan Jiang, Hongjie Li, Ziye Yuan et al.

NEURIPS 2025oralarXiv:2506.09501

#8524

Understanding and Mitigating Numerical Sources of Nondeterminism in LLM Inference

Jiayi Yuan, Hao Li, Xinheng Ding et al.

NEURIPS 2025arXiv:2505.10881

#8525

Prior-Guided Diffusion Planning for Offline Reinforcement Learning

Donghyeon Ki, JunHyeok Oh, Seong-Woong Shim et al.

#8526

Hot-pluggable Federated Learning: Bridging General and Personalized FL via Dynamic Selection

Lei Shen, Zhenheng Tang, Lijun Wu et al.

ICLR 2025arXiv:2410.07571

#8527

How Does Vision-Language Adaptation Impact the Safety of Vision Language Models?

Seongyun Lee, Geewook Kim, Jiyeon Kim et al.

NEURIPS 2025arXiv:2502.07193

#8528

Provably Efficient Online RLHF with One-Pass Reward Modeling

Long-Fei Li, Yu-Yang Qian, Peng Zhao et al.

CVPR 2025arXiv:2307.16375

#8529

UniAP: Unifying Inter- and Intra-Layer Automatic Parallelism by Mixed Integer Quadratic Programming

Hao Lin, Ke Wu, Jie Li et al.

ICCV 2025arXiv:2508.00823

#8530

IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation

Wenxuan Guo, Xiuwei Xu, Hang Yin et al.

NEURIPS 2025oralarXiv:2507.17664

#8531

Talk2Event: Grounded Understanding of Dynamic Scenes from Event Cameras

Lingdong Kong, Dongyue Lu, Alan Liang et al.

NEURIPS 2025spotlightarXiv:2508.12491

#8532

Cost-Aware Contrastive Routing for LLMs

Reza Shirkavand, Shangqian Gao, Peiran Yu et al.

CVPR 2025arXiv:2503.15898

#8533

Reconstructing In-the-Wild Open-Vocabulary Human-Object Interactions

Boran Wen, Dingbang Huang, Zichen Zhang et al.

CVPR 2025arXiv:2503.17814

#8534

LightLoc: Learning Outdoor LiDAR Localization at Light Speed

Wen Li, Chen Liu, Shangshu Yu et al.

NEURIPS 2025arXiv:2510.02745

#8535

Retrv-R1: A Reasoning-Driven MLLM Framework for Universal and Efficient Multimodal Retrieval

Lanyun Zhu, Deyi Ji, Tianrun Chen et al.

ICLR 2025arXiv:2408.15495

#8536

Remove Symmetries to Control Model Expressivity and Improve Optimization

Liu Ziyin, Yizhou Xu, Isaac Chuang

NEURIPS 2025oralarXiv:2506.04528

#8537

Hierarchical Implicit Neural Emulators

Ruoxi Jiang, Xiao Zhang, Karan Jakhar et al.

CVPR 2025highlightarXiv:2503.03265

#8538

Optimizing for the Shortest Path in Denoising Diffusion Model

Ping Chen, Xingpeng Zhang, Zhaoxiang Liu et al.

CVPR 2025highlightarXiv:2409.17993

#8539

SSHNet: Unsupervised Cross-modal Homography Estimation via Problem Reformulation and Split Optimization

Junchen Yu, Siyuan Cao, Runmin Zhang et al.

#8540

BiLoRA: Almost-Orthogonal Parameter Spaces for Continual Learning

Hao Zhu, Yifei Zhang, Junhao Dong et al.

CVPR 2025highlightarXiv:2503.15005

#8541

Universal Scene Graph Generation

Shengqiong Wu, Hao Fei, Tat-seng Chua

CVPR 2025arXiv:2503.18338

#8542

SPMTrack: Spatio-Temporal Parameter-Efficient Fine-Tuning with Mixture of Experts for Scalable Visual Tracking

Wenrui Cai, Qingjie Liu, Yunhong Wang

CVPR 2025arXiv:2406.20085

#8543

Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language

Yicheng Chen, Xiangtai Li, Yining Li et al.

#8544

Brain-Informed Fine-Tuning for Improved Multilingual Understanding in Language Models

Anuja Negi, SUBBAREDDY OOTA, Anwar Nunez-Elizalde et al.

NEURIPS 2025

#8545

Affine Steerable Equivariant Layer for Canonicalization of Neural Networks

Yikang Li, Yeqing Qiu, Yuxuan Chen et al.

ICLR 2025arXiv:2409.13949

#8546

Mufu: Multilingual Fused Learning for Low-Resource Translation with LLM

Zheng Wei Lim, Nitish Gupta, Honglin Yu et al.

#8547

Enhanced then Progressive Fusion with View Graph for Multi-View Clustering

Zhibin Dong, Meng Liu, Siwei Wang et al.

ICCV 2025arXiv:2506.22246

#8548

EAMamba: Efficient All-Around Vision State Space Model for Image Restoration

Yu-Cheng Lin, Yu-Syuan Xu, Hao-Wei Chen et al.

CVPR 2025arXiv:2504.08125

#8549

Gen3DEval: Using vLLMs for Automatic Evaluation of Generated 3D Objects

Shalini Maiti, Lourdes Agapito, Filippos Kokkinos

CVPR 2025arXiv:2404.10620

#8550

PyTorchGeoNodes: Enabling Differentiable Shape Programs for 3D Shape Reconstruction

Sinisa Stekovic, Arslan Artykov, Stefan Ainetter et al.

NEURIPS 2025oralarXiv:2405.12179

#8551

PLEIADES: Building Temporal Kernels with Orthogonal Polynomials

Yan Ru Pei, Olivier Coenen

CVPR 2025arXiv:2505.21755

#8552

FRAMES-VQA: Benchmarking Fine-Tuning Robustness across Multi-Modal Shifts in Visual Question Answering

Chengyue Huang, Brisa Maneechotesuwan, Shivang Chopra et al.

NEURIPS 2025arXiv:2505.19209

#8553

MOOSE-Chem2: Exploring LLM Limits in Fine-Grained Scientific Hypothesis Discovery via Hierarchical Search

Zonglin Yang, Wanhao Liu, Ben Gao et al.

NEURIPS 2025oralarXiv:2509.17955

#8554

Breaking the Discretization Barrier of Continuous Physics Simulation Learning

Fan Xu, Hao Wu, Nan Wang et al.

CVPR 2025arXiv:2503.19897

#8555

Scaling Down Text Encoders of Text-to-Image Diffusion Models

Lifu Wang, Daqing Liu, Xinchen Liu et al.

CVPR 2025arXiv:2506.01037

#8556

Self-supervised ControlNet with Spatio-Temporal Mamba for Real-world Video Super-resolution

Shijun Shi, Jing Xu, Lijing Lu et al.

ICCV 2025arXiv:2505.05591

#8557

QuickSplat: Fast 3D Surface Reconstruction via Learned Gaussian Initialization

Yueh-Cheng Liu, Lukas Höllein, Matthias Nießner et al.

#8558

TrustMark: Robust Watermarking and Watermark Removal for Arbitrary Resolution Images

Tu Bui, Shruti Agarwal, John Collomosse

ICCV 2025

CVPR 2025arXiv:2411.11223

#8559

Efficient Transfer Learning for Video-language Foundation Models

Haoxing Chen, Zizheng Huang, Yan Hong et al.

CVPR 2025arXiv:2503.23702

#8560

3D Dental Model Segmentation with Geometrical Boundary Preserving

Shufan Xi, Zexian Liu, Junlin Chang et al.

ICLR 2025arXiv:2411.02275

#8561

Breaking the Reclustering Barrier in Centroid-based Deep Clustering

Lukas Miklautz, Timo Klein, Kevin Sidak et al.

NEURIPS 2025arXiv:2502.00198

#8562

Fairshare Data Pricing via Data Valuation for Large Language Models

Luyang Zhang, Cathy Jiao, Beibei Li et al.

CVPR 2025arXiv:2509.00332

#8563

CryptoFace: End-to-End Encrypted Face Recognition

Wei Ao, Vishnu Naresh Boddeti

CVPR 2025arXiv:2411.02818

#8564

LiVOS: Light Video Object Segmentation with Gated Linear Matching

Qin Liu, Jianfeng Wang, Zhengyuan Yang et al.

ICCV 2025arXiv:2412.19920

#8565

Not all Views are Created Equal: Analyzing Viewpoint Instabilities in Vision Foundation Models

Mateusz Michalkiewicz, Xinyue Bai, Mahsa Baktashmotlagh et al.

NEURIPS 2025oralarXiv:2506.00895

#8566

State-Covering Trajectory Stitching for Diffusion Planners

Kyowoon Lee, Jaesik Choi

CVPR 2025arXiv:2505.05505

#8567

Apply Hierarchical-Chain-of-Generation to Complex Attributes Text-to-3D Generation

Yiming Qin, Zhu Xu, Yang Liu

ICLR 2025arXiv:2412.16156

#8568

Personalized Representation from Personalized Generation

Shobhita Sundaram, Julia Chae, Yonglong Tian et al.

NEURIPS 2025spotlightarXiv:2503.19034

#8569

Color Conditional Generation with Sliced Wasserstein Guidance

Alexander Lobashev, Maria Larchenko, Dmitry Guskov

ICLR 2025arXiv:2403.07968

#8570

Do Deep Neural Network Solutions Form a Star Domain?

Ankit Sonthalia, Alexander Rubinstein, Ehsan Abbasnejad et al.

CVPR 2025arXiv:2411.17474

#8571

Probing the Mid-level Vision Capabilities of Self-Supervised Learning

Xuweiyi Chen, Markus Marks, Zezhou Cheng

NEURIPS 2025arXiv:2505.22310

#8572

From Dormant to Deleted: Tamper-Resistant Unlearning Through Weight-Space Regularization

Shoaib Ahmed Siddiqui, Adrian Weller, David Krueger et al.

CVPR 2025highlightarXiv:2503.05936

#8573

CASP: Compression of Large Multimodal Models Based on Attention Sparsity

Mohsen Gholami, Mohammad Akbari, Kevin Cannons et al.

ICLR 2025arXiv:2409.16453

#8574

Extending Mercer's expansion to indefinite and asymmetric kernels

Sungwoo Jeong, Alex Townsend

NEURIPS 2025arXiv:2502.15543

#8575

ParamMute: Suppressing Knowledge-Critical FFNs for Faithful Retrieval-Augmented Generation

Pengcheng Huang, Zhenghao Liu, Yukun Yan et al.

NEURIPS 2025arXiv:2506.21170

#8576

Compressed and Smooth Latent Space for Text Diffusion Modeling

Viacheslav Meshchaninov, Egor Chimbulatov, Alexander Shabalin et al.

CVPR 2025arXiv:2403.10344

#8577

ViiNeuS: Volumetric Initialization for Implicit Neural Surface Reconstruction of Urban Scenes with Limited Image Overlap

Hala Djeghim, Nathan Piasco, Moussab Bennehar et al.

NEURIPS 2025arXiv:2505.12345

#8578

UniEdit: A Unified Knowledge Editing Benchmark for Large Language Models

Qizhou Chen, Dakan Wang, Taolin Zhang et al.

CVPR 2025arXiv:2503.06277

#8579

STiL: Semi-supervised Tabular-Image Learning for Comprehensive Task-Relevant Information Exploration in Multimodal Classification

Siyi Du, Xinzhe Luo, Declan ORegan et al.

NEURIPS 2025arXiv:2505.15093

#8580

Steering Generative Models with Experimental Data for Protein Fitness Optimization

Jason Yang, Wenda Chu, Daniel Khalil et al.

ICCV 2025arXiv:2411.14951

#8581

Morph: A Motion-free Physics Optimization Framework for Human Motion Generation

Zhuo Li, Mingshuang Luo, RuiBing Hou et al.

#8582

SplineGS: Learning Smooth Trajectories in Gaussian Splatting for Dynamic Scene Reconstruction

Jihwan Yoon, Sangbeom Han, Jaeseok Oh et al.

ICLR 2025oral

ICCV 2025highlightarXiv:2503.15671

#8583

CHROME: Clothed Human Reconstruction with Occlusion-Resilience and Multiview-Consistency from a Single Image

Arindam Dutta, Meng Zheng, Zhongpai Gao et al.

CVPR 2025arXiv:2412.05161

#8584

DNF: Unconditional 4D Generation with Dictionary-based Neural Fields

Xinyi Zhang, Naiqi Li, Angela Dai

NEURIPS 2025arXiv:2506.13717

#8585

Contrastive Self-Supervised Learning As Neural Manifold Packing

Guanming Zhang, David Heeger, Stefano Martiniani

NEURIPS 2025arXiv:2505.23625

#8586

ZeroSep: Separate Anything in Audio with Zero Training

Chao Huang, Yuesheng Ma, Junxuan Huang et al.

#8587

$\texttt{STRCMP}$: Integrating Graph Structural Priors with Language Models for Combinatorial Optimization

Xijun Li, Jiexiang Yang, Jinghao Wang et al.

NEURIPS 2025

ICCV 2025arXiv:2507.01756

#8588

Rethinking Discrete Tokens: Treating Them as Conditions for Continuous Autoregressive Image Synthesis

Peng Zheng, Junke Wang, Yi Chang et al.

CVPR 2025arXiv:2411.13549

#8589

Generating 3D-Consistent Videos from Unposed Internet Photos

Gene Chou, Kai Zhang, Sai Bi et al.

NEURIPS 2025arXiv:2505.11003

#8590

ForensicHub: A Unified Benchmark & Codebase for All-Domain Fake Image Detection and Localization

Bo Du, Xuekang Zhu, Xiaochen Ma et al.

CVPR 2025highlightarXiv:2503.08306

#8591

Reasoning in Visual Navigation of End-to-end Trained Agents: A Dynamical Systems Approach

Steeven JANNY, Hervé Poirier, Leonid Antsfeld et al.

CVPR 2025arXiv:2502.20981

#8592

Distribution Prototype Diffusion Learning for Open-set Supervised Anomaly Detection

Fuyun Wang, Tong Zhang, Yuanzhi Wang et al.

NEURIPS 2025spotlightarXiv:2508.00643

#8593

Light-Weight Diffusion Multiplier and Uncertainty Quantification for Fourier Neural Operators

Albert Matveev, Sanmitra Ghosh, Aamal Hussain et al.

NEURIPS 2025arXiv:2509.16450

#8594

On the Existence and Complexity of Core-Stable Data Exchanges

Jiaxin Song, Pooja Kulkarni, Parnian Shahkar et al.

NEURIPS 2025arXiv:2311.01104

#8595

On the Convergence of Projected Policy Gradient for Any Constant Step Sizes

Jiacai Liu, Wenye Li, Dachao Lin et al.

CVPR 2025highlightarXiv:2503.17142

#8596

Not Only Text: Exploring Compositionality of Visual Representations in Vision-Language Models

Davide Berasi, Matteo Farina, Massimiliano Mancini et al.

CVPR 2025arXiv:2503.01175

#8597

HOP: Heterogeneous Topology-based Multimodal Entanglement for Co-Speech Gesture Generation

Hongye Cheng, Tianyu Wang, guangsi shi et al.

NEURIPS 2025arXiv:2309.17262

#8598

Estimation and Inference in Distributional Reinforcement Learning

Liangyu Zhang, Yang Peng, Jiadong Liang et al.

ICCV 2025arXiv:2504.06908

#8599

UKBOB: One Billion MRI Labeled Masks for Generalizable 3D Medical Image Segmentation

Emmanuelle Bourigault, Amir Jamaludin, Abdullah Hamdi

NEURIPS 2025arXiv:2505.13731

#8600

GeoRanker: Distance-Aware Ranking for Worldwide Image Geolocalization

Pengyue Jia, Seongheon Park, Song Gao et al.