Most Cited 2025 "stochastic multi-agent systems" Papers

22,274 papers found • Page 43 of 112

#8401

Computational Algebra with Attention: Transformer Oracles for Border Basis Algorithms

Hiroshi Kera, Nico Pelleriti, Yuki Ishihara et al.

NEURIPS 2025arXiv:2505.23696
4
citations
#8402

Certifying Counterfactual Bias in LLMs

Isha Chaudhary, Qian Hu, Manoj Kumar et al.

ICLR 2025arXiv:2405.18780
4
citations
#8403

WikiAutoGen: Towards Multi-Modal Wikipedia-Style Article Generation

Zhongyu Yang, Jun Chen, Dannong Xu et al.

ICCV 2025arXiv:2503.19065
4
citations
#8404

Memory Efficient Transformer Adapter for Dense Predictions

Dong Zhang, Rui Yan, Pingcheng Dong et al.

ICLR 2025arXiv:2502.01962
4
citations
#8405

Balanced Neural ODEs: nonlinear model order reduction and Koopman operator approximations

Julius Aka, Johannes Brunnemann, Jörg Eiden et al.

ICLR 2025arXiv:2410.10174
4
citations
#8406

UniGlyph: Unified Segmentation-Conditioned Diffusion for Precise Visual Text Synthesis

Yuanrui Wang, Cong Han, Yafei Li et al.

ICCV 2025arXiv:2507.00992
4
citations
#8407

Event-Driven Dynamic Scene Depth Completion

Zhiqiang Yan, Jianhao Jiao, Zhengxue Wang et al.

NEURIPS 2025oralarXiv:2505.13279
4
citations
#8408

Flatten Graphs as Sequences: Transformers are Scalable Graph Generators

Dexiong Chen, Markus Krimmel, Karsten Borgwardt

NEURIPS 2025arXiv:2502.02216
4
citations
#8409

Flow Equivariant Recurrent Neural Networks

Andy Keller

NEURIPS 2025spotlightarXiv:2507.14793
4
citations
#8410

A Token-level Text Image Foundation Model for Document Understanding

Tongkun Guan, Zining Wang, Pei Fu et al.

ICCV 2025arXiv:2503.02304
4
citations
#8411

Learning 3D Object Spatial Relationships from Pre-trained 2D Diffusion Models

Sangwon Baik, Hyeonwoo Kim, Hanbyul Joo

ICCV 2025arXiv:2503.19914
4
citations
#8412

Context-Aware Multimodal Pretraining

Karsten Roth, Zeynep Akata, Dima Damen et al.

CVPR 2025highlightarXiv:2411.15099
4
citations
#8413

MarkushGrapher: Joint Visual and Textual Recognition of Markush Structures

Lucas Morin, Valery Weber, Ahmed Nassar et al.

CVPR 2025arXiv:2503.16096
4
citations
#8414

Mono3DVLT: Monocular-Video-Based 3D Visual Language Tracking

Hongkai Wei, YANG YANG, Shijie Sun et al.

CVPR 2025
4
citations
#8415

Unity in Diversity: Video Editing via Gradient-Latent Purification

Junyu Gao, Kunlin Yang, Xuan Yao et al.

CVPR 2025
4
citations
#8416

Dataset Distillation via Knowledge Distillation: Towards Efficient Self-Supervised Pre-training of Deep Networks

Siddharth Joshi, Jiayi Ni, Baharan Mirzasoleiman

ICLR 2025arXiv:2410.02116
4
citations
#8417

A Implies B: Circuit Analysis in LLMs for Propositional Logical Reasoning

Guan Zhe Hong, Nishanth Dikkala, Enming Luo et al.

NEURIPS 2025spotlightarXiv:2411.04105
4
citations
#8418

A*-Thought: Efficient Reasoning via Bidirectional Compression for Low-Resource Settings

Xiaoang Xu, Shuo Wang, Xu Han et al.

NEURIPS 2025arXiv:2505.24550
4
citations
#8419

Dynamic Derivation and Elimination: Audio Visual Segmentation with Enhanced Audio Semantics

Chen Liu, Liying Yang, Peike Li et al.

CVPR 2025arXiv:2503.12840
4
citations
#8420

Beyond [cls]: Exploring the True Potential of Masked Image Modeling Representations

Marcin Przewięźlikowski, Randall Balestriero, Wojciech Jasiński et al.

ICCV 2025arXiv:2412.03215
4
citations
#8421

Revisiting Frank-Wolfe for Structured Nonconvex Optimization

Hoomaan Maskan, Yikun Hou, Suvrit Sra et al.

NEURIPS 2025arXiv:2503.08921
4
citations
#8422

AdvPaint: Protecting Images from Inpainting Manipulation via Adversarial Attention Disruption

Joonsung Jeon, Woo Jae Kim, Suhyeon Ha et al.

ICLR 2025arXiv:2503.10081
4
citations
#8423

Varying Shades of Wrong: Aligning LLMs with Wrong Answers Only

Jihan Yao, Wenxuan Ding, Shangbin Feng et al.

ICLR 2025arXiv:2410.11055
4
citations
#8424

Antidote: A Unified Framework for Mitigating LVLM Hallucinations in Counterfactual Presupposition and Object Perception

Yuanchen Wu, Lu Zhang, Hang Yao et al.

CVPR 2025arXiv:2504.20468
4
citations
#8425

Faster Parameter-Efficient Tuning with Token Redundancy Reduction

Kwonyoung Kim, Jungin Park, Jin Kim et al.

CVPR 2025arXiv:2503.20282
4
citations
#8426

ComRoPE: Scalable and Robust Rotary Position Embedding Parameterized by Trainable Commuting Angle Matrices

Hao Yu, Tangyu Jiang, Shuning Jia et al.

CVPR 2025arXiv:2506.03737
4
citations
#8427

PanoLlama: Generating Endless and Coherent Panoramas with Next-Token-Prediction LLMs

Teng Zhou, Xiaoyu Zhang, Yongchuan Tang

ICCV 2025highlightarXiv:2411.15867
4
citations
#8428

What Makes a Good Dataset for Knowledge Distillation?

Logan Frank, Jim Davis

CVPR 2025arXiv:2411.12817
4
citations
#8429

Temporal Score Analysis for Understanding and Correcting Diffusion Artifacts

Yu Cao, Zengqun Zhao, Ioannis Patras et al.

CVPR 2025arXiv:2503.16218
4
citations
#8430

Logicbreaks: A Framework for Understanding Subversion of Rule-based Inference

Anton Xue, Avishree Khare, Rajeev Alur et al.

ICLR 2025arXiv:2407.00075
4
citations
#8431

Which Data Attributes Stimulate Math and Code Reasoning? An Investigation via Influence Functions

Siqi Kou, Qingyuan Tian, Hanwen Xu et al.

NEURIPS 2025arXiv:2505.19949
4
citations
#8432

SBSC: Step-by-Step Coding for Improving Mathematical Olympiad Performance

Kunal Singh, Ankan Biswas, Sayandeep Bhowmick et al.

ICLR 2025arXiv:2502.16666
4
citations
#8433

Sculpting Memory: Multi-Concept Forgetting in Diffusion Models via Dynamic Mask and Concept-Aware Optimization

Li, Yang Xiao, Jie Ji et al.

ICCV 2025arXiv:2504.09039
4
citations
#8434

Towards Generalizable Trajectory Prediction using Dual-Level Representation Learning and Adaptive Prompting

Kaouther Messaoud, Matthieu Cord, Alex Alahi

CVPR 2025arXiv:2501.04815
4
citations
#8435

Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

Ruihang Chu, Yefei He, Zhekai Chen et al.

NEURIPS 2025oralarXiv:2512.08765
4
citations
#8436

On Inductive Biases That Enable Generalization in Diffusion Transformers

Jie An, De Wang, Pengsheng Guo et al.

NEURIPS 2025arXiv:2410.21273
4
citations
#8437

Occupancy Learning with Spatiotemporal Memory

Ziyang Leng, Jiawei Yang, Wenlong Yi et al.

ICCV 2025arXiv:2508.04705
4
citations
#8438

Zero-1-to-A: Zero-Shot One Image to Animatable Head Avatars Using Video Diffusion

Zhenglin Zhou, Fan Ma, Hehe Fan et al.

CVPR 2025arXiv:2503.15851
4
citations
#8439

MP-HSIR: A Multi-Prompt Framework for Universal Hyperspectral Image Restoration

Zhehui Wu, Yong Chen, Naoto Yokoya et al.

ICCV 2025arXiv:2503.09131
4
citations
#8440

Radio Frequency Ray Tracing with Neural Object Representation for Enhanced RF Modeling

Xingyu Chen, Zihao Feng, Kun Qian et al.

CVPR 2025
4
citations
#8441

HiFi-Portrait: Zero-shot Identity-preserved Portrait Generation with High-fidelity Multi-face Fusion

Yifang Xu, BenXiang Zhai, Yunzhuo Sun et al.

CVPR 2025arXiv:2512.14542
4
citations
#8442

ETA: Efficiency through Thinking Ahead, A Dual Approach to Self-Driving with Large Models

Shadi Hamdan, Chonghao Sima, Zetong Yang et al.

ICCV 2025arXiv:2506.07725
4
citations
#8443

CoMPaSS: Enhancing Spatial Understanding in Text-to-Image Diffusion Models

Gaoyang Zhang, Bingtao Fu, Qingnan Fan et al.

ICCV 2025arXiv:2412.13195
4
citations
#8444

ZeroVO: Visual Odometry with Minimal Assumptions

Lei Lai, Zekai Yin, Eshed Ohn-Bar

CVPR 2025arXiv:2506.08005
4
citations
#8445

Order-One Rolling Shutter Cameras

Marvin Anas Hahn, Kathlén Kohn, Orlando Marigliano et al.

CVPR 2025highlightarXiv:2403.11295
4
citations
#8446

Learning to Generalize without Bias for Open-Vocabulary Action Recognition

Yating Yu, Congqi Cao, Yifan Zhang et al.

ICCV 2025highlightarXiv:2502.20158
4
citations
#8447

Quality over Quantity in Attention Layers: When Adding More Heads Hurts

Noah Amsel, Gilad Yehudai, Joan Bruna

ICLR 2025
4
citations
#8448

Memory-Enhanced Neural Solvers for Routing Problems

Felix Chalumeau, Refiloe Shabe, Noah De Nicola et al.

NEURIPS 2025spotlightarXiv:2406.16424
4
citations
#8449

Self-Supervised Learning of Motion Concepts by Optimizing Counterfactuals

Stefan Stojanov, David Wendt, Seungwoo Kim et al.

NEURIPS 2025spotlightarXiv:2503.19953
4
citations
#8450

Adversarial Mixup Unlearning

Zhuoyi Peng, Yixuan Tang, Yi Yang

ICLR 2025arXiv:2502.10288
4
citations
#8451

Approximation theory for 1-Lipschitz ResNets

Davide Murari, Takashi Furuya, Carola-Bibiane Schönlieb

NEURIPS 2025arXiv:2505.12003
4
citations
#8452

INS-MMBench: A Comprehensive Benchmark for Evaluating LVLMs' Performance in Insurance

Chenwei Lin, Hanjia Lyu, Xian Xu et al.

ICCV 2025arXiv:2406.09105
4
citations
#8453

A Regularization-Guided Equivariant Approach for Image Restoration

Yulu Bai, Jiahong Fu, Qi Xie et al.

CVPR 2025arXiv:2505.19799
4
citations
#8454

DPSeg: Dual-Prompt Cost Volume Learning for Open-Vocabulary Semantic Segmentation

Ziyu Zhao, Xiaoguang Li, Lingjia Shi et al.

CVPR 2025arXiv:2505.11676
4
citations
#8455

DreamCache: Finetuning-Free Lightweight Personalized Image Generation via Feature Caching

Emanuele Aiello, Umberto Michieli, Diego Valsesia et al.

CVPR 2025arXiv:2411.17786
4
citations
#8456

Full-DoF Egomotion Estimation for Event Cameras Using Geometric Solvers

Ji Zhao, Banglei Guan, Zibin Liu et al.

CVPR 2025highlightarXiv:2503.03307
4
citations
#8457

Flow-based Variational Mutual Information: Fast and Flexible Approximations

Caleb Dahlke, Jason Pacheco

ICLR 2025
4
citations
#8458

PartNeXt: A Next-Generation Dataset for Fine-Grained and Hierarchical 3D Part Understanding

Penghao Wang, Yiyang He, Xin Lv et al.

NEURIPS 2025arXiv:2510.20155
4
citations
#8459

Believing is Seeing: Unobserved Object Detection using Generative Models

Subhransu S. Bhattacharjee, Dylan Campbell, Rahul Shome

CVPR 2025arXiv:2410.05869
4
citations
#8460

Floxels: Fast Unsupervised Voxel Based Scene Flow Estimation

David T. Hoffmann, Syed Haseeb Raza, Hanqiu Jiang et al.

CVPR 2025arXiv:2503.04718
4
citations
#8461

Stable Diffusion Models are Secretly Good at Visual In-Context Learning

Trevine Oorloff, Vishwanath Sindagi, Wele Gedara Chaminda Bandara et al.

ICCV 2025arXiv:2508.09949
4
citations
#8462

LatentHOI: On the Generalizable Hand Object Motion Generation with Latent Hand Diffusion.

Muchen Li, Sammy Christen, Chengde Wan et al.

CVPR 2025
4
citations
#8463

GraphMimic: Graph-to-Graphs Generative Modeling from Videos for Policy Learning

Guangyan Chen, Te Cui, Meiling Wang et al.

CVPR 2025
4
citations
#8464

Tracking the Copyright of Large Vision-Language Models through Parameter Learning Adversarial Images

Yubo Wang, Jianting Tang, Liu et al.

ICLR 2025arXiv:2502.16593
4
citations
#8465

StelLA: Subspace Learning in Low-rank Adaptation using Stiefel Manifold

Zhizhong Li, Sina Sajadmanesh, Jingtao Li et al.

NEURIPS 2025spotlightarXiv:2510.01938
4
citations
#8466

Occlusion-aware Non-Rigid Point Cloud Registration via Unsupervised Neural Deformation Correntropy

Mingyang Zhao, Gaofeng Meng, Dong-ming Yan

ICLR 2025arXiv:2502.10704
4
citations
#8467

Continual SFT Matches Multimodal RLHF with Negative Supervision

Ke Zhu, Yu Wang, Yanpeng Sun et al.

CVPR 2025arXiv:2411.14797
4
citations
#8468

Analog In-memory Training on General Non-ideal Resistive Elements: The Impact of Response Functions

Zhaoxian Wu, Quan Xiao, Tayfun Gokmen et al.

NEURIPS 2025oralarXiv:2502.06309
4
citations
#8469

Adaptive $Q$-Network: On-the-fly Target Selection for Deep Reinforcement Learning

Théo Vincent, Fabian Wahren, Jan Peters et al.

ICLR 2025arXiv:2405.16195
4
citations
#8470

Humans as a Calibration Pattern: Dynamic 3D Scene Reconstruction from Unsynchronized and Uncalibrated Videos

Changwoon Choi, Jeongjun Kim, Geonho Cha et al.

ICCV 2025arXiv:2412.19089
4
citations
#8471

Test-Time Adaptation of Vision-Language Models for Open-Vocabulary Semantic Segmentation

Mehrdad Noori, David OSOWIECHI, Gustavo Vargas Hakim et al.

NEURIPS 2025arXiv:2505.21844
4
citations
#8472

Transformer Learns Optimal Variable Selection in Group-Sparse Classification

Chenyang Zhang, Xuran Meng, Yuan Cao

ICLR 2025arXiv:2504.08638
4
citations
#8473

MonoFusion: Sparse-View 4D Reconstruction via Monocular Fusion

Zihan Wang, Jeff Tan, Tarasha Khurana et al.

ICCV 2025arXiv:2507.23782
4
citations
#8474

MonoMobility: Zero-Shot 3D Mobility Analysis from Monocular Videos

Hongyi Zhou, Xiaogang Wang, Yulan Guo et al.

ICCV 2025arXiv:2505.11868
4
citations
#8475

Unveiling Concept Attribution in Diffusion Models

Nguyen Hung-Quang, Hoang Phan, Khoa D Doan

NEURIPS 2025arXiv:2412.02542
4
citations
#8476

Segment Any-Quality Images with Generative Latent Space Enhancement

Guangqian Guo, Yong Guo, Xuehui Yu et al.

CVPR 2025arXiv:2503.12507
4
citations
#8477

RaySt3R: Predicting Novel Depth Maps for Zero-Shot Object Completion

Bardienus Duisterhof, Jan Oberst, Bowen Wen et al.

NEURIPS 2025arXiv:2506.05285
4
citations
#8478

Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing

Taihang Hu, Linxuan Li, Kai Wang et al.

ICCV 2025arXiv:2504.10434
4
citations
#8479

MeshGen: Generating PBR Textured Mesh with Render-Enhanced Auto-Encoder and Generative Data Augmentation

Zilong Chen, Yikai Wang, Wenqiang Sun et al.

CVPR 2025highlightarXiv:2505.04656
4
citations
#8480

Neural Collapse is Globally Optimal in Deep Regularized ResNets and Transformers

Peter Súkeník, Christoph Lampert, Marco Mondelli

NEURIPS 2025arXiv:2505.15239
4
citations
#8481

Language Models can Self-Improve at State-Value Estimation for Better Search

Ethan Mendes, Alan Ritter

NEURIPS 2025spotlightarXiv:2503.02878
4
citations
#8482

Exploring Contextual Attribute Density in Referring Expression Counting

Zhicheng Wang, Zhiyu Pan, Zhan Peng et al.

CVPR 2025arXiv:2503.12460
4
citations
#8483

UCOD-DPL: Unsupervised Camouflaged Object Detection via Dynamic Pseudo-label Learning

Weiqi Yan, Lvhai Chen, Huaijia Kou et al.

CVPR 2025highlightarXiv:2506.07087
4
citations
#8484

ClimaQA: An Automated Evaluation Framework for Climate Question Answering Models

Veeramakali Vignesh Manivannan, Yasaman Jafari, Srikar Eranky et al.

ICLR 2025arXiv:2410.16701
4
citations
#8485

Towards Scalable Spatial Intelligence via 2D-to-3D Data Lifting

Xingyu Miao, Haoran Duan, Quanhao Qian et al.

ICCV 2025highlightarXiv:2507.18678
4
citations
#8486

Enhancing Online Continual Learning with Plug-and-Play State Space Model and Class-Conditional Mixture of Discretization

Sihao Liu, Yibo Yang, Xiaojie Li et al.

CVPR 2025arXiv:2412.18177
4
citations
#8487

Synchronized Video-to-Audio Generation via Mel Quantization-Continuum Decomposition

Juncheng Wang, Chao Xu, Cheng Yu et al.

CVPR 2025arXiv:2503.06984
4
citations
#8488

SVG-IR: Spatially-Varying Gaussian Splatting for Inverse Rendering

Hanxiao Sun, Yupeng Gao, Jin Xie et al.

CVPR 2025arXiv:2504.06815
4
citations
#8489

SP2T: Sparse Proxy Attention for Dual-stream Point Transformer

Jiaxu Wan, Hong Zhang, Ziqi He et al.

ICCV 2025
4
citations
#8490

Gradient-Variation Online Adaptivity for Accelerated Optimization with Hölder Smoothness

Yuheng Zhao, Yu-Hu Yan, Kfir Y. Levy et al.

NEURIPS 2025spotlightarXiv:2511.02276
4
citations
#8491

OuroMamba: A Data-Free Quantization Framework for Vision Mamba

Akshat Ramachandran, Mingyu Lee, Huan Xu et al.

ICCV 2025arXiv:2503.10959
4
citations
#8492

Constrained Sampling for Language Models Should Be Easy: An MCMC Perspective

Emmanuel Anaya Gonzalez, Sairam Vaidya, Kanghee Park et al.

NEURIPS 2025arXiv:2506.05754
4
citations
#8493

Reinforcement Learning Teachers of Test Time Scaling

Edoardo Cetin, Tianyu Zhao, Yujin Tang

NEURIPS 2025arXiv:2506.08388
4
citations
#8494

Instruct-CLIP: Improving Instruction-Guided Image Editing with Automated Data Refinement Using Contrastive Learning

Sherry X. Chen, Misha Sra, Pradeep Sen

CVPR 2025arXiv:2503.18406
4
citations
#8495

Are Images Indistinguishable to Humans Also Indistinguishable to Classifiers?

Zebin You, Xinyu Zhang, Hanzhong Guo et al.

CVPR 2025arXiv:2405.18029
4
citations
#8496

Domain-Specific Pruning of Large Mixture-of-Experts Models with Few-shot Demonstrations

Zican Dong, Han Peng, Peiyu Liu et al.

NEURIPS 2025arXiv:2504.06792
4
citations
#8497

Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning

Jun Li, Jinpeng Wang, Chaolei Tan et al.

ICCV 2025arXiv:2507.17402
4
citations
#8498

GS-DiT: Advancing Video Generation with Dynamic 3D Gaussian Fields through Efficient Dense 3D Point Tracking

Weikang Bian, Zhaoyang Huang, Xiaoyu Shi et al.

CVPR 2025
4
citations
#8499

Pi-GPS: Enhancing Geometry Problem Solving by Unleashing the Power of Diagrammatic Information

Junbo Zhao, Ting Zhang, Jiayu Sun et al.

ICCV 2025arXiv:2503.05543
4
citations
#8500

Learning Video-Conditioned Policy on Unlabelled Data with Joint Embedding Predictive Transformer

Hao Luo, Zongqing Lu

ICLR 2025
4
citations
#8501

PatchVSR: Breaking Video Diffusion Resolution Limits with Patch-wise Video Super-Resolution

Shian Du, Menghan Xia, Chang Liu et al.

CVPR 2025arXiv:2509.26025
4
citations
#8502

Test-Time Visual In-Context Tuning

Jiahao Xie, Alessio Tonioni, Nathalie Rauschmayr et al.

CVPR 2025arXiv:2503.21777
4
citations
#8503

Beyond Single-Task: Robust Multi-Task Length Generalization for LLMs

Yi Hu, Shijia Kang, Haotong Yang et al.

NEURIPS 2025arXiv:2502.11525
4
citations
#8504

Enhancing Clustered Federated Learning: Integration of Strategies and Improved Methodologies

Yongxin Guo, Xiaoying Tang, Tao Lin

ICLR 2025arXiv:2310.05397
4
citations
#8505

FruitNinja: 3D Object Interior Texture Generation with Gaussian Splatting

Fangyu Wu, Yuhao Chen

CVPR 2025arXiv:2411.12089
4
citations
#8506

State Space Model Meets Transformer: A New Paradigm for 3D Object Detection

Chuxin Wang, Wenfei Yang, Xiang Liu et al.

ICLR 2025arXiv:2503.14493
4
citations
#8507

Learning to Detect Objects from Multi-Agent LiDAR Scans without Manual Labels

Qiming Xia, Wenkai Lin, Haoen Xiang et al.

CVPR 2025arXiv:2503.08421
4
citations
#8508

GS-ID: Illumination Decomposition on Gaussian Splatting via Adaptive Light Aggregation and Diffusion-Guided Material Priors

Kang DU, Zhihao Liang, Yulin Shen et al.

ICCV 2025arXiv:2408.08524
4
citations
#8509

Defending Multimodal Backdoored Models by Repulsive Visual Prompt Tuning

Zhifang Zhang, Shuo He, Haobo Wang et al.

NEURIPS 2025arXiv:2412.20392
4
citations
#8510

Evaluating Vision-Language Models as Evaluators in Path Planning

Mohamed Aghzal, Xiang Yue, Erion Plaku et al.

CVPR 2025arXiv:2411.18711
4
citations
#8511

Inference-time Alignment in Continuous Space

Yige Yuan, Teng Xiao, Li Yunfan et al.

NEURIPS 2025arXiv:2505.20081
4
citations
#8512

Second-Order Min-Max Optimization with Lazy Hessians

Lesi Chen, Chengchang Liu, Jingzhao Zhang

ICLR 2025arXiv:2410.09568
4
citations
#8513

Learning Audio-guided Video Representation with Gated Attention for Video-Text Retrieval

Boseung Jeong, Jicheol Park, Sungyeon Kim et al.

CVPR 2025arXiv:2504.02397
4
citations
#8514

DistillDrive: End-to-End Multi-Mode Autonomous Driving Distillation by Isomorphic Hetero-Source Planning Model

Rui Yu, Xianghang Zhang, Runkai Zhao et al.

ICCV 2025arXiv:2508.05402
4
citations
#8515

HollowFlow: Efficient Sample Likelihood Evaluation using Hollow Message Passing

Johann Flemming Gloy, Simon Olsson

NEURIPS 2025arXiv:2510.21542
4
citations
#8516

When Thinking Drifts: Evidential Grounding for Robust Video Reasoning

Romy Luo, Zihui (Sherry) Xue, Alex Dimakis et al.

NEURIPS 2025arXiv:2510.06077
4
citations
#8517

PSBD: Prediction Shift Uncertainty Unlocks Backdoor Detection

Wei Li, Pin-Yu Chen, Sijia Liu et al.

CVPR 2025arXiv:2406.05826
4
citations
#8518

Extremely Simple Multimodal Outlier Synthesis for Out-of-Distribution Detection and Segmentation

Moru Liu, Hao Dong, Jessica Kelly et al.

NEURIPS 2025arXiv:2505.16985
4
citations
#8519

Zero-shot protein stability prediction by inverse folding models: a free energy interpretation

Jes Frellsen, Maher Kassem, Tone Bengtsen et al.

NEURIPS 2025arXiv:2506.05596
4
citations
#8520

The Atlas of In-Context Learning: How Attention Heads Shape In-Context Retrieval Augmentation

Patrick Kahardipraja, Reduan Achtibat, Thomas Wiegand et al.

NEURIPS 2025arXiv:2505.15807
4
citations
#8521

Preference Learning with Lie Detectors can Induce Honesty or Evasion

Chris Cundy, Adam Gleave

NEURIPS 2025arXiv:2505.13787
4
citations
#8522

ZigzagPointMamba: Spatial-Semantic Mamba for Point Cloud Understanding

LinshuangDiao, Sensen Song, Yurong Qian et al.

NEURIPS 2025
4
citations
#8523

Dynamic Motion Blending for Versatile Motion Editing

Nan Jiang, Hongjie Li, Ziye Yuan et al.

CVPR 2025arXiv:2503.20724
4
citations
#8524

Understanding and Mitigating Numerical Sources of Nondeterminism in LLM Inference

Jiayi Yuan, Hao Li, Xinheng Ding et al.

NEURIPS 2025oralarXiv:2506.09501
4
citations
#8525

Prior-Guided Diffusion Planning for Offline Reinforcement Learning

Donghyeon Ki, JunHyeok Oh, Seong-Woong Shim et al.

NEURIPS 2025arXiv:2505.10881
4
citations
#8526

Hot-pluggable Federated Learning: Bridging General and Personalized FL via Dynamic Selection

Lei Shen, Zhenheng Tang, Lijun Wu et al.

ICLR 2025
4
citations
#8527

How Does Vision-Language Adaptation Impact the Safety of Vision Language Models?

Seongyun Lee, Geewook Kim, Jiyeon Kim et al.

ICLR 2025arXiv:2410.07571
4
citations
#8528

Provably Efficient Online RLHF with One-Pass Reward Modeling

Long-Fei Li, Yu-Yang Qian, Peng Zhao et al.

NEURIPS 2025arXiv:2502.07193
4
citations
#8529

UniAP: Unifying Inter- and Intra-Layer Automatic Parallelism by Mixed Integer Quadratic Programming

Hao Lin, Ke Wu, Jie Li et al.

CVPR 2025arXiv:2307.16375
4
citations
#8530

IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation

Wenxuan Guo, Xiuwei Xu, Hang Yin et al.

ICCV 2025arXiv:2508.00823
4
citations
#8531

Talk2Event: Grounded Understanding of Dynamic Scenes from Event Cameras

Lingdong Kong, Dongyue Lu, Alan Liang et al.

NEURIPS 2025oralarXiv:2507.17664
4
citations
#8532

Cost-Aware Contrastive Routing for LLMs

Reza Shirkavand, Shangqian Gao, Peiran Yu et al.

NEURIPS 2025spotlightarXiv:2508.12491
4
citations
#8533

Reconstructing In-the-Wild Open-Vocabulary Human-Object Interactions

Boran Wen, Dingbang Huang, Zichen Zhang et al.

CVPR 2025arXiv:2503.15898
4
citations
#8534

LightLoc: Learning Outdoor LiDAR Localization at Light Speed

Wen Li, Chen Liu, Shangshu Yu et al.

CVPR 2025arXiv:2503.17814
4
citations
#8535

Retrv-R1: A Reasoning-Driven MLLM Framework for Universal and Efficient Multimodal Retrieval

Lanyun Zhu, Deyi Ji, Tianrun Chen et al.

NEURIPS 2025arXiv:2510.02745
4
citations
#8536

Remove Symmetries to Control Model Expressivity and Improve Optimization

Liu Ziyin, Yizhou Xu, Isaac Chuang

ICLR 2025arXiv:2408.15495
4
citations
#8537

Hierarchical Implicit Neural Emulators

Ruoxi Jiang, Xiao Zhang, Karan Jakhar et al.

NEURIPS 2025oralarXiv:2506.04528
4
citations
#8538

Optimizing for the Shortest Path in Denoising Diffusion Model

Ping Chen, Xingpeng Zhang, Zhaoxiang Liu et al.

CVPR 2025highlightarXiv:2503.03265
4
citations
#8539

SSHNet: Unsupervised Cross-modal Homography Estimation via Problem Reformulation and Split Optimization

Junchen Yu, Siyuan Cao, Runmin Zhang et al.

CVPR 2025highlightarXiv:2409.17993
4
citations
#8540

BiLoRA: Almost-Orthogonal Parameter Spaces for Continual Learning

Hao Zhu, Yifei Zhang, Junhao Dong et al.

CVPR 2025
4
citations
#8541

Universal Scene Graph Generation

Shengqiong Wu, Hao Fei, Tat-seng Chua

CVPR 2025highlightarXiv:2503.15005
4
citations
#8542

SPMTrack: Spatio-Temporal Parameter-Efficient Fine-Tuning with Mixture of Experts for Scalable Visual Tracking

Wenrui Cai, Qingjie Liu, Yunhong Wang

CVPR 2025arXiv:2503.18338
4
citations
#8543

Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language

Yicheng Chen, Xiangtai Li, Yining Li et al.

CVPR 2025arXiv:2406.20085
4
citations
#8544

Brain-Informed Fine-Tuning for Improved Multilingual Understanding in Language Models

Anuja Negi, SUBBAREDDY OOTA, Anwar Nunez-Elizalde et al.

NEURIPS 2025
4
citations
#8545

Affine Steerable Equivariant Layer for Canonicalization of Neural Networks

Yikang Li, Yeqing Qiu, Yuxuan Chen et al.

ICLR 2025
4
citations
#8546

Mufu: Multilingual Fused Learning for Low-Resource Translation with LLM

Zheng Wei Lim, Nitish Gupta, Honglin Yu et al.

ICLR 2025arXiv:2409.13949
4
citations
#8547

Enhanced then Progressive Fusion with View Graph for Multi-View Clustering

Zhibin Dong, Meng Liu, Siwei Wang et al.

CVPR 2025
4
citations
#8548

EAMamba: Efficient All-Around Vision State Space Model for Image Restoration

Yu-Cheng Lin, Yu-Syuan Xu, Hao-Wei Chen et al.

ICCV 2025arXiv:2506.22246
4
citations
#8549

Gen3DEval: Using vLLMs for Automatic Evaluation of Generated 3D Objects

Shalini Maiti, Lourdes Agapito, Filippos Kokkinos

CVPR 2025arXiv:2504.08125
4
citations
#8550

PyTorchGeoNodes: Enabling Differentiable Shape Programs for 3D Shape Reconstruction

Sinisa Stekovic, Arslan Artykov, Stefan Ainetter et al.

CVPR 2025arXiv:2404.10620
4
citations
#8551

PLEIADES: Building Temporal Kernels with Orthogonal Polynomials

Yan Ru Pei, Olivier Coenen

NEURIPS 2025oralarXiv:2405.12179
4
citations
#8552

FRAMES-VQA: Benchmarking Fine-Tuning Robustness across Multi-Modal Shifts in Visual Question Answering

Chengyue Huang, Brisa Maneechotesuwan, Shivang Chopra et al.

CVPR 2025arXiv:2505.21755
4
citations
#8553

MOOSE-Chem2: Exploring LLM Limits in Fine-Grained Scientific Hypothesis Discovery via Hierarchical Search

Zonglin Yang, Wanhao Liu, Ben Gao et al.

NEURIPS 2025arXiv:2505.19209
4
citations
#8554

Breaking the Discretization Barrier of Continuous Physics Simulation Learning

Fan Xu, Hao Wu, Nan Wang et al.

NEURIPS 2025oralarXiv:2509.17955
4
citations
#8555

Scaling Down Text Encoders of Text-to-Image Diffusion Models

Lifu Wang, Daqing Liu, Xinchen Liu et al.

CVPR 2025arXiv:2503.19897
4
citations
#8556

Self-supervised ControlNet with Spatio-Temporal Mamba for Real-world Video Super-resolution

Shijun Shi, Jing Xu, Lijing Lu et al.

CVPR 2025arXiv:2506.01037
4
citations
#8557

QuickSplat: Fast 3D Surface Reconstruction via Learned Gaussian Initialization

Yueh-Cheng Liu, Lukas Höllein, Matthias Nießner et al.

ICCV 2025arXiv:2505.05591
4
citations
#8558

TrustMark: Robust Watermarking and Watermark Removal for Arbitrary Resolution Images

Tu Bui, Shruti Agarwal, John Collomosse

ICCV 2025
4
citations
#8559

Efficient Transfer Learning for Video-language Foundation Models

Haoxing Chen, Zizheng Huang, Yan Hong et al.

CVPR 2025arXiv:2411.11223
4
citations
#8560

3D Dental Model Segmentation with Geometrical Boundary Preserving

Shufan Xi, Zexian Liu, Junlin Chang et al.

CVPR 2025arXiv:2503.23702
4
citations
#8561

Breaking the Reclustering Barrier in Centroid-based Deep Clustering

Lukas Miklautz, Timo Klein, Kevin Sidak et al.

ICLR 2025arXiv:2411.02275
4
citations
#8562

Fairshare Data Pricing via Data Valuation for Large Language Models

Luyang Zhang, Cathy Jiao, Beibei Li et al.

NEURIPS 2025arXiv:2502.00198
4
citations
#8563

CryptoFace: End-to-End Encrypted Face Recognition

Wei Ao, Vishnu Naresh Boddeti

CVPR 2025arXiv:2509.00332
4
citations
#8564

LiVOS: Light Video Object Segmentation with Gated Linear Matching

Qin Liu, Jianfeng Wang, Zhengyuan Yang et al.

CVPR 2025arXiv:2411.02818
4
citations
#8565

Not all Views are Created Equal: Analyzing Viewpoint Instabilities in Vision Foundation Models

Mateusz Michalkiewicz, Xinyue Bai, Mahsa Baktashmotlagh et al.

ICCV 2025arXiv:2412.19920
4
citations
#8566

State-Covering Trajectory Stitching for Diffusion Planners

Kyowoon Lee, Jaesik Choi

NEURIPS 2025oralarXiv:2506.00895
4
citations
#8567

Apply Hierarchical-Chain-of-Generation to Complex Attributes Text-to-3D Generation

Yiming Qin, Zhu Xu, Yang Liu

CVPR 2025arXiv:2505.05505
4
citations
#8568

Personalized Representation from Personalized Generation

Shobhita Sundaram, Julia Chae, Yonglong Tian et al.

ICLR 2025arXiv:2412.16156
4
citations
#8569

Color Conditional Generation with Sliced Wasserstein Guidance

Alexander Lobashev, Maria Larchenko, Dmitry Guskov

NEURIPS 2025spotlightarXiv:2503.19034
4
citations
#8570

Do Deep Neural Network Solutions Form a Star Domain?

Ankit Sonthalia, Alexander Rubinstein, Ehsan Abbasnejad et al.

ICLR 2025arXiv:2403.07968
4
citations
#8571

Probing the Mid-level Vision Capabilities of Self-Supervised Learning

Xuweiyi Chen, Markus Marks, Zezhou Cheng

CVPR 2025arXiv:2411.17474
4
citations
#8572

From Dormant to Deleted: Tamper-Resistant Unlearning Through Weight-Space Regularization

Shoaib Ahmed Siddiqui, Adrian Weller, David Krueger et al.

NEURIPS 2025arXiv:2505.22310
4
citations
#8573

CASP: Compression of Large Multimodal Models Based on Attention Sparsity

Mohsen Gholami, Mohammad Akbari, Kevin Cannons et al.

CVPR 2025highlightarXiv:2503.05936
4
citations
#8574

Extending Mercer's expansion to indefinite and asymmetric kernels

Sungwoo Jeong, Alex Townsend

ICLR 2025arXiv:2409.16453
4
citations
#8575

ParamMute: Suppressing Knowledge-Critical FFNs for Faithful Retrieval-Augmented Generation

Pengcheng Huang, Zhenghao Liu, Yukun Yan et al.

NEURIPS 2025arXiv:2502.15543
4
citations
#8576

Compressed and Smooth Latent Space for Text Diffusion Modeling

Viacheslav Meshchaninov, Egor Chimbulatov, Alexander Shabalin et al.

NEURIPS 2025arXiv:2506.21170
4
citations
#8577

ViiNeuS: Volumetric Initialization for Implicit Neural Surface Reconstruction of Urban Scenes with Limited Image Overlap

Hala Djeghim, Nathan Piasco, Moussab Bennehar et al.

CVPR 2025arXiv:2403.10344
4
citations
#8578

UniEdit: A Unified Knowledge Editing Benchmark for Large Language Models

Qizhou Chen, Dakan Wang, Taolin Zhang et al.

NEURIPS 2025arXiv:2505.12345
4
citations
#8579

STiL: Semi-supervised Tabular-Image Learning for Comprehensive Task-Relevant Information Exploration in Multimodal Classification

Siyi Du, Xinzhe Luo, Declan ORegan et al.

CVPR 2025arXiv:2503.06277
4
citations
#8580

Steering Generative Models with Experimental Data for Protein Fitness Optimization

Jason Yang, Wenda Chu, Daniel Khalil et al.

NEURIPS 2025arXiv:2505.15093
4
citations
#8581

Morph: A Motion-free Physics Optimization Framework for Human Motion Generation

Zhuo Li, Mingshuang Luo, RuiBing Hou et al.

ICCV 2025arXiv:2411.14951
4
citations
#8582

SplineGS: Learning Smooth Trajectories in Gaussian Splatting for Dynamic Scene Reconstruction

Jihwan Yoon, Sangbeom Han, Jaeseok Oh et al.

ICLR 2025oral
4
citations
#8583

CHROME: Clothed Human Reconstruction with Occlusion-Resilience and Multiview-Consistency from a Single Image

Arindam Dutta, Meng Zheng, Zhongpai Gao et al.

ICCV 2025highlightarXiv:2503.15671
4
citations
#8584

DNF: Unconditional 4D Generation with Dictionary-based Neural Fields

Xinyi Zhang, Naiqi Li, Angela Dai

CVPR 2025arXiv:2412.05161
4
citations
#8585

Contrastive Self-Supervised Learning As Neural Manifold Packing

Guanming Zhang, David Heeger, Stefano Martiniani

NEURIPS 2025arXiv:2506.13717
4
citations
#8586

ZeroSep: Separate Anything in Audio with Zero Training

Chao Huang, Yuesheng Ma, Junxuan Huang et al.

NEURIPS 2025arXiv:2505.23625
4
citations
#8587

$\texttt{STRCMP}$: Integrating Graph Structural Priors with Language Models for Combinatorial Optimization

Xijun Li, Jiexiang Yang, Jinghao Wang et al.

NEURIPS 2025
4
citations
#8588

Rethinking Discrete Tokens: Treating Them as Conditions for Continuous Autoregressive Image Synthesis

Peng Zheng, Junke Wang, Yi Chang et al.

ICCV 2025arXiv:2507.01756
4
citations
#8589

Generating 3D-Consistent Videos from Unposed Internet Photos

Gene Chou, Kai Zhang, Sai Bi et al.

CVPR 2025arXiv:2411.13549
4
citations
#8590

ForensicHub: A Unified Benchmark & Codebase for All-Domain Fake Image Detection and Localization

Bo Du, Xuekang Zhu, Xiaochen Ma et al.

NEURIPS 2025arXiv:2505.11003
4
citations
#8591

Reasoning in Visual Navigation of End-to-end Trained Agents: A Dynamical Systems Approach

Steeven JANNY, Hervé Poirier, Leonid Antsfeld et al.

CVPR 2025highlightarXiv:2503.08306
4
citations
#8592

Distribution Prototype Diffusion Learning for Open-set Supervised Anomaly Detection

Fuyun Wang, Tong Zhang, Yuanzhi Wang et al.

CVPR 2025arXiv:2502.20981
4
citations
#8593

Light-Weight Diffusion Multiplier and Uncertainty Quantification for Fourier Neural Operators

Albert Matveev, Sanmitra Ghosh, Aamal Hussain et al.

NEURIPS 2025spotlightarXiv:2508.00643
4
citations
#8594

On the Existence and Complexity of Core-Stable Data Exchanges

Jiaxin Song, Pooja Kulkarni, Parnian Shahkar et al.

NEURIPS 2025arXiv:2509.16450
4
citations
#8595

On the Convergence of Projected Policy Gradient for Any Constant Step Sizes

Jiacai Liu, Wenye Li, Dachao Lin et al.

NEURIPS 2025arXiv:2311.01104
4
citations
#8596

Not Only Text: Exploring Compositionality of Visual Representations in Vision-Language Models

Davide Berasi, Matteo Farina, Massimiliano Mancini et al.

CVPR 2025highlightarXiv:2503.17142
4
citations
#8597

HOP: Heterogeneous Topology-based Multimodal Entanglement for Co-Speech Gesture Generation

Hongye Cheng, Tianyu Wang, guangsi shi et al.

CVPR 2025arXiv:2503.01175
4
citations
#8598

Estimation and Inference in Distributional Reinforcement Learning

Liangyu Zhang, Yang Peng, Jiadong Liang et al.

NEURIPS 2025arXiv:2309.17262
4
citations
#8599

UKBOB: One Billion MRI Labeled Masks for Generalizable 3D Medical Image Segmentation

Emmanuelle Bourigault, Amir Jamaludin, Abdullah Hamdi

ICCV 2025arXiv:2504.06908
4
citations
#8600

GeoRanker: Distance-Aware Ranking for Worldwide Image Geolocalization

Pengyue Jia, Seongheon Park, Song Gao et al.

NEURIPS 2025arXiv:2505.13731
4
citations