Most Cited 2025 &quot;multimodal&quot; Papers

NEURIPS 2025arXiv:2510.17131

#16802

GOOD: Training-Free Guided Diffusion Sampling for Out-of-Distribution Detection

Xin Gao, Jiyao Liu, Guanghao Li et al.

#16803

Feature Information Driven Position Gaussian Distribution Estimation for Tiny Object Detection

Jinghao Bian, Mingtao Feng, Weisheng Dong et al.

NEURIPS 2025arXiv:2510.22534

#16804

SRSR: Enhancing Semantic Accuracy in Real-World Image Super-Resolution with Spatially Re-Focused Text-Conditioning

Chen Chen, Majid Abdolshah, Violetta Shevchenko et al.

#16805

Adaptive and Multi-scale Affinity Alignment for Hierarchical Contrastive Learning

Jiawei Huang, Minming Li, Hu Ding

#16806

Learning Memory-Enhanced Improvement Heuristics for Flexible Job Shop Scheduling

Jiaqi Wang, Zhiguang Cao, Peng Zhao et al.

NEURIPS 2025arXiv:2510.24288

#16807

Problem-Parameter-Free Decentralized Bilevel Optimization

Zhiwei Zhai, Wenjing Yan, Ying-Jun Zhang

#16808

Robust Regression of General ReLUs with Queries

Ilias Diakonikolas, Daniel Kane, Mingchen Ma

CVPR 2025arXiv:2504.16499

#16809

PRaDA: Projective Radial Distortion Averaging

Daniil Sinitsyn, Linus Härenstam-Nielsen, Daniel Cremers

#16810

Towards Lossless Implicit Neural Representation via Bit Plane Decomposition

Woo Kyoung Han, Byeonghun Lee, Hyunmin Cho et al.

CVPR 2025arXiv:2502.21001

#16811

PossLoss: A Reliable and Sensitive Facial Landmark Detection Loss Function

Qikui Zhu

ICCV 2025arXiv:2411.16180

#16812

Event-boosted Deformable 3D Gaussians for Dynamic Scene Reconstruction

Wenhao Xu, Wenming Weng, Yueyi Zhang et al.

#16813

SIGMA: Refining Large Language Model Reasoning via Sibling-Guided Monte Carlo Augmentation

Yanwei Ren, Haotian Zhang, Fuxiang Wu et al.

NEURIPS 2025spotlightarXiv:2506.06470

#16814

AccidentalGS: 3D Gaussian Splatting from Accidental Camera Motion

Mao Mao, Xujie Shen, Guyuan Chen et al.

NEURIPS 2025arXiv:2510.19270

#16815

Social World Model-Augmented Mechanism Design Policy Learning

Xiaoyuan Zhang, Yizhe Huang, Chengdong Ma et al.

#16816

Regional Explanations: Bridging Local and Global Variable Importance

Salim I. Amoukou, Nicolas Brunel

NEURIPS 2025arXiv:2509.01200

#16817

SimulMEGA: MoE Routers are Advanced Policy Makers for Simultaneous Speech Translation

Chenyang Le, Bing Han, Jinshun Li et al.

#16818

DUO: No Compromise to Accuracy Degradation

Jinda Jia, Cong Xie, Hanlin Lu et al.

#16819

Understanding Bias Terms in Neural Representations

Weixiang Zhang, Boxi Li, Shuzhao Xie et al.

#16820

Active Seriation: Efficient Ordering Recovery with Statistical Guarantees

James Cheshire, Yann Issartel

NEURIPS 2025arXiv:2512.11458

#16821

Boosting Skeleton-based Zero-Shot Action Recognition with Training-Free Test-Time Adaptation

Jingmin Zhu, Anqi Zhu, Hossein Rahmani et al.

#16822

X-Prompt: Generalizable Auto-Regressive Visual Learning with In-Context Prompting

Zeyi Sun, Ziyang Chu, Pan Zhang et al.

CVPR 2025highlightarXiv:2505.21335

#16823

Structure from Collision

Takuhiro Kaneko

#16824

Unlocking the Potential of Diffusion Priors in Blind Face Restoration

Yunqi Miao, Zhiyu Qu, Mingqi Gao et al.

ICCV 2025arXiv:2508.08556

#16825

Liberated-GS: 3D Gaussian Splatting Independent from SfM Point Clouds

Weihong Pan, Xiaoyu Zhang, Hongjia Zhai et al.

#16826

Searching Efficient Semantic Segmentation Architectures via Dynamic Path Selection

Yuxi Liu, Min Liu, Shuai Jiang et al.

#16827

DIFFSSR: Stereo Image Super-resolution Using Differential Transformer

Dafeng Zhang

ICCV 2025arXiv:2508.14604

#16828

UST-SSM: Unified Spatio-Temporal State Space Models for Point Cloud Video Modeling

Peiming Li, Ziyi Wang, Yulin Yuan et al.

#16829

Confusion-Driven Self-Supervised Progressively Weighted Ensemble Learning for Non-Exemplar Class Incremental Learning

Kai Hu, Zhang Yu, Yuan Zhang et al.

#16830

VETA-DiT: Variance-Equalized and Temporally Adaptive Quantization for Efficient 4-bit Diffusion Transformers

Qinkai XU, yijin liu, YangChen et al.

ICCV 2025arXiv:2507.04503

#16831

U-ViLAR: Uncertainty-Aware Visual Localization for Autonomous Driving via Differentiable Association and Registration

Xiaofan Li, Zhihao Xu, Chenming Wu et al.

#16832

Robust Dataset Condensation using Supervised Contrastive Learning

Nicole Kim, Hwanjun Song

#16833

CaliGCL: Calibrated Graph Contrastive Learning via Partitioned Similarity and Consistency Discrimination

Yuena Lin, Hao Wei, Hai-Chun Cai et al.

#16834

Dependency Matters: Enhancing LLM Reasoning with Explicit Knowledge Grounding

Xiangyu Wen, Min Li, Junhua Huang et al.

#16835

Computation and Memory-Efficient Model Compression with Gradient Reweighting

Zhiwei Li, Yuesen Liao, Binrui Wu et al.

ICCV 2025arXiv:2510.21654

#16836

CounterPC: Counterfactual Feature Realignment for Unsupervised Domain Adaptation on Point Clouds

Feng Yang, Yichao Cao, Xiu Su et al.

ICCV 2025highlight

#16837

Group Inertial Poser: Multi-Person Pose and Global Translation from Sparse Inertial Sensors and Ultra-Wideband Ranging

Ying Xue, Jiaxi Jiang, Rayan Armani et al.

#16838

Backdooring Self-Supervised Contrastive Learning by Noisy Alignment

Tuo Chen, Jie Gui, Minjing Dong et al.

ICCV 2025arXiv:2508.14015

#16839

Dynamic Siamese Expansion Framework for Improving Robustness in Online Continual Learning

Fei Ye, Yulong Zhao, Qihe Liu et al.

NEURIPS 2025arXiv:2510.24234

#16840

Sparse Optimistic Information Directed Sampling

Ludovic Schwartz, Hamish Flynn, Gergely Neu

#16841

PlanU: Large Language Model Reasoning through Planning under Uncertainty

Ziwei Deng, Mian Deng, Chenjing Liang et al.

NEURIPS 2025arXiv:2510.18442

#16842

Automated Model Discovery via Multi-modal & Multi-step Pipeline

Lee Jung-Mok, Nam Hyeon-Woo, Moon Ye-Bin et al.

NEURIPS 2025arXiv:2509.25946

#16843

Rethinking Hebbian Principle: Low-Dimensional Structural Projection for Unsupervised Learning

Shikuang Deng, Jiayuan Zhang, Yuhang Wu et al.

NEURIPS 2025arXiv:2510.14810

#16844

Mitigating Occlusions in Virtual Try-On via A Simple-Yet-Effective Mask-Free Framework

Chenghu Du, Shengwu Xiong, junyin Wang et al.

NEURIPS 2025oralarXiv:2507.06645

#16845

Quantifying Uncertainty in Error Consistency: Towards Reliable Behavioral Comparison of Classifiers

Thomas Klein, Sascha Meyen, Wieland Brendel et al.

#16846

Topology-Aware Learning of Tubular Manifolds via SE(3)-Equivariant Network on Ball B-Spline Curve

Jingxuan Wang, Zhongke Wu, Wang et al.

NEURIPS 2025arXiv:2511.13911

#16847

Uncertainty-Calibrated Prediction of Randomly-Timed Biomarker Trajectories with Conformal Bands

Vasiliki Tassopoulou, Charis Stamouli, Haochang Shou et al.

#16848

From Complexity to Clarity: Analytical Expressions of Deep Neural Network Weights via Clifford Algebra and Convexity

Mert Pilanci

ICLR 2025

#16849

Accelerating Model-Free Optimization via Averaging of Cost Samples

Guido Carnevale, Giuseppe Notarstefano

NEURIPS 2025arXiv:2509.22793

#16850

LaViDa: A Large Diffusion Model for Vision-Language Understanding

Shufan Li, Konstantinos Kallidromitis, Hritik Bansal et al.

NEURIPS 2025spotlight

#16851

DEFT: Decompositional Efficient Fine-Tuning for Text-to-Image Models

Komal Kumar, Rao Anwer, Fahad Shahbaz Khan et al.

#16852

Neural Hamiltonian Diffusions for Modeling Structured Geometric Dynamics

Sungwoo Park

NEURIPS 2025arXiv:2506.09813

#16853

Metritocracy: Representative Metrics for Lite Benchmarks

Ariel Procaccia, Ben Schiffer, Serena Wang et al.

#16854

Adversarial Graph Fusion for Incomplete Multi-view Semi-supervised Learning with Tensorial Imputation

Zhangqi Jiang, Tingjin Luo, Xu Yang et al.

NEURIPS 2025arXiv:2509.15955

#16855

ComRank: Ranking Loss for Multi-Label Complementary Label Learning

Jing-Yi Zhu, Yi Gao, Miao Xu et al.

NEURIPS 2025arXiv:2509.21930

#16856

DynaNav: Dynamic Feature and Layer Selection for Efficient Visual Navigation

Jiahui Wang, Changhao Chen

#16857

$\Delta \mathrm{Energy}$: Optimizing Energy Change During Vision-Language Alignment Improves both OOD Detection and OOD Generalization

Lin Zhu, Yifeng Yang, Xinbing Wang et al.

NEURIPS 2025arXiv:2506.16806

#16858

FOCUS: Unified Vision-Language Modeling for Interactive Editing Driven by Referential Segmentation

Fan Yang, Yousong Zhu, Xin Li et al.

#16859

Language-Guided Salient Object Ranking

Fang Liu, Yuhao Liu, Ke Xu et al.

NEURIPS 2025arXiv:2510.22123

#16860

Learning 3D Anisotropic Noise Distributions Improves Molecular Force Fields

Xixian Liu, Rui Jiao, ZHIYUAN LIU et al.

#16861

DSCS: Fast CPDAG-Based Verification of Collapsible Submodels in High-Dimensional Bayesian Networks

Wentao Wu, Shiyuan He, Jianhua Guo

#16862

Hypergraph-Enhanced Contrastive Learning for Multi-View Clustering with Hyper-Laplacian Regularization

Zhibin Gu, weili wang

#16863

Instruction-based Image Editing with Planning, Reasoning, and Generation

Liya Ji, Chenyang Qi, Qifeng Chen

NEURIPS 2025arXiv:2507.11060

#16864

Personalized Exercise Recommendation with Semantically-Grounded Knowledge Tracing

Yilmazcan Ozyurt, Tunaberk Almaci, Stefan Feuerriegel et al.

#16865

On the Sample Complexity of Differentially Private Policy Optimization

Yi He, Xingyu Zhou

NEURIPS 2025arXiv:2510.21060

#16866

Ascent Fails to Forget

Ioannis Mavrothalassitis, Pol Puigdemont, Noam Levi et al.

NEURIPS 2025arXiv:2509.26427

#16867

Tensor-aggregated LoRA in Federated Fine-tuning

Zhixuan Li, Binqian Xu, Xiangbo Shu et al.

#16868

Generalizing Single-Frame Supervision to Event-Level Understanding for Video Anomaly Detection

Junxi Chen, Liang Li, Yunbin Tu et al.

#16869

PointGAC: Geometric-Aware Codebook for Masked Point Modeling

Abiao Li, Chenlei Lv, Guofeng Mei et al.

NEURIPS 2025arXiv:2511.16673

#16870

NoPo-Avatar: Generalizable and Animatable Avatars from Sparse Inputs without Human Poses

Jing Wen, Alex Schwing, Shenlong Wang

#16871

Breaking the Compression Ceiling: Data-Free Pipeline for Ultra-Efficient Delta Compression

Xiaohui Wang, Peng Ye, Chenyu Huang et al.

NEURIPS 2025arXiv:2505.13563

#16872

AdvEDM: Fine-grained Adversarial Attack against VLM-based Embodied Agents

Yichen Wang, Hangtao Zhang, Hewen Pan et al.

#16873

GeGS-PCR: Fast and Robust Color 3D Point Cloud Registration with Two-Stage Geometric-3DGS Fusion

Jiayi Tian, Haiduo Huang, Tian Xia et al.

#16874

Elastic Robust Unlearning of Specific Knowledge in Large Language Models

Yize Sui, Jing Ren, Wenjing Yang et al.

#16875

From Pose to Muscle: Multimodal Learning for Piano Hand Muscle Electromyography

RUOFAN LIU, YICHEN PENG, Takanori Oku et al.

#16876

End-to-End Low-Light Enhancement for Object Detection with Learned Metadata from RAWs

Xuelin Shen, Haifeng Jiao, Yitong Wang et al.

NEURIPS 2025arXiv:2502.04580

#16877

Technical Debt in In-Context Learning: Diminishing Efficiency in Long Context

Taejong Joo, Diego Klabjan

#16878

ShoeFit: A New Dataset and Dual-image-stream DiT Framework for Virtual Footwear Try-On

Yuhan Li, Zhiyu Jin, Yifan Tong et al.

ICCV 2025arXiv:2510.18521

#16879

RayPose: Ray Bundling Diffusion for Template Views in Unseen 6D Object Pose Estimation

Junwen Huang, Shishir Reddy Vutukur, Peter Yu et al.

#16880

A Gradient Guidance Perspective on Stepwise Preference Optimization for Diffusion Models

Joshua Tian Jin Tee, Hee Suk Yoon, Abu Hanif Muhammad Syarubany et al.

#16881

Retrieval is Not Enough: Enhancing RAG through Test-Time Critique and Optimization

Jiaqi Wei, Hao Zhou, Xiang Zhang et al.

NEURIPS 2025arXiv:2510.21267

#16882

Relieving the Over-Aggregating Effect in Graph Transformers

Junshu Sun, Wanxing Chang, Chenxue Yang et al.

#16883

Beyond Generation: A Diffusion-based Low-level Feature Extractor for Detecting AI-generated Images

Nan Zhong, Haoyu Chen, Yiran Xu et al.

#16884

Bias Mitigation in Graph Diffusion Models

Meng Yu, Kun Zhan

ICLR 2025

#16885

Statistical Inference for Decentralized Federated Learning

Jia Gu, Songxi Chen

ICCV 2025arXiv:2508.06546

#16886

Statistical Confidence Rescoring for Robust 3D Scene Graph Generation from Multi-View Images

Qi Xun Yeo, Yanyan Li, Gim Hee Lee

#16887

Neural Multi-View Self-Calibrated Photometric Stereo without Photometric Stereo Cues

Xu Cao, Takafumi Taketomi

ICCV 2025arXiv:2507.23162

#16888

GMV: A Unified and Efficient Graph Multi-View Learning Framework

Qipeng zhu, Jie Chen, Jian Pu et al.

NEURIPS 2025arXiv:2501.15127

#16889

Versatile differentially private learning for general loss functions

Qilong Lu, Songxi Chen, Yumou Qiu

#16890

Constrained Linear Thompson Sampling

Aditya Gangrade, Venkatesh Saligrama

NEURIPS 2025arXiv:2503.02043

#16891

S2D-LFE: Sparse-to-Dense Light Field Event Generation

Yutong Liu, Wenming Weng, Yueyi Zhang et al.

CVPR 2025arXiv:2503.18784

#16892

Leveraging Perturbation Robustness to Enhance Out-of-Distribution Detection

Wenxi Chen, Raymond A. Yeh, Shaoshuai Mou et al.

#16893

On the Stability and Generalization of Meta-Learning: the Impact of Inner-Levels

Wenjun Ding, Jingling Liu, Lixing Chen et al.

NEURIPS 2025arXiv:2510.16807

#16894

Improving Model Representation and Reducing KV Cache via Skip Connections with First Value Heads

Zhoutong Wu, Yuan Zhang, Yiming Dong et al.

#16895

ICLScan: Detecting Backdoors in Black-Box Large Language Models via Targeted In-context Illumination

Xiaoyi Pang, Xuanyi Hao, Song Guo et al.

#16896

Leveraging Debiased Cross-modal Attention Maps and Code-based Reasoning for Zero-shot Referring Expression Comprehension

Juntao Chen, Wen Shen, Zhihua Wei et al.

ICCV 2025arXiv:2508.16121

#16897

Lightweight and Fast Real-time Image Enhancement via Decomposition of the Spatial-aware Lookup Tables

Wontae Kim, Keuntek Lee, Nam Ik Cho

#16898

NeurIPS should lead scientific consensus on AI policy

Rishi Bommasani

NEURIPS 2025oralarXiv:2510.00075

#16899

World Models Should Prioritize the Unification of Physical and Social Dynamics

Xiaoyuan Zhang, Chengdong Ma, Yizhe Huang et al.

NEURIPS 2025arXiv:2510.21219

#16900

Sample-Conditional Coverage in Split-Conformal Prediction

John Duchi

ICCV 2025arXiv:2508.20376

#16901

Enhancing Mamba Decoder with Bidirectional Interaction in Multi-Task Dense Prediction

Mang Cao, Sanping Zhou, Yizhe Li et al.

#16902

Noise-Robustness Through Noise: A Framework combining Asymmetric LoRA with Poisoning MoE

Zhaokun Wang, Jinyu Guo, Jingwen Pu et al.

NEURIPS 2025arXiv:2505.23868

#16903

MTADiffusion: Mask Text Alignment Diffusion Model for Object Inpainting

jun huang, Ting Liu, Yihang Wu et al.

CVPR 2025arXiv:2506.23482

#16904

Setting $\varepsilon$ is not the Issue in Differential Privacy

Edwige Cyffers

NEURIPS 2025arXiv:2511.06305

#16905

Diffusion-Classifier Synergy: Reward-Aligned Learning via Mutual Boosting Loop for FSCIL

Ruitao Wu, Yifan Zhao, Guangyao Chen et al.

NEURIPS 2025arXiv:2510.03608

#16906

S$^2$M-Former: Spiking Symmetric Mixing Branchformer for Brain Auditory Attention Detection

Jiaqi Wang, Zhengyu Ma, Xiongri Shen et al.

NEURIPS 2025arXiv:2508.05164

#16907

HDR Image Generation via Gain Map Decomposed Diffusion

Yuanshen Guan, Ruikang Xu, Yinuo Liao et al.

NEURIPS 2025oralarXiv:2507.00163

#16908

Prompting as Scientific Inquiry

Ari Holtzman, Chenhao Tan

#16909

PolarNeXt: Rethink Instance Segmentation with Polar Representation

Jiacheng Sun, Xinghong Zhou, Yiqiang Wu et al.

NEURIPS 2025arXiv:2505.15133

#16910

DeepKD: A Deeply Decoupled and Denoised Knowledge Distillation Trainer

Haiduo Huang, Jiangcheng Song, Yadong Zhang et al.

#16911

The Adaptive Complexity of Minimizing Relative Fisher Information

Huanjian Zhou, Masashi Sugiyama

#16912

HPSERec: A Hierarchical Partitioning and Stepwise Enhancement Framework for Long-tailed Sequential Recommendation

Xiaolong Xu, Xudong Zhao, Haolong Xiang et al.

#16913

Accurate KV Cache Eviction via Anchor Direction Projection for Efficient LLM Inference

Zijie Geng, Jie Wang, Ziqi Liu et al.

ICCV 2025highlightarXiv:2508.10896

#16914

ESSENTIAL: Episodic and Semantic Memory Integration for Video Class-Incremental Learning

Jongseo Lee, Kyungho Bae, Kyle Min et al.

#16915

Teaching AI the Anatomy Behind the Scan: Addressing Anatomical Flaws in Medical Image Segmentation with Learnable Prior

Young Seok Jeon, Hongfei Yang, Huazhu Fu et al.

ICCV 2025arXiv:2403.18878

#16916

CaliMatch: Adaptive Calibration for Improving Safe Semi-supervised Learning

Jinsoo Bae, Seoung Bum Kim, Hyungrok Do

ICCV 2025arXiv:2508.00922

#16917

Label Shift Meets Online Learning: Ensuring Consistent Adaptation with Universal Dynamic Regret

Yucong Dai, Shilin Gu, Ruidong Fan et al.

CVPR 2025highlight

#16918

EPA: Boosting Event-based Video Frame Interpolation with Perceptually Aligned Learning

Yuhan Liu, LingHui Fu, Zhen Yang et al.

NEURIPS 2025arXiv:2505.15870

#16919

Satellites Reveal Mobility: A Commuting Origin-destination Flow Generator for Global Cities

Can Rong, Xin Zhang, Yanxin Xi et al.

#16920

Scalable Feature Learning on Huge Knowledge Graphs for Downstream Machine Learning

Félix Lefebvre, Gael Varoquaux

NEURIPS 2025arXiv:2507.00965

#16921

Simultaneous Statistical Inference for Off-Policy Evaluation in Reinforcement Learning

Tianpai Luo, Xinyuan Fan, Weichi Wu

#16922

Causal Discovery and Inference through Next-Token Prediction

Eivinas Butkus, Nikolaus Kriegeskorte

NEURIPS 2025oralarXiv:2510.17245

#16923

On Efficiency-Effectiveness Trade-off of Diffusion-based Recommenders

Wenyu Mao, Jiancan Wu, Guoqing Hu et al.

#16924

Improved Sampling Algorithms for Lévy-Itô Diffusion Models

Vadim Popov, Assel Yermekova, Tasnima Sadekova et al.

ICLR 2025

#16925

Covering Multiple Objectives with a Small Set of Solutions Using Bayesian Optimization

Natalie Maus, Kyurae Kim, Yimeng Zeng et al.

NEURIPS 2025arXiv:2501.19342

#16926

EDFFDNet: Towards Accurate and Efficient Unsupervised Multi-Grid Image Registration

Haokai Zhu, Bo Qu, Si-Yuan Cao et al.

ICCV 2025arXiv:2509.07662

#16927

Yo’Chameleon: Personalized Vision and Language Generation

Thao Nguyen, Krishna Kumar Singh, Jing Shi et al.

#16928

ROLL: Robust Noisy Pseudo-label Learning for Multi-View Clustering with Noisy Correspondence

Yuan Sun, Yongxiang Li, Zhenwen Ren et al.

CVPR 2025highlight

#16929

Dual-S3D: Hierarchical Dual-Path Selective SSM-CNN for High-Fidelity Implicit Reconstruction

Luoxi Zhang, Pragyan Shrestha, Yu Zhou et al.

NEURIPS 2025arXiv:2510.18258

#16930

NTKMTL: Mitigating Task Imbalance in Multi-Task Learning from Neural Tangent Kernel Perspective

Xiaohan Qin, Xiaoxing Wang, Ning Liao et al.

#16931

FastPoint: Accelerating 3D Point Cloud Model Inference via Sample Point Distance Prediction

Donghyun Lee, Dawoon Jeong, Jae W. Lee et al.

ICCV 2025arXiv:2507.23480

#16932

RobuSTereo: Robust Zero-Shot Stereo Matching under Adverse Weather

Yuran Wang, Yingping Liang, Yutao Hu et al.

ICCV 2025arXiv:2507.01653

#16933

Bridging Viewpoint Gaps: Geometric Reasoning Boosts Semantic Correspondence

Qiyang Qian, Hansheng Chen, Masayoshi Tomizuka et al.

#16934

MMGeo: Multimodal Compositional Geo-Localization for UAVs

Yuxiang Ji, Boyong He, Zhuoyue Tan et al.

NEURIPS 2025arXiv:2505.15439

#16935

FRN: Fractal-Based Recursive Spectral Reconstruction Network

Ge Meng, Zhongnan Cai, Ruizhe Chen et al.

#16936

On the SAC-BL Algorithm for Anomaly Detection

Xinsong Ma, Jie Wu, Weiwei Liu

NEURIPS 2025oralarXiv:2503.11245

#16937

L2RSI: Cross-view LiDAR-based Place Recognition for Large-scale Urban Scenes via Remote Sensing Imagery

Ziwei Shi, Xiaoran Zhang, Wenjing Xu et al.

#16938

Steering Information Utility in Key-Value Memory for Language Model Post-Training

Chunyuan Deng, Ruidi Chang, Hanjie Chen

NEURIPS 2025arXiv:2507.05158

#16939

Resounding Acoustic Fields with Reciprocity

Zitong Lan, Yiduo Hao, Mingmin Zhao

NEURIPS 2025arXiv:2510.20602

#16940

WAVE: Warp-Based View Guidance for Consistent Novel View Synthesis Using a Single Image

Jiwoo Park, Tae Choi, Youngjun Jun et al.

ICCV 2025arXiv:2506.23518

#16941

ArtEditor: Learning Customized Instructional Image Editor from Few-Shot Examples

Shijie Huang, Yiren Song, Yuxuan Zhang et al.

#16942

LithoSim: A Large, Holistic Lithography Simulation Benchmark for AI-Driven Semiconductor Manufacturing

Hongquan He, Zhen Wang, Jingya Wang et al.

#16943

A3GS: Arbitrary Artistic Style into Arbitrary 3D Gaussian Splatting

Zhiyuan Fang, Rengan Xie, Xuancheng Jin et al.

#16944

Asynchronous Collaborative Graph Representation for Frames and Events

Dianze Li, Jianing Li, Xu Liu et al.

#16945

Large Scene Generation with Cube-Absorb Discrete Diffusion

Qianjiang Hu, Wei Hu

ICCV 2025arXiv:2510.24052

#16946

SynAD: Enhancing Real-World End-to-End Autonomous Driving Models through Synthetic Data Integration

Jongsuk Kim, Jae Young Lee, Gyojin Han et al.

#16947

STAR-Bets: Sequential TArget-Recalculating Bets for Tighter Confidence Intervals

Vaclav Voracek, Francesco Orabona

NEURIPS 2025oralarXiv:2510.25173

#16948

D$^2$GS: Dense Depth Regularization for LiDAR-free Urban Scene Reconstruction

Kejing Xia, Jidong Jia, Ke Jin et al.

#16949

Learning to Watermark: A Selective Watermarking Framework for Large Language Models via Multi-Objective Optimization

Chenrui Wang, Junyi Shu, Billy Chiu et al.

NEURIPS 2025arXiv:2510.15976

#16950

PC-Net: Weakly Supervised Compositional Moment Retrieval via Proposal-Centric Network

Mingyao Zhou, Hao Sun, Wei Xie et al.

NEURIPS 2025arXiv:2511.05935

#16951

Interaction-Centric Knowledge Infusion and Transfer for Open Vocabulary Scene Graph Generation

Lin Li, Chuhan ZHANG, Dong Zhang et al.

#16952

Semantic Alignment and Reinforcement for Data-Free Quantization of Vision Transformers

Yunshan Zhong, Yuyao Zhou, Yuxin Zhang et al.

ICCV 2025arXiv:2412.16553

#16953

Local Learning for Covariate Selection in Nonparametric Causal Effect Estimation with Latent Variables

Zheng Li, Xichen Guo, Feng Xie et al.

NEURIPS 2025arXiv:2411.16315

#16954

GSOT3D: Towards Generic 3D Single Object Tracking in the Wild

Yifan Jiao, Yunhao Li, Junhua Ding et al.

ICCV 2025arXiv:2412.02129

#16955

OCSplats: Observation Completeness Quantification and Label Noise Separation in 3DGS

Han Ling, Yinghui Sun, Xian Xu et al.

ICCV 2025arXiv:2508.01239

#16956

Theory-Inspired Deep Multi-View Multi-Label Learning with Incomplete Views and Noisy Labels

Quanjiang Li, Tingjin Luo, Jiahui Liao

#16957

Compress & Cache: Vision token compression for efficient generation and retrieval

Adrian Bulat, Yassine Ouali, Georgios Tzimiropoulos

#16958

Gaussian-based World Model: Gaussian Priors for Voxel-Based Occupancy Prediction and Future Motion Prediction

Tuo Feng, Wenguan Wang, Yi Yang

#16959

HOMO-Feature: Cross-Arbitrary-Modal Image Matching with Homomorphism of Organized Major Orientation

Chenzhong Gao, Wei Li, Desheng Weng

NEURIPS 2025arXiv:2506.03750

#16960

MoodAngels: A Retrieval-augmented Multi-agent Framework for Psychiatry Diagnosis

Mengxi Xiao, Ben Liu, He Li et al.

#16961

CO2-Net: A Physics-Informed Spatio-Temporal Model for Global Surface CO2 Reconstruction

Hao Zheng, Yuting Zheng, Hanbo Huang et al.

#16962

High Dynamic Range Imaging with Time-Encoding Spike Camera

Zhenkun Zhu, Ruiqin Xiong, Jiyu Xie et al.

#16963

Robustifying Zero-Shot Vision Language Models by Subspaces Alignment

Junhao Dong, Piotr Koniusz, Liaoyuan Feng et al.

#16964

OpenOmni: Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Alignment and Real-time Emotional Speech Synthesis

Run Luo, Ting-En Lin, Haonan Zhang et al.

#16965

Adaptive Fission: Post-training Encoding for Low-latency Spike Neural Networks

Yizhou Jiang, Feng Chen, Yihan Li et al.

NEURIPS 2025arXiv:2506.02453

#16966

PAID: Pairwise Angular-Invariant Decomposition for Continual Test-Time Adaptation

Kunyu Wang, Xueyang Fu, Yuanfei Bao et al.

#16967

Hierarchical Variational Test-Time Prompt Generation for Zero-Shot Generalization

Zhaoyang Wu, Fang Liu, Licheng Jiao et al.

NEURIPS 2025arXiv:2510.19506

#16968

Lookahead Routing for Large Language Models

Canbin Huang, Tianyuan Shi, Yuhua Zhu et al.

#16969

Accelerating Block Coordinate Descent for LLM Finetuning via Landscape Expansion

Qijun Luo, Yifei Shen, Liangzu Peng et al.

#16970

Point-MaDi: Masked Autoencoding with Diffusion for Point Cloud Pre-training

Xiaoyang Xiao, Runzhao Yao, Zhiqiang Tian et al.

#16971

Generation as Search Operator for Test-Time Scaling of Diffusion-based Combinatorial Optimization

Yang Li, Lvda Chen, Haonan Wang et al.

#16972

Improving Semi-Supervised Semantic Segmentation with Sliced-Wasserstein Feature Alignment and Uniformity

Chen Yi Lu, Kasra Derakhshandeh, Somali Chaterji

NEURIPS 2025oralarXiv:2512.03678

#16973

Feature-aware Modulation for Learning from Temporal Tabular Data

Haorun Cai, Han-Jia Ye

#16974

Hierarchical Adaptive Filtering Network for Text Image Specular Highlight Removal

Zhi Jiang, Jingbo Hu, Ling Zhang et al.

CVPR 2025arXiv:2505.23763

#16975

Sketch Down the FLOPs: Towards Efficient Networks for Human Sketch

Aneeshan Sain, Subhajit Maity, Pinaki Nath Chowdhury et al.

#16976

HERA: Hybrid Explicit Representation for Ultra-Realistic Head Avatars

Hongrui Cai, Yuting Xiao, Xuan Wang et al.

NEURIPS 2025oralarXiv:2509.23802

#16977

STAIR: Addressing Stage Misalignment through Temporal-Aligned Preference Reinforcement Learning

Yao Luan, Ni Mu, Yiqin Yang et al.

#16978

DAA*: Deep Angular A Star for Image-based Path Planning

Zhiwei Xu

ICCV 2025arXiv:2507.09305

#16979

Zero-Shot Detection of LLM-Generated Text via Implicit Reward Model

Runheng Liu, Heyan Huang, Xingchen Xiao et al.

NEURIPS 2025arXiv:2509.22807

#16980

MTRec: Learning to Align with User Preferences via Mental Reward Models

Mengchen Zhao, Yifan Gao, Yaqing Hou et al.

#16981

Enhancing Contrastive Learning with Variable Similarity

Haowen Cui, Shuo Chen, Jun Li et al.

NEURIPS 2025spotlight

#16982

Unifying Reconstruction and Density Estimation via Invertible Contraction Mapping in One-Class Classification

Xiaolei Wang, Tianhong Dai, Huihui Bai et al.

#16983

VPR-Cloak: A First Look at Privacy Cloak Against Visual Place Recognition

Shuting Dong, Mingzhi Chen, Feng Lu et al.

ICCV 2025arXiv:2506.21880

#16984

Physical Degradation Model-Guided Interferometric Hyperspectral Reconstruction with Unfolding Transformer

Yuansheng Li, Yunhao Zou, Linwei Chen et al.

#16985

Purity Law for Neural Routing Problem Solvers with Enhanced Generalizability

Wenzhao Liu, Haoran Li, Congying Han et al.

ICCV 2025arXiv:2509.17712

#16986

RCTDistill: Cross-Modal Knowledge Distillation Framework for Radar-Camera 3D Object Detection with Temporal Fusion

Geonho Bang, Minjae Seong, Jisong Kim et al.

#16987

Multi-Modal Interactive Agent Layer for Few-Shot Universal Cross-Domain Retrieval and Beyond

Kaixiang Chen, Pengfei Fang, hui xue

#16988

Price of Parsimony: Complexity of Fourier Sparsity Testing

Arijit Ghosh, Manmatha Roy

#16989

CrypticBio: A Large Multimodal Dataset for Visually Confusing Species

Georgiana Manolache, Gerard Schouten, Joaquin Vanschoren

#16990

Leveraging Panoptic Scene Graph for Evaluating Fine-Grained Text-to-Image Generation

Xueqing Deng, Linjie Yang, Qihang Yu et al.

#16991

Entropy-Adaptive Diffusion Policy Optimization with Dynamic Step Alignment

Renye Yan, Jikang Cheng, Yaozhong Gan et al.

#16992

SGSST: Scaling Gaussian Splatting Style Transfer

Bruno Galerne, Jianling WANG, Lara Raad et al.

#16993

Listening to the Brain: Multi-Band sEEG Auditory Reconstruction via Dynamic Spatio-Temporal Hypergraphs

Xueyi Zhang, Ruicong Wang, Jialu Sun et al.

#16994

Unified Medical Lesion Segmentation via Self-referring Indicator

Shijie Chang, Xiaoqi Zhao, Lihe Zhang et al.

#16995

Forensic-MoE: Exploring Comprehensive Synthetic Image Detection Traces with Mixture of Experts

Mingqi Fang, Ziguang Li, Lingyun Yu et al.

#16996

FastJSMA: Accelerating Jacobian-based Saliency Map Attacks through Gradient Decoupling

Zhenghao Gao, Shengjie Xu, Zijing Li et al.

#16997

GSRecon: Efficient Generalizable Gaussian Splatting for Surface Reconstruction from Sparse Views

Hang Yang, Le Hui, Jianjun Qian et al.