Most Cited 2025 "lora tuning" Papers

22,274 papers found • Page 19 of 112

#3601

NuPlanQA: A Large-Scale Dataset and Benchmark for Multi-View Driving Scene Understanding in Multi-Modal Large Language Models

Sung-Yeon Park, Can Cui, Yunsheng Ma et al.

ICCV 2025arXiv:2503.12772
13
citations
#3602

HandDiffuse: Generative Controllers for Two-Hand Interactions via Diffusion Models

Pei Lin

AAAI 2025paperarXiv:2312.04867
13
citations
#3603

Large Language Models Miss the Multi-agent Mark

Emanuele La Malfa, Gabriele La Malfa, Samuele Marro et al.

NEURIPS 2025arXiv:2505.21298
13
citations
#3604

Teaching Language Models to Evolve with Users: Dynamic Profile Modeling for Personalized Alignment

Weixiang Zhao, Xingyu Sui, Yulin Hu et al.

NEURIPS 2025arXiv:2505.15456
13
citations
#3605

ParetoFlow: Guided Flows in Multi-Objective Optimization

Ye Yuan, Can Chen, Christopher Pal et al.

ICLR 2025arXiv:2412.03718
13
citations
#3606

LUDVIG: Learning-Free Uplifting of 2D Visual Features to Gaussian Splatting Scenes

Juliette Marrie, Romain Menegaux, Michael Arbel et al.

ICCV 2025arXiv:2410.14462
13
citations
#3607

Active Evaluation Acquisition for Efficient LLM Benchmarking

Yang Li, Jie Ma, Miguel Ballesteros et al.

ICML 2025arXiv:2410.05952
13
citations
#3608

Rethinking Spiking Neural Networks from an Ensemble Learning Perspective

Yongqi Ding, Lin Zuo, Mengmeng Jing et al.

ICLR 2025oralarXiv:2502.14218
13
citations
#3609

Point-RFT: Improving Multimodal Reasoning with Visually Grounded Reinforcement Finetuning

Minheng Ni, Zhengyuan Yang, Linjie Li et al.

NEURIPS 2025arXiv:2505.19702
13
citations
#3610

Back on Track: Bundle Adjustment for Dynamic Scene Reconstruction

Weirong Chen, Ganlin Zhang, Felix Wimbauer et al.

ICCV 2025arXiv:2504.14516
13
citations
#3611

LBM: Latent Bridge Matching for Fast Image-to-Image Translation

Clément Chadebec, Onur Tasar, Sanjeev Sreetharan et al.

ICCV 2025highlightarXiv:2503.07535
13
citations
#3612

Neuroplastic Expansion in Deep Reinforcement Learning

Jiashun Liu, Johan S Obando Ceron, Aaron Courville et al.

ICLR 2025arXiv:2410.07994
13
citations
#3613

Synthesizing Programmatic Reinforcement Learning Policies with Large Language Model Guided Search

Max Liu, Chan-Hung Yu, Wei-Hsu Lee et al.

ICLR 2025arXiv:2405.16450
13
citations
#3614

Inverse problems with experiment-guided AlphaFold

Sai Advaith Maddipatla, Nadav Bojan, Meital Bojan et al.

ICML 2025arXiv:2502.09372
13
citations
#3615

Port-Hamiltonian Architectural Bias for Long-Range Propagation in Deep Graph Networks

Simon Heilig, Alessio Gravina, Alessandro Trenta et al.

ICLR 2025arXiv:2405.17163
13
citations
#3616

LLMs on the Line: Data Determines Loss-to-Loss Scaling Laws

Prasanna Mayilvahanan, Thaddäus Wiedemer, Sayak Mallick et al.

ICML 2025arXiv:2502.12120
13
citations
#3617

Recycling the Web: A Method to Enhance Pre-training Data Quality and Quantity for Language Models

Thao Nguyen, Yang Li, Olga Golovneva et al.

COLM 2025paperarXiv:2506.04689
13
citations
#3618

Linguini: A benchmark for language-agnostic linguistic reasoning

Eduardo Sánchez, Belen Alastruey, Christophe Ropers et al.

NEURIPS 2025arXiv:2409.12126
13
citations
#3619

MoMa-Kitchen: A 100K+ Benchmark for Affordance-Grounded Last-Mile Navigation in Mobile Manipulation

Pingrui Zhang, Xianqiang Gao, Yuhan Wu et al.

ICCV 2025arXiv:2503.11081
13
citations
#3620

Fréchet Wavelet Distance: A Domain-Agnostic Metric for Image Generation

Lokesh Veeramacheneni, Moritz Wolter, Hilde Kuehne et al.

ICLR 2025arXiv:2312.15289
13
citations
#3621

Geometry Informed Tokenization of Molecules for Language Model Generation

Xiner Li, Limei Wang, Youzhi Luo et al.

ICML 2025arXiv:2408.10120
13
citations
#3622

Zero-Shot Monocular Scene Flow Estimation in the Wild

Yiqing Liang, Abhishek Badki, Hang Su et al.

CVPR 2025arXiv:2501.10357
13
citations
#3623

Bayesian scaling laws for in-context learning

Aryaman Arora, Dan Jurafsky, Christopher Potts et al.

COLM 2025paperarXiv:2410.16531
13
citations
#3624

ShotAdapter: Text-to-Multi-Shot Video Generation with Diffusion Models

Ozgur Kara, Krishna Kumar Singh, Feng Liu et al.

CVPR 2025arXiv:2505.07652
13
citations
#3625

VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning

Xueqing Wu, Yuheng Ding, Bingxuan Li et al.

CVPR 2025arXiv:2412.02172
13
citations
#3626

Policy Filtration for RLHF to Mitigate Noise in Reward Models

Chuheng Zhang, Wei Shen, Li Zhao et al.

ICML 2025arXiv:2409.06957
13
citations
#3627

Temporally Consistent Object-Centric Learning by Contrasting Slots

Anna Manasyan, Maximilian Seitzer, Filip Radovic et al.

CVPR 2025arXiv:2412.14295
13
citations
#3628

Causal Composition Diffusion Model for Closed-loop Traffic Generation

Haohong Lin, Xin Huang, Tung Phan-Minh et al.

CVPR 2025arXiv:2412.17920
13
citations
#3629

MotionPro: A Precise Motion Controller for Image-to-Video Generation

Zhongwei Zhang, Fuchen Long, Zhaofan Qiu et al.

CVPR 2025arXiv:2505.20287
13
citations
#3630

Decision Information Meets Large Language Models: The Future of Explainable Operations Research

Yansen Zhang, Qingcan Kang, Wing Yin YU et al.

ICLR 2025arXiv:2502.09994
13
citations
#3631

PDE-Controller: LLMs for Autoformalization and Reasoning of PDEs

Mauricio Soroco, Jialin Song, Mengzhou Xia et al.

ICML 2025arXiv:2502.00963
13
citations
#3632

Imagine360: Immersive 360 Video Generation from Perspective Anchor

Jing Tan, Shuai Yang, Tong Wu et al.

NEURIPS 2025arXiv:2412.03552
13
citations
#3633

InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models

Cong Wei, Yujie Zhong, yingsen zeng et al.

ICCV 2025arXiv:2412.14006
13
citations
#3634

LucidPPN: Unambiguous Prototypical Parts Network for User-centric Interpretable Computer Vision

Mateusz Pach, Koryna Lewandowska, Jacek Tabor et al.

ICLR 2025arXiv:2405.14331
13
citations
#3635

Distributional Diffusion Models with Scoring Rules

Valentin De Bortoli, Alexandre Galashov, J Swaroop Guntupalli et al.

ICML 2025arXiv:2502.02483
13
citations
#3636

Detect Anything 3D in the Wild

Hanxue Zhang, Haoran Jiang, Qingsong Yao et al.

ICCV 2025arXiv:2504.07958
13
citations
#3637

UniNet: A Contrastive Learning-guided Unified Framework with Feature Selection for Anomaly Detection

Shun Wei, Jielin Jiang, Xiaolong Xu

CVPR 2025
13
citations
#3638

Learning Few-Step Diffusion Models by Trajectory Distribution Matching

Yihong Luo, Tianyang Hu, Jiacheng Sun et al.

ICCV 2025arXiv:2503.06674
13
citations
#3639

RelCon: Relative Contrastive Learning for a Motion Foundation Model for Wearable Data

Maxwell Xu, Jaya Narain, Gregory Darnell et al.

ICLR 2025arXiv:2411.18822
13
citations
#3640

Creation-MMBench: Assessing Context-Aware Creative Intelligence in MLLMs

Xinyu Fang, Zhijian Chen, Kai Lan et al.

ICCV 2025arXiv:2503.14478
13
citations
#3641

MaterialMVP: Illumination-Invariant Material Generation via Multi-view PBR Diffusion

Zebin He, Mx Yang, Shuhui Yang et al.

ICCV 2025highlightarXiv:2503.10289
13
citations
#3642

Exploring More from Multiple Gait Modalities for Human Identification

Dongyang Jin, Chao Fan, Weihua Chen et al.

AAAI 2025paperarXiv:2412.11495
13
citations
#3643

Beyond Matryoshka: Revisiting Sparse Coding for Adaptive Representation

Tiansheng Wen, Yifei Wang, Zequn Zeng et al.

ICML 2025oralarXiv:2503.01776
13
citations
#3644

The ODE Method for Stochastic Approximation and Reinforcement Learning with Markovian Noise

Shuze Daniel Liu, Shuhang Chen, Shangtong Zhang

NEURIPS 2025oralarXiv:2401.07844
13
citations
#3645

BountyBench: Dollar Impact of AI Agent Attackers and Defenders on Real-World Cybersecurity Systems

Andy Zhang, Joey Ji, Celeste Menders et al.

NEURIPS 2025arXiv:2505.15216
13
citations
#3646

SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models

Jiale Cheng, Xiao Liu, Cunxiang Wang et al.

ICLR 2025arXiv:2412.11605
13
citations
#3647

Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL

Jiarui Yao, Yifan Hao, Hanning Zhang et al.

NEURIPS 2025arXiv:2505.02391
13
citations
#3648

TreeEval: Benchmark-Free Evaluation of Large Language Models through Tree Planning

Xiang Li, Yunshi Lan, Chao Yang

AAAI 2025paperarXiv:2402.13125
13
citations
#3649

Instant Adversarial Purification with Adversarial Consistency Distillation

Chun Tong Lei, Hon Ming Yam, Zhongliang Guo et al.

CVPR 2025arXiv:2408.17064
13
citations
#3650

Speech Robust Bench: A Robustness Benchmark For Speech Recognition

Muhammad Shah, David Solans Noguero, Mikko Heikkilä et al.

ICLR 2025arXiv:2403.07937
13
citations
#3651

Safe Delta: Consistently Preserving Safety when Fine-Tuning LLMs on Diverse Datasets

Ning LU, Shengcai Liu, Jiahao Wu et al.

ICML 2025arXiv:2505.12038
13
citations
#3652

OmniAudio: Generating Spatial Audio from 360-Degree Video

Huadai Liu, Tianyi Luo, Kaicheng Luo et al.

ICML 2025arXiv:2504.14906
13
citations
#3653

Growing a Twig to Accelerate Large Vision-Language Models

Zhenwei Shao, Mingyang Wang, Zhou Yu et al.

ICCV 2025arXiv:2503.14075
13
citations
#3654

ConvCodeWorld: Benchmarking Conversational Code Generation in Reproducible Feedback Environments

Hojae Han, seung-won hwang, Rajhans Samdani et al.

ICLR 2025arXiv:2502.19852
13
citations
#3655

Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation

Eliot Xing, Vernon Luk, Jean Oh

ICLR 2025arXiv:2412.12089
13
citations
#3656

Boosting Segment Anything Model Towards Open-Vocabulary Learning

Xumeng Han, Longhui Wei, Xuehui Yu et al.

AAAI 2025paperarXiv:2312.03628
13
citations
#3657

OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data

Yiren Song, Cheng Liu, Mike Zheng Shou

NEURIPS 2025arXiv:2505.18445
13
citations
#3658

METASCENES: Towards Automated Replica Creation for Real-world 3D Scans

Huangyue Yu, Baoxiong Jia, Yixin Chen et al.

CVPR 2025arXiv:2505.02388
13
citations
#3659

ATLAS: Autoformalizing Theorems through Lifting, Augmentation, and Synthesis of Data

Xiaoyang Liu, Kangjie Bao, Jiashuo Zhang et al.

NEURIPS 2025arXiv:2502.05567
13
citations
#3660

Semi-supervised Concept Bottleneck Models

Lijie Hu, Tianhao Huang, Huanyi Xie et al.

ICCV 2025arXiv:2406.18992
13
citations
#3661

Epsilon-VAE: Denoising as Visual Decoding

Long Zhao, Sanghyun Woo, Ziyu Wan et al.

ICML 2025arXiv:2410.04081
13
citations
#3662

Training a Generally Curious Agent

Fahim Tajwar, Yiding Jiang, Abitha Thankaraj et al.

ICML 2025oralarXiv:2502.17543
13
citations
#3663

Parameter and Memory Efficient Pretraining via Low-rank Riemannian Optimization

Zhanfeng Mo, Long-Kai Huang, Sinno Jialin Pan

ICLR 2025
13
citations
#3664

Distilling LLM Agent into Small Models with Retrieval and Code Tools

Minki Kang, Jongwon Jeong, Seanie Lee et al.

NEURIPS 2025spotlightarXiv:2505.17612
13
citations
#3665

Large language models can learn and generalize steganographic chain-of-thought under process supervision

ROBERT MC CARTHY, Joey SKAF, Luis Ibanez-Lissen et al.

NEURIPS 2025arXiv:2506.01926
13
citations
#3666

Identifying Query-Relevant Neurons in Large Language Models for Long-Form Texts

Lihu Chen, Adam Dejl, Francesca Toni

AAAI 2025paperarXiv:2406.10868
13
citations
#3667

Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching

Enshu Liu, Xuefei Ning, Yu Wang et al.

ICLR 2025arXiv:2412.17153
13
citations
#3668

VPO: Aligning Text-to-Video Generation Models with Prompt Optimization

Jiale Cheng, Ruiliang Lyu, Xiaotao Gu et al.

ICCV 2025arXiv:2503.20491
13
citations
#3669

MagicID: Hybrid Preference Optimization for ID-Consistent and Dynamic-Preserved Video Customization

Hengjia Li, Lifan Jiang, Xi Xiao et al.

ICCV 2025arXiv:2503.12689
13
citations
#3670

Derm1M: A Million-scale Vision-Language Dataset Aligned with Clinical Ontology Knowledge for Dermatology

Siyuan Yan, Ming Hu, Yiwen Jiang et al.

ICCV 2025highlightarXiv:2503.14911
13
citations
#3671

CoT Red-Handed: Stress Testing Chain-of-Thought Monitoring

Benjamin Arnav, Pablo Bernabeu-Perez, Nathan Helm-Burger et al.

NEURIPS 2025arXiv:2505.23575
13
citations
#3672

InstantSplamp: Fast and Generalizable Stenography Framework for Generative Gaussian Splatting

Chenxin Li, Hengyu Liu, Zhiwen Fan et al.

ICLR 2025
13
citations
#3673

DifIISR: A Diffusion Model with Gradient Guidance for Infrared Image Super-Resolution

Xingyuan Li, Zirui Wang, Yang Zou et al.

CVPR 2025arXiv:2503.01187
13
citations
#3674

Proactive Agents for Multi-Turn Text-to-Image Generation Under Uncertainty

Meera Hahn, Wenjun Zeng, Nithish Kannen et al.

ICML 2025arXiv:2412.06771
13
citations
#3675

Imputation for prediction: beware of diminishing returns.

Marine Le Morvan, Gael Varoquaux

ICLR 2025arXiv:2407.19804
13
citations
#3676

Unlearn and Burn: Adversarial Machine Unlearning Requests Destroy Model Accuracy

Yangsibo Huang, Daogao Liu, Lynn Chua et al.

ICLR 2025arXiv:2410.09591
13
citations
#3677

Towards a Universal Synthetic Video Detector: From Face or Background Manipulations to Fully AI-Generated Content

Rohit Kundu, Hao Xiong, Vishal Mohanty et al.

CVPR 2025arXiv:2412.12278
13
citations
#3678

$\mathbb{X}$-Sample Contrastive Loss: Improving Contrastive Learning with Sample Similarity Graphs

Vlad Sobal, Mark Ibrahim, Randall Balestriero et al.

ICLR 2025arXiv:2407.18134
13
citations
#3679

Image Over Text: Transforming Formula Recognition Evaluation with Character Detection Matching

Bin Wang, Fan Wu, Linke Ouyang et al.

CVPR 2025arXiv:2409.03643
13
citations
#3680

X-Transfer Attacks: Towards Super Transferable Adversarial Attacks on CLIP

Hanxun Huang, Sarah Erfani, Yige Li et al.

ICML 2025arXiv:2505.05528
13
citations
#3681

Heterogeneous Swarms: Jointly Optimizing Model Roles and Weights for Multi-LLM Systems

Shangbin Feng, Zifeng Wang, Palash Goyal et al.

NEURIPS 2025arXiv:2502.04510
13
citations
#3682

Audio-Visual Instance Segmentation

Ruohao Guo, Xianghua Ying, Yaru Chen et al.

CVPR 2025arXiv:2310.18709
13
citations
#3683

Completion as Enhancement: A Degradation-Aware Selective Image Guided Network for Depth Completion

Zhiqiang Yan, Zhengxue Wang, Kun Wang et al.

CVPR 2025arXiv:2412.19225
13
citations
#3684

GWM: Towards Scalable Gaussian World Models for Robotic Manipulation

Guanxing Lu, Baoxiong Jia, Puhao Li et al.

ICCV 2025arXiv:2508.17600
13
citations
#3685

Ambient Diffusion Omni: Training Good Models with Bad Data

Giannis Daras, Adrian Rodriguez-Munoz, Adam Klivans et al.

NEURIPS 2025spotlightarXiv:2506.10038
13
citations
#3686

Grounding Continuous Representations in Geometry: Equivariant Neural Fields

David Wessels, David Knigge, Riccardo Valperga et al.

ICLR 2025arXiv:2406.05753
13
citations
#3687

Object-level Geometric Structure Preserving for Natural Image Stitching

Wenxiao Cai, Wankou Yang

AAAI 2025paperarXiv:2402.12677
13
citations
#3688

Geometry Aware Operator Transformer as an efficient and accurate neural surrogate for PDEs on arbitrary domains

Shizheng Wen, Arsh Kumbhat, Levi Lingsch et al.

NEURIPS 2025arXiv:2505.18781
13
citations
#3689

Universal Sharpness Dynamics in Neural Network Training: Fixed Point Analysis, Edge of Stability, and Route to Chaos

Dayal Singh Kalra, Tianyu He, Maissam Barkeshli

ICLR 2025arXiv:2311.02076
13
citations
#3690

StarGen: A Spatiotemporal Autoregression Framework with Video Diffusion Model for Scalable and Controllable Scene Generation

Shangjin Zhai, Zhichao Ye, Jialin Liu et al.

CVPR 2025arXiv:2501.05763
13
citations
#3691

MobilePortrait: Real-Time One-Shot Neural Head Avatars on Mobile Devices

Jianwen Jiang, Gaojie Lin, Zhengkun Rong et al.

CVPR 2025arXiv:2407.05712
13
citations
#3692

FSTA-SNN:Frequency-Based Spatial-Temporal Attention Module for Spiking Neural Networks

Kairong Yu, Tianqing Zhang, Hongwei Wang et al.

AAAI 2025paper
13
citations
#3693

Spatial-Temporal Graph Diffusion Policy with Kinematic Modeling for Bimanual Robotic Manipulation

Qi Lv, Hao Li, Xiang Deng et al.

CVPR 2025arXiv:2503.10743
13
citations
#3694

A Memory Efficient Randomized Subspace Optimization Method for Training Large Language Models

Yiming Chen, yuan zhang, Yin Liu et al.

ICML 2025arXiv:2502.07222
13
citations
#3695

From Zero to Detail: Deconstructing Ultra-High-Definition Image Restoration from Progressive Spectral Perspective

Chen Zhao, Zhizhou Chen, Yunzhe Xu et al.

CVPR 2025arXiv:2503.13165
13
citations
#3696

SecureGS: Boosting the Security and Fidelity of 3D Gaussian Splatting Steganography

Xuanyu Zhang, Jiarui Meng, Zhipei Xu et al.

ICLR 2025arXiv:2503.06118
13
citations
#3697

HIIF: Hierarchical Encoding based Implicit Image Function for Continuous Super-resolution

Yuxuan Jiang, Ho Man Kwan, jasmine peng et al.

CVPR 2025arXiv:2412.03748
13
citations
#3698

RoboTron-Mani: All-in-One Multimodal Large Model for Robotic Manipulation

Feng yan, Fanfan Liu, Yiyang Huang et al.

ICCV 2025arXiv:2412.07215
13
citations
#3699

Learning Transformer-based World Models with Contrastive Predictive Coding

Maxime Burchi, Radu Timofte

ICLR 2025oralarXiv:2503.04416
13
citations
#3700

Progressive Focused Transformer for Single Image Super-Resolution

Wei Long, Xingyu Zhou, Leheng Zhang et al.

CVPR 2025arXiv:2503.20337
13
citations
#3701

Knowledge Is Power: Harnessing Large Language Models for Enhanced Cognitive Diagnosis

Zhiang Dong, Jingyuan Chen, Fei Wu

AAAI 2025paperarXiv:2502.05556
13
citations
#3702

70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float (DFloat11)

Tianyi Zhang, Mohsen Hariri, Shaochen (Henry) Zhong et al.

NEURIPS 2025arXiv:2504.11651
13
citations
#3703

Forget to Flourish: Leveraging Machine-Unlearning on Pretrained Language Models for Privacy Leakage

Md Rafi Ur Rashid, Jing Liu, Toshiaki Koike-Akino et al.

AAAI 2025paperarXiv:2408.17354
13
citations
#3704

Surprising Effectiveness of pretraining Ternary Language Model at Scale

Ayush Kaushal, Tejas Vaidhya, Arnab Mondal et al.

ICLR 2025arXiv:2407.12327
13
citations
#3705

Envisioning Class Entity Reasoning by Large Language Models for Few-shot Learning

Mushui Liu, Fangtai Wu, Bozheng Li et al.

AAAI 2025paperarXiv:2408.12469
13
citations
#3706

DeepGate4: Efficient and Effective Representation Learning for Circuit Design at Scale

Ziyang Zheng, Shan Huang, Jianyuan Zhong et al.

ICLR 2025arXiv:2502.01681
13
citations
#3707

BadJudge: Backdoor Vulnerabilities of LLM-As-A-Judge

Terry Tong, Fei Wang, Zhe Zhao et al.

ICLR 2025arXiv:2503.00596
13
citations
#3708

Targeted Attack Improves Protection against Unauthorized Diffusion Customization

Boyang Zheng, Chumeng Liang, Xiaoyu Wu

ICLR 2025arXiv:2310.04687
13
citations
#3709

Quest: Query-centric Data Synthesis Approach for Long-context Scaling of Large Language Model

Chaochen Gao, Xing W, Qi Fu et al.

ICLR 2025arXiv:2405.19846
13
citations
#3710

C-CLIP: Multimodal Continual Learning for Vision-Language Model

Wenzhuo Liu, Fei Zhu, Longhui Wei et al.

ICLR 2025
13
citations
#3711

TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models

Makoto Shing, Kou Misaki, Han Bao et al.

ICLR 2025oralarXiv:2501.16937
13
citations
#3712

Instruction-Following Pruning for Large Language Models

Bairu Hou, Qibin Chen, Jianyu Wang et al.

ICML 2025arXiv:2501.02086
13
citations
#3713

ACC-Collab: An Actor-Critic Approach to Multi-Agent LLM Collaboration

Andrew Estornell, Jean-Francois Ton, Yuanshun Yao et al.

ICLR 2025arXiv:2411.00053
13
citations
#3714

Cost-efficient Collaboration between On-device and Cloud Language Models

Avanika Narayan, Dan Biderman, Sabri Eyuboglu et al.

ICML 2025arXiv:2502.15964
13
citations
#3715

VibeCheck: Discover and Quantify Qualitative Differences in Large Language Models

Lisa Dunlap, Krishna Mandal, trevor darrell et al.

ICLR 2025arXiv:2410.12851
13
citations
#3716

The Last Iterate Advantage: Empirical Auditing and Principled Heuristic Analysis of Differentially Private SGD

Milad Nasr, Thomas Steinke, Borja Balle et al.

ICLR 2025arXiv:2410.06186
13
citations
#3717

Classical Planning with LLM-Generated Heuristics: Challenging the State of the Art with Python Code

Augusto B. Corrêa, André G. Pereira, Jendrik Seipp

NEURIPS 2025arXiv:2503.18809
13
citations
#3718

Enhancing Personalized Multi-Turn Dialogue with Curiosity Reward

Yanming Wan, Jiaxing Wu, Marwa Abdulhai et al.

NEURIPS 2025arXiv:2504.03206
13
citations
#3719

Learning Molecular Representation in a Cell

Gang Liu, Srijit Seal, John Arevalo et al.

ICLR 2025arXiv:2406.12056
13
citations
#3720

Hierarchical Autoregressive Transformers: Combining Byte- and Word-Level Processing for Robust, Adaptable Language Models

Pit Neitemeier, Björn Deiseroth, Constantin Eichenberg et al.

ICLR 2025arXiv:2501.10322
13
citations
#3721

Simplifying DINO via Coding Rate Regularization

Ziyang Wu, Jingyuan Zhang, Druv Pai et al.

ICML 2025arXiv:2502.10385
13
citations
#3722

HierarQ: Task-Aware Hierarchical Q-Former for Enhanced Video Understanding

Shehreen Azad, Vibhav Vineet, Yogesh S. Rawat

CVPR 2025arXiv:2503.08585
13
citations
#3723

MoDec-GS: Global-to-Local Motion Decomposition and Temporal Interval Adjustment for Compact Dynamic 3D Gaussian Splatting

Sangwoon Kwak, Joonsoo Kim, Jun Young Jeong et al.

CVPR 2025arXiv:2501.03714
13
citations
#3724

Measuring Non-Adversarial Reproduction of Training Data in Large Language Models

Michael Aerni, Javier Rando, Edoardo Debenedetti et al.

ICLR 2025arXiv:2411.10242
13
citations
#3725

Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for LLM Reasoning

Kongcheng Zhang, QI YAO, Shunyu Liu et al.

NEURIPS 2025arXiv:2506.08745
13
citations
#3726

Look Before You Leap: A GUI-Critic-R1 Model for Pre-Operative Error Diagnosis in GUI Automation

Yuyang Wanyan, Xi Zhang, Haiyang Xu et al.

NEURIPS 2025arXiv:2506.04614
13
citations
#3727

AIGI-Holmes: Towards Explainable and Generalizable AI-Generated Image Detection via Multimodal Large Language Models

Ziyin Zhou, Yunpeng Luo, Yuanchen Wu et al.

ICCV 2025arXiv:2507.02664
13
citations
#3728

Enhancing Multi-Robot Semantic Navigation Through Multimodal Chain-of-Thought Score Collaboration

Zhixuan Shen, Haonan Luo, Kexun Chen et al.

AAAI 2025paperarXiv:2412.18292
13
citations
#3729

Faster Algorithms for Structured Linear and Kernel Support Vector Machines

Yuzhou Gu, Zhao Song, Lichen Zhang

ICLR 2025arXiv:2307.07735
13
citations
#3730

Detecting High-Stakes Interactions with Activation Probes

Alex McKenzie, Urja Pawar, Phil Blandfort et al.

NEURIPS 2025arXiv:2506.10805
13
citations
#3731

Automatic Curriculum Expert Iteration for Reliable LLM Reasoning

Zirui Zhao, Hanze Dong, Amrita Saha et al.

ICLR 2025arXiv:2410.07627
13
citations
#3732

Mitigating the Backdoor Effect for Multi-Task Model Merging via Safety-Aware Subspace

Jinluan Yang, Anke Tang, Didi Zhu et al.

ICLR 2025arXiv:2410.13910
13
citations
#3733

GS-LiDAR: Generating Realistic LiDAR Point Clouds with Panoramic Gaussian Splatting

Junzhe Jiang, Chun Gu, Yurui Chen et al.

ICLR 2025arXiv:2501.13971
13
citations
#3734

Learning local equivariant representations for quantum operators

YinZhangHao Zhou, Zixi Gan, Shishir Pandey et al.

ICLR 2025arXiv:2407.06053
13
citations
#3735

A Unifying Framework for Representation Learning

Shaden Alshammari, John Hershey, Axel Feldmann et al.

ICLR 2025arXiv:2504.16929
13
citations
#3736

Pre-Training Graph Neural Networks on Molecules by Using Subgraph-Conditioned Graph Information Bottleneck

Van Thuy Hoang, O-Joun Lee

AAAI 2025paperarXiv:2412.15589
13
citations
#3737

DropoutGS: Dropping Out Gaussians for Better Sparse-view Rendering

Yexing Xu, Longguang Wang, Minglin Chen et al.

CVPR 2025arXiv:2504.09491
13
citations
#3738

Pixel-Perfect Depth with Semantics-Prompted Diffusion Transformers

Gangwei Xu, Haotong Lin, Hongcheng Luo et al.

NEURIPS 2025arXiv:2510.07316
13
citations
#3739

Learning Hazing to Dehazing: Towards Realistic Haze Generation for Real-World Image Dehazing

Ruiyi Wang, Yushuo Zheng, Zicheng Zhang et al.

CVPR 2025arXiv:2503.19262
13
citations
#3740

BadVLA: Towards Backdoor Attacks on Vision-Language-Action Models via Objective-Decoupled Optimization

Xueyang Zhou, Guiyao Tie, Guowen Zhang et al.

NEURIPS 2025arXiv:2505.16640
13
citations
#3741

Improving Complex Reasoning over Knowledge Graph with Logic-Aware Curriculum Tuning

Tianle Xia, Liang Ding, Guojia Wan et al.

AAAI 2025paperarXiv:2405.01649
13
citations
#3742

What Makes a Maze Look Like a Maze?

Joy Hsu, Jiayuan Mao, Joshua B Tenenbaum et al.

ICLR 2025arXiv:2409.08202
13
citations
#3743

FocalCodec: Low-Bitrate Speech Coding via Focal Modulation Networks

Luca Della Libera, Francesco Paissan, Cem Subakan et al.

NEURIPS 2025arXiv:2502.04465
13
citations
#3744

High-Fidelity Simultaneous Speech-To-Speech Translation

Tom Labiausse, Laurent Mazaré, Edouard Grave et al.

ICML 2025arXiv:2502.03382
13
citations
#3745

Latent Thought Models with Variational Bayes Inference-Time Computation

Deqian Kong, Minglu Zhao, Dehong Xu et al.

ICML 2025arXiv:2502.01567
13
citations
#3746

TANGO: Training-free Embodied AI Agents for Open-world Tasks

Filippo Ziliotto, Tommaso Campari, Luciano Serafini et al.

CVPR 2025arXiv:2412.10402
13
citations
#3747

VLForgery Face Triad: Detection, Localization and Attribution via Multimodal Large Language Models

Xinan He, Yue Zhou, Bing Fan et al.

NEURIPS 2025arXiv:2503.06142
13
citations
#3748

Probabilistic Conformal Prediction with Approximate Conditional Validity

Vincent Plassier, Alexander Fishkov, Mohsen Guizani et al.

ICLR 2025arXiv:2407.01794
13
citations
#3749

ASIGN: An Anatomy-aware Spatial Imputation Graphic Network for 3D Spatial Transcriptomics

Junchao Zhu, Ruining Deng, Tianyuan Yao et al.

CVPR 2025arXiv:2412.03026
13
citations
#3750

Post-hoc Reward Calibration: A Case Study on Length Bias

Zeyu Huang, Zihan Qiu, zili wang et al.

ICLR 2025arXiv:2409.17407
13
citations
#3751

One Example Shown, Many Concepts Known! Counterexample-Driven Conceptual Reasoning in Mathematical LLMs

Yinghui Li, Jiayi Kuang, Haojing Huang et al.

ICML 2025arXiv:2502.10454
13
citations
#3752

AI-Researcher: Autonomous Scientific Innovation

Jiabin Tang, Lianghao Xia, Zhonghang Li et al.

NEURIPS 2025spotlightarXiv:2505.18705
13
citations
#3753

CoRA: Collaborative Information Perception by Large Language Model’s Weights for Recommendation

Yuting Liu, Jinghao Zhang, Yizhou Dang et al.

AAAI 2025paperarXiv:2408.10645
13
citations
#3754

Personalized Visual Instruction Tuning

Renjie Pi, Jianshu Zhang, Tianyang Han et al.

ICLR 2025arXiv:2410.07113
13
citations
#3755

UFM: A Simple Path towards Unified Dense Correspondence with Flow

Yuchen Zhang, Nikhil Keetha, Chenwei Lyu et al.

NEURIPS 2025arXiv:2506.09278
13
citations
#3756

FairGP: A Scalable and Fair Graph Transformer Using Graph Partitioning

Renqiang Luo, Huafei Huang, Ivan Lee et al.

AAAI 2025paperarXiv:2412.10669
13
citations
#3757

Bridging Traffic State and Trajectory for Dynamic Road Network and Trajectory Representation Learning

Chengkai Han, Jingyuan Wang, Yongyao Wang et al.

AAAI 2025paperarXiv:2502.06870
13
citations
#3758

Differential learning kinetics govern the transition from memorization to generalization during in-context learning

Alex Nguyen, Gautam Reddy Nallamala

ICLR 2025arXiv:2412.00104
13
citations
#3759

FlashMask: Efficient and Rich Mask Extension of FlashAttention

Guoxia Wang, Jinle Zeng, Xiyuan Xiao et al.

ICLR 2025arXiv:2410.01359
13
citations
#3760

Amplifier: Bringing Attention to Neglected Low-Energy Components in Time Series Forecasting

Jingru Fei, Kun Yi, Wei Fan et al.

AAAI 2025paperarXiv:2501.17216
13
citations
#3761

CipherPrune: Efficient and Scalable Private Transformer Inference

Yancheng Zhang, Jiaqi Xue, Mengxin Zheng et al.

ICLR 2025arXiv:2502.16782
13
citations
#3762

KLay: Accelerating Arithmetic Circuits for Neurosymbolic AI

Jaron Maene, Vincent Derkinderen, Pedro Zuidberg Dos Martires

ICLR 2025arXiv:2410.11415
13
citations
#3763

RhythmMamba: Fast, Lightweight, and Accurate Remote Physiological Measurement

Bochao Zou, Zizheng Guo, Xiaocheng Hu et al.

AAAI 2025paperarXiv:2404.06483
13
citations
#3764

This Time is Different: An Observability Perspective on Time Series Foundation Models

Ben Cohen, Emaad Khwaja, Youssef Doubli et al.

NEURIPS 2025arXiv:2505.14766
13
citations
#3765

Large Language-Geometry Model: When LLM meets Equivariance

Zongzhao Li, Jiacheng Cen, Bing Su et al.

ICML 2025arXiv:2502.11149
13
citations
#3766

CoRe: Benchmarking LLMs’ Code Reasoning Capabilities through Static Analysis Tasks

Danning Xie, Mingwei Zheng, Xuwei Liu et al.

NEURIPS 2025spotlightarXiv:2507.05269
13
citations
#3767

ProSec: Fortifying Code LLMs with Proactive Security Alignment

Xiangzhe Xu, Zian Su, Jinyao Guo et al.

ICML 2025arXiv:2411.12882
13
citations
#3768

ETTA: Elucidating the Design Space of Text-to-Audio Models

Sang-gil Lee, Zhifeng Kong, ARUSHI GOEL et al.

ICML 2025arXiv:2412.19351
13
citations
#3769

Can Classic GNNs Be Strong Baselines for Graph-level Tasks? Simple Architectures Meet Excellence

Yuankai Luo, Lei Shi, Xiao-Ming Wu

ICML 2025arXiv:2502.09263
13
citations
#3770

Lexico: Extreme KV Cache Compression via Sparse Coding over Universal Dictionaries

Junhyuck Kim, Jongho Park, Jaewoong Cho et al.

ICML 2025arXiv:2412.08890
13
citations
#3771

POPEN: Preference-Based Optimization and Ensemble for LVLM-Based Reasoning Segmentation

Lanyun Zhu, Tianrun Chen, Qianxiong Xu et al.

CVPR 2025arXiv:2504.00640
13
citations
#3772

CoLLM: A Large Language Model for Composed Image Retrieval

Chuong Huynh, Jinyu Yang, Ashish Tawari et al.

CVPR 2025arXiv:2503.19910
13
citations
#3773

Compositional 4D Dynamic Scenes Understanding with Physics Priors for Video Question Answering

Xingrui Wang, Wufei Ma, Angtian Wang et al.

ICLR 2025oralarXiv:2406.00622
13
citations
#3774

Local Conditional Controlling for Text-to-Image Diffusion Models

Yibo Zhao, Liang Peng, Yang Yang et al.

AAAI 2025paperarXiv:2312.08768
13
citations
#3775

Flow matching achieves almost minimax optimal convergence

Kenji Fukumizu, Taiji Suzuki, Noboru Isobe et al.

ICLR 2025arXiv:2405.20879
13
citations
#3776

DexTrack: Towards Generalizable Neural Tracking Control for Dexterous Manipulation from Human References

Xueyi Liu, Jianibieke Adalibieke, Qianwei Han et al.

ICLR 2025arXiv:2502.09614
13
citations
#3777

MindLLM: A Subject-Agnostic and Versatile Model for fMRI-to-text Decoding

Weikang Qiu, Zheng Huang, Haoyu Hu et al.

ICML 2025arXiv:2502.15786
13
citations
#3778

GeoBEV: Learning Geometric BEV Representation for Multi-view 3D Object Detection

Jinqing Zhang, Yanan Zhang, Yunlong Qi et al.

AAAI 2025paperarXiv:2409.01816
13
citations
#3779

A Closer Look at TabPFN v2: Understanding Its Strengths and Extending Its Capabilities

Han-Jia Ye, Si-Yang Liu, Wei-Lun (Harry) Chao

NEURIPS 2025arXiv:2502.17361
13
citations
#3780

Standardizing Structural Causal Models

Weronika Ormaniec, Scott Sussex, Lars Lorch et al.

ICLR 2025arXiv:2406.11601
13
citations
#3781

ObjectMover: Generative Object Movement with Video Prior

Xin Yu, Tianyu Wang, Soo Ye Kim et al.

CVPR 2025arXiv:2503.08037
13
citations
#3782

Contextual Integrity in LLMs via Reasoning and Reinforcement Learning

Guangchen (Eric) Lan, Huseyin A. Inan, Sahar Abdelnabi et al.

NEURIPS 2025arXiv:2506.04245
13
citations
#3783

Topology of Reasoning: Understanding Large Reasoning Models through Reasoning Graph Properties

Gouki Minegishi, Hiroki Furuta, Takeshi Kojima et al.

NEURIPS 2025arXiv:2506.05744
13
citations
#3784

Efficient Inference for Large Language Model-based Generative Recommendation

Xinyu Lin, Chaoqun Yang, Wenjie Wang et al.

ICLR 2025arXiv:2410.05165
13
citations
#3785

Worse than Zero-shot? A Fact-Checking Dataset for Evaluating the Robustness of RAG Against Misleading Retrievals

Linda Zeng, Rithwik Gupta, Divij Motwani et al.

NEURIPS 2025arXiv:2502.16101
13
citations
#3786

AG-VPReID: A Challenging Large-Scale Benchmark for Aerial-Ground Video-based Person Re-Identification

Huy Nguyen, Kien Nguyen Thanh, Akila Pemasiri et al.

CVPR 2025arXiv:2503.08121
13
citations
#3787

Repulsive Latent Score Distillation for Solving Inverse Problems

Nicolas Zilberstein, Morteza Mardani, Santiago Segarra

ICLR 2025arXiv:2406.16683
13
citations
#3788

KARMA: Leveraging Multi-Agent LLMs for Automated Knowledge Graph Enrichment

Yuxing Lu, Wei Wu, Xukai Zhao et al.

NEURIPS 2025spotlightarXiv:2502.06472
13
citations
#3789

DefMamba: Deformable Visual State Space Model

Leiye Liu, Miao Zhang, Jihao Yin et al.

CVPR 2025arXiv:2504.05794
13
citations
#3790

TREAD: Token Routing for Efficient Architecture-agnostic Diffusion Training

Felix Krause, Timy Phan, Ming Gui et al.

ICCV 2025arXiv:2501.04765
13
citations
#3791

Intelligence at the Edge of Chaos

Shiyang Zhang, Aakash Patel, Syed Rizvi et al.

ICLR 2025arXiv:2410.02536
13
citations
#3792

Direct Discriminative Optimization: Your Likelihood-Based Visual Generative Model is Secretly a GAN Discriminator

Kaiwen Zheng, Yongxin Chen, Huayu Chen et al.

ICML 2025spotlightarXiv:2503.01103
13
citations
#3793

DVP-MVS: Synergize Depth-Edge and Visibility Prior for Multi-View Stereo

Zhenlong Yuan, Jinguo Luo, Fei Shen et al.

AAAI 2025paperarXiv:2412.11578
13
citations
#3794

Solving Inequality Proofs with Large Language Models

Jiayi Sheng, Luna Lyu, Jikai Jin et al.

NEURIPS 2025spotlightarXiv:2506.07927
13
citations
#3795

OmniSep: Unified Omni-Modality Sound Separation with Query-Mixup

Xize Cheng, Siqi Zheng, zehan wang et al.

ICLR 2025arXiv:2410.21269
13
citations
#3796

UniGen: Enhanced Training & Test-Time Strategies for Unified Multimodal Understanding and Generation

Rui Tian, Mingfei Gao, Mingze Xu et al.

NEURIPS 2025arXiv:2505.14682
12
citations
#3797

HashAttention: Semantic Sparsity for Faster Inference

Aditya Desai, Shuo Yang, Alejandro Cuadron et al.

ICML 2025arXiv:2412.14468
12
citations
#3798

ReasonGrounder: LVLM-Guided Hierarchical Feature Splatting for Open-Vocabulary 3D Visual Grounding and Reasoning

Zhenyang Liu, Yikai Wang, Sixiao Zheng et al.

CVPR 2025arXiv:2503.23297
12
citations
#3799

Proxy Denoising for Source-Free Domain Adaptation

Song Tang, Wenxin Su, Yan Gan et al.

ICLR 2025arXiv:2406.01658
12
citations
#3800

SymmCompletion: High-Fidelity and High-Consistency Point Cloud Completion with Symmetry Guidance

Hongyu Yan, Zijun Li, Kunming Luo et al.

AAAI 2025paperarXiv:2503.18007
12
citations