Most Cited 2025 "plug-and-play control" Papers

22,274 papers found • Page 27 of 112

#5201

Diff3DS: Generating View-Consistent 3D Sketch via Differentiable Curve Rendering

Yibo Zhang, Lihong Wang, Changqing Zou et al.

ICLR 2025arXiv:2405.15305
9
citations
#5202

REGENT: A Retrieval-Augmented Generalist Agent That Can Act In-Context in New Environments

Kaustubh Sridhar, Souradeep Dutta, Dinesh Jayaraman et al.

ICLR 2025arXiv:2412.04759
9
citations
#5203

GReaTer: Gradients Over Reasoning Makes Smaller Language Models Strong Prompt Optimizers

Sarkar Snigdha Sarathi Das, Ryo Kamoi, Bo Pang et al.

ICLR 2025arXiv:2412.09722
9
citations
#5204

Simulation-Free Hierarchical Latent Policy Planning for Proactive Dialogues

Tao He, Lizi Liao, Yixin Cao et al.

AAAI 2025paperarXiv:2412.14584
9
citations
#5205

The Elicitation Game: Evaluating Capability Elicitation Techniques

Felix Hofstätter, Teun van der Weij, Jayden Teoh et al.

ICML 2025arXiv:2502.02180
9
citations
#5206

Fast Think-on-Graph: Wider, Deeper and Faster Reasoning of Large Language Model on Knowledge Graph

Xujian Liang, Zhaoquan Gu

AAAI 2025paperarXiv:2501.14300
9
citations
#5207

Near, far: Patch-ordering enhances vision foundation models' scene understanding

Valentinos Pariza, Mohammadreza Salehi, Gertjan J Burghouts et al.

ICLR 2025arXiv:2408.11054
9
citations
#5208

CP-Guard: Malicious Agent Detection and Defense in Collaborative Bird’s Eye View Perception

Senkang Hu, Yihang Tao, Guowen Xu et al.

AAAI 2025paperarXiv:2412.12000
9
citations
#5209

Beyond Sequence: Impact of Geometric Context for RNA Property Prediction

Junjie Xu, Artem Moskalev, Tommaso Mansi et al.

ICLR 2025arXiv:2410.11933
9
citations
#5210

Understanding and Improving Length Generalization in Recurrent Models

Ricardo Buitrago Ruiz, Albert Gu

ICML 2025arXiv:2507.02782
9
citations
#5211

(Almost Full) EFX for Three (and More) Types of Agents

Pratik Ghosal, Vishwa Prakash HV, Prajakta Nimbhorkar et al.

AAAI 2025paperarXiv:2301.10632
9
citations
#5212

Constrained Fair and Efficient Allocations

Benjamin Cookson, Soroush Ebadian, Nisarg Shah

AAAI 2025paperarXiv:2411.00133
9
citations
#5213

Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models

Zhengfeng Lai, Vasileios Saveris, Chen Chen et al.

ICLR 2025arXiv:2410.02740
9
citations
#5214

FloNa: Floor Plan Guided Embodied Visual Navigation

Jiaxin Li, Weiqi Huang, Zan Wang et al.

AAAI 2025paperarXiv:2412.18335
9
citations
#5215

Counterfactual Generative Modeling with Variational Causal Inference

Yulun Wu, Louis McConnell, Claudia Iriondo

ICLR 2025arXiv:2410.12730
9
citations
#5216

KPL: Training-Free Medical Knowledge Mining of Vision-Language Models

Jiaxiang Liu, Tianxiang Hu, Jiawei Du et al.

AAAI 2025paperarXiv:2501.11231
9
citations
#5217

TimeCHEAT: A Channel Harmony Strategy for Irregularly Sampled Multivariate Time Series Analysis

Jiexi Liu, Meng Cao, Songcan Chen

AAAI 2025paperarXiv:2412.12886
9
citations
#5218

Can a MISL Fly? Analysis and Ingredients for Mutual Information Skill Learning

Chongyi Zheng, Jens Tuyls, Joanne Peng et al.

ICLR 2025arXiv:2412.08021
9
citations
#5219

Learning Generalizable Skills from Offline Multi-Task Data for Multi-Agent Cooperation

Sicong Liu, Yang Shu, Chenjuan Guo et al.

ICLR 2025oralarXiv:2503.21200
9
citations
#5220

Explain Yourself, Briefly! Self-Explaining Neural Networks with Concise Sufficient Reasons

Shahaf Bassan, Ron Eliav, Shlomit Gur

ICLR 2025arXiv:2502.03391
9
citations
#5221

Semi-Supervised Multi-View Multi-Label Learning with View-Specific Transformer and Enhanced Pseudo-Label

Quanjiang Li, Tingjin Luo, Mingdie Jiang et al.

AAAI 2025paper
9
citations
#5222

Improved Finite-Particle Convergence Rates for Stein Variational Gradient Descent

Sayan Banerjee, Krishna Balasubramanian, PROMIT GHOSAL

ICLR 2025arXiv:2409.08469
9
citations
#5223

HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation

Ling Yang, Xinchen Zhang, Ye Tian et al.

NEURIPS 2025arXiv:2502.12148
9
citations
#5224

Highly Compressed Tokenizer Can Generate Without Training

Lukas Lao Beyer, Tianhong Li, Xinlei Chen et al.

ICML 2025arXiv:2506.08257
9
citations
#5225

Multi-Granular Multimodal Clue Fusion for Meme Understanding

Li Zheng, Hao Fei, Ting Dai et al.

AAAI 2025paperarXiv:2503.12560
9
citations
#5226

Assessing the Creativity of LLMs in Proposing Novel Solutions to Mathematical Problems

Junyi Ye, Jingyi Gu, Xinyun Zhao et al.

AAAI 2025paperarXiv:2410.18336
9
citations
#5227

One Token Can Help! Learning Scalable and Pluggable Virtual Tokens for Retrieval-Augmented Large Language Models

Yutao Zhu, Zhaoheng Huang, Zhicheng Dou et al.

AAAI 2025paperarXiv:2405.19670
9
citations
#5228

Intermediate Layer Classifiers for OOD generalization

Arnas Uselis, Seong Joon Oh

ICLR 2025arXiv:2504.05461
9
citations
#5229

LLM+AL: Bridging Large Language Models and Action Languages for Complex Reasoning About Actions

Adam Ishay, Joohyung Lee

AAAI 2025paperarXiv:2501.00830
9
citations
#5230

TabFlex: Scaling Tabular Learning to Millions with Linear Attention

Yuchen Zeng, Tuan Dinh, Wonjun Kang et al.

ICML 2025spotlightarXiv:2506.05584
9
citations
#5231

MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions

Yekun Chai, Haoran Sun, Huang Fang et al.

ICLR 2025oralarXiv:2410.02743
9
citations
#5232

Loss Functions and Operators Generated by f-Divergences

Vincent Roulet, Tianlin Liu, Nino Vieillard et al.

ICML 2025arXiv:2501.18537
9
citations
#5233

Extractive Structures Learned in Pretraining Enable Generalization on Finetuned Facts

Jiahai Feng, Stuart Russell, Jacob Steinhardt

ICML 2025arXiv:2412.04614
9
citations
#5234

GVMGen: A General Video-to-Music Generation Model with Hierarchical Attentions

Heda Zuo, Weitao You, Junxian Wu et al.

AAAI 2025paperarXiv:2501.09972
9
citations
#5235

MAGE: Model-Level Graph Neural Networks Explanations via Motif-based Graph Generation

Zhaoning Yu, Hongyang Gao

ICLR 2025arXiv:2405.12519
9
citations
#5236

Neural ODE Transformers: Analyzing Internal Dynamics and Adaptive Fine-tuning

Anh Tong, Thanh Nguyen-Tang, Dongeun Lee et al.

ICLR 2025arXiv:2503.01329
9
citations
#5237

Realistic Evaluation of Deep Partial-Label Learning Algorithms

Wei Wang, Dong-Dong Wu, Jindong Wang et al.

ICLR 2025arXiv:2502.10184
9
citations
#5238

$\gamma-$MoD: Exploring Mixture-of-Depth Adaptation for Multimodal Large Language Models

Yaxin Luo, Gen Luo, Jiayi Ji et al.

ICLR 2025
9
citations
#5239

Broken Tokens? Your Language Model can Secretly Handle Non-Canonical Tokenizations

Brian Zheng, Alisa Liu, Orevaoghene Ahia et al.

NEURIPS 2025spotlightarXiv:2506.19004
9
citations
#5240

ADIFF: Explaining audio difference using natural language

Soham Deshmukh, Shuo Han, Rita Singh et al.

ICLR 2025arXiv:2502.04476
9
citations
#5241

Grounding Language with Vision: A Conditional Mutual Information Calibrated Decoding Strategy for Reducing Hallucinations in LVLMs

Hao Fang, Changle Zhou, Jiawei Kong et al.

NEURIPS 2025arXiv:2505.19678
9
citations
#5242

QuaDiM: A Conditional Diffusion Model For Quantum State Property Estimation

Yehui Tang, Mabiao Long, Junchi Yan

ICLR 2025
9
citations
#5243

A Sharper Global Convergence Analysis for Average Reward Reinforcement Learning via an Actor-Critic Approach

Swetha Ganesh, Washim Mondal, Vaneet Aggarwal

ICML 2025arXiv:2407.18878
9
citations
#5244

A Generalist Intracortical Motor Decoder

Joel Ye, Fabio Rizzoglio, Xuan Ma et al.

NEURIPS 2025
9
citations
#5245

From Specificity to Generality: Revisiting Generalizable Artifacts in Detecting Face Deepfakes

Long Ma, Zhiyuan Yan, Jin Xu et al.

NEURIPS 2025arXiv:2504.04827
9
citations
#5246

Thinking Racial Bias in Fair Forgery Detection: Models, Datasets and Evaluations

Decheng Liu, Zongqi Wang, Chunlei Peng et al.

AAAI 2025paperarXiv:2407.14367
9
citations
#5247

MoRe: Class Patch Attention Needs Regularization for Weakly Supervised Semantic Segmentation

Zhiwei Yang, Yucong Meng, Kexue Fu et al.

AAAI 2025paperarXiv:2412.11076
9
citations
#5248

Stochastic Forward–Backward Deconvolution: Training Diffusion Models with Finite Noisy Datasets

Haoye Lu, Qifan Wu, Yaoliang Yu

ICML 2025arXiv:2502.05446
9
citations
#5249

Synthesizing Privacy-Preserving Text Data via Finetuning *without* Finetuning Billion-Scale LLMs

Bowen Tan, Zheng Xu, Eric Xing et al.

ICML 2025arXiv:2503.12347
9
citations
#5250

Knowing Your Target: Target-Aware Transformer Makes Better Spatio-Temporal Video Grounding

Xin Gu, Yaojie Shen, Chenxi Luo et al.

ICLR 2025oralarXiv:2502.11168
9
citations
#5251

Dialogue Without Limits: Constant-Sized KV Caches for Extended Response in LLMs

Ravi Ghadia, Avinash Kumar, Gaurav Jain et al.

ICML 2025arXiv:2503.00979
9
citations
#5252

Adversarial Generative Flow Network for Solving Vehicle Routing Problems

Ni Zhang, Jingfeng Yang, Zhiguang Cao et al.

ICLR 2025arXiv:2503.01931
9
citations
#5253

MSE-Adapter: A Lightweight Plugin Endowing LLMs with the Capability to Perform Multimodal Sentiment Analysis and Emotion Recognition

Yang Yang, Xunde Dong, Yupeng Qiang

AAAI 2025paperarXiv:2502.12478
9
citations
#5254

MUSE: Mamba Is Efficient Multi-scale Learner for Text-video Retrieval

Haoran Tang, Meng Cao, Jinfa Huang et al.

AAAI 2025paperarXiv:2408.10575
9
citations
#5255

Promptable Anomaly Segmentation with SAM Through Self-Perception Tuning

Hui-Yue Yang, Hui Chen, Ao Wang et al.

AAAI 2025paperarXiv:2411.17217
9
citations
#5256

DriveEditor: A Unified 3D Information-Guided Framework for Controllable Object Editing in Driving Scenes

Yiyuan Liang, Zhiying Yan, Liqun Chen et al.

AAAI 2025paperarXiv:2412.19458
9
citations
#5257

Everything Everywhere All at Once: LLMs can In-Context Learn Multiple Tasks in Superposition

Zheyang Xiong, Jack Cai, John Cooper et al.

ICML 2025spotlightarXiv:2410.05603
9
citations
#5258

Be More Diverse than the Most Diverse: Optimal Mixtures of Generative Models via Mixture-UCB Bandit Algorithms

Parham Rezaei, Farzan Farnia, Cheuk Ting Li

ICLR 2025arXiv:2412.17622
9
citations
#5259

Deep Linear Network Training Dynamics from Random Initialization: Data, Width, Depth, and Hyperparameter Transfer

Blake Bordelon, Cengiz Pehlevan

ICML 2025arXiv:2502.02531
9
citations
#5260

Can Textual Gradient Work in Federated Learning?

Minghui Chen, Ruinan Jin, Wenlong Deng et al.

ICLR 2025arXiv:2502.19980
9
citations
#5261

Value-Based Deep RL Scales Predictably

Oleh Rybkin, Michal Nauman, Preston Fu et al.

ICML 2025arXiv:2502.04327
9
citations
#5262

VLScene: Vision-Language Guidance Distillation for Camera-Based 3D Semantic Scene Completion

Meng Wang, Huilong Pi, Ruihui Li et al.

AAAI 2025paperarXiv:2503.06219
9
citations
#5263

A Training-Free Sub-quadratic Cost Transformer Model Serving Framework with Hierarchically Pruned Attention

Heejun Lee, Geon Park, Youngwan Lee et al.

ICLR 2025arXiv:2406.09827
9
citations
#5264

Feature Denoising Diffusion Model for Blind Image Quality Assessment

Xudong Li, Yan Zhang, Yunhang Shen et al.

AAAI 2025paperarXiv:2401.11949
9
citations
#5265

Chaos Meets Attention: Transformers for Large-Scale Dynamical Prediction

Yi He, Yiming Yang, Xiaoyuan Cheng et al.

ICML 2025arXiv:2504.20858
9
citations
#5266

ND-SDF: Learning Normal Deflection Fields for High-Fidelity Indoor Reconstruction

Ziyu Tang, Weicai Ye, Yifan Wang et al.

ICLR 2025arXiv:2408.12598
9
citations
#5267

LODGE: Level-of-Detail Large-Scale Gaussian Splatting with Efficient Rendering

Jonas Kulhanek, Marie-Julie Rakotosaona, Fabian Manhardt et al.

NEURIPS 2025spotlightarXiv:2505.23158
9
citations
#5268

Do Large Language Models Truly Understand Geometric Structures?

Xiaofeng Wang, Yiming Wang, Wenhong Zhu et al.

ICLR 2025arXiv:2501.13773
9
citations
#5269

Synthetic Tabular Data Generation for Imbalanced Classification: The Surprising Effectiveness of an Overlap Class

Annie D'souza, Swetha M, Sunita Sarawagi

AAAI 2025paperarXiv:2412.15657
9
citations
#5270

Lumina-T2X: Scalable Flow-based Large Diffusion Transformer for Flexible Resolution Generation

Gao Peng, Le Zhuo, Dongyang Liu et al.

ICLR 2025oral
9
citations
#5271

Gumbel Counterfactual Generation From Language Models

Shauli Ravfogel, Anej Svete, Vésteinn Snæbjarnarson et al.

ICLR 2025arXiv:2411.07180
9
citations
#5272

An Interpretable N-gram Perplexity Threat Model for Large Language Model Jailbreaks

Valentyn Boreiko, Alexander Panfilov, Václav Voráček et al.

ICML 2025arXiv:2410.16222
9
citations
#5273

ChangeDiff: A Multi-Temporal Change Detection Data Generator with Flexible Text Prompts via Diffusion Model

Qi Zang, Jiayi Yang, Shuang Wang et al.

AAAI 2025paperarXiv:2412.15541
9
citations
#5274

C2F-TP: A Coarse-to-Fine Denoising Framework for Uncertainty-Aware Trajectory Prediction

Zichen Wang, Hao Miao, Senzhang Wang et al.

AAAI 2025paperarXiv:2412.13231
9
citations
#5275

Multi-Modal and Multi-Attribute Generation of Single Cells with CFGen

Alessandro Palma, Till Richter, Hanyi Zhang et al.

ICLR 2025arXiv:2407.11734
9
citations
#5276

Fast Training of Sinusoidal Neural Fields via Scaling Initialization

Taesun Yeom, Sangyoon Lee, Jaeho Lee

ICLR 2025arXiv:2410.04779
9
citations
#5277

TEncDM: Understanding the Properties of the Diffusion Model in the Space of Language Model Encodings

Alexander Shabalin, Viacheslav Meshchaninov, Egor Chimbulatov et al.

AAAI 2025paperarXiv:2402.19097
9
citations
#5278

Sortformer: A Novel Approach for Permutation-Resolved Speaker Supervision in Speech-to-Text Systems

Taejin Park, Ivan Medennikov, Kunal Dhawan et al.

ICML 2025arXiv:2409.06656
9
citations
#5279

HieraFashDiff: Hierarchical Fashion Design with Multi-stage Diffusion Models

Zhifeng Xie, Hao Li, Huiming Ding et al.

AAAI 2025paperarXiv:2401.07450
9
citations
#5280

Leveraging Large Vision-Language Model as User Intent-Aware Encoder for Composed Image Retrieval

Zelong Sun, Dong Jing, Guoxing Yang et al.

AAAI 2025paperarXiv:2412.11087
8
citations
#5281

Everything is Editable: Extend Knowledge Editing to Unstructured Data in Large Language Models

Jingcheng Deng, Zihao Wei, Liang Pang et al.

ICLR 2025arXiv:2405.15349
8
citations
#5282

Does learning the right latent variables necessarily improve in-context learning?

Sarthak Mittal, Eric Elmoznino, Léo Gagnon et al.

ICML 2025arXiv:2405.19162
8
citations
#5283

Open CaptchaWorld: A Comprehensive Web-based Platform for Testing and Benchmarking Multimodal LLM Agents

Yaxin Luo, Zhaoyi Li, Jiacheng Liu et al.

NEURIPS 2025arXiv:2505.24878
8
citations
#5284

Temporal Difference Flows

Jesse Farebrother, Matteo Pirotta, Andrea Tirinzoni et al.

ICML 2025oralarXiv:2503.09817
8
citations
#5285

CAPrompt: Cyclic Prompt Aggregation for Pre-Trained Model Based Class Incremental Learning

Qiwei Li, Jiahuan Zhou

AAAI 2025paperarXiv:2412.08929
8
citations
#5286

STAR: Synthesis of Tailored Architectures

Armin Thomas, Rom Parnichkun, Alexander Amini et al.

ICLR 2025arXiv:2411.17800
8
citations
#5287

Accelerated Over-Relaxation Heavy-Ball Method: Achieving Global Accelerated Convergence with Broad Generalization

Jingrong Wei, Long Chen

ICLR 2025arXiv:2406.09772
8
citations
#5288

Incomplete Multi-view Deep Clustering with Data Imputation and Alignment

Jiyuan Liu, Xinwang Liu, Xinhang Wan et al.

NEURIPS 2025
8
citations
#5289

DenseGrounding: Improving Dense Language-Vision Semantics for Ego-centric 3D Visual Grounding

Henry Zheng, Hao Shi, Qihang Peng et al.

ICLR 2025arXiv:2505.04965
8
citations
#5290

Enhancing Large Language Model Performance with Gradient-Based Parameter Selection

Haoling Li, Xin Zhang, Xiao Liu et al.

AAAI 2025paperarXiv:2406.15330
8
citations
#5291

ProtComposer: Compositional Protein Structure Generation with 3D Ellipsoids

Hannes Stärk, Bowen Jing, Tomas Geffner et al.

ICLR 2025arXiv:2503.05025
8
citations
#5292

TIGeR: Unifying Text-to-Image Generation and Retrieval with Large Multimodal Models

Leigang Qu, Haochuan Li, Tan Wang et al.

ICLR 2025arXiv:2406.05814
8
citations
#5293

DataMan: Data Manager for Pre-training Large Language Models

Ru Peng, Kexin Yang, Yawen Zeng et al.

ICLR 2025arXiv:2502.19363
8
citations
#5294

A Reductions Approach to Risk-Sensitive Reinforcement Learning with Optimized Certainty Equivalents

Kaiwen Wang, Dawen Liang, Nathan Kallus et al.

ICML 2025arXiv:2403.06323
8
citations
#5295

Differentially Private Steering for Large Language Model Alignment

Anmol Goel, Yaxi Hu, Iryna Gurevych et al.

ICLR 2025arXiv:2501.18532
8
citations
#5296

Unlocking the Potential of Reverse Distillation for Anomaly Detection

Xinyue Liu, Jianyuan Wang, Biao Leng et al.

AAAI 2025paperarXiv:2412.07579
8
citations
#5297

Disentangling and Integrating Relational and Sensory Information in Transformer Architectures

Awni Altabaa, John Lafferty

ICML 2025arXiv:2405.16727
8
citations
#5298

Expensive Multi-Objective Bayesian Optimization Based on Diffusion Models

Bingdong Li, Zixiang Di, Yongfan Lu et al.

AAAI 2025paperarXiv:2405.08674
8
citations
#5299

Elucidating the Design Space of Multimodal Protein Language Models

Cheng-Yen Hsieh, Xinyou Wang, Daiheng Zhang et al.

ICML 2025spotlightarXiv:2504.11454
8
citations
#5300

AGAV-Rater: Adapting Large Multimodal Model for AI-Generated Audio-Visual Quality Assessment

Yuqin Cao, Xiongkuo Min, Yixuan Gao et al.

ICML 2025arXiv:2501.18314
8
citations
#5301

HC-LLM: Historical-Constrained Large Language Models for Radiology Report Generation

Tengfei Liu, Jiapu Wang, Yongli Hu et al.

AAAI 2025paperarXiv:2412.11070
8
citations
#5302

Enhancing Federated Domain Adaptation with Multi-Domain Prototype-Based Federated Fine-Tuning

Jingyuan Zhang, Yiyang Duan, Shuaicheng Niu et al.

ICLR 2025arXiv:2410.07738
8
citations
#5303

Multi-Marginal Stochastic Flow Matching for High-Dimensional Snapshot Data at Irregular Time Points

Justin Lee, Behnaz Moradi-Jamei, Heman Shakeri

ICML 2025arXiv:2508.04351
8
citations
#5304

Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers

Roman Abramov, Felix Steinbauer, Gjergji Kasneci

ICML 2025arXiv:2504.20752
8
citations
#5305

Bayesian Optimization of Antibodies Informed by a Generative Model of Evolving Sequences

Alan Amin, Nate Gruver, Yilun Kuang et al.

ICLR 2025arXiv:2412.07763
8
citations
#5306

Conformal Prediction Sets Can Cause Disparate Impact

Jesse Cresswell, Bhargava Kumar, Yi Sui et al.

ICLR 2025arXiv:2410.01888
8
citations
#5307

Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining

Jie Cheng, Ruixi Qiao, ma yingwei et al.

ICLR 2025oralarXiv:2410.00564
8
citations
#5308

Embedding Safety into RL: A New Take on Trust Region Methods

Nikola Milosevic, Johannes Müller, Nico Scherf

ICML 2025arXiv:2411.02957
8
citations
#5309

MDSGen: Fast and Efficient Masked Diffusion Temporal-Aware Transformers for Open-Domain Sound Generation

Trung X. Pham, Tri Ton, Chang Yoo

ICLR 2025oralarXiv:2410.02130
8
citations
#5310

Fast and Slow Streams for Online Time Series Forecasting Without Information Leakage

Ying-yee Ava Lau, Zhiwen Shao, Dit-Yan Yeung

ICLR 2025oral
8
citations
#5311

VQTalker: Towards Multilingual Talking Avatars Through Facial Motion Tokenization

Tao Liu, Ziyang Ma, Qi Chen et al.

AAAI 2025paperarXiv:2412.09892
8
citations
#5312

Proactive Privacy Amnesia for Large Language Models: Safeguarding PII with Negligible Impact on Model Utility

Martin Kuo, Jingyang Zhang, Jianyi Zhang et al.

ICLR 2025arXiv:2502.17591
8
citations
#5313

Data Taggants: Dataset Ownership Verification Via Harmless Targeted Data Poisoning

Wassim Bouaziz, Nicolas Usunier, El-Mahdi El-Mhamdi

ICLR 2025arXiv:2410.09101
8
citations
#5314

FaceMe: Robust Blind Face Restoration with Personal Identification

Siyu Liu, Zheng-Peng Duan, Jia OuYang et al.

AAAI 2025paperarXiv:2501.05177
8
citations
#5315

Does Training with Synthetic Data Truly Protect Privacy?

Yunpeng Zhao, Jie Zhang

ICLR 2025arXiv:2502.12976
8
citations
#5316

Modality-Specialized Synergizers for Interleaved Vision-Language Generalists

Zhiyang Xu, Minqian Liu, Ying Shen et al.

ICLR 2025arXiv:2407.03604
8
citations
#5317

Approaching Rate-Distortion Limits in Neural Compression with Lattice Transform Coding

Eric Lei, Hamed Hassani, Shirin Saeedi Bidokhti

ICLR 2025arXiv:2403.07320
8
citations
#5318

GravMAD: Grounded Spatial Value Maps Guided Action Diffusion for Generalized 3D Manipulation

Yangtao Chen, Zixuan Chen, Junhui Yin et al.

ICLR 2025arXiv:2409.20154
8
citations
#5319

Unlocking the Power of Function Vectors for Characterizing and Mitigating Catastrophic Forgetting in Continual Instruction Tuning

Gangwei Jiang, caigao jiang, Zhaoyi Li et al.

ICLR 2025arXiv:2502.11019
8
citations
#5320

Robustness of Quantum Algorithms for Nonconvex Optimization

Weiyuan Gong, Chenyi Zhang, Tongyang Li

ICLR 2025arXiv:2212.02548
8
citations
#5321

Accurate Link Prediction for Edge-Incomplete Graphs via PU Learning

Junghun Kim, Ka Hyun Park, Hoyoung Yoon et al.

AAAI 2025paperarXiv:2405.11911
8
citations
#5322

Efficiently Serving Large Multimodal Models Using EPD Disaggregation

Gursimran Singh, Xinglu Wang, Yifan Hu et al.

ICML 2025arXiv:2501.05460
8
citations
#5323

Efficient Reward Poisoning Attacks on Online Deep Reinforcement Learning

Yinglun Xu, Qi Zeng, Gagandeep Singh

ICLR 2025arXiv:2205.14842
8
citations
#5324

A Theoretical Perspective: How to Prevent Model Collapse in Self-consuming Training Loops

Shi Fu, Yingjie Wang, Yuzhu Chen et al.

ICLR 2025arXiv:2502.18865
8
citations
#5325

LIFe-GoM: Generalizable Human Rendering with Learned Iterative Feedback Over Multi-Resolution Gaussians-on-Mesh

Jing Wen, Alex Schwing, Shenlong Wang

ICLR 2025arXiv:2502.09617
8
citations
#5326

Visual Attention Never Fades: Selective Progressive Attention ReCalibration for Detailed Image Captioning in Multimodal Large Language Models

Mingi Jung, Saehyung Lee, Eunji Kim et al.

ICML 2025arXiv:2502.01419
8
citations
#5327

Anywhere: A Multi-Agent Framework for User-Guided, Reliable, and Diverse Foreground-Conditioned Image Generation

Xie Tianyidan, Rui Ma, Qian Wang et al.

AAAI 2025paperarXiv:2404.18598
8
citations
#5328

Free Hunch: Denoiser Covariance Estimation for Diffusion Models Without Extra Costs

Severi Rissanen, Markus Heinonen, Arno Solin

ICLR 2025arXiv:2410.11149
8
citations
#5329

Looking Backward: Retrospective Backward Synthesis for Goal-Conditioned GFlowNets

Haoran He, Can Chang, Huazhe Xu et al.

ICLR 2025arXiv:2406.01150
8
citations
#5330

Injecting Universal Jailbreak Backdoors into LLMs in Minutes

Zhuowei Chen, qiannan zhang, Shichao Pei

ICLR 2025arXiv:2502.10438
8
citations
#5331

Same Task, Different Circuits: Disentangling Modality-Specific Mechanisms in VLMs

Yaniv Nikankin, Dana Arad, Yossi Gandelsman et al.

NEURIPS 2025arXiv:2506.09047
8
citations
#5332

Growth Inhibitors for Suppressing Inappropriate Image Concepts in Diffusion Models

Die Chen, Zhiwen Li, Mingyuan Fan et al.

ICLR 2025arXiv:2408.01014
8
citations
#5333

From Kernels to Features: A Multi-Scale Adaptive Theory of Feature Learning

Noa Rubin, Kirsten Fischer, Javed Lindner et al.

ICML 2025arXiv:2502.03210
8
citations
#5334

Accessing Vision Foundation Models via ImageNet-1K

Yitian Zhang, Xu Ma, Yue Bai et al.

ICLR 2025arXiv:2407.10366
8
citations
#5335

The Canary’s Echo: Auditing Privacy Risks of LLM-Generated Synthetic Text

Matthieu Meeus, Lukas Wutschitz, Santiago Zanella-Beguelin et al.

ICML 2025arXiv:2502.14921
8
citations
#5336

Bayesian Optimization via Continual Variational Last Layer Training

Paul Brunzema, Mikkel Jordahn, John Willes et al.

ICLR 2025arXiv:2412.09477
8
citations
#5337

Episodic Novelty Through Temporal Distance

Yuhua Jiang, Qihan Liu, Yiqin Yang et al.

ICLR 2025oralarXiv:2501.15418
8
citations
#5338

A Two-Stage Learning-to-Defer Approach for Multi-Task Learning

Yannis Montreuil, Shu Heng Yeo, Axel Carlier et al.

ICML 2025arXiv:2410.15729
8
citations
#5339

Knowledge in Superposition: Unveiling the Failures of Lifelong Knowledge Editing for Large Language Models

Chenhui Hu, Pengfei Cao, Yubo Chen et al.

AAAI 2025paperarXiv:2408.07413
8
citations
#5340

Towards Bridging Generalization and Expressivity of Graph Neural Networks

Shouheng Li, Floris Geerts, Dongwoo Kim et al.

ICLR 2025arXiv:2410.10051
8
citations
#5341

CaPo: Cooperative Plan Optimization for Efficient Embodied Multi-Agent Cooperation

Jie Liu, Pan Zhou, Yingjun Du et al.

ICLR 2025arXiv:2411.04679
8
citations
#5342

Learning Evolving Tools for Large Language Models

Guoxin Chen, Zhong Zhang, Xin Cong et al.

ICLR 2025arXiv:2410.06617
8
citations
#5343

Fair Submodular Cover

Wenjing Chen, Shuo Xing, Samson Zhou et al.

ICLR 2025arXiv:2407.04804
8
citations
#5344

Preference Diffusion for Recommendation

Shuo Liu, An Zhang, Guoqing Hu et al.

ICLR 2025arXiv:2410.13117
8
citations
#5345

Incomplete Modality Disentangled Representation for Ophthalmic Disease Grading and Diagnosis

Chengzhi Liu, Zile Huang, Zhe Chen et al.

AAAI 2025paperarXiv:2502.11724
8
citations
#5346

Principled Algorithms for Optimizing Generalized Metrics in Binary Classification

Anqi Mao, Mehryar Mohri, Yutao Zhong

ICML 2025arXiv:2512.23133
8
citations
#5347

Enhancing Rating-Based Reinforcement Learning to Effectively Leverage Feedback from Large Vision-Language Models

Minh-Tung Luu, Younghwan Lee, Donghoon Lee et al.

ICML 2025arXiv:2506.12822
8
citations
#5348

RobuRCDet: Enhancing Robustness of Radar-Camera Fusion in Bird's Eye View for 3D Object Detection

Jingtong Yue, Zhiwei Lin, Xin Lin et al.

ICLR 2025arXiv:2502.13071
8
citations
#5349

Better than Your Teacher: LLM Agents that learn from Privileged AI Feedback

Sanjiban Choudhury, Paloma Sodhi

ICLR 2025arXiv:2410.05434
8
citations
#5350

Towards Unifying Evaluation of Counterfactual Explanations: Leveraging Large Language Models for Human-Centric Assessments

Marharyta Domnich, Julius Välja, Rasmus Moorits Veski et al.

AAAI 2025paperarXiv:2410.21131
8
citations
#5351

De-mark: Watermark Removal in Large Language Models

Ruibo Chen, Yihan Wu, Junfeng Guo et al.

ICML 2025arXiv:2410.13808
8
citations
#5352

Speculative Prefill: Turbocharging TTFT with Lightweight and Training-Free Token Importance Estimation

Jingyu Liu, Beidi Chen, Ce Zhang

ICML 2025arXiv:2502.02789
8
citations
#5353

Balancing the Scales: A Theoretical and Algorithmic Framework for Learning from Imbalanced Data

Corinna Cortes, Anqi Mao, Mehryar Mohri et al.

ICML 2025arXiv:2502.10381
8
citations
#5354

Micro-macro Wavelet-based Gaussian Splatting for 3D Reconstruction from Unconstrained Images

Yihui Li, Chengxin Lv, Hongyu Yang et al.

AAAI 2025paperarXiv:2501.14231
8
citations
#5355

EgoPrivacy: What Your First-Person Camera Says About You?

Yijiang Li, Genpei Zhang, Jiacheng Cheng et al.

ICML 2025arXiv:2506.12258
8
citations
#5356

Offline-to-Online Hyperparameter Transfer for Stochastic Bandits

Dravyansh Sharma, Arun Suggala

AAAI 2025paperarXiv:2501.02926
8
citations
#5357

Unsupervised Audio-Visual Segmentation with Modality Alignment

Swapnil Bhosale, Haosen Yang, Diptesh Kanojia et al.

AAAI 2025paperarXiv:2403.14203
8
citations
#5358

Towards Understanding Why Label Smoothing Degrades Selective Classification and How to Fix It

Guoxuan Xia, Olivier Laurent, Gianni Franchi et al.

ICLR 2025arXiv:2403.14715
8
citations
#5359

Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data

Sreyan Ghosh, Sonal Kumar, Zhifeng Kong et al.

ICLR 2025arXiv:2410.02056
8
citations
#5360

Meta-Black-Box-Optimization through Offline Q-function Learning

Zeyuan Ma, Zhiguang Cao, Zhou Jiang et al.

ICML 2025arXiv:2505.02010
8
citations
#5361

Building a Multi-modal Spatiotemporal Expert for Zero-shot Action Recognition with CLIP

Yating Yu, Congqi Cao, Yueran Zhang et al.

AAAI 2025paperarXiv:2412.09895
8
citations
#5362

MergeNet: Knowledge Migration Across Heterogeneous Models, Tasks, and Modalities

Kunxi Li, Tianyu Zhan, Kairui Fu et al.

AAAI 2025paperarXiv:2404.13322
8
citations
#5363

OpenViewer: Openness-Aware Multi-View Learning

Shide Du, Zihan Fang, Yanchao Tan et al.

AAAI 2025paperarXiv:2412.12596
8
citations
#5364

World Knowledge-Enhanced Reasoning Using Instruction-Guided Interactor in Autonomous Driving

Mingliang Zhai, Cheng Li, Zengyuan Guo et al.

AAAI 2025paperarXiv:2412.06324
8
citations
#5365

Understanding High-Dimensional Bayesian Optimization

Leonard Papenmeier, Matthias Poloczek, Luigi Nardi

ICML 2025arXiv:2502.09198
8
citations
#5366

Decoding Game: On Minimax Optimality of Heuristic Text Generation Strategies

Sijin Chen, Omar Hagrass, Jason Klusowski

ICLR 2025arXiv:2410.03968
8
citations
#5367

Compositional simulation-based inference for time series

Manuel Gloeckler, Shoji Toyota, Kenji Fukumizu et al.

ICLR 2025arXiv:2411.02728
8
citations
#5368

Outsourced Diffusion Sampling: Efficient Posterior Inference in Latent Spaces of Generative Models

Siddarth Venkatraman, Mohsin Hasan, Minsu Kim et al.

ICML 2025arXiv:2502.06999
8
citations
#5369

Efficient stagewise pretraining via progressive subnetworks

Abhishek Panigrahi, Nikunj Saunshi, Kaifeng Lyu et al.

ICLR 2025arXiv:2402.05913
8
citations
#5370

GraphCL: Graph-based Clustering for Semi-Supervised Medical Image Segmentation

Mengzhu Wang, houcheng su, Jiao Li et al.

ICML 2025arXiv:2411.13147
8
citations
#5371

Stable Mean Teacher for Semi-supervised Video Action Detection

Akash Kumar, Sirshapan Mitra, Yogesh Singh Rawat

AAAI 2025paperarXiv:2412.07072
8
citations
#5372

The Belief State Transformer

Edward Hu, Kwangjun Ahn, Qinghua Liu et al.

ICLR 2025arXiv:2410.23506
8
citations
#5373

Do We Need to Verify Step by Step? Rethinking Process Supervision from a Theoretical Perspective

Zeyu Jia, Alexander Rakhlin, Tengyang Xie

ICML 2025arXiv:2502.10581
8
citations
#5374

Fine-Tuning Attention Modules Only: Enhancing Weight Disentanglement in Task Arithmetic

Ruochen Jin, Bojian Hou, Jiancong Xiao et al.

ICLR 2025arXiv:2407.07089
8
citations
#5375

Topograph: An Efficient Graph-Based Framework for Strictly Topology Preserving Image Segmentation

Laurin Lux, Alexander H Berger, Alexander Weers et al.

ICLR 2025arXiv:2411.03228
8
citations
#5376

ROUTE: Robust Multitask Tuning and Collaboration for Text-to-SQL

Yang Qin, Chao Chen, Zhihang Fu et al.

ICLR 2025arXiv:2412.10138
8
citations
#5377

Hierarchical Alignment-enhanced Adaptive Grounding Network for Generalized Referring Expression Comprehension

Yaxian Wang, Henghui Ding, Shuting He et al.

AAAI 2025paperarXiv:2501.01416
8
citations
#5378

SynQ: Accurate Zero-shot Quantization by Synthesis-aware Fine-tuning

Minjun Kim, Jongjin Kim, U Kang

ICLR 2025
8
citations
#5379

Rethinking Pseudo-Label Guided Learning for Weakly Supervised Temporal Action Localization from the Perspective of Noise Correction

Quan Zhang, Yuxin Qi, Xi Tang et al.

AAAI 2025paperarXiv:2501.11124
8
citations
#5380

Video Action Differencing

James Burgess, Xiaohan Wang, Yuhui Zhang et al.

ICLR 2025arXiv:2503.07860
8
citations
#5381

CapeX: Category-Agnostic Pose Estimation from Textual Point Explanation

Matan Rusanovsky, Or Hirschorn, Shai Avidan

ICLR 2025arXiv:2406.00384
8
citations
#5382

SplatFormer: Point Transformer for Robust 3D Gaussian Splatting

Yutong Chen, Marko Mihajlovic, Xiyi Chen et al.

ICLR 2025arXiv:2411.06390
8
citations
#5383

DCBM: Data-Efficient Visual Concept Bottleneck Models

Katharina Prasse, Patrick Knab, Sascha Marton et al.

ICML 2025arXiv:2412.11576
8
citations
#5384

An All-Atom Generative Model for Designing Protein Complexes

Ruizhe Chen, Dongyu Xue, Xiangxin Zhou et al.

ICML 2025arXiv:2504.13075
8
citations
#5385

Manta: Enhancing Mamba for Few-Shot Action Recognition of Long Sub-Sequence

Wenbo Huang, Jinghui Zhang, Guang Li et al.

AAAI 2025paperarXiv:2412.07481
8
citations
#5386

Gradient descent with generalized Newton’s method

Zhiqi Bu, Shiyun Xu

ICLR 2025arXiv:2407.02772
8
citations
#5387

Glauber Generative Model: Discrete Diffusion Models via Binary Classification

Harshit Varma, Dheeraj Nagaraj, Karthikeyan Shanmugam

ICLR 2025arXiv:2405.17035
8
citations
#5388

Leveraging Consistent Spatio-Temporal Correspondence for Robust Visual Odometry

Zhaoxing Zhang, Junda Cheng, Gangwei Xu et al.

AAAI 2025paperarXiv:2412.16923
8
citations
#5389

Neural Context Flows for Meta-Learning of Dynamical Systems

Roussel Desmond Nzoyem, David Barton, Tom Deakin

ICLR 2025arXiv:2405.02154
8
citations
#5390

From Attention to Activation: Unraveling the Enigmas of Large Language Models

Prannay Kaul, Chengcheng Ma, Ismail Elezi et al.

ICLR 2025arXiv:2410.17174
8
citations
#5391

Evaluating LLM Reasoning in the Operations Research Domain with ORQA

Mahdi Mostajabdaveh, Timothy Tin Long Yu, Samarendra Chandan Bindu Dash et al.

AAAI 2025paperarXiv:2412.17874
8
citations
#5392

Boosting Fine-Grained Visual Anomaly Detection with Coarse-Knowledge-Aware Adversarial Learning

Qingqing Fang, Qinliang Su, Wenxi Lv et al.

AAAI 2025paperarXiv:2412.12850
8
citations
#5393

How many samples are needed to train a deep neural network?

Pegah Golestaneh, Mahsa Taheri, Johannes Lederer

ICLR 2025arXiv:2405.16696
8
citations
#5394

Position: The Future of Bayesian Prediction Is Prior-Fitted

Samuel Gabriel Müller, Arik Reuter, Noah Hollmann et al.

ICML 2025arXiv:2505.23947
8
citations
#5395

A transfer learning framework for weak to strong generalization

Seamus Somerstep, Felipe Maia Polo, Moulinath Banerjee et al.

ICLR 2025
8
citations
#5396

DuMo: Dual Encoder Modulation Network for Precise Concept Erasure

Feng Han, Kai Chen, Chao Gong et al.

AAAI 2025paperarXiv:2501.01125
8
citations
#5397

Learning Chaos In A Linear Way

Xiaoyuan Cheng, Yi He, Yiming Yang et al.

ICLR 2025arXiv:2503.14702
8
citations
#5398

Interactive Speculative Planning: Enhance Agent Efficiency through Co-design of System and User Interface

Wenyue Hua, Mengting Wan, JAGANNATH VADREVU et al.

ICLR 2025arXiv:2410.00079
8
citations
#5399

VIoTGPT: Learning to Schedule Vision Tools Towards Intelligent Video Internet of Things

Yaoyao Zhong, Mengshi Qi, Rui Wang et al.

AAAI 2025paper
8
citations
#5400

DISCO: learning to DISCover an evolution Operator for multi-physics-agnostic prediction

Rudy Morel, Jiequn Han, Edouard Oyallon

ICML 2025oralarXiv:2504.19496
8
citations