Most Cited 2025 "reinforcement learning exploration" Papers

22,274 papers found • Page 27 of 112

#5201

Anchor Learning with Potential Cluster Constraints for Multi-view Clustering

Yawei Chen, Huibing Wang, Jinjia Peng et al.

AAAI 2025paperarXiv:2412.16519
5
citations
#5202

MERGE$^3$: Efficient Evolutionary Merging on Consumer-grade GPUs

Tommaso Mencattini, Adrian Robert Minut, Donato Crisostomi et al.

ICML 2025arXiv:2502.10436
5
citations
#5203

Tree-Sliced Wasserstein Distance with Nonlinear Projection

Thanh Tran, Viet Hoang Tran, Thanh Chu et al.

ICML 2025arXiv:2505.00968
5
citations
#5204

Finding Shared Decodable Concepts and their Negations in the Brain

Cory Efird, Alex Murphy, Joel Zylberberg et al.

ICLR 2025arXiv:2405.17663
5
citations
#5205

Generative Medical Segmentation

Jiayu Huo, Xi Ouyang, Sébastien Ourselin et al.

AAAI 2025paperarXiv:2403.18198
5
citations
#5206

Noise-Resilient Symbolic Regression with Dynamic Gating Reinforcement Learning

Chenglu Sun, Shuo Shen, Wenzhi Tao et al.

AAAI 2025paperarXiv:2501.01085
5
citations
#5207

E(3)-equivariant models cannot learn chirality: Field-based molecular generation

Alexandru Dumitrescu, Dani Korpela, Markus Heinonen et al.

ICLR 2025arXiv:2402.15864
5
citations
#5208

Privacy-Preserving Personalized Federated Prompt Learning for Multimodal Large Language Models

Linh Tran, Wei Sun, Stacy Patterson et al.

ICLR 2025arXiv:2501.13904
5
citations
#5209

Focus On This, Not That! Steering LLMs with Adaptive Feature Specification

Tom A. Lamb, Adam Davies, Alasdair J Paren et al.

ICML 2025arXiv:2410.22944
4
citations
#5210

PABBO: Preferential Amortized Black-Box Optimization

Xinyu Zhang, Daolang Huang, Samuel Kaski et al.

ICLR 2025arXiv:2503.00924
4
citations
#5211

ALLVB: All-in-One Long Video Understanding Benchmark

Xichen Tan, Yuanjing Luo, Yunfan Ye et al.

AAAI 2025paperarXiv:2503.07298
4
citations
#5212

A Training-free Synthetic Data Selection Method for Semantic Segmentation

Hao Tang, Siyue Yu, Jian Pang et al.

AAAI 2025paperarXiv:2501.15201
4
citations
#5213

CoT-lized Diffusion: Let's Reinforce T2I Generation Step-by-step

Zheyuan Liu, Munan Ning, Qihui Zhang et al.

NEURIPS 2025arXiv:2507.04451
4
citations
#5214

Flow-based Variational Mutual Information: Fast and Flexible Approximations

Caleb Dahlke, Jason Pacheco

ICLR 2025
4
citations
#5215

Debiased Distillation for Consistency Regularization

Lu Wang, Liuchi Xu, Xiong Yang et al.

AAAI 2025paper
4
citations
#5216

Thinking in Granularity: Dynamic Quantization for Image Super-Resolution by Intriguing Multi-Granularity Clues

Mingshen Wang, Zhao Zhang, Feng Li et al.

AAAI 2025paperarXiv:2409.14330
4
citations
#5217

GPLQ: A General, Practical, and Lightning QAT Method for Vision Transformers

Guang Liang, Xinyao Liu, Jianxin Wu

NEURIPS 2025arXiv:2506.11784
4
citations
#5218

TokenMatcher: Diverse Tokens Matching for Unsupervised Visible-Infrared Person Re-Identification

Xiao Wang, Lekai Liu, Bin Yang et al.

AAAI 2025paper
4
citations
#5219

TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation

Xingrui Wang, Xin Li, Yaosi Hu et al.

AAAI 2025paperarXiv:2412.10275
4
citations
#5220

GCD: Advancing Vision-Language Models for Incremental Object Detection via Global Alignment and Correspondence Distillation

Xu Wang, Zilei Wang, Zihan Lin

AAAI 2025paper
4
citations
#5221

Enhancing Fine-Grained Vision-Language Pretraining with Negative Augmented Samples

Yeyuan Wang, Dehong Gao, Lei Yi et al.

AAAI 2025paperarXiv:2412.10029
4
citations
#5222

Efficient Training of Neural Stochastic Differential Equations by Matching Finite Dimensional Distributions

Jianxin Zhang, Josh Viktorov, Doosan Jung et al.

ICLR 2025arXiv:2410.03973
4
citations
#5223

ComPC: Completing a 3D Point Cloud with 2D Diffusion Priors

Tianxin Huang, Zhiwen Yan, Yuyang Zhao et al.

ICLR 2025arXiv:2404.06814
4
citations
#5224

A Unified Loss for Handling Inter-Class and Intra-Class Imbalance in Medical Image Segmentation

Feilong Xu, Feiyang Yang, Xiongfei Li et al.

AAAI 2025paper
4
citations
#5225

SBSC: Step-by-Step Coding for Improving Mathematical Olympiad Performance

Kunal Singh, Ankan Biswas, Sayandeep Bhowmick et al.

ICLR 2025arXiv:2502.16666
4
citations
#5226

Model Selection for Off-policy Evaluation: New Algorithms and Experimental Protocol

Pai Liu, Lingfeng Zhao, Shivangi Agarwal et al.

NEURIPS 2025arXiv:2502.08021
4
citations
#5227

AdvPaint: Protecting Images from Inpainting Manipulation via Adversarial Attention Disruption

Joonsung Jeon, Woo Jae Kim, Suhyeon Ha et al.

ICLR 2025arXiv:2503.10081
4
citations
#5228

RA-SGG: Retrieval-Augmented Scene Graph Generation Framework via Multi-Prototype Learning

Kanghoon Yoon, Kibum Kim, Jaehyeong Jeon et al.

AAAI 2025paperarXiv:2412.12788
4
citations
#5229

Action-Agnostic Point-Level Supervision for Temporal Action Detection

Shuhei M. Yoshida, Takashi Shibata, Makoto Terao et al.

AAAI 2025paperarXiv:2412.21205
4
citations
#5230

Dataset Distillation via Knowledge Distillation: Towards Efficient Self-Supervised Pre-training of Deep Networks

Siddharth Joshi, Jiayi Ni, Baharan Mirzasoleiman

ICLR 2025arXiv:2410.02116
4
citations
#5231

Transformers Handle Endogeneity in In-Context Linear Regression

Haodong Liang, Krishna Balasubramanian, Lifeng Lai

ICLR 2025arXiv:2410.01265
4
citations
#5232

Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding

Wenbo Zhang, Lu Zhang, Ping Hu et al.

AAAI 2025paperarXiv:2411.19551
4
citations
#5233

FedRTS: Federated Robust Pruning via Combinatorial Thompson Sampling

Hong Huang, Jinhai Yang, Yuan Chen et al.

NEURIPS 2025arXiv:2501.19122
4
citations
#5234

Microcanonical Langevin Ensembles: Advancing the Sampling of Bayesian Neural Networks

Emanuel Sommer, Jakob Robnik, Giorgi Nozadze et al.

ICLR 2025arXiv:2502.06335
4
citations
#5235

Cross-PCR: A Robust Cross-Source Point Cloud Registration Framework

Guiyu Zhao, Zhentao Guo, Zewen Du et al.

AAAI 2025paperarXiv:2412.18873
4
citations
#5236

Balanced Token Pruning: Accelerating Vision Language Models Beyond Local Optimization

kaiyuan Li, Xiaoyue Chen, Chen Gao et al.

NEURIPS 2025arXiv:2505.22038
4
citations
#5237

Dynamic Contrastive Knowledge Distillation for Efficient Image Restoration

Yunshuai Zhou, Junbo Qiao, Jincheng Liao et al.

AAAI 2025paperarXiv:2412.08939
4
citations
#5238

Conditional Diffusion Models are Minimax-Optimal and Manifold-Adaptive for Conditional Distribution Estimation

Rong Tang, Lizhen Lin, Yun Yang

ICLR 2025arXiv:2409.20124
4
citations
#5239

Improving the Lower Bound in Branch-and-Bound Algorithms for MaxSAT

Shuolin Li, Chu-Min Li, Jordi Coll et al.

AAAI 2025paper
4
citations
#5240

Multi-Label Test-Time Adaptation with Bound Entropy Minimization

Xiangyu Wu, Feng Yu, Yang Yang et al.

ICLR 2025arXiv:2502.03777
4
citations
#5241

Neural Reasoning for Sure Through Constructing Explainable Models

Tiansi Dong, Mateja Jamnik, Pietro Liò

AAAI 2025paper
4
citations
#5242

Beyond Graph Convolution: Multimodal Recommendation with Topology-aware MLPs

Junjie Huang, Jiarui Qin, Yong Yu et al.

AAAI 2025paperarXiv:2412.11747
4
citations
#5243

HI-DR: Exploiting Health Status-Aware Attention and an EHR Graph+ for Effective Medication Recommendation

Taeri Kim, Jiho Heo, Hyunjoon Kim et al.

AAAI 2025paper
4
citations
#5244

LLaMaFlex: Many-in-one LLMs via Generalized Pruning and Weight Sharing

Ruisi Cai, Saurav Muralidharan, Hongxu Yin et al.

ICLR 2025
4
citations
#5245

COPER: Correlation-based Permutations for Multi-View Clustering

Ran Eisenberg, Jonathan Svirsky, Ofir Lindenbaum

ICLR 2025
4
citations
#5246

UniFORM: Towards Unified Framework for Anomaly Detection on Graphs

Chuancheng Song, Xixun Lin, Hanyang Shen et al.

AAAI 2025paper
4
citations
#5247

SiMHand: Mining Similar Hands for Large-Scale 3D Hand Pose Pre-training

Nie Lin, Takehiko Ohkawa, Yifei Huang et al.

ICLR 2025arXiv:2502.15251
4
citations
#5248

Pioneer: Physics-informed Riemannian Graph ODE for Entropy-increasing Dynamics

Li Sun, Ziheng Zhang, Zixi Wang et al.

AAAI 2025paperarXiv:2502.03236
4
citations
#5249

A Unified Framework for Forward and Inverse Problems in Subsurface Imaging using Latent Space Translations

Naveen Gupta, Medha Sawhney, Arka Daw et al.

ICLR 2025arXiv:2410.11247
4
citations
#5250

Advantage-Guided Distillation for Preference Alignment in Small Language Models

Shiping Gao, Fanqi Wan, Jiajian Guo et al.

ICLR 2025arXiv:2502.17927
4
citations
#5251

SeniorTalk: A Chinese Conversation Dataset with Rich Annotations for Super-Aged Seniors

chen yang, Hui Wang, Shiyao Wang et al.

NEURIPS 2025arXiv:2503.16578
4
citations
#5252

Disentangling Tabular Data Towards Better One-Class Anomaly Detection

Jianan Ye, Zhaorui Tan, Yijie Hu et al.

AAAI 2025paperarXiv:2411.07574
4
citations
#5253

Towards Effective, Efficient and Unsupervised Social Event Detection in the Hyperbolic Space

Xiaoyan Yu, Yifan Wei, Shuaishuai Zhou et al.

AAAI 2025paperarXiv:2412.10712
4
citations
#5254

L3TC: Leveraging RWKV for Learned Lossless Low-Complexity Text Compression

Junxuan Zhang, Zhengxue Cheng, Yan Zhao et al.

AAAI 2025paperarXiv:2412.16642
4
citations
#5255

ExcluIR: Exclusionary Neural Information Retrieval

Wenhao Zhang, Mengqi Zhang, Shiguang Wu et al.

AAAI 2025paperarXiv:2404.17288
4
citations
#5256

Tokenphormer: Structure-aware Multi-token Graph Transformer for Node Classification

Zijie Zhou, Zhaoqi Lu, Xuekai Wei et al.

AAAI 2025paperarXiv:2412.15302
4
citations
#5257

Towards Resilient Safety-driven Unlearning for Diffusion Models against Downstream Fine-tuning

Boheng Li, Renjie Gu, Junjie Wang et al.

NEURIPS 2025arXiv:2507.16302
4
citations
#5258

EF2X Exists for Four Agents

Arash Ashuri, Vasilis Gkatzelis, Alkmini Sgouritsa

AAAI 2025paperarXiv:2412.00254
4
citations
#5259

The Value of Recall in Extensive-Form Games

Ratip Emin Berker, Emanuel Tewolde, Ioannis Anagnostides et al.

AAAI 2025paperarXiv:2412.19659
4
citations
#5260

Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning

Wesley Suttle, Aamodh Suresh, Carlos Nieto-Granda

ICLR 2025oralarXiv:2502.04141
4
citations
#5261

Every Bit Helps: Achieving the Optimal Distortion with a Few Queries

Soroush Ebadian, Nisarg Shah

AAAI 2025paper
4
citations
#5262

Improved Maximin Share Approximations for Chores by Bin Packing

Jugal Garg, Xin Huang, Erel Segal-Halevi

AAAI 2025paperarXiv:2411.04391
4
citations
#5263

Continuous Diffusion Model for Language Modeling

Jaehyeong Jo, Sung Ju Hwang

NEURIPS 2025arXiv:2502.11564
4
citations
#5264

Optimal Bounds for Dissatisfaction in Perpetual Voting

Alexander Kozachinskiy, Alexander Shen, Tomasz Steifer

AAAI 2025paperarXiv:2501.01969
4
citations
#5265

ConSense: Continually Sensing Human Activity with WiFi via Growing and Picking

Rong Li, Tao Deng, Siwei Feng et al.

AAAI 2025paperarXiv:2502.17483
4
citations
#5266

Nonasymptotic Analysis of Stochastic Gradient Descent with the Richardson–Romberg Extrapolation

Marina Sheshukova, Denis Belomestny, Alain Oliviero Durmus et al.

ICLR 2025arXiv:2410.05106
4
citations
#5267

Generating Traffic Scenarios via In-Context Learning to Learn Better Motion Planner

Aizierjiang Aiersilan

AAAI 2025paperarXiv:2412.18086
4
citations
#5268

MRBTP: Efficient Multi-Robot Behavior Tree Planning and Collaboration

Yishuai Cai, Xinglin Chen, Zhongxuan Cai et al.

AAAI 2025paperarXiv:2502.18072
4
citations
#5269

How to Train Your LLM Web Agent: A Statistical Diagnosis

Dheeraj Vattikonda, Santhoshi Ravichandran, Emiliano Penaloza et al.

NEURIPS 2025arXiv:2507.04103
4
citations
#5270

Towards Audio-Visual Navigation in Noisy Environments: A Large-Scale Benchmark Dataset and an Architecture Considering Multiple Sound-Sources

Zhanbo Shi, Lin Zhang, Linfei Li et al.

AAAI 2025paper
4
citations
#5271

NaviFormer: A Spatio-Temporal Context-Aware Transformer for Object Navigation

Wei Xie, Haobo Jiang, Yun Zhu et al.

AAAI 2025paper
4
citations
#5272

Learning to engineer protein flexibility

Petr Kouba, Joan Planas-Iglesias, Jiri Damborsky et al.

ICLR 2025arXiv:2412.18275
4
citations
#5273

A Practical Approach to Causal Inference over Time

Martina Cinquini, Isacco Beretta, Salvatore Ruggieri et al.

AAAI 2025paperarXiv:2410.10502
4
citations
#5274

Nesterov acceleration in benignly non-convex landscapes

Kanan Gupta, Stephan Wojtowytsch

ICLR 2025arXiv:2410.08395
4
citations
#5275

MamKO: Mamba-based Koopman operator for modeling and predictive control

Zhaoyang Li, Minghao Han, Xunyuan Yin

ICLR 2025
4
citations
#5276

Functional Homotopy: Smoothing Discrete Optimization via Continuous Parameters for LLM Jailbreak Attacks

Zi Wang, Divyam Anshumaan, Ashish Hooda et al.

ICLR 2025arXiv:2410.04234
4
citations
#5277

Prot2Text-V2: Protein Function Prediction with Multimodal Contrastive Alignment

Xiao Fei, Michail Chatzianastasis, Sarah Carneiro et al.

NEURIPS 2025arXiv:2505.11194
4
citations
#5278

Active Fourier Auditor for Estimating Distributional Properties of ML Models

Ayoub Ajarra, Bishwamittra Ghosh, Debabrota Basu

AAAI 2025paperarXiv:2410.08111
4
citations
#5279

Efficient and Accurate Explanation Estimation with Distribution Compression

Hubert Baniecki, Giuseppe Casalicchio, Bernd Bischl et al.

ICLR 2025arXiv:2406.18334
4
citations
#5280

When Witnesses Defend: A Witness Graph Topological Layer for Adversarial Graph Learning

Naheed Anjum Arafat, Debabrota Basu, Yulia Gel et al.

AAAI 2025paperarXiv:2409.14161
4
citations
#5281

Self-Supervised Diffusion MRI Denoising via Iterative and Stable Refinement

Chenxu Wu, Qingpeng Kong, Zihang Jiang et al.

ICLR 2025oralarXiv:2501.13514
4
citations
#5282

EditBoard: Towards a Comprehensive Evaluation Benchmark for Text-Based Video Editing Models

Yupeng Chen, Penglin Chen, Xiaoyu Zhang et al.

AAAI 2025paperarXiv:2409.09668
4
citations
#5283

Unveiling the Threat of Fraud Gangs to Graph Neural Networks: Multi-Target Graph Injection Attacks Against GNN-Based Fraud Detectors

Jinhyeok Choi, Heehyeon Kim, Joyce Jiyoung Whang

AAAI 2025paperarXiv:2412.18370
4
citations
#5284

A Similarity Paradigm Through Textual Regularization Without Forgetting

Fangming Cui, Jan Fong, Rongfei Zeng et al.

AAAI 2025paperarXiv:2502.14376
4
citations
#5285

Learning to Generate Gradients for Test-Time Adaptation via Test-Time Training Layers

Qi Deng, Shuaicheng Niu, Ronghao Zhang et al.

AAAI 2025paperarXiv:2412.16901
4
citations
#5286

Bayesian Low-Rank Learning (Bella): A Practical Approach to Bayesian Neural Networks

Bao Gia Doan, Afshar Shamsi, Xiao-Yu Guo et al.

AAAI 2025paperarXiv:2407.20891
4
citations
#5287

Dataset Ownership Verification in Contrastive Pre-trained Models

Yuechen Xie, Jie Song, Mengqi Xue et al.

ICLR 2025arXiv:2502.07276
4
citations
#5288

HyPoGen: Optimization-Biased Hypernetworks for Generalizable Policy Generation

Hanxiang Ren, Li Sun, Xulong Wang et al.

ICLR 2025
4
citations
#5289

SADBA: Self-Adaptive Distributed Backdoor Attack Against Federated Learning

Jun Feng, Yuzhe Lai, Hong Sun et al.

AAAI 2025paper
4
citations
#5290

Universality of Real Minimal Complexity Reservoir

Robert Simon Fong, Boyu Li, Peter Tino

AAAI 2025paperarXiv:2408.08071
4
citations
#5291

DivGCL: A Graph Contrastive Learning Model for Diverse Recommendation

Wenwen Gong, Yangliao Geng, Dan Zhang et al.

AAAI 2025paper
4
citations
#5292

RegMixMatch: Optimizing Mixup Utilization in Semi-Supervised Learning

Haorong Han, Jidong Yuan, Chixuan Wei et al.

AAAI 2025paperarXiv:2412.10741
4
citations
#5293

SCOPE: A Self-supervised Framework for Improving Faithfulness in Conditional Text Generation

Song Duong, Florian Le Bronnec, Alexandre Allauzen et al.

ICLR 2025arXiv:2502.13674
4
citations
#5294

MeshMask: Physics-Based Simulations with Masked Graph Neural Networks

Paul Garnier, Vincent Lannelongue, Jonathan Viquerat et al.

ICLR 2025arXiv:2501.08738
4
citations
#5295

NextBestPath: Efficient 3D Mapping of Unseen Environments

Shiyao Li, Antoine Guedon, Clémentin Boittiaux et al.

ICLR 2025arXiv:2502.05378
4
citations
#5296

DeepSN: A Sheaf Neural Framework for Influence Maximization

Asela Hevapathige, Qing Wang, Ahad N. Zehmakan

AAAI 2025paperarXiv:2412.12416
4
citations
#5297

Generalization Analysis for Deep Contrastive Representation Learning

Nong Minh Hieu, Antoine Ledent, Yunwen Lei et al.

AAAI 2025paperarXiv:2412.12014
4
citations
#5298

MotionDreamer: One-to-Many Motion Synthesis with Localized Generative Masked Transformer

Yilin Wang, chuan guo, Yuxuan Mu et al.

ICLR 2025oralarXiv:2504.08959
4
citations
#5299

Meta-Dynamical State Space Models for Integrative Neural Data Analysis

Ayesha Vermani, Josue Nassar, Hyungju Jeon et al.

ICLR 2025arXiv:2410.05454
4
citations
#5300

Memory Mosaics at scale

Jianyu Zhang, Leon Bottou

NEURIPS 2025oralarXiv:2507.03285
4
citations
#5301

Backdoor Attack on Propagation-based Rumor Detectors

Di Jin, Yujun Zhang, Bingdao Feng et al.

AAAI 2025paper
4
citations
#5302

WALL-E: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents

Siyu Zhou, Tianyi Zhou, Yijun Yang et al.

NEURIPS 2025
4
citations
#5303

Semi-Supervised Online Cross-Modal Hashing

Xiao Kang, Xingbo Liu, Xuening Zhang et al.

AAAI 2025paper
4
citations
#5304

Quantitative Approximation for Neural Operators in Nonlinear Parabolic Equations

Takashi Furuya, Koichi Taniguchi, Satoshi Okuda

ICLR 2025arXiv:2410.02151
4
citations
#5305

Regret Analysis of Multi-task Representation Learning for Linear-Quadratic Adaptive Control

Bruce D. Lee, Leonardo F. Toso, Thomas T. Zhang et al.

AAAI 2025paperarXiv:2407.05781
4
citations
#5306

On the Learn-to-Optimize Capabilities of Transformers in In-Context Sparse Recovery

Renpu Liu, Ruida Zhou, Cong Shen et al.

ICLR 2025arXiv:2410.13981
4
citations
#5307

STAFF: Speculative Coreset Selection for Task-Specific Fine-tuning

Xiaoyu Zhang, Juan Zhai, Shiqing Ma et al.

ICLR 2025
4
citations
#5308

AgentMixer: Multi-Agent Correlated Policy Factorization

Zhiyuan Li, Wenshuai Zhao, Lijun Wu et al.

AAAI 2025paperarXiv:2401.08728
4
citations
#5309

Towards Scalable and Deep Graph Neural Networks via Noise Masking

Yuxuan Liang, Wentao Zhang, Zeang Sheng et al.

AAAI 2025paperarXiv:2412.14602
4
citations
#5310

ReX: A Framework for Incorporating Temporal Information in Model-Agnostic Local Explanation Techniques

Junhao Liu, Xin Zhang

AAAI 2025paperarXiv:2209.03798
4
citations
#5311

AeroGTO: An Efficient Graph-Transformer Operator for Learning Large-Scale Aerodynamics of 3D Vehicle Geometries

Pengwei Liu, Pengkai Wang, Xingyu Ren et al.

AAAI 2025paper
4
citations
#5312

3D-MolT5: Leveraging Discrete Structural Information for Molecule-Text Modeling

Qizhi Pei, Rui Yan, Kaiyuan Gao et al.

ICLR 2025arXiv:2406.05797
4
citations
#5313

Visual Reinforcement Learning with Residual Action

Zhenxian Liu, Peixi Peng, Yonghong Tian

AAAI 2025paper
4
citations
#5314

Epsilon: Exploring Comprehensive Visual-Semantic Projection for Multi-Label Zero-Shot Learning

Ziming Liu, Jingcai Guo, Song Guo et al.

AAAI 2025paperarXiv:2408.12253
4
citations
#5315

Erasing Concept Combination from Text-to-Image Diffusion Model

hongyi nie, Quanming Yao, Yang Liu et al.

ICLR 2025
4
citations
#5316

AGMixup: Adaptive Graph Mixup for Semi-supervised Node Classification

Weigang Lu, Ziyu Guan, Wei Zhao et al.

AAAI 2025paperarXiv:2412.08144
4
citations
#5317

Unlocking Global Optimality in Bilevel Optimization: A Pilot Study

Quan Xiao, Tianyi Chen

ICLR 2025arXiv:2408.16087
4
citations
#5318

Tackling Data Corruption in Offline Reinforcement Learning via Sequence Modeling

Jiawei Xu, Rui Yang, Shuang Qiu et al.

ICLR 2025oralarXiv:2407.04285
4
citations
#5319

Spurious Feature Eraser: Stabilizing Test-Time Adaptation for Vision-Language Foundation Model

Huan Ma, Yan Zhu, Changqing Zhang et al.

AAAI 2025paperarXiv:2403.00376
4
citations
#5320

Attribute-based Visual Reprogramming for Vision-Language Models

Chengyi Cai, Zesheng Ye, Lei Feng et al.

ICLR 2025arXiv:2501.13982
4
citations
#5321

Benchmarking and Understanding Compositional Relational Reasoning of LLMs

Ruikang Ni, Da Xiao, Qingye Meng et al.

AAAI 2025paperarXiv:2412.12841
4
citations
#5322

DualDynamics: Synergizing Implicit and Explicit Methods for Robust Irregular Time Series Analysis

YongKyung Oh, Dong-Young Lim, Sungil Kim

AAAI 2025paperarXiv:2401.04979
4
citations
#5323

Efficient Few-Shot Neural Architecture Search by Counting the Number of Nonlinear Functions

Youngmin Oh, Hyunju Lee, Bumsub Ham

AAAI 2025paperarXiv:2412.14678
4
citations
#5324

SOLA-GCL: Subgraph-Oriented Learnable Augmentation Method for Graph Contrastive Learning

Tianhao Peng, Xuhong Li, Haitao Yuan et al.

AAAI 2025paperarXiv:2503.10100
4
citations
#5325

ImProver: Agent-Based Automated Proof Optimization

Riyaz Ahuja, Jeremy Avigad, Prasad Tetali et al.

ICLR 2025arXiv:2410.04753
4
citations
#5326

On Corruption-Robustness in Performative Reinforcement Learning

Vasilis Pollatos, Debmalya Mandal, Goran Radanovic

AAAI 2025paperarXiv:2505.05609
4
citations
#5327

Enhancing SQL Query Generation with Neurosymbolic Reasoning

Henrijs Princis, Cristina David, Alan Mycroft

AAAI 2025paperarXiv:2408.13888
4
citations
#5328

Enhancing Masked Time-Series Modeling via Dropping Patches

Tianyu Qiu, Yi Xie, Hao Niu et al.

AAAI 2025paperarXiv:2412.15315
4
citations
#5329

Optimizing Backward Policies in GFlowNets via Trajectory Likelihood Maximization

Timofei Gritsaev, Nikita Morozov, Sergey Samsonov et al.

ICLR 2025arXiv:2410.15474
4
citations
#5330

User Preference Meets Pareto-Optimality in Multi-Objective Bayesian Optimization

Joshua Hang Sai Ip, Ankush Chakrabarty, Ali Mesbah et al.

AAAI 2025paperarXiv:2502.06971
4
citations
#5331

Can Video LLMs Refuse to Answer? Alignment for Answerability in Video Large Language Models

Eunseop Yoon, Hee Suk Yoon, Mark Hasegawa-Johnson et al.

ICLR 2025arXiv:2507.04976
4
citations
#5332

Pursuing Better Decision Boundaries for Long-Tailed Object Detection via Category Information Amount

Yanbiao Ma, Wei Dai, Jiayi Chen

ICLR 2025arXiv:2502.03852
4
citations
#5333

ConceptSearch: Towards Efficient Program Search Using LLMs for Abstraction and Reasoning Corpus (ARC)

Kartik Singhal, Gautam Shroff

AAAI 2025paperarXiv:2412.07322
4
citations
#5334

Single-View Graph Contrastive Learning with Soft Neighborhood Awareness

Qingqiang Sun, Chaoqi Chen, Ziyue Qiao et al.

AAAI 2025paperarXiv:2412.09261
4
citations
#5335

Lasso Bandit with Compatibility Condition on Optimal Arm

Harin Lee, Taehyun Hwang, Min-hwan Oh

ICLR 2025arXiv:2406.00823
4
citations
#5336

Guaranteed Generation from Large Language Models

Minbeom Kim, Thibaut Thonet, Jos Rozen et al.

ICLR 2025arXiv:2410.06716
4
citations
#5337

HyperMixer: Specializable Hypergraph Channel Mixing for Long-term Multivariate Time Series Forecasting

Changyuan Tian, Zhicong Lu, Zequn Zhang et al.

AAAI 2025paper
4
citations
#5338

An LLM-Empowered Adaptive Evolutionary Algorithm for Multi-Component Deep Learning Systems

Haoxiang Tian, Xingshuo Han, Guoquan Wu et al.

AAAI 2025paperarXiv:2501.00829
4
citations
#5339

Conditional Diffusion Models Based Conditional Independence Testing

Yanfeng Yang, Shuai Li, Yingjie Zhang et al.

AAAI 2025paperarXiv:2412.11744
4
citations
#5340

Solving Partial Differential Equations via Radon Neural Operator

Wenbin Lu, Yihan Chen, Junnan Xu et al.

NEURIPS 2025
4
citations
#5341

Direct Post-Training Preference Alignment for Multi-Agent Motion Generation Model Using Implicit Feedback from Pre-training Demonstrations

Thomas Tian, Kratarth Goel

ICLR 2025arXiv:2503.20105
4
citations
#5342

SSAN: A Symbol Spatial-Aware Network for Handwritten Mathematical Expression Recognition

Haoran Zhang, Xiangdong Su, Xingxiang Zhou et al.

AAAI 2025paper
4
citations
#5343

Max-Mahalanobis Anchors Guidance for Multi-View Clustering

Pei Zhang, Yuangang Pan, Siwei Wang et al.

AAAI 2025paper
4
citations
#5344

Shh, don't say that! Domain Certification in LLMs

Cornelius Emde, Alasdair Paren, Preetham Arvind et al.

ICLR 2025arXiv:2502.19320
4
citations
#5345

CoInD: Enabling Logical Compositions in Diffusion Models

Sachit Gaudi, Gautam Sreekumar, Vishnu Boddeti

ICLR 2025arXiv:2503.01145
4
citations
#5346

API Pack: A Massive Multi-Programming Language Dataset for API Call Generation

Gavin (Zhen) Guo, Adriana Meza Soria, Wei Sun et al.

ICLR 2025arXiv:2402.09615
4
citations
#5347

Hummingbird: High Fidelity Image Generation via Multimodal Context Alignment

Minh-Quan Le, Gaurav Mittal, Tianjian Meng et al.

ICLR 2025arXiv:2502.05153
4
citations
#5348

GraSP: Simple Yet Effective Graph Similarity Predictions

Haoran Zheng, Jieming Shi, Renchi Yang

AAAI 2025paperarXiv:2412.09968
4
citations
#5349

Group Downsampling with Equivariant Anti-aliasing

Md Ashiqur Rahman, Raymond A. Yeh

ICLR 2025arXiv:2504.17258
4
citations
#5350

Reward Learning from Multiple Feedback Types

Yannick Metz, Andras Geiszl, Raphaël Baur et al.

ICLR 2025arXiv:2502.21038
4
citations
#5351

CoPEFT: Fast Adaptation Framework for Multi-Agent Collaborative Perception with Parameter-Efficient Fine-Tuning

Quanmin Wei, Penglin Dai, Wei Li et al.

AAAI 2025paperarXiv:2502.10705
4
citations
#5352

Mitigating Information Loss in Tree-Based Reinforcement Learning via Direct Optimization

Sascha Marton, Tim Grams, Florian Vogt et al.

ICLR 2025arXiv:2408.08761
4
citations
#5353

Rotated Runtime Smooth: Training-Free Activation Smoother for accurate INT4 inference

Ke Yi, Zengke Liu, jianwei zhang et al.

ICLR 2025arXiv:2409.20361
4
citations
#5354

KAES: Multi-aspect Shared Knowledge Finding and Aligning for Cross-prompt Automated Scoring of Essay Traits

Xia Li, Wenjing Pan

AAAI 2025paper
4
citations
#5355

Convergence and Implicit Bias of Gradient Descent on Continual Linear Classification

Hyunji Jung, Hanseul Cho, Chulhee Yun

ICLR 2025arXiv:2504.12712
4
citations
#5356

Filling Memory Gaps: Enhancing Continual Semantic Parsing via SQL Syntax Variance-Guided LLMs Without Real Data Replay

Ruiheng Liu, Jinyu Zhang, Yanqi Song et al.

AAAI 2025paperarXiv:2412.07246
4
citations
#5357

Boosting Multiple Views for pretrained-based Continual Learning

Quyen Tran, Tung Lam Tran, Khanh Doan et al.

ICLR 2025
4
citations
#5358

Mental-Perceiver: Audio-Textual Multi-Modal Learning for Estimating Mental Disorders

Jinghui Qin, Changsong Liu, Tianchi Tang et al.

AAAI 2025paperarXiv:2408.12088
4
citations
#5359

Divide-Solve-Combine: An Interpretable and Accurate Prompting Framework for Zero-shot Multi-Intent Detection

Libo Qin, Qiguang Chen, Jingxuan Zhou et al.

AAAI 2025paper
4
citations
#5360

Beyond Interpretability: The Gains of Feature Monosemanticity on Model Robustness

Qi Zhang, Yifei Wang, Jingyi Cui et al.

ICLR 2025arXiv:2410.21331
4
citations
#5361

Knowledge Distillation with Multi-granularity Mixture of Priors for Image Super-Resolution

Simiao Li, Yun Zhang, Wei Li et al.

ICLR 2025arXiv:2404.02573
4
citations
#5362

Control-oriented Clustering of Visual Latent Representation

Han Qi, Haocheng Yin, Heng Yang

ICLR 2025arXiv:2410.05063
4
citations
#5363

eQMARL: Entangled Quantum Multi-Agent Reinforcement Learning for Distributed Cooperation over Quantum Channels

Alexander DeRieux, Walid Saad

ICLR 2025arXiv:2405.17486
4
citations
#5364

Fine-tuning can Help Detect Pretraining Data from Large Language Models

Hengxiang Zhang, Songxin Zhang, Bingyi Jing et al.

ICLR 2025arXiv:2410.10880
4
citations
#5365

Kernel-based Optimally Weighted Conformal Time-Series Prediction

Jonghyeok Lee, Chen Xu, Yao Xie

ICLR 2025
4
citations
#5366

Operationalising Rawlsian Ethics for Fairness in Norm Learning Agents

Jessica Woodgate, Paul Marshall, Nirav Ajmeri

AAAI 2025paperarXiv:2412.15163
4
citations
#5367

Stem-OB: Generalizable Visual Imitation Learning with Stem-Like Convergent Observation through Diffusion Inversion

Kaizhe Hu, Zihang Rui, Yao He et al.

ICLR 2025arXiv:2411.04919
4
citations
#5368

Generating Physical Dynamics under Priors

Zihan Zhou, Xiaoxue Wang, Tianshu Yu

ICLR 2025arXiv:2409.00730
4
citations
#5369

Testing Causal Models with Hidden Variables in Polynomial Delay via Conditional Independencies

Hyunchai Jeong, Adiba Ejaz, Jin Tian et al.

AAAI 2025paperarXiv:2409.14593
4
citations
#5370

An Online Learning Theory of Trading-Volume Maximization

Tommaso Cesari, Roberto Colomboni

ICLR 2025
4
citations
#5371

Conformal Inference of Individual Treatment Effects Using Conditional Density Estimates

Baozhen Wang, Xingye Qiao

AAAI 2025paperarXiv:2501.14933
4
citations
#5372

Decentralized Sporadic Federated Learning: A Unified Algorithmic Framework with Convergence Guarantees

Shahryar Zehtabi, Dong-Jun Han, Rohit Parasnis et al.

ICLR 2025arXiv:2402.03448
4
citations
#5373

Multi-Draft Speculative Sampling: Canonical Decomposition and Theoretical Limits

Ashish Khisti, MohammadReza Ebrahimi, Hassan Dbouk et al.

ICLR 2025arXiv:2410.18234
4
citations
#5374

Decentralized Federated Learning with Model Caching on Mobile Agents

Xiaoyu Wang, Guojun Xiong, Houwei Cao et al.

AAAI 2025paperarXiv:2408.14001
4
citations
#5375

Do Deep Neural Network Solutions Form a Star Domain?

Ankit Sonthalia, Alexander Rubinstein, Ehsan Abbasnejad et al.

ICLR 2025arXiv:2403.07968
4
citations
#5376

ChemAgent: Self-updating Memories in Large Language Models Improves Chemical Reasoning

Xiangru Tang, Tianyu Hu, Muyang Ye et al.

ICLR 2025
4
citations
#5377

Personalized Representation from Personalized Generation

Shobhita Sundaram, Julia Chae, Yonglong Tian et al.

ICLR 2025arXiv:2412.16156
4
citations
#5378

Revisiting Interpolation for Noisy Label Correction

Yuanzhuo Xu, Xiaoguang Niu, Jie Yang et al.

AAAI 2025paper
4
citations
#5379

DCT-CryptoNets: Scaling Private Inference in the Frequency Domain

Arjun Roy, Kaushik Roy

ICLR 2025arXiv:2408.15231
4
citations
#5380

Words in Motion: Extracting Interpretable Control Vectors for Motion Transformers

Omer Sahin Tas, Royden Wagner

ICLR 2025arXiv:2406.11624
4
citations
#5381

JAMUN: Bridging Smoothed Molecular Dynamics and Score-Based Learning for Conformational Ensemble Generation

Ameya Daigavane, Bodhi Vani, Darcy Davidson et al.

NEURIPS 2025
4
citations
#5382

SRA-CL: Semantic Retrieval Augmented Contrastive Learning for Sequential Recommendation

Ziqiang Cui, Yunpeng Weng, Xing Tang et al.

NEURIPS 2025
4
citations
#5383

Hot-pluggable Federated Learning: Bridging General and Personalized FL via Dynamic Selection

Lei Shen, Zhenheng Tang, Lijun Wu et al.

ICLR 2025
4
citations
#5384

Distilling Dataset into Neural Field

Donghyeok Shin, HeeSun Bae, Gyuwon Sim et al.

ICLR 2025arXiv:2503.04835
4
citations
#5385

Guided Score identity Distillation for Data-Free One-Step Text-to-Image Generation

Mingyuan Zhou, Zhendong Wang, Huangjie Zheng et al.

ICLR 2025arXiv:2406.01561
4
citations
#5386

MODA: MOdular Duplex Attention for Multimodal Perception, Cognition, and Emotion Understanding

Zhicheng Zhang, Wuyou Xia, Chenxi Zhao et al.

ICML 2025spotlightarXiv:2507.04635
4
citations
#5387

Linear Mixture Distributionally Robust Markov Decision Processes

Zhishuai Liu, Pan Xu

NEURIPS 2025arXiv:2505.18044
4
citations
#5388

On-Device Collaborative Language Modeling via a Mixture of Generalists and Specialists

Dongyang Fan, Bettina Messmer, Nikita Doikov et al.

ICML 2025arXiv:2409.13931
4
citations
#5389

Cannot See the Forest for the Trees: Invoking Heuristics and Biases to Elicit Irrational Choices of LLMs

Haoming Yang, Ke Ma, Xiaojun Jia et al.

ICML 2025arXiv:2505.02862
4
citations
#5390

Best of Both Worlds: Advantages of Hybrid Graph Sequence Models

Ali Behrouz, Ali Parviz, Mahdi Karami et al.

ICML 2025arXiv:2411.15671
4
citations
#5391

Transformer Learns Optimal Variable Selection in Group-Sparse Classification

Chenyang Zhang, Xuran Meng, Yuan Cao

ICLR 2025arXiv:2504.08638
4
citations
#5392

One-Step Diffusion-Based Image Compression with Semantic Distillation

Naifu Xue, Zhaoyang Jia, Jiahao Li et al.

NEURIPS 2025arXiv:2505.16687
4
citations
#5393

FACTER: Fairness-Aware Conformal Thresholding and Prompt Engineering for Enabling Fair LLM-Based Recommender Systems

Arya Fayyazi, Mehdi Kamal, Massoud Pedram

ICML 2025arXiv:2502.02966
4
citations
#5394

Sliding Puzzles Gym: A Scalable Benchmark for State Representation in Visual Reinforcement Learning

Bryan L. M. de Oliveira, Luana G. B. Martins, Bruno Brandão et al.

ICML 2025arXiv:2410.14038
4
citations
#5395

DMOSpeech: Direct Metric Optimization via Distilled Diffusion Model in Zero-Shot Speech Synthesis

Yinghao Li, Rithesh Kumar, Zeyu Jin

ICML 2025oralarXiv:2410.11097
4
citations
#5396

OpenHOI: Open-World Hand-Object Interaction Synthesis with Multimodal Large Language Model

Zhenhao Zhang, Ye Shi, Lingxiao Yang et al.

NEURIPS 2025oralarXiv:2505.18947
4
citations
#5397

Task-Agnostic Pre-training and Task-Guided Fine-tuning for Versatile Diffusion Planner

Chenyou Fan, Chenjia Bai, Zhao Shan et al.

ICML 2025arXiv:2409.19949
4
citations
#5398

Quality over Quantity in Attention Layers: When Adding More Heads Hurts

Noah Amsel, Gilad Yehudai, Joan Bruna

ICLR 2025
4
citations
#5399

Captured by Captions: On Memorization and its Mitigation in CLIP Models

Wenhao Wang, Adam Dziedzic, Grace Kim et al.

ICLR 2025arXiv:2502.07830
4
citations
#5400

Flow Distillation Sampling: Regularizing 3D Gaussians with Pre-trained Matching Priors

Lin-Zhuo Chen, Kangjie Liu, Youtian Lin et al.

ICLR 2025arXiv:2502.07615
4
citations