Most Cited ICLR "llm unlearning" Papers

6,124 papers found • Page 13 of 31

#2401

CircuitFusion: Multimodal Circuit Representation Learning for Agile Chip Design

Wenji Fang, Shang Liu, Jing Wang et al.

ICLR 2025arXiv:2505.02168
14
citations
#2402

Overcoming Lower-Level Constraints in Bilevel Optimization: A Novel Approach with Regularized Gap Functions

Wei Yao, Haian Yin, Shangzhi Zeng et al.

ICLR 2025arXiv:2406.01992
14
citations
#2403

Self-supervised Pocket Pretraining via Protein Fragment-Surroundings Alignment

Bowen Gao, Yinjun JIA, Yuanle Mo et al.

ICLR 2024arXiv:2310.07229
14
citations
#2404

DiffusionGuard: A Robust Defense Against Malicious Diffusion-based Image Editing

William June Suk Choi, Kyungmin Lee, Jongheon Jeong et al.

ICLR 2025arXiv:2410.05694
14
citations
#2405

Ready-to-React: Online Reaction Policy for Two-Character Interaction Generation

Zhi Cen, Huaijin Pi, Sida Peng et al.

ICLR 2025arXiv:2502.20370
14
citations
#2406

CEB: Compositional Evaluation Benchmark for Fairness in Large Language Models

Song Wang, Peng Wang, Tong Zhou et al.

ICLR 2025arXiv:2407.02408
14
citations
#2407

Bounding the Expected Robustness of Graph Neural Networks Subject to Node Feature Attacks

Yassine ABBAHADDOU, Sofiane ENNADIR, Johannes Lutzeyer et al.

ICLR 2024arXiv:2404.17947
14
citations
#2408

Mechanistic Permutability: Match Features Across Layers

Nikita Balagansky, Ian Maksimov, Daniil Gavrilov

ICLR 2025arXiv:2410.07656
14
citations
#2409

ReGenesis: LLMs can Grow into Reasoning Generalists via Self-Improvement

XIANGYU PENG, Congying Xia, Xinyi Yang et al.

ICLR 2025arXiv:2410.02108
14
citations
#2410

CirT: Global Subseasonal-to-Seasonal Forecasting with Geometry-inspired Transformer

Yang Liu, Zinan Zheng, Jiashun Cheng et al.

ICLR 2025oralarXiv:2502.19750
14
citations
#2411

KITAB: Evaluating LLMs on Constraint Satisfaction for Information Retrieval

Marah I Abdin, Suriya Gunasekar, Varun Chandrasekaran et al.

ICLR 2024arXiv:2310.15511
14
citations
#2412

Neural Active Learning Beyond Bandits

Yikun Ban, Ishika Agarwal, Ziwei Wu et al.

ICLR 2024arXiv:2404.12522
14
citations
#2413

Online GNN Evaluation Under Test-time Graph Distribution Shifts

Xin Zheng, Dongjin Song, Qingsong Wen et al.

ICLR 2024spotlightarXiv:2403.09953
14
citations
#2414

The Power of LLM-Generated Synthetic Data for Stance Detection in Online Political Discussions

Stefan Sylvius Wagner, Maike Behrendt, Marc Ziegele et al.

ICLR 2025arXiv:2406.12480
14
citations
#2415

CNN Kernels Can Be the Best Shapelets

Eric Qu, Yansen Wang, Xufang Luo et al.

ICLR 2024
14
citations
#2416

Proving Olympiad Inequalities by Synergizing LLMs and Symbolic Reasoning

Zenan Li, Zhaoyu Li, Wen Tang et al.

ICLR 2025arXiv:2502.13834
14
citations
#2417

Exploring Local Memorization in Diffusion Models via Bright Ending Attention

Chen Chen, Daochang Liu, Mubarak Shah et al.

ICLR 2025arXiv:2410.21665
14
citations
#2418

Neural Contractive Dynamical Systems

Hadi Beik Mohammadi, Søren Hauberg, Georgios Arvanitidis et al.

ICLR 2024spotlightarXiv:2401.09352
14
citations
#2419

Implicit Search via Discrete Diffusion: A Study on Chess

Jiacheng Ye, Zhenyu Wu, Jiahui Gao et al.

ICLR 2025arXiv:2502.19805
14
citations
#2420

Large Scale Knowledge Washing

Yu Wang, Ruihan Wu, Zexue He et al.

ICLR 2025arXiv:2405.16720
14
citations
#2421

Mixture of Parrots: Experts improve memorization more than reasoning

Samy Jelassi, Clara Mohri, David Brandfonbrener et al.

ICLR 2025arXiv:2410.19034
14
citations
#2422

Lifting Architectural Constraints of Injective Flows

Peter Sorrenson, Felix Draxler, Armand Rousselot et al.

ICLR 2024arXiv:2306.01843
14
citations
#2423

Mixture of Attentions For Speculative Decoding

Matthieu Zimmer, Milan Gritta, Gerasimos Lampouras et al.

ICLR 2025arXiv:2410.03804
14
citations
#2424

Quantifying Generalization Complexity for Large Language Models

Zhenting Qi, Hongyin Luo, Xuliang Huang et al.

ICLR 2025arXiv:2410.01769
14
citations
#2425

Build-A-Scene: Interactive 3D Layout Control for Diffusion-Based Image Generation

Abdelrahman Eldesokey, Peter Wonka

ICLR 2025arXiv:2408.14819
14
citations
#2426

Weighted-Reward Preference Optimization for Implicit Model Fusion

Ziyi Yang, Fanqi Wan, Longguang Zhong et al.

ICLR 2025arXiv:2412.03187
14
citations
#2427

Provable Reward-Agnostic Preference-Based Reinforcement Learning

Wenhao Zhan, Masatoshi Uehara, Wen Sun et al.

ICLR 2024spotlightarXiv:2305.18505
14
citations
#2428

UGMathBench: A Diverse and Dynamic Benchmark for Undergraduate-Level Mathematical Reasoning with Large Language Models

Xin Xu, Jiaxin ZHANG, Tianhao Chen et al.

ICLR 2025arXiv:2501.13766
14
citations
#2429

Ultra-Sparse Memory Network

Zihao Huang, Qiyang Min, Hongzhi Huang et al.

ICLR 2025arXiv:2411.12364
14
citations
#2430

Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation

Haruka Kiyohara, Ren Kishimoto, Kosuke Kawakami et al.

ICLR 2024arXiv:2311.18207
14
citations
#2431

Protein Multimer Structure Prediction via Prompt Learning

Ziqi Gao, Xiangguo SUN, Zijing Liu et al.

ICLR 2024arXiv:2402.18813
14
citations
#2432

Diffusion Attribution Score: Evaluating Training Data Influence in Diffusion Models

Jinxu Lin, Linwei Tao, Minjing Dong et al.

ICLR 2025arXiv:2410.18639
14
citations
#2433

Discrete Diffusion Schrödinger Bridge Matching for Graph Transformation

Jun Hyeong Kim, Seonghwan Kim, Seokhyun Moon et al.

ICLR 2025arXiv:2410.01500
14
citations
#2434

Transport meets Variational Inference: Controlled Monte Carlo Diffusions

Francisco Vargas, Shreyas Padhy, Denis Blessing et al.

ICLR 2024arXiv:2307.01050
14
citations
#2435

Pre-training with Random Orthogonal Projection Image Modeling

Maryam Haghighat, Peyman Moghadam, Shaheer Mohamed et al.

ICLR 2024spotlightarXiv:2310.18737
14
citations
#2436

Soft Robust MDPs and Risk-Sensitive MDPs: Equivalence, Policy Gradient, and Sample Complexity

Runyu Zhang, Yang Hu, Na Li

ICLR 2024arXiv:2306.11626
14
citations
#2437

SimpleTM: A Simple Baseline for Multivariate Time Series Forecasting

Hui Chen, Viet Luong, Lopamudra Mukherjee et al.

ICLR 2025oral
14
citations
#2438

PAC Prediction Sets Under Label Shift

Wenwen Si, Sangdon Park, Insup Lee et al.

ICLR 2024arXiv:2310.12964
13
citations
#2439

RelCon: Relative Contrastive Learning for a Motion Foundation Model for Wearable Data

Maxwell Xu, Jaya Narain, Gregory Darnell et al.

ICLR 2025arXiv:2411.18822
13
citations
#2440

FedLoGe: Joint Local and Generic Federated Learning under Long-tailed Data

Zikai Xiao, Zihan Chen, Liyinglan Liu et al.

ICLR 2024arXiv:2401.08977
13
citations
#2441

Look, Remember and Reason: Grounded Reasoning in Videos with Language Models

Apratim Bhattacharyya, Sunny Panchal, Reza Pourreza et al.

ICLR 2024oralarXiv:2306.17778
13
citations
#2442

Adaptive Self-training Framework for Fine-grained Scene Graph Generation

Kibum Kim, Kanghoon Yoon, Yeonjun In et al.

ICLR 2024arXiv:2401.09786
13
citations
#2443

TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models

Makoto Shing, Kou Misaki, Han Bao et al.

ICLR 2025oralarXiv:2501.16937
13
citations
#2444

MeteoRA: Multiple-tasks Embedded LoRA for Large Language Models

Jingwei Xu, Junyu Lai, Yunpeng Huang

ICLR 2025arXiv:2405.13053
13
citations
#2445

Enhancing Small Medical Learners with Privacy-preserving Contextual Prompting

xinlu zhang, Shiyang Li, Xianjun Yang et al.

ICLR 2024arXiv:2305.12723
13
citations
#2446

Fine-Grained Verifiers: Preference Modeling as Next-token Prediction in Vision-Language Alignment

Chenhang Cui, An Zhang, Yiyang Zhou et al.

ICLR 2025arXiv:2410.14148
13
citations
#2447

SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models

Jiale Cheng, Xiao Liu, Cunxiang Wang et al.

ICLR 2025arXiv:2412.11605
13
citations
#2448

Stabilizing Contrastive RL: Techniques for Robotic Goal Reaching from Offline Data

Chongyi Zheng, Benjamin Eysenbach, Homer Walke et al.

ICLR 2024spotlightarXiv:2306.03346
13
citations
#2449

CipherPrune: Efficient and Scalable Private Transformer Inference

Yancheng Zhang, Jiaqi Xue, Mengxin Zheng et al.

ICLR 2025arXiv:2502.16782
13
citations
#2450

FlashMask: Efficient and Rich Mask Extension of FlashAttention

Guoxia Wang, Jinle Zeng, Xiyuan Xiao et al.

ICLR 2025arXiv:2410.01359
13
citations
#2451

Deep Learning Alternatives Of The Kolmogorov Superposition Theorem

Leonardo Ferreira Guilhoto, Paris Perdikaris

ICLR 2025arXiv:2410.01990
13
citations
#2452

High-dimensional Analysis of Knowledge Distillation: Weak-to-Strong Generalization and Scaling Laws

Muhammed Ildiz, Halil Gozeten, Ege Taga et al.

ICLR 2025arXiv:2410.18837
13
citations
#2453

Re-Imagining Multimodal Instruction Tuning: A Representation View

Yiyang Liu, James Liang, Ruixiang Tang et al.

ICLR 2025arXiv:2503.00723
13
citations
#2454

Understanding Reconstruction Attacks with the Neural Tangent Kernel and Dataset Distillation

Noel Loo, Ramin Hasani, Mathias Lechner et al.

ICLR 2024arXiv:2302.01428
13
citations
#2455

Harnessing Density Ratios for Online Reinforcement Learning

Philip Amortila, Dylan Foster, Nan Jiang et al.

ICLR 2024spotlightarXiv:2401.09681
13
citations
#2456

Repulsive Latent Score Distillation for Solving Inverse Problems

Nicolas Zilberstein, Morteza Mardani, Santiago Segarra

ICLR 2025arXiv:2406.16683
13
citations
#2457

ODE Discovery for Longitudinal Heterogeneous Treatment Effects Inference

Krzysztof Kacprzyk, Samuel Holt, Jeroen Berrevoets et al.

ICLR 2024spotlightarXiv:2403.10766
13
citations
#2458

The Crystal Ball Hypothesis in diffusion models: Anticipating object positions from initial noise

Yuanhao Ban, Ruochen Wang, Tianyi Zhou et al.

ICLR 2025arXiv:2406.01970
13
citations
#2459

DV-3DLane: End-to-end Multi-modal 3D Lane Detection with Dual-view Representation

Yueru Luo, Shuguang Cui, Zhen Li

ICLR 2024arXiv:2406.16072
13
citations
#2460

LucidPPN: Unambiguous Prototypical Parts Network for User-centric Interpretable Computer Vision

Mateusz Pach, Koryna Lewandowska, Jacek Tabor et al.

ICLR 2025arXiv:2405.14331
13
citations
#2461

Rethinking Spiking Neural Networks from an Ensemble Learning Perspective

Yongqi Ding, Lin Zuo, Mengmeng Jing et al.

ICLR 2025oralarXiv:2502.14218
13
citations
#2462

GS-LiDAR: Generating Realistic LiDAR Point Clouds with Panoramic Gaussian Splatting

Junzhe Jiang, Chun Gu, Yurui Chen et al.

ICLR 2025arXiv:2501.13971
13
citations
#2463

LipSim: A Provably Robust Perceptual Similarity Metric

Sara Ghazanfari, Alexandre Araujo, Prashanth Krishnamurthy et al.

ICLR 2024arXiv:2310.18274
13
citations
#2464

On the hardness of learning under symmetries

Bobak Kiani, Thien Le, Hannah Lawrence et al.

ICLR 2024spotlightarXiv:2401.01869
13
citations
#2465

Neuroplastic Expansion in Deep Reinforcement Learning

Jiashun Liu, Johan S Obando Ceron, Aaron Courville et al.

ICLR 2025arXiv:2410.07994
13
citations
#2466

SLoPe: Double-Pruned Sparse Plus Lazy Low-Rank Adapter Pretraining of LLMs

Mohammad Mozaffari, Amir Yazdanbakhsh, Zhao Zhang et al.

ICLR 2025arXiv:2405.16325
13
citations
#2467

Learning local equivariant representations for quantum operators

YinZhangHao Zhou, Zixi Gan, Shishir Pandey et al.

ICLR 2025arXiv:2407.06053
13
citations
#2468

I-PHYRE: Interactive Physical Reasoning

Shiqian Li, Kewen Wu, Chi Zhang et al.

ICLR 2024arXiv:2312.03009
13
citations
#2469

Fréchet Wavelet Distance: A Domain-Agnostic Metric for Image Generation

Lokesh Veeramacheneni, Moritz Wolter, Hilde Kuehne et al.

ICLR 2025arXiv:2312.15289
13
citations
#2470

Be Aware of the Neighborhood Effect: Modeling Selection Bias under Interference

Haoxuan Li, Chunyuan Zheng, Sihao Ding et al.

ICLR 2024arXiv:2404.19620
13
citations
#2471

Glimpse: Enabling White-Box Methods to Use Proprietary Models for Zero-Shot LLM-Generated Text Detection

Guangsheng Bao, Yanbin Zhao, Juncai He et al.

ICLR 2025arXiv:2412.11506
13
citations
#2472

Port-Hamiltonian Architectural Bias for Long-Range Propagation in Deep Graph Networks

Simon Heilig, Alessio Gravina, Alessandro Trenta et al.

ICLR 2025arXiv:2405.17163
13
citations
#2473

On Characterizing the Trade-off in Invariant Representation Learning

Vishnu Boddeti, Sepehr Dehdashtian, Bashir Sadeghi

ICLR 2024arXiv:2109.03386
13
citations
#2474

Adding Conditional Control to Diffusion Models with Reinforcement Learning

Yulai Zhao, Masatoshi Uehara, Gabriele Scalia et al.

ICLR 2025arXiv:2406.12120
13
citations
#2475

Speech Robust Bench: A Robustness Benchmark For Speech Recognition

Muhammad Shah, David Solans Noguero, Mikko Heikkilä et al.

ICLR 2025arXiv:2403.07937
13
citations
#2476

MovingParts: Motion-based 3D Part Discovery in Dynamic Radiance Field

Kaizhi Yang, Xiaoshuai Zhang, Zhiao Huang et al.

ICLR 2024spotlightarXiv:2303.05703
13
citations
#2477

ConvCodeWorld: Benchmarking Conversational Code Generation in Reproducible Feedback Environments

Hojae Han, seung-won hwang, Rajhans Samdani et al.

ICLR 2025arXiv:2502.19852
13
citations
#2478

BadJudge: Backdoor Vulnerabilities of LLM-As-A-Judge

Terry Tong, Fei Wang, Zhe Zhao et al.

ICLR 2025arXiv:2503.00596
13
citations
#2479

Efficient Sharpness-Aware Minimization for Molecular Graph Transformer Models

Yili Wang, Kaixiong Zhou, Ninghao Liu et al.

ICLR 2024arXiv:2406.13137
13
citations
#2480

Towards Optimal Feature-Shaping Methods for Out-of-Distribution Detection

Qinyu Zhao, Ming Xu, Kartik Gupta et al.

ICLR 2024arXiv:2402.00865
13
citations
#2481

Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching

Enshu Liu, Xuefei Ning, Yu Wang et al.

ICLR 2025arXiv:2412.17153
13
citations
#2482

Light Schrödinger Bridge

Alexander Korotin, Nikita Gushchin, Evgeny Burnaev

ICLR 2024arXiv:2310.01174
13
citations
#2483

Imputation for prediction: beware of diminishing returns.

Marine Le Morvan, Gael Varoquaux

ICLR 2025arXiv:2407.19804
13
citations
#2484

Generalized Principal-Agent Problem with a Learning Agent

Tao Lin, Yiling Chen

ICLR 2025arXiv:2402.09721
13
citations
#2485

Lipsum-FT: Robust Fine-Tuning of Zero-Shot Models Using Random Text Guidance

Giung Nam, Byeongho Heo, Juho Lee

ICLR 2024arXiv:2404.00860
13
citations
#2486

Retro-fallback: retrosynthetic planning in an uncertain world

Austin Tripp, Krzysztof Maziarz, Sarah Lewis et al.

ICLR 2024arXiv:2310.09270
13
citations
#2487

Grounding Continuous Representations in Geometry: Equivariant Neural Fields

David Wessels, David Knigge, Riccardo Valperga et al.

ICLR 2025arXiv:2406.05753
13
citations
#2488

Universal Sharpness Dynamics in Neural Network Training: Fixed Point Analysis, Edge of Stability, and Route to Chaos

Dayal Singh Kalra, Tianyu He, Maissam Barkeshli

ICLR 2025arXiv:2311.02076
13
citations
#2489

InstantSplamp: Fast and Generalizable Stenography Framework for Generative Gaussian Splatting

Chenxin Li, Hengyu Liu, Zhiwen Fan et al.

ICLR 2025
13
citations
#2490

CALICO: Self-Supervised Camera-LiDAR Contrastive Pre-training for BEV Perception

Jiachen Sun, Haizhong Zheng, Qingzhao Zhang et al.

ICLR 2024arXiv:2306.00349
13
citations
#2491

VibeCheck: Discover and Quantify Qualitative Differences in Large Language Models

Lisa Dunlap, Krishna Mandal, trevor darrell et al.

ICLR 2025arXiv:2410.12851
13
citations
#2492

A Unified Sampling Framework for Solver Searching of Diffusion Probabilistic Models

Enshu Liu, Xuefei Ning, Huazhong Yang et al.

ICLR 2024arXiv:2312.07243
13
citations
#2493

Hierarchical Autoregressive Transformers: Combining Byte- and Word-Level Processing for Robust, Adaptable Language Models

Pit Neitemeier, Björn Deiseroth, Constantin Eichenberg et al.

ICLR 2025arXiv:2501.10322
13
citations
#2494

Measuring Non-Adversarial Reproduction of Training Data in Large Language Models

Michael Aerni, Javier Rando, Edoardo Debenedetti et al.

ICLR 2025arXiv:2411.10242
13
citations
#2495

SecureGS: Boosting the Security and Fidelity of 3D Gaussian Splatting Steganography

Xuanyu Zhang, Jiarui Meng, Zhipei Xu et al.

ICLR 2025arXiv:2503.06118
13
citations
#2496

Reward-Free Curricula for Training Robust World Models

Marc Rigter, Minqi Jiang, Ingmar Posner

ICLR 2024arXiv:2306.09205
13
citations
#2497

What Makes a Maze Look Like a Maze?

Joy Hsu, Jiayuan Mao, Joshua B Tenenbaum et al.

ICLR 2025arXiv:2409.08202
13
citations
#2498

Jailbreak Antidote: Runtime Safety-Utility Balance via Sparse Representation Adjustment in Large Language Models

Guobin Shen, Dongcheng Zhao, Yiting Dong et al.

ICLR 2025arXiv:2410.02298
13
citations
#2499

Dynamic Loss-Based Sample Reweighting for Improved Large Language Model Pretraining

Daouda Sow, Herbert Woisetschläger, Saikiran Bulusu et al.

ICLR 2025arXiv:2502.06733
13
citations
#2500

Automatic Curriculum Expert Iteration for Reliable LLM Reasoning

Zirui Zhao, Hanze Dong, Amrita Saha et al.

ICLR 2025arXiv:2410.07627
13
citations
#2501

Personalized Visual Instruction Tuning

Renjie Pi, Jianshu Zhang, Tianyang Han et al.

ICLR 2025arXiv:2410.07113
13
citations
#2502

Conformal Inductive Graph Neural Networks

Soroush H. Zargarbashi, Aleksandar Bojchevski

ICLR 2024arXiv:2407.09173
13
citations
#2503

Measuring Vision-Language STEM Skills of Neural Models

Jianhao Shen, Ye Yuan, Srbuhi Mirzoyan et al.

ICLR 2024arXiv:2402.17205
13
citations
#2504

Compositional 4D Dynamic Scenes Understanding with Physics Priors for Video Question Answering

Xingrui Wang, Wufei Ma, Angtian Wang et al.

ICLR 2025oralarXiv:2406.00622
13
citations
#2505

Symmetric Mean-field Langevin Dynamics for Distributional Minimax Problems

Juno Kim, Kakei Yamamoto, Kazusato Oko et al.

ICLR 2024spotlightarXiv:2312.01127
13
citations
#2506

Ward: Provable RAG Dataset Inference via LLM Watermarks

Nikola Jovanović, Robin Staab, Maximilian Baader et al.

ICLR 2025arXiv:2410.03537
13
citations
#2507

Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation

Eliot Xing, Vernon Luk, Jean Oh

ICLR 2025arXiv:2412.12089
13
citations
#2508

PRES: Toward Scalable Memory-Based Dynamic Graph Neural Networks

Junwei Su, Difan Zou, Chuan Wu

ICLR 2024oralarXiv:2402.04284
13
citations
#2509

ParetoFlow: Guided Flows in Multi-Objective Optimization

Ye Yuan, Can Chen, Christopher Pal et al.

ICLR 2025arXiv:2412.03718
13
citations
#2510

Attacking Perceptual Similarity Metrics

Abhijay Ghildyal, Feng Liu

ICLR 2024arXiv:2305.08840
13
citations
#2511

Parameter and Memory Efficient Pretraining via Low-rank Riemannian Optimization

Zhanfeng Mo, Long-Kai Huang, Sinno Jialin Pan

ICLR 2025
13
citations
#2512

$\mathbb{X}$-Sample Contrastive Loss: Improving Contrastive Learning with Sample Similarity Graphs

Vlad Sobal, Mark Ibrahim, Randall Balestriero et al.

ICLR 2025arXiv:2407.18134
13
citations
#2513

Unlearn and Burn: Adversarial Machine Unlearning Requests Destroy Model Accuracy

Yangsibo Huang, Daogao Liu, Lynn Chua et al.

ICLR 2025arXiv:2410.09591
13
citations
#2514

C-CLIP: Multimodal Continual Learning for Vision-Language Model

Wenzhuo Liu, Fei Zhu, Longhui Wei et al.

ICLR 2025
13
citations
#2515

Self-Supervised High Dynamic Range Imaging with Multi-Exposure Images in Dynamic Scenes

Zhilu Zhang, Haoyu Wang, Shuai Liu et al.

ICLR 2024arXiv:2310.01840
13
citations
#2516

Efficient Residual Learning with Mixture-of-Experts for Universal Dexterous Grasping

Ziye Huang, Haoqi Yuan, Yuhui Fu et al.

ICLR 2025arXiv:2410.02475
13
citations
#2517

Post-hoc Reward Calibration: A Case Study on Length Bias

Zeyu Huang, Zihan Qiu, zili wang et al.

ICLR 2025arXiv:2409.17407
13
citations
#2518

In-context Exploration-Exploitation for Reinforcement Learning

Zhenwen Dai, Federico Tomasi, Sina Ghiassian

ICLR 2024arXiv:2403.06826
13
citations
#2519

Learning Transformer-based World Models with Contrastive Predictive Coding

Maxime Burchi, Radu Timofte

ICLR 2025oralarXiv:2503.04416
13
citations
#2520

From Posterior Sampling to Meaningful Diversity in Image Restoration

Noa Cohen, Hila Manor, Yuval Bahat et al.

ICLR 2024arXiv:2310.16047
13
citations
#2521

Surprising Effectiveness of pretraining Ternary Language Model at Scale

Ayush Kaushal, Tejas Vaidhya, Arnab Mondal et al.

ICLR 2025arXiv:2407.12327
13
citations
#2522

Synthesizing Programmatic Reinforcement Learning Policies with Large Language Model Guided Search

Max Liu, Chan-Hung Yu, Wei-Hsu Lee et al.

ICLR 2025arXiv:2405.16450
13
citations
#2523

ACC-Collab: An Actor-Critic Approach to Multi-Agent LLM Collaboration

Andrew Estornell, Jean-Francois Ton, Yuanshun Yao et al.

ICLR 2025arXiv:2411.00053
13
citations
#2524

MAP: Multi-Human-Value Alignment Palette

Xinran Wang, Qi Le, Ammar Ahmed et al.

ICLR 2025arXiv:2410.19198
13
citations
#2525

A Unifying Framework for Representation Learning

Shaden Alshammari, John Hershey, Axel Feldmann et al.

ICLR 2025arXiv:2504.16929
13
citations
#2526

The Last Iterate Advantage: Empirical Auditing and Principled Heuristic Analysis of Differentially Private SGD

Milad Nasr, Thomas Steinke, Borja Balle et al.

ICLR 2025arXiv:2410.06186
13
citations
#2527

Mitigating the Backdoor Effect for Multi-Task Model Merging via Safety-Aware Subspace

Jinluan Yang, Anke Tang, Didi Zhu et al.

ICLR 2025arXiv:2410.13910
13
citations
#2528

Standardizing Structural Causal Models

Weronika Ormaniec, Scott Sussex, Lars Lorch et al.

ICLR 2025arXiv:2406.11601
13
citations
#2529

BRUSLEATTACK: A QUERY-EFFICIENT SCORE- BASED BLACK-BOX SPARSE ADVERSARIAL ATTACK

Quoc Viet Vo, Ehsan Abbasnejad, Damith Ranasinghe

ICLR 2024arXiv:2404.05311
13
citations
#2530

FedWon: Triumphing Multi-domain Federated Learning Without Normalization

Weiming Zhuang, Lingjuan Lyu

ICLR 2024arXiv:2306.05879
13
citations
#2531

Probabilistic Conformal Prediction with Approximate Conditional Validity

Vincent Plassier, Alexander Fishkov, Mohsen Guizani et al.

ICLR 2025arXiv:2407.01794
13
citations
#2532

Learning Molecular Representation in a Cell

Gang Liu, Srijit Seal, John Arevalo et al.

ICLR 2025arXiv:2406.12056
13
citations
#2533

DeepGate4: Efficient and Effective Representation Learning for Circuit Design at Scale

Ziyang Zheng, Shan Huang, Jianyuan Zhong et al.

ICLR 2025arXiv:2502.01681
13
citations
#2534

DexTrack: Towards Generalizable Neural Tracking Control for Dexterous Manipulation from Human References

Xueyi Liu, Jianibieke Adalibieke, Qianwei Han et al.

ICLR 2025arXiv:2502.09614
13
citations
#2535

Differential learning kinetics govern the transition from memorization to generalization during in-context learning

Alex Nguyen, Gautam Reddy Nallamala

ICLR 2025arXiv:2412.00104
13
citations
#2536

KLay: Accelerating Arithmetic Circuits for Neurosymbolic AI

Jaron Maene, Vincent Derkinderen, Pedro Zuidberg Dos Martires

ICLR 2025arXiv:2410.11415
13
citations
#2537

MIRAGE: Evaluating and Explaining Inductive Reasoning Process in Language Models

Jiachun Li, Pengfei Cao, Zhuoran Jin et al.

ICLR 2025arXiv:2410.09542
13
citations
#2538

Flow matching achieves almost minimax optimal convergence

Kenji Fukumizu, Taiji Suzuki, Noboru Isobe et al.

ICLR 2025arXiv:2405.20879
13
citations
#2539

Rephrase, Augment, Reason: Visual Grounding of Questions for Vision-Language Models

Archiki Prasad, Elias Stengel-Eskin, Mohit Bansal

ICLR 2024arXiv:2310.05861
13
citations
#2540

An Efficient Tester-Learner for Halfspaces

Aravind Gollakota, Adam Klivans, Konstantinos Stavropoulos et al.

ICLR 2024arXiv:2302.14853
13
citations
#2541

Decision Information Meets Large Language Models: The Future of Explainable Operations Research

Yansen Zhang, Qingcan Kang, Wing Yin YU et al.

ICLR 2025arXiv:2502.09994
13
citations
#2542

On a Connection Between Imitation Learning and RLHF

Teng Xiao, Yige Yuan, Mingxiao Li et al.

ICLR 2025arXiv:2503.05079
13
citations
#2543

Efficient Inference for Large Language Model-based Generative Recommendation

Xinyu Lin, Chaoqun Yang, Wenjie Wang et al.

ICLR 2025arXiv:2410.05165
13
citations
#2544

Quest: Query-centric Data Synthesis Approach for Long-context Scaling of Large Language Model

Chaochen Gao, Xing W, Qi Fu et al.

ICLR 2025arXiv:2405.19846
13
citations
#2545

Imitation Learning from Observation with Automatic Discount Scheduling

Yuyang Liu, Weijun Dong, Yingdong Hu et al.

ICLR 2024arXiv:2310.07433
13
citations
#2546

Targeted Attack Improves Protection against Unauthorized Diffusion Customization

Boyang Zheng, Chumeng Liang, Xiaoyu Wu

ICLR 2025arXiv:2310.04687
13
citations
#2547

Intelligence at the Edge of Chaos

Shiyang Zhang, Aakash Patel, Syed Rizvi et al.

ICLR 2025arXiv:2410.02536
13
citations
#2548

Evoke: Evoking Critical Thinking Abilities in LLMs via Reviewer-Author Prompt Editing

Xinyu Hu, Pengfei Tang, Simiao Zuo et al.

ICLR 2024arXiv:2310.13855
13
citations
#2549

HYPO: Hyperspherical Out-Of-Distribution Generalization

Haoyue Bai, Yifei Ming, Julian Katz-Samuels et al.

ICLR 2024arXiv:2402.07785
13
citations
#2550

Faster Algorithms for Structured Linear and Kernel Support Vector Machines

Yuzhou Gu, Zhao Song, Lichen Zhang

ICLR 2025arXiv:2307.07735
13
citations
#2551

Concept Pinpoint Eraser for Text-to-image Diffusion Models via Residual Attention Gate

Byung Hyun Lee, Sungjin Lim, Seunggyu Lee et al.

ICLR 2025arXiv:2506.22806
13
citations
#2552

OmniSep: Unified Omni-Modality Sound Separation with Query-Mixup

Xize Cheng, Siqi Zheng, zehan wang et al.

ICLR 2025arXiv:2410.21269
13
citations
#2553

PeriodWave: Multi-Period Flow Matching for High-Fidelity Waveform Generation

Sang-Hoon Lee, Ha-Yeong Choi, Seong-Whan Lee

ICLR 2025arXiv:2408.07547
13
citations
#2554

Prediction Error-based Classification for Class-Incremental Learning

Michał Zając, Tinne Tuytelaars, Gido M van de Ven

ICLR 2024arXiv:2305.18806
13
citations
#2555

Skill Expansion and Composition in Parameter Space

Tenglong Liu, Jianxiong Li, Yinan Zheng et al.

ICLR 2025arXiv:2502.05932
12
citations
#2556

AdjointDPM: Adjoint Sensitivity Method for Gradient Backpropagation of Diffusion Probabilistic Models

Jiachun Pan, Jiachun Pan, Jun Hao Liew et al.

ICLR 2024arXiv:2307.10711
12
citations
#2557

Prompt Risk Control: A Rigorous Framework for Responsible Deployment of Large Language Models

Thomas Zollo, Todd Morrill, Zhun Deng et al.

ICLR 2024arXiv:2311.13628
12
citations
#2558

From Commands to Prompts: LLM-based Semantic File System for AIOS

Zeru Shi, Kai Mei, Mingyu Jin et al.

ICLR 2025arXiv:2410.11843
12
citations
#2559

P$^2$OT: Progressive Partial Optimal Transport for Deep Imbalanced Clustering

Chuyu Zhang, Hui Ren, Xuming He

ICLR 2024arXiv:2401.09266
12
citations
#2560

Abstractors and relational cross-attention: An inductive bias for explicit relational reasoning in Transformers

Awni Altabaa, Taylor Webb, Jonathan Cohen et al.

ICLR 2024arXiv:2304.00195
12
citations
#2561

Revisiting Zeroth-Order Optimization: Minimum-Variance Two-Point Estimators and Directionally Aligned Perturbations

Shaocong Ma, Heng Huang

ICLR 2025arXiv:2510.19975
12
citations
#2562

Differentially Private SGD Without Clipping Bias: An Error-Feedback Approach

Xinwei Zhang, Zhiqi Bu, Steven Wu et al.

ICLR 2024arXiv:2311.14632
12
citations
#2563

LeanAgent: Lifelong Learning for Formal Theorem Proving

Adarsh Kumarappan, Mohit Tiwari, Peiyang Song et al.

ICLR 2025arXiv:2410.06209
12
citations
#2564

Language-Informed Visual Concept Learning

Sharon Lee, Yunzhi Zhang, Shangzhe Wu et al.

ICLR 2024arXiv:2312.03587
12
citations
#2565

MaestroMotif: Skill Design from Artificial Intelligence Feedback

Martin Klissarov, Mikael Henaff, Roberta Raileanu et al.

ICLR 2025arXiv:2412.08542
12
citations
#2566

In-context Time Series Predictor

Jiecheng Lu, Yan Sun, Shihao Yang

ICLR 2025arXiv:2405.14982
12
citations
#2567

Conditional Instrumental Variable Regression with Representation Learning for Causal Inference

Debo Cheng, Ziqi Xu, Jiuyong Li et al.

ICLR 2024arXiv:2310.01865
12
citations
#2568

A Simple and Scalable Representation for Graph Generation

Yunhui Jang, Seul Lee, Sungsoo Ahn

ICLR 2024arXiv:2312.02230
12
citations
#2569

Backdoor Contrastive Learning via Bi-level Trigger Optimization

Weiyu Sun, Xinyu Zhang, Hao LU et al.

ICLR 2024arXiv:2404.07863
12
citations
#2570

Achieving Dimension-Free Communication in Federated Learning via Zeroth-Order Optimization

Zhe Li, Bicheng Ying, Zidong Liu et al.

ICLR 2025arXiv:2405.15861
12
citations
#2571

Noise Stability Optimization for Finding Flat Minima: A Hessian-based Regularization Approach

Haotian Ju, Hongyang Zhang, Dongyue Li

ICLR 2025arXiv:2306.08553
12
citations
#2572

Unbounded: A Generative Infinite Game of Character Life Simulation

Jialu Li, Yuanzhen Li, Neal Wadhwa et al.

ICLR 2025arXiv:2410.18975
12
citations
#2573

Generative Monoculture in Large Language Models

Fan Wu, Emily Black, Varun Chandrasekaran

ICLR 2025arXiv:2407.02209
12
citations
#2574

Once-for-All: Controllable Generative Image Compression with Dynamic Granularity Adaptation

Anqi Li, Feng Li, Yuxi Liu et al.

ICLR 2025arXiv:2406.00758
12
citations
#2575

SmartRAG: Jointly Learn RAG-Related Tasks From the Environment Feedback

Jingsheng Gao, Linxu Li, Ke Ji et al.

ICLR 2025arXiv:2410.18141
12
citations
#2576

Linear Log-Normal Attention with Unbiased Concentration

Yury Nahshan, Joseph Kampeas, Emir Haleva

ICLR 2024arXiv:2311.13541
12
citations
#2577

P-SPIKESSM: HARNESSING PROBABILISTIC SPIKING STATE SPACE MODELS FOR LONG-RANGE DEPENDENCY TASKS

Malyaban Bal, Abhronil Sengupta

ICLR 2025arXiv:2406.02923
12
citations
#2578

Retrieval is Accurate Generation

Bowen Cao, Deng Cai, Leyang Cui et al.

ICLR 2024arXiv:2402.17532
12
citations
#2579

Illusory Attacks: Information-theoretic detectability matters in adversarial attacks

Tim Franzmeyer, Stephen McAleer, Joao F. Henriques et al.

ICLR 2024spotlightarXiv:2207.10170
12
citations
#2580

Stable Anisotropic Regularization

William Rudman, Carsten Eickhoff

ICLR 2024arXiv:2305.19358
12
citations
#2581

Trusted Multi-View Classification via Evolutionary Multi-View Fusion

Xinyan Liang, Pinhan Fu, Yuhua Qian et al.

ICLR 2025
12
citations
#2582

DiffPuter: Empowering Diffusion Models for Missing Data Imputation

Hengrui Zhang, Liancheng Fang, Qitian Wu et al.

ICLR 2025arXiv:2405.20690
12
citations
#2583

Variance Reduced Halpern Iteration for Finite-Sum Monotone Inclusions

Xufeng Cai, Ahmet Alacaoglu, Jelena Diakonikolas

ICLR 2024arXiv:2310.02987
12
citations
#2584

MADGEN: Mass-Spec attends to De Novo Molecular generation

Yinkai Wang, Xiaohui Chen, Liping Liu et al.

ICLR 2025arXiv:2501.01950
12
citations
#2585

Edge Prompt Tuning for Graph Neural Networks

Xingbo Fu, Yinhan He, Jundong Li

ICLR 2025arXiv:2503.00750
12
citations
#2586

Learning Multi-Index Models with Neural Networks via Mean-Field Langevin Dynamics

Alireza Mousavi-Hosseini, Denny Wu, Murat A Erdogdu

ICLR 2025arXiv:2408.07254
12
citations
#2587

The Optimization Landscape of SGD Across the Feature Learning Strength

Alexander Atanasov, Alexandru Meterez, James Simon et al.

ICLR 2025arXiv:2410.04642
12
citations
#2588

Training-free LLM-generated Text Detection by Mining Token Probability Sequences

Yihuai Xu, Yongwei Wang, YIFEI BI et al.

ICLR 2025oralarXiv:2410.06072
12
citations
#2589

TLDR: Token-Level Detective Reward Model for Large Vision Language Models

Deqing Fu, Tong Xiao, Rui Wang et al.

ICLR 2025arXiv:2410.04734
12
citations
#2590

Consistent Flow Distillation for Text-to-3D Generation

runjie yan, Yinbo Chen, Xiaolong Wang

ICLR 2025arXiv:2501.05445
12
citations
#2591

DAFA: Distance-Aware Fair Adversarial Training

Hyungyu Lee, Saehyung Lee, Hyemi Jang et al.

ICLR 2024arXiv:2401.12532
12
citations
#2592

Can Watermarked LLMs be Identified by Users via Crafted Prompts?

Aiwei Liu, Sheng Guan, Yiming Liu et al.

ICLR 2025arXiv:2410.03168
12
citations
#2593

Generalizability of Adversarial Robustness Under Distribution Shifts

Bernard Ghanem, Kumail Alhamoud, Hasan Hammoud et al.

ICLR 2024
12
citations
#2594

Stable Segment Anything Model

Qi Fan, Xin Tao, Lei Ke et al.

ICLR 2025arXiv:2311.15776
12
citations
#2595

Denoising Task Difficulty-based Curriculum for Training Diffusion Models

Jin-Young Kim, Hyojun Go, Soonwoo Kwon et al.

ICLR 2025arXiv:2403.10348
12
citations
#2596

Tuning Frequency Bias of State Space Models

Annan Yu, Dongwei Lyu, Soon Hoe Lim et al.

ICLR 2025arXiv:2410.02035
12
citations
#2597

Few for Many: Tchebycheff Set Scalarization for Many-Objective Optimization

Xi Lin, Yilu Liu, Xiaoyuan Zhang et al.

ICLR 2025arXiv:2405.19650
12
citations
#2598

Advancing the Lower Bounds: an Accelerated, Stochastic, Second-order Method with Optimal Adaptation to Inexactness

Artem Agafonov, Dmitry Kamzolov, Alexander Gasnikov et al.

ICLR 2024arXiv:2309.01570
12
citations
#2599

3D-Aware Hypothesis & Verification for Generalizable Relative Object Pose Estimation

Chen Zhao, Tong Zhang, Mathieu Salzmann

ICLR 2024arXiv:2310.03534
12
citations
#2600

Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment

Huayu Chen, Hang Su, Peize Sun et al.

ICLR 2025arXiv:2410.09347
12
citations