Most Cited ICLR "deep reinforcement learning" Papers

6,124 papers found • Page 14 of 31

#2601

Probabilistic Geometric Principal Component Analysis with application to neural data

Han-Lin Hsieh, Maryam Shanechi

ICLR 2025posterarXiv:2509.18469
2
citations
#2602

Enhancing Document Understanding with Group Position Embedding: A Novel Approach to Incorporate Layout Information

Yuke Zhu, Yue Zhang, Dongdong Liu et al.

ICLR 2025poster
2
citations
#2603

ProtoSnap: Prototype Alignment For Cuneiform Signs

Rachel Mikulinsky, Morris Alper, Shai Gordin et al.

ICLR 2025posterarXiv:2502.00129
2
citations
#2604

Language-Assisted Feature Transformation for Anomaly Detection

EungGu Yun, Heonjin Ha, Yeongwoo Nam et al.

ICLR 2025posterarXiv:2503.01184
2
citations
#2605

Cross-Attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-Image Generative Models

Jungwon Park, Jungmin Ko, Dongnam Byun et al.

ICLR 2025posterarXiv:2412.02237
2
citations
#2606

Beyond-Expert Performance with Limited Demonstrations: Efficient Imitation Learning with Double Exploration

Heyang Zhao, Xingrui Yu, David Bossens et al.

ICLR 2025posterarXiv:2506.20307
2
citations
#2607

RTop-K: Ultra-Fast Row-Wise Top-K Selection for Neural Network Acceleration on GPUs

Xi Xie, Yuebo Luo, Hongwu Peng et al.

ICLR 2025posterarXiv:2409.00822
2
citations
#2608

Action abstractions for amortized sampling

Oussama Boussif, Léna Ezzine, Joseph Viviano et al.

ICLR 2025posterarXiv:2410.15184
2
citations
#2609

Global Convergence of Policy Gradient in Average Reward MDPs

Navdeep Kumar, Yashaswini Murthy, Itai Shufaro et al.

ICLR 2025poster
2
citations
#2610

AutoG: Towards automatic graph construction from tabular data

Zhikai Chen, Han Xie, Jian Zhang et al.

ICLR 2025posterarXiv:2501.15282
2
citations
#2611

An Efficient Framework for Crediting Data Contributors of Diffusion Models

MingYu Lu, Chris Lin, Chanwoo Kim et al.

ICLR 2025posterarXiv:2407.03153
2
citations
#2612

TFG-Flow: Training-free Guidance in Multimodal Generative Flow

Haowei Lin, Shanda Li, Haotian Ye et al.

ICLR 2025posterarXiv:2501.14216
2
citations
#2613

A Poincaré Inequality and Consistency Results for Signal Sampling on Large Graphs

Thien Le, Luana Ruiz, Stefanie Jegelka

ICLR 2024spotlightarXiv:2311.10610
2
citations
#2614

PianoMotion10M: Dataset and Benchmark for Hand Motion Generation in Piano Performance

Qijun Gan, Song Wang, Shengtao Wu et al.

ICLR 2025posterarXiv:2406.09326
2
citations
#2615

Swift Hydra: Self-Reinforcing Generative Framework for Anomaly Detection with Multiple Mamba Models

Hoang Khoi Nguyen Do, Truc Nguyen, Malik Hassanaly et al.

ICLR 2025posterarXiv:2503.06413
2
citations
#2616

Gradient correlation is a key ingredient to accelerate SGD with momentum

Julien Hermant, Marien Renaud, Jean-François Aujol et al.

ICLR 2025posterarXiv:2410.07870
2
citations
#2617

Flash Inference: Near Linear Time Inference for Long Convolution Sequence Models and Beyond

Costin-Andrei Oncescu, Sanket Jayant Purandare, Stratos Idreos et al.

ICLR 2025posterarXiv:2410.12982
2
citations
#2618

Towards Characterizing Domain Counterfactuals for Invertible Latent Causal Models

Zeyu Zhou, Ruqi Bai, Sean Kulinski et al.

ICLR 2024posterarXiv:2306.11281
2
citations
#2619

Learning system dynamics without forgetting

Xikun ZHANG, Dongjin Song, Yushan Jiang et al.

ICLR 2025posterarXiv:2407.00717
2
citations
#2620

Fast unsupervised ground metric learning with tree-Wasserstein distance

Kira Michaela Düsterwald, Samo Hromadka, Makoto Yamada

ICLR 2025posterarXiv:2411.07432
2
citations
#2621

Rational Decision-Making Agent with Learning Internal Utility Judgment

Yining Ye, Xin Cong, Shizuo Tian et al.

ICLR 2025poster
2
citations
#2622

Bidirectional Temporal Diffusion Model for Temporally Consistent Human Animation

Tserendorj Adiya, Jae Shin Yoon, Jung Eun Lee et al.

ICLR 2024oralarXiv:2307.00574
2
citations
#2623

The Breakdown of Gaussian Universality in Classification of High-dimensional Linear Factor Mixtures

Xiaoyi MAI, Zhenyu Liao

ICLR 2025posterarXiv:2410.05609
2
citations
#2624

Interpreting Global Perturbation Robustness of Image Models using Axiomatic Spectral Importance Decomposition

Róisín Luo, James McDermott, Colm O'Riordan

ICLR 2025posterarXiv:2408.01139
2
citations
#2625

How Low Can You Go? Searching for the Intrinsic Dimensionality of Complex Networks using Metric Node Embeddings

Nikolaos Nakis, Niels Raunkjær Holm, Andreas Lyhne Fiehn et al.

ICLR 2025posterarXiv:2503.01723
2
citations
#2626

Learning semilinear neural operators: A unified recursive framework for prediction and data assimilation.

Ashutosh Singh, Ricardo Borsoi, Deniz Erdogmus et al.

ICLR 2024oral
2
citations
#2627

Minimal Variance Model Aggregation: A principled, non-intrusive, and versatile integration of black box models

Theo Bourdais, Houman Owhadi

ICLR 2025posterarXiv:2409.17267
2
citations
#2628

Bayesian Analysis of Combinatorial Gaussian Process Bandits

Jack Sandberg, Niklas Åkerblom, Morteza Haghir Chehreghani

ICLR 2025posterarXiv:2312.12676
2
citations
#2629

Efficient Distribution Matching of Representations via Noise-Injected Deep InfoMax

Ivan Butakov, Alexander Semenenko, Alexander Tolmachev et al.

ICLR 2025posterarXiv:2410.06993
2
citations
#2630

EMMA: Empowering Multi-modal Mamba with Structural and Hierarchical Alignment

Yifei Xing, Xiangyuan Lan, Ruiping Wang et al.

ICLR 2025posterarXiv:2410.05938
2
citations
#2631

A Multiscale Frequency Domain Causal Framework for Enhanced Pathological Analysis

Xiaoyu Cui, Weixing Chen, Jiandong Su

ICLR 2025poster
2
citations
#2632

Can a Large Language Model be a Gaslighter?

Wei Li, Luyao Zhu, Yang Song et al.

ICLR 2025posterarXiv:2410.09181
2
citations
#2633

Out-of-distribution Generalization for Total Variation based Invariant Risk Minimization

Yuanchao Wang, Zhao-Rong Lai, Tianqi Zhong

ICLR 2025posterarXiv:2502.19665
2
citations
#2634

Discovering Group Structures via Unitary Representation Learning

Dongsung Huh

ICLR 2025poster
2
citations
#2635

Adversaries With Incentives: A Strategic Alternative to Adversarial Robustness

Maayan Ehrenberg, Roy Ganz, Nir Rosenfeld

ICLR 2025posterarXiv:2406.11458
2
citations
#2636

Debiasing Mini-Batch Quadratics for Applications in Deep Learning

Lukas Nicola Tatzel, Bálint Mucsányi, Osane Hackel et al.

ICLR 2025posterarXiv:2410.14325
2
citations
#2637

Generalization of Scaled Deep ResNets in the Mean-Field Regime

Yihang Chen, Fanghui Liu, Yiping Lu et al.

ICLR 2024spotlightarXiv:2403.09889
2
citations
#2638

Exploring the Camera Bias of Person Re-identification

Myungseo Song, Jin-Woo Park, Jong-Seok Lee

ICLR 2025posterarXiv:2502.10195
2
citations
#2639

Stabilizing Backpropagation Through Time to Learn Complex Physics

Patrick Schnell, Nils Thuerey

ICLR 2024oralarXiv:2405.02041
2
citations
#2640

Linear Spherical Sliced Optimal Transport: A Fast Metric for Comparing Spherical Data

Xinran Liu, Yikun Bai, Rocio Diaz Martin et al.

ICLR 2025posterarXiv:2411.06055
2
citations
#2641

AI2TALE: An Innovative Information Theory-based Approach for Learning to Localize Phishing Attacks

Van Nguyen, Tingmin Wu, Xingliang YUAN et al.

ICLR 2025poster
2
citations
#2642

CL-DiffPhyCon: Closed-loop Diffusion Control of Complex Physical Systems

Long Wei, Haodong Feng, Yuchen Yang et al.

ICLR 2025posterarXiv:2408.03124
2
citations
#2643

DAWN: Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for Talking head Video Generation

Hanbo Cheng, Limin Lin, Chenyu Liu et al.

ICLR 2025posterarXiv:2410.13726
2
citations
#2644

Isometric Regularization for Manifolds of Functional Data

Hyeongjun Heo, Seonghun Oh, JaeYong Lee et al.

ICLR 2025poster
2
citations
#2645

NetInfoF Framework: Measuring and Exploiting Network Usable Information

Meng-Chieh Lee, Haiyang Yu, Jian Zhang et al.

ICLR 2024spotlightarXiv:2402.07999
2
citations
#2646

Gaussian Head & Shoulders: High Fidelity Neural Upper Body Avatars with Anchor Gaussian Guided Texture Warping

Tianhao Wu, Jing Yang, Zhilin Guo et al.

ICLR 2025posterarXiv:2405.12069
2
citations
#2647

Rethinking Classifier Re-Training in Long-Tailed Recognition: Label Over-Smooth Can Balance

Siyu Sun, Han Lu, Jiangtong Li et al.

ICLR 2025poster
2
citations
#2648

Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Transformers

Shaobo Wang, Hongxuan Tang, Mingyang Wang et al.

ICLR 2025posterarXiv:2410.21815
2
citations
#2649

Beyond Circuit Connections: A Non-Message Passing Graph Transformer Approach for Quantum Error Mitigation

Tianyi Bao, Xinyu Ye, Hang Ruan et al.

ICLR 2025poster
2
citations
#2650

From Search to Sampling: Generative Models for Robust Algorithmic Recourse

Prateek Garg, Lokesh Nagalapatti, Sunita Sarawagi

ICLR 2025posterarXiv:2505.07351
2
citations
#2651

An Optimal Discriminator Weighted Imitation Perspective for Reinforcement Learning

Haoran Xu, Shuozhe Li, Harshit Sikchi et al.

ICLR 2025posterarXiv:2504.13368
2
citations
#2652

Fine-tuning with Reserved Majority for Noise Reduction

Shuyang Jiang, Yusheng Liao, Ya Zhang et al.

ICLR 2025poster
2
citations
#2653

SeedLM: Compressing LLM Weights into Seeds of Pseudo-Random Generators

Rasoul Shafipour, David Harrison, Maxwell Horton et al.

ICLR 2025posterarXiv:2410.10714
2
citations
#2654

Enhancing Uncertainty Estimation and Interpretability with Bayesian Non-negative Decision Layer

XINYUE HU, Zhibin Duan, Bo Chen et al.

ICLR 2025posterarXiv:2505.22199
2
citations
#2655

Salvage: Shapley-distribution Approximation Learning Via Attribution Guided Exploration for Explainable Image Classification

Mehdi Naouar, Hanne Raum, Jens Rahnfeld et al.

ICLR 2025poster
2
citations
#2656

MIND over Body: Adaptive Thinking using Dynamic Computation

Mrinal Mathur, Barak Pearlmutter, Sergey Plis

ICLR 2025poster
2
citations
#2657

An Asynchronous Bundle Method for Distributed Learning Problems

Daniel Cederberg, Xuyang Wu, Stephen Boyd et al.

ICLR 2025poster
2
citations
#2658

Weighted Point Set Embedding for Multimodal Contrastive Learning Toward Optimal Similarity Metric

Toshimitsu Uesaka, Taiji Suzuki, Yuhta Takida et al.

ICLR 2025posterarXiv:2404.19228
2
citations
#2659

Uncovering Gaps in How Humans and LLMs Interpret Subjective Language

Erik Jones, Arjun Patrawala, Jacob Steinhardt

ICLR 2025posterarXiv:2503.04113
2
citations
#2660

Data-centric Prediction Explanation via Kernelized Stein Discrepancy

Mahtab Sarvmaili, Hassan Sajjad, Ga Wu

ICLR 2025posterarXiv:2403.15576
2
citations
#2661

Towards Generalizable Reinforcement Learning via Causality-Guided Self-Adaptive Representations

Yupei Yang, Biwei Huang, Fan Feng et al.

ICLR 2025posterarXiv:2407.20651
2
citations
#2662

Bisimulation Metric for Model Predictive Control

Yutaka Shimizu, Masayoshi Tomizuka

ICLR 2025posterarXiv:2410.04553
2
citations
#2663

Automated Filtering of Human Feedback Data for Aligning Text-to-Image Diffusion Models

Yongjin Yang, Sihyeon Kim, Hojung Jung et al.

ICLR 2025posterarXiv:2410.10166
2
citations
#2664

Warm Diffusion: Recipe for Blur-Noise Mixture Diffusion Models

Hao-Chien Hsueh, Wen-Hsiao Peng, Ching-Chun Huang

ICLR 2025posterarXiv:2511.16904
2
citations
#2665

KooNPro: A Variance-Aware Koopman Probabilistic Model Enhanced by Neural Process for Time Series Forecasting

Ronghua Zheng, Hanru Bai, Weiyang Ding

ICLR 2025oral
2
citations
#2666

FLOPS: Forward Learning with OPtimal Sampling

Tao Ren, Zishi Zhang, Jinyang Jiang et al.

ICLR 2025posterarXiv:2410.05966
2
citations
#2667

Graph-based Document Structure Analysis

Yufan Chen, Ruiping Liu, Junwei Zheng et al.

ICLR 2025posterarXiv:2502.02501
2
citations
#2668

PINP: Physics-Informed Neural Predictor with latent estimation of fluid flows

Huaguan Chen, Yang Liu, Hao Sun

ICLR 2025oralarXiv:2504.06070
2
citations
#2669

Non-Stationary Dueling Bandits Under a Weighted Borda Criterion

Joe Suk, Arpit Agarwal

ICLR 2025posterarXiv:2403.12950
2
citations
#2670

Teaching Human Behavior Improves Content Understanding Abilities Of VLMs

SOMESH SINGH, Harini S I, Yaman Singla et al.

ICLR 2025poster
2
citations
#2671

Weakly Supervised Video Scene Graph Generation via Natural Language Supervision

Kibum Kim, Kanghoon Yoon, Yeonjun In et al.

ICLR 2025oralarXiv:2502.15370
2
citations
#2672

Wavelet-based Positional Representation for Long Context

Yui Oka, Taku Hasegawa, Kyosuke Nishida et al.

ICLR 2025posterarXiv:2502.02004
2
citations
#2673

Controlling the Fidelity and Diversity of Deep Generative Models via Pseudo Density

Sabine Susstrunk, Mathieu Salzmann, Chen Liu et al.

ICLR 2025posterarXiv:2407.08659
2
citations
#2674

Beyond Surface Structure: A Causal Assessment of LLMs' Comprehension ability

Yujin Han, Lei Xu, Sirui Chen et al.

ICLR 2025posterarXiv:2411.19456
2
citations
#2675

Identification of Intermittent Temporal Latent Process

Yuke Li, Yujia Zheng, Guangyi Chen et al.

ICLR 2025oral
2
citations
#2676

MIRACLE 3D: Memory-efficient Integrated Robust Approach for Continual Learning on 3D Point Clouds via Shape Model Construction

Hossein Resani, Behrooz Nasihatkon

ICLR 2025poster
2
citations
#2677

The Directionality of Optimization Trajectories in Neural Networks

Sidak Pal Singh, Bobby He, Thomas Hofmann et al.

ICLR 2025poster
2
citations
#2678

Zero-Shot Natural Language Explanations

Fawaz Sammani, Nikos Deligiannis

ICLR 2025poster
2
citations
#2679

Enhance Multi-View Classification Through Multi-Scale Alignment and Expanded Boundary

Yuena Lin, Yiyuan Wang, Gengyu Lyu et al.

ICLR 2025poster
2
citations
#2680

Learning on One Mode: Addressing Multi-modality in Offline Reinforcement Learning

Mianchu Wang, Yue Jin, Giovanni Montana

ICLR 2025posterarXiv:2412.03258
2
citations
#2681

Preference Elicitation for Offline Reinforcement Learning

Alizée Pace, Bernhard Schölkopf, Gunnar Ratsch et al.

ICLR 2025posterarXiv:2406.18450
2
citations
#2682

Masked Distillation Advances Self-Supervised Transformer Architecture Search

Caixia Yan, Xiaojun Chang, Zhihui Li et al.

ICLR 2024poster
2
citations
#2683

CPSample: Classifier Protected Sampling for Guarding Training Data During Diffusion

Joshua Kazdan, Hao Sun, Jiaqi Han et al.

ICLR 2025posterarXiv:2409.07025
2
citations
#2684

Exact Community Recovery under Side Information: Optimality of Spectral Algorithms

Julia Gaudio, Nirmit Joshi

ICLR 2025posterarXiv:2406.13075
2
citations
#2685

AIMS.au: A Dataset for the Analysis of Modern Slavery Countermeasures in Corporate Statements

Adriana-Eufrosina Bora, Pierre-Luc St-Charles, Mirko Bronzi et al.

ICLR 2025posterarXiv:2502.07022
2
citations
#2686

CBMA: Improving Conformal Prediction through Bayesian Model Averaging

Pankaj Bhagwat, Linglong Kong, Bei Jiang

ICLR 2025posterarXiv:2511.16924
2
citations
#2687

Provably Reliable Conformal Prediction Sets in the Presence of Data Poisoning

Yan Scholten, Stephan Günnemann

ICLR 2025posterarXiv:2410.09878
2
citations
#2688

Bounds on $L_p$ Errors in Density Ratio Estimation via $f$-Divergence Loss Functions

Yoshiaki Kitazawa

ICLR 2025poster
2
citations
#2689

OptionZero: Planning with Learned Options

Po-Wei Huang, Pei-Chiun Peng, Hung Guei et al.

ICLR 2025posterarXiv:2502.16634
2
citations
#2690

A Statistical Approach for Controlled Training Data Detection

Zirui Hu, Yingjie Wang, Zheng Zhang et al.

ICLR 2025poster
2
citations
#2691

SEBRA : Debiasing through Self-Guided Bias Ranking

Adarsh Kappiyath, Abhra Chaudhuri, AJAY JAISWAL et al.

ICLR 2025posterarXiv:2501.18277
2
citations
#2692

Query-based Knowledge Transfer for Heterogeneous Learning Environments

Norah Alballa, Wenxuan Zhang, Ziquan Liu et al.

ICLR 2025posterarXiv:2504.09205
2
citations
#2693

Geometry of Long-Tailed Representation Learning: Rebalancing Features for Skewed Distributions

Lingjie Yi, Michael Yao, Weimin Lyu et al.

ICLR 2025poster
2
citations
#2694

INFER: A Neural-symbolic Model For Extrapolation Reasoning on Temporal Knowledge Graph

Ningyuan Li, Haihong E, Tianyu Yao et al.

ICLR 2025oral
2
citations
#2695

Toward Efficient Multi-Agent Exploration With Trajectory Entropy Maximization

Tianxu Li, Kun Zhu

ICLR 2025poster
2
citations
#2696

Learning 3D Perception from Others' Predictions

Jinsu Yoo, Zhenyang Feng, Tai-Yu Pan et al.

ICLR 2025posterarXiv:2410.02646
2
citations
#2697

FedLWS: Federated Learning with Adaptive Layer-wise Weight Shrinking

Changlong Shi, Jinmeng Li, He Zhao et al.

ICLR 2025posterarXiv:2503.15111
2
citations
#2698

Test-time Adaptation for Image Compression with Distribution Regularization

Kecheng Chen, Pingping Zhang, Tiexin Qin et al.

ICLR 2025posterarXiv:2410.12191
2
citations
#2699

Dreamweaver: Learning Compositional World Models from Pixels

Junyeob Baek, Yi-Fu Wu, Gautam Singh et al.

ICLR 2025posterarXiv:2501.14174
2
citations
#2700

Learning from Aggregate responses: Instance Level versus Bag Level Loss Functions

Adel Javanmard, Lin Chen, Vahab Mirrokni et al.

ICLR 2024posterarXiv:2401.11081
2
citations
#2701

Prevalence of Negative Transfer in Continual Reinforcement Learning: Analyses and a Simple Baseline

Hongjoon Ahn, Jinu Hyeon, Youngmin Oh et al.

ICLR 2025poster
2
citations
#2702

Offline RL with Smooth OOD Generalization in Convex Hull and its Neighborhood

Qingmao Yao, Zhichao Lei, Tianyuan Chen et al.

ICLR 2025posterarXiv:2506.08417
2
citations
#2703

GLOMA: Global Video Text Spotting with Morphological Association

Han Wang, Yanjie Wang, Yang Li et al.

ICLR 2025oral
2
citations
#2704

DEPT: Decoupled Embeddings for Pre-training Language Models

Alex Iacob, Lorenzo Sani, Meghdad Kurmanji et al.

ICLR 2025posterarXiv:2410.05021
2
citations
#2705

Inner Information Analysis Algorithm for Deep Neural Network based on Community

Guipeng Lan, Shuai Xiao, Meng Xi et al.

ICLR 2025poster
2
citations
#2706

Loss Landscape of Shallow ReLU-like Neural Networks: Stationary Points, Saddle Escape, and Network Embedding

Frank Zhengqing Wu, Berfin Simsek, François Ged

ICLR 2025posterarXiv:2402.05626
2
citations
#2707

Theory on Score-Mismatched Diffusion Models and Zero-Shot Conditional Samplers

Yuchen Liang, Peizhong Ju, Yingbin Liang et al.

ICLR 2025posterarXiv:2410.13746
2
citations
#2708

Differentiable Rule Induction from Raw Sequence Inputs

Kun Gao, Katsumi Inoue, Yongzhi Cao et al.

ICLR 2025poster
2
citations
#2709

Sample Efficient Myopic Exploration Through Multitask Reinforcement Learning with Diverse Tasks

Ziping Xu, Zifan Xu, Runxuan Jiang et al.

ICLR 2024posterarXiv:2403.01636
2
citations
#2710

VICtoR: Learning Hierarchical Vision-Instruction Correlation Rewards for Long-horizon Manipulation

Kuo-Han Hung, Pang-Chi Lo, Jia-Fong Yeh et al.

ICLR 2025posterarXiv:2405.16545
2
citations
#2711

Residual Deep Gaussian Processes on Manifolds

Kacper Wyrwal, Andreas Krause, Viacheslav (Slava) Borovitskiy

ICLR 2025posterarXiv:2411.00161
2
citations
#2712

Learning to Generate Diverse Pedestrian Movements from Web Videos with Noisy Labels

Zhizheng Liu, Joe Lin, Wayne Wu et al.

ICLR 2025posterarXiv:2410.07500
2
citations
#2713

3D-SPATIAL MULTIMODAL MEMORY

Xueyan Zou, Yuchen Song, Ri-Zhao Qiu et al.

ICLR 2025posterarXiv:2503.16413
2
citations
#2714

LLM-based Typed Hyperresolution for Commonsense Reasoning with Knowledge Bases

Armin Toroghi, Ali Pesaranghader, Tanmana Sadhu et al.

ICLR 2025poster
2
citations
#2715

MAGNet: Motif-Agnostic Generation of Molecules from Scaffolds

Leon Hetzel, Johanna Sommer, Bastian Rieck et al.

ICLR 2025poster
2
citations
#2716

Graph Neural Networks Can (Often) Count Substructures

Paolo Pellizzoni, Till Schulz, Karsten Borgwardt

ICLR 2025poster
2
citations
#2717

Geometry-Aware Approaches for Balancing Performance and Theoretical Guarantees in Linear Bandits

Yuwei Luo, Mohsen Bayati

ICLR 2025posterarXiv:2306.14872
2
citations
#2718

Swing-by Dynamics in Concept Learning and Compositional Generalization

Yongyi Yang, Core Francisco Park, Ekdeep Singh Lubana et al.

ICLR 2025posterarXiv:2410.08309
2
citations
#2719

Enabling Lanuguage Models to Implicitly Learn Self-Improvement

Ziqi Wang, Le Hou, Tianjian Lu et al.

ICLR 2024poster
2
citations
#2720

Learning from Imperfect Human Feedback: A Tale from Corruption-Robust Dueling

Yuwei Cheng, Fan Yao, Xuefeng Liu et al.

ICLR 2025posterarXiv:2405.11204
2
citations
#2721

Measuring And Improving Engagement of Text-to-Image Generation Models

Varun Khurana, Yaman Singla, Jayakumar Subramanian et al.

ICLR 2025poster
2
citations
#2722

Confidence Elicitation: A New Attack Vector for Large Language Models

Brian Formento, Chuan Sheng Foo, See-Kiong Ng

ICLR 2025posterarXiv:2502.04643
2
citations
#2723

Dysca: A Dynamic and Scalable Benchmark for Evaluating Perception Ability of LVLMs

Jie Zhang, Zhongqi Wang, Mengqi Lei et al.

ICLR 2025posterarXiv:2406.18849
2
citations
#2724

HERO: Human-Feedback Efficient Reinforcement Learning for Online Diffusion Model Finetuning

Ayano Hiranaka, Shang-Fu Chen, Chieh-Hsin Lai et al.

ICLR 2025posterarXiv:2410.05116
2
citations
#2725

Diverse Policies Recovering via Pointwise Mutual Information Weighted Imitation Learning

Hanlin Yang, Jian Yao, Weiming Liu et al.

ICLR 2025oralarXiv:2410.15910
2
citations
#2726

How Much is Unseen Depends Chiefly on Information About the Seen

Seongmin Lee, Marcel Boehme

ICLR 2025posterarXiv:2402.05835
2
citations
#2727

Uncertainty modeling for fine-tuned implicit functions

Anna Susmelj, Mael Macuglia, Natasa Tagasovska et al.

ICLR 2025posterarXiv:2406.12082
2
citations
#2728

cryoSPHERE: Single-Particle HEterogeneous REconstruction from cryo EM

Gabriel Claude Jean Ducrocq, Lukas Grunewald, Sebastian Westenhoff et al.

ICLR 2025posterarXiv:2407.01574
2
citations
#2729

Are Transformers Able to Reason by Connecting Separated Knowledge in Training Data?

Yutong Yin, Zhaoran Wang

ICLR 2025posterarXiv:2501.15857
2
citations
#2730

REVISITING MULTI-PERMUTATION EQUIVARIANCE THROUGH THE LENS OF IRREDUCIBLE REPRESENTATIONS

Yonatan Sverdlov, Ido Springer, Nadav Dym

ICLR 2025posterarXiv:2410.06665
2
citations
#2731

Demystifying Topological Message-Passing with Relational Structures: A Case Study on Oversquashing in Simplicial Message-Passing

Diaaeldin Taha, James Chapman, Marzieh Eidi et al.

ICLR 2025posterarXiv:2506.06582
2
citations
#2732

Hyperbolic Genome Embeddings

Raiyan Khan, Philippe Chlenski, Itsik Pe'er

ICLR 2025posterarXiv:2507.21648
2
citations
#2733

Deriving Causal Order from Single-Variable Interventions: Guarantees & Algorithm

Mathieu Chevalley, Patrick Schwab, Arash Mehrjou

ICLR 2025posterarXiv:2405.18314
2
citations
#2734

Lambda-Skip Connections: the architectural component that prevents Rank Collapse

Federico Arangath Joseph, Jerome Sieber, Melanie Zeilinger et al.

ICLR 2025posterarXiv:2410.10609
2
citations
#2735

COFlowNet: Conservative Constraints on Flows Enable High-Quality Candidate Generation

Yudong Zhang, Xuan Yu, Xu Wang et al.

ICLR 2025poster
2
citations
#2736

Physics-informed Temporal Difference Metric Learning for Robot Motion Planning

Ruiqi Ni, zherong pan, Ahmed Hussain Qureshi

ICLR 2025oralarXiv:2505.05691
2
citations
#2737

Self-Attention-Based Contextual Modulation Improves Neural System Identification

Isaac Lin, Tianye Wang, Shang Gao et al.

ICLR 2025posterarXiv:2406.07843
2
citations
#2738

Supervised and Semi-Supervised Diffusion Maps with Label-Driven Diffusion

Harel Mendelman, Ronen Talmon

ICLR 2025poster
2
citations
#2739

Predicting the Energy Landscape of Stochastic Dynamical System via Physics-informed Self-supervised Learning

Ruikun Li, Huandong Wang, Qingmin Liao et al.

ICLR 2025posterarXiv:2502.16828
2
citations
#2740

Causal Graph Transformer for Treatment Effect Estimation Under Unknown Interference

Anpeng Wu, Haiyi Qiu, Zhengming Chen et al.

ICLR 2025poster
2
citations
#2741

Storybooth: Training-Free Multi-Subject Consistency for Improved Visual Storytelling

Jaskirat Singh, Junshen K Chen, Jonas Kohler et al.

ICLR 2025posterarXiv:2504.05800
2
citations
#2742

Accurate and Scalable Graph Neural Networks via Message Invariance

Zhihao Shi, Jie Wang, Zhiwei Zhuang et al.

ICLR 2025posterarXiv:2502.19693
2
citations
#2743

Easing Training Process of Rectified Flow Models Via Lengthening Inter-Path Distance

Shifeng Xu, Yanzhu Liu, Adams Kong

ICLR 2025poster
2
citations
#2744

Adaptive backtracking for faster optimization

Joao V. Cavalcanti, Laurent Lessard, Ashia Wilson

ICLR 2025poster
2
citations
#2745

EcoFace: Audio-Visual Emotional Co-Disentanglement Speech-Driven 3D Talking Face Generation

Jiajian Xie, Shengyu Zhang, Mengze Li et al.

ICLR 2025poster
2
citations
#2746

GeSubNet: Gene Interaction Inference for Disease Subtype Network Generation

Ziwei Yang, Zheng Chen, XIN LIU et al.

ICLR 2025posterarXiv:2410.13178
2
citations
#2747

A Non-Contrastive Learning Framework for Sequential Recommendation with Preference-Preserving Profile Generation

Huimin Zeng, Xiaojie Wang, Anoop Jain et al.

ICLR 2025poster
2
citations
#2748

Tight Lower Bounds under Asymmetric High-Order Hölder Smoothness and Uniform Convexity

Cedar Site Bai, Brian Bullins

ICLR 2025posterarXiv:2409.10773
2
citations
#2749

SoftMatcha: A Soft and Fast Pattern Matcher for Billion-Scale Corpus Searches

Hiroyuki Deguchi, Go Kamoda, Yusuke Matsushita et al.

ICLR 2025posterarXiv:2503.03703
2
citations
#2750

Towards Marginal Fairness Sliced Wasserstein Barycenter

Khai Nguyen, Hai Nguyen, Nhat Ho

ICLR 2025posterarXiv:2405.07482
2
citations
#2751

PaLD: Detection of Text Partially Written by Large Language Models

Eric Lei, Hsiang Hsu, Chun-Fu Chen

ICLR 2025poster
2
citations
#2752

GSBA$^K$: $top$-$K$ Geometric Score-based Black-box Attack

Md Farhamdur Reza, Richeng Jin, Tianfu Wu et al.

ICLR 2025posterarXiv:2503.12827
2
citations
#2753

Revealing the 3D Cosmic Web through Gravitationally Constrained Neural Fields

Brandon Zhao, Aviad Levis, Liam Connor et al.

ICLR 2025posterarXiv:2504.15262
2
citations
#2754

PharmacoMatch: Efficient 3D Pharmacophore Screening via Neural Subgraph Matching

Daniel Rose, Oliver Wieder, Thomas Seidel et al.

ICLR 2025posterarXiv:2409.06316
2
citations
#2755

Sharper Guarantees for Learning Neural Network Classifiers with Gradient Methods

Hossein Taheri, Christos Thrampoulidis, Arya Mazumdar

ICLR 2025posterarXiv:2410.10024
2
citations
#2756

Long-time asymptotics of noisy SVGD outside the population limit

Victor Priser, PASCAL BIANCHI, Adil Salim

ICLR 2025posterarXiv:2406.11929
2
citations
#2757

Learning to Help in Multi-Class Settings

Yu Wu, Yansong Li, Zeyu Dong et al.

ICLR 2025posterarXiv:2501.13810
2
citations
#2758

An Auditing Test to Detect Behavioral Shift in Language Models

Leo Richter, Xuanli He, Pasquale Minervini et al.

ICLR 2025oralarXiv:2410.19406
2
citations
#2759

From Promise to Practice: Realizing High-performance Decentralized Training

Zesen Wang, Jiaojiao Zhang, Xuyang Wu et al.

ICLR 2025posterarXiv:2410.11998
2
citations
#2760

Learning multi-modal generative models with permutation-invariant encoders and tighter variational objectives

Marcel Hirt, Domenico Campolo, Victoria Leong et al.

ICLR 2025posterarXiv:2309.00380
2
citations
#2761

The Rise and Down of Babel Tower: Investigating the Evolution Process of Multilingual Code Large Language Model

Jiawei Chen, Wentao Chen, Jing Su et al.

ICLR 2025posterarXiv:2412.07298
2
citations
#2762

GMValuator: Similarity-based Data Valuation for Generative Models

Jiaxi Yang, Wenlong Deng, Benlin Liu et al.

ICLR 2025posterarXiv:2304.10701
2
citations
#2763

Adaptive Shrinkage Estimation for Personalized Deep Kernel Regression in Modeling Brain Trajectories

Vasiliki Tassopoulou, Haochang Shou, Christos Davatzikos

ICLR 2025posterarXiv:2504.08840
2
citations
#2764

Beyond Next Token Prediction: Patch-Level Training for Large Language Models

Chenze Shao, Fandong Meng, Jie Zhou

ICLR 2025posterarXiv:2407.12665
2
citations
#2765

TopoGaussian: Inferring Internal Topology Structures from Visual Clues

Xiaoyu Xiong, Changyu Hu, Chunru Lin et al.

ICLR 2025posterarXiv:2503.12343
2
citations
#2766

REBIND: Enhancing Ground-state Molecular Conformation Prediction via Force-Based Graph Rewiring

Taewon Kim, Hyunjin Seo, Sungsoo Ahn et al.

ICLR 2025poster
2
citations
#2767

Physiome-ODE: A Benchmark for Irregularly Sampled Multivariate Time-Series Forecasting Based on Biological ODEs

Christian Klötergens, Vijaya Krishna Yalavarthi, Randolf Scholz et al.

ICLR 2025posterarXiv:2502.07489
2
citations
#2768

Asymptotic Analysis of Two-Layer Neural Networks after One Gradient Step under Gaussian Mixtures Data with Structure

Samet Demir, Zafer Dogan

ICLR 2025posterarXiv:2503.00856
2
citations
#2769

UNIP: Rethinking Pre-trained Attention Patterns for Infrared Semantic Segmentation

Tao Zhang, Jinyong Wen, Zhen Chen et al.

ICLR 2025posterarXiv:2502.02257
2
citations
#2770

Learned Reference-based Diffusion Sampler for multi-modal distributions

Maxence Noble, Louis Grenioux, Marylou Gabrié et al.

ICLR 2025poster
2
citations
#2771

Can Neural Networks Achieve Optimal Computational-statistical Tradeoff? An Analysis on Single-Index Model

Siyu Chen, Beining Wu, Miao Lu et al.

ICLR 2025poster
2
citations
#2772

Belief-Enriched Pessimistic Q-Learning against Adversarial State Perturbations

Xiaolin Sun, Zizhan Zheng

ICLR 2024posterarXiv:2403.04050
2
citations
#2773

Bonsai: Gradient-free Graph Condensation for Node Classification

Mridul Gupta, Samyak Jain, Vansh Ramani et al.

ICLR 2025posterarXiv:2410.17579
2
citations
#2774

Towards Understanding Why FixMatch Generalizes Better Than Supervised Learning

Jingyang Li, Jiachun Pan, Vincent Tan et al.

ICLR 2025posterarXiv:2410.11206
2
citations
#2775

Interference Among First-Price Pacing Equilibria: A Bias and Variance Analysis

Luofeng Liao, Christian Kroer, Sergei Leonenkov et al.

ICLR 2025posterarXiv:2402.07322
2
citations
#2776

Privately Counting Partially Ordered Data

Matthew Joseph, Mónica Ribero, Alexander Yu

ICLR 2025posterarXiv:2410.06881
2
citations
#2777

Efficient Biological Data Acquisition through Inference Set Design

Ihor Neporozhnii, Julien Roy, Emmanuel Bengio et al.

ICLR 2025posterarXiv:2410.19631
2
citations
#2778

L-WISE: Boosting Human Visual Category Learning Through Model-Based Image Selection and Enhancement

Morgan B Talbot, Gabriel Kreiman, James DiCarlo et al.

ICLR 2025oralarXiv:2412.09765
2
citations
#2779

TimeInf: Time Series Data Contribution via Influence Functions

Yizi Zhang, Jingyan Shen, Xiaoxue Xiong et al.

ICLR 2025oralarXiv:2407.15247
2
citations
#2780

Matrix Product Sketching via Coordinated Sampling

Majid Daliri, Juliana Freire, Danrong Li et al.

ICLR 2025posterarXiv:2501.17836
2
citations
#2781

Learning Spatiotemporal Dynamical Systems from Point Process Observations

Valerii Iakovlev, Harri Lähdesmäki

ICLR 2025oralarXiv:2406.00368
2
citations
#2782

A Generalist Hanabi Agent

Arjun V Sudhakar, Hadi Nekoei, Mathieu Reymond et al.

ICLR 2025posterarXiv:2503.14555
2
citations
#2783

Diffusion Sampling with Momentum for Mitigating Divergence Artifacts

Suttisak Wisadwongsa, Worameth Chinchuthakun, Pramook Khungurn et al.

ICLR 2024poster
2
citations
#2784

Principled Architecture-aware Scaling of Hyperparameters

Wuyang Chen, Junru Wu, Zhangyang Wang et al.

ICLR 2024posterarXiv:2402.17440
2
citations
#2785

AutoCGP: Closed-Loop Concept-Guided Policies from Unlabeled Demonstrations

Pei Zhou, Ruizhe Liu, Qian Luo et al.

ICLR 2025poster
2
citations
#2786

Multimodal Lego: Model Merging and Fine-Tuning Across Topologies and Modalities in Biomedicine

Konstantin Hemker, Nikola Simidjievski, Mateja Jamnik

ICLR 2025posterarXiv:2405.19950
2
citations
#2787

No Location Left Behind: Measuring and Improving the Fairness of Implicit Representations for Earth Data

Daniel Cai, Randall Balestriero

ICLR 2025posterarXiv:2502.06831
2
citations
#2788

Rethinking the Uniformity Metric in Self-Supervised Learning

Xianghong Fang, Jian Li, Qiang Sun et al.

ICLR 2024posterarXiv:2403.00642
2
citations
#2789

Content-Style Learning from Unaligned Domains: Identifiability under Unknown Latent Dimensions

Sagar Shrestha, Xiao Fu

ICLR 2025posterarXiv:2411.03755
2
citations
#2790

Generalizable Motion Planning via Operator Learning

Sharath Matada, Luke Bhan, Yuanyuan Shi et al.

ICLR 2025posterarXiv:2410.17547
2
citations
#2791

Efficient Sparse PCA via Block-Diagonalization

Alberto Del Pia, Dekun Zhou, Yinglun Zhu

ICLR 2025posterarXiv:2410.14092
2
citations
#2792

Improving Generalization and Robustness in SNNs Through Signed Rate Encoding and Sparse Encoding Attacks

Bhaskar Mukhoty, Hilal AlQuabeh, Bin Gu

ICLR 2025poster
2
citations
#2793

A Conditional Independence Test in the Presence of Discretization

Boyang Sun, Yu Yao, Guang-Yuan Hao et al.

ICLR 2025posterarXiv:2404.17644
2
citations
#2794

ImpScore: A Learnable Metric For Quantifying The Implicitness Level of Sentences

Yuxin Wang, Xiaomeng Zhu, Weimin Lyu et al.

ICLR 2025posterarXiv:2411.05172
2
citations
#2795

Algorithmic Stability Based Generalization Bounds for Adversarial Training

Runzhi Tian, Yongyi Mao

ICLR 2025poster
2
citations
#2796

Towards a learning theory of representation alignment

Francesco Maria Gabriele Insulla, Shuo Huang, Lorenzo Rosasco

ICLR 2025posterarXiv:2502.14047
2
citations
#2797

Do Mice Grok? Glimpses of Hidden Progress in Sensory Cortex

Tanishq Kumar, Blake Bordelon, Cengiz Pehlevan et al.

ICLR 2025poster
1
citations
#2798

RAPPER: Reinforced Rationale-Prompted Paradigm for Natural Language Explanation in Visual Question Answering

Kai-Po Chang, Chi-Pin Huang, Wei-Yuan Cheng et al.

ICLR 2024poster
1
citations
#2799

Associative memory and dead neurons

Vladimir Fanaskov, Ivan Oseledets

ICLR 2025posterarXiv:2410.13866
1
citations
#2800

Dynamic Assortment Selection and Pricing with Censored Preference Feedback

Jung-hun Kim, Min-hwan Oh

ICLR 2025posterarXiv:2504.02324
1
citations