Most Cited ICLR "agent" Papers

6,124 papers found • Page 28 of 31

#5401

Local Composite Saddle Point Optimization

Site Bai, Brian Bullins

ICLR 2024
#5402

ASID: Active Exploration for System Identification in Robotic Manipulation

Marius Memmel, Andrew Wagenmaker, Chuning Zhu et al.

ICLR 2024arXiv:2404.12308
#5403

Simple Hierarchical Planning with Diffusion

Chang Chen, Fei Deng, Kenji Kawaguchi et al.

ICLR 2024oralarXiv:2401.02644
#5404

sRGB Real Noise Modeling via Noise-Aware Sampling with Normalizing Flows

Dongjin Kim, Donggoo Jung, Sungyong Baik et al.

ICLR 2024
#5405

Improving Generalization of Alignment with Human Preferences through Group Invariant Learning

Rui Zheng, Wei Shen, Yuan Hua et al.

ICLR 2024spotlightarXiv:2310.11971
#5406

Dynamic Sparse Training with Structured Sparsity

Mike Lasby, Anna Golubeva, Utku Evci et al.

ICLR 2024arXiv:2305.02299
#5407

DENEVIL: TOWARDS DECIPHERING AND NAVIGATING THE ETHICAL VALUES OF LARGE LANGUAGE MODELS VIA INSTRUCTION LEARNING

Shitong Duan, Xiaoyuan Yi, Peng Zhang et al.

ICLR 2024oralarXiv:2310.11053
#5408

Robustifying State-space Models for Long Sequences via Approximate Diagonalization

Annan Yu, Arnur Nigmetov, Dmitriy Morozov et al.

ICLR 2024spotlightarXiv:2310.01698
#5409

Generative Adversarial Equilibrium Solvers

Denizalp Goktas, David Parkes, Ian Gemp et al.

ICLR 2024arXiv:2302.06607
#5410

Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models

Shuai Zhao, Xiaohan Wang, Linchao Zhu et al.

ICLR 2024arXiv:2305.18010
#5411

FedImpro: Measuring and Improving Client Update in Federated Learning

Zhenheng Tang, Yonggang Zhang, Shaohuai Shi et al.

ICLR 2024arXiv:2402.07011
#5412

Improving LoRA in Privacy-preserving Federated Learning

Youbang Sun, Zitao Li, Yaliang Li et al.

ICLR 2024arXiv:2403.12313
#5413

Efficient Inverse Multiagent Learning

Denizalp Goktas, Amy Greenwald, Sadie Zhao et al.

ICLR 2024spotlightarXiv:2502.14160
#5414

Neural Neighborhood Search for Multi-agent Path Finding

Zhongxia Yan, Cathy Wu

ICLR 2024oral
#5415

Nemesis: Normalizing the Soft-prompt Vectors of Vision-Language Models

Shuai Fu, Shuai Fu, Xiequn Wang et al.

ICLR 2024spotlightarXiv:2408.13979
#5416

FasterViT: Fast Vision Transformers with Hierarchical Attention

Ali Hatamizadeh, Greg Heinrich, Hongxu Yin et al.

ICLR 2024arXiv:2306.06189
#5417

C-TPT: Calibrated Test-Time Prompt Tuning for Vision-Language Models via Text Feature Dispersion

Hee Suk Yoon, Eunseop Yoon, Joshua Tian Jin Tee et al.

ICLR 2024arXiv:2403.14119
#5418

DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior

Jingxiang Sun, Bo Zhang, Ruizhi Shao et al.

ICLR 2024arXiv:2310.16818
#5419

Plugin estimators for selective classification with out-of-distribution detection

Harikrishna Narasimhan, Aditya Krishna Menon, Wittawat Jitkrittum et al.

ICLR 2024arXiv:2301.12386
#5420

P$^2$OT: Progressive Partial Optimal Transport for Deep Imbalanced Clustering

Chuyu Zhang, Hui Ren, Xuming He

ICLR 2024arXiv:2401.09266
#5421

Adaptive Stochastic Gradient Algorithm for Black-box Multi-Objective Learning

Feiyang YE, YUEMING LYU, Xuehao Wang et al.

ICLR 2024
#5422

Intriguing Properties of Data Attribution on Diffusion Models

Xiaosen Zheng, Tianyu Pang, Chao Du et al.

ICLR 2024arXiv:2311.00500
#5423

How Does Unlabeled Data Provably Help Out-of-Distribution Detection?

Xuefeng Du, Zhen Fang, Ilias Diakonikolas et al.

ICLR 2024arXiv:2402.03502
#5424

GlucoBench: Curated List of Continuous Glucose Monitoring Datasets with Prediction Benchmarks

Renat Sergazinov, Elizabeth Chun, Valeriya Rogovchenko et al.

ICLR 2024arXiv:2410.05780
#5425

Look, Remember and Reason: Grounded Reasoning in Videos with Language Models

Apratim Bhattacharyya, Sunny Panchal, Reza Pourreza et al.

ICLR 2024oralarXiv:2306.17778
#5426

Pushing Boundaries: Mixup's Influence on Neural Collapse

Quinn Fisher, Haoming Meng, Vardan Papyan

ICLR 2024arXiv:2402.06171
#5427

LLCP: Learning Latent Causal Processes for Reasoning-based Video Question Answer

Guangyi Chen, Yuke Li, Xiao Liu et al.

ICLR 2024oral
#5428

Implicit regularization of deep residual networks towards neural ODEs

Pierre Marion, Yu-Han Wu, Michael Sander et al.

ICLR 2024spotlightarXiv:2309.01213
#5429

Provably Efficient UCB-type Algorithms For Learning Predictive State Representations

Ruiquan Huang, Yingbin Liang, Jing Yang

ICLR 2024arXiv:2307.00405
#5430

Inner Classifier-Free Guidance and Its Taylor Expansion for Diffusion Models

Shikun Sun, Longhui Wei, Zhicai Wang et al.

ICLR 2024
#5431

Compressing Latent Space via Least Volume

Qiuyi Chen, Mark Fuge

ICLR 2024
#5432

CoLiDE: Concomitant Linear DAG Estimation

Seyed Saman Saboksayr, Gonzalo Mateos, Mariano Tepper

ICLR 2024arXiv:2310.02895
#5433

Going Beyond Neural Network Feature Similarity: The Network Feature Complexity and Its Interpretation Using Category Theory

Yiting Chen, Zhanpeng Zhou, Junchi Yan

ICLR 2024arXiv:2310.06756
#5434

A Unified Framework for Bayesian Optimization under Contextual Uncertainty

Sebastian Shenghong Tay, Chuan-Sheng Foo, Daisuke Urano et al.

ICLR 2024
#5435

Learning Large DAGs is Harder than you Think: Many Losses are Minimal for the Wrong DAG

Jonas Seng, Matej Zečević, Devendra Singh Dhami et al.

ICLR 2024
#5436

Nearly $d$-Linear Convergence Bounds for Diffusion Models via Stochastic Localization

Joe Benton, Valentin De Bortoli, Arnaud Doucet et al.

ICLR 2024spotlightarXiv:2308.03686
#5437

Active Retrosynthetic Planning Aware of Route Quality

Luotian Yuan, Yemin Yu, Ying Wei et al.

ICLR 2024
#5438

Solving Inverse Problems with Latent Diffusion Models via Hard Data Consistency

Bowen Song, Soo Min Kwon, Zecheng Zhang et al.

ICLR 2024spotlightarXiv:2307.08123
#5439

Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model

Yinan Zheng, Jianxiong Li, Dongjie Yu et al.

ICLR 2024arXiv:2401.10700
#5440

Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy

Pingzhi Li, Zhenyu Zhang, Prateek Yadav et al.

ICLR 2024spotlightarXiv:2310.01334
#5441

Non-Exchangeable Conformal Risk Control

António Farinhas, Chrysoula Zerva, Dennis Ulmer et al.

ICLR 2024arXiv:2310.01262
#5442

Spoken Question Answering and Speech Continuation Using Spectrogram-Powered LLM

Eliya Nachmani, Alon Levkovitch, Roy Hirsch et al.

ICLR 2024arXiv:2305.15255
#5443

USB-NeRF: Unrolling Shutter Bundle Adjusted Neural Radiance Fields

Moyang Li, Peng Wang, Lingzhe Zhao et al.

ICLR 2024arXiv:2310.02687
#5444

Are Bert Family Good Instruction Followers? A Study on Their Potential And Limitations

yisheng xiao, Juntao Li, Zechen Sun et al.

ICLR 2024
#5445

Synergistic Patch Pruning for Vision Transformer: Unifying Intra- & Inter-Layer Patch Importance

Yuyao Zhang, Lan Wei, Nikolaos Freris

ICLR 2024
#5446

Early Stopping Against Label Noise Without Validation Data

Suqin Yuan, Lei Feng, Tongliang Liu

ICLR 2024arXiv:2502.07551
#5447

Contrastive Preference Learning: Learning from Human Feedback without Reinforcement Learning

Joey Hejna, Rafael Rafailov, Harshit Sikchi et al.

ICLR 2024
#5448

Unknown Domain Inconsistency Minimization for Domain Generalization

Seungjae Shin, HeeSun Bae, Byeonghu Na et al.

ICLR 2024arXiv:2403.07329
#5449

Enhancing Transfer Learning with Flexible Nonparametric Posterior Sampling

Hyungi Lee, Giung Nam, Edwin Fong et al.

ICLR 2024arXiv:2403.07282
#5450

Finite Scalar Quantization: VQ-VAE Made Simple

Fabian Mentzer, David Minnen, Eirikur Agustsson et al.

ICLR 2024arXiv:2309.15505
#5451

Fixed-Budget Differentially Private Best Arm Identification

Zhirui Chen, P. N. Karthik, Yeow Meng Chee et al.

ICLR 2024arXiv:2401.09073
#5452

Rethinking Backdoor Attacks on Dataset Distillation: A Kernel Method Perspective

Ming-Yu Chung, Sheng-Yen Chou, Chia-Mu Yu et al.

ICLR 2024arXiv:2311.16646
#5453

Neural Contractive Dynamical Systems

Hadi Beik Mohammadi, Søren Hauberg, Georgios Arvanitidis et al.

ICLR 2024spotlightarXiv:2401.09352
#5454

Energy-based Automated Model Evaluation

Ru Peng, Heming Zou, Haobo Wang et al.

ICLR 2024arXiv:2401.12689
#5455

FreeDyG: Frequency Enhanced Continuous-Time Dynamic Graph Model for Link Prediction

Yuxing Tian, Yiyan Qi, Fan Guo

ICLR 2024oral
#5456

SEAL: A Framework for Systematic Evaluation of Real-World Super-Resolution

Wenlong Zhang, Xiaohui Li, Xiangyu Chen et al.

ICLR 2024spotlightarXiv:2309.03020
#5457

Toward Optimal Policy Population Growth in Two-Player Zero-Sum Games

Stephen McAleer, John Banister Lanier, Kevin A. Wang et al.

ICLR 2024
#5458

Towards Robust and Efficient Cloud-Edge Elastic Model Adaptation via Selective Entropy Distillation

Yaofo Chen, Shuaicheng Niu, Yaowei Wang et al.

ICLR 2024arXiv:2402.17316
#5459

Polynormer: Polynomial-Expressive Graph Transformer in Linear Time

Chenhui Deng, Zichao Yue, Zhiru Zhang

ICLR 2024arXiv:2403.01232
#5460

Beyond task performance: evaluating and reducing the flaws of large multimodal models with in-context-learning

Mustafa Shukor, Alexandre Rame, Corentin Dancette et al.

ICLR 2024arXiv:2310.00647
#5461

A Differentially Private Clustering Algorithm for Well-Clustered Graphs

Weiqiang He, Hendrik Fichtenberger, Pan Peng

ICLR 2024arXiv:2403.14332
#5462

The Trickle-down Impact of Reward Inconsistency on RLHF

Lingfeng Shen, Lingfeng Shen, Sihao Chen et al.

ICLR 2024
#5463

Contrastive Learning is Spectral Clustering on Similarity Graph

Zhiquan Tan, Yifan Zhang, Jingqin Yang et al.

ICLR 2024arXiv:2303.15103
#5464

Better Neural PDE Solvers Through Data-Free Mesh Movers

Peiyan Hu, Yue Wang, Zhi-Ming Ma

ICLR 2024arXiv:2312.05583
#5465

Weatherproofing Retrieval for Localization with Generative AI and Geometric Consistency

Yannis Kalantidis, Mert Bulent SARIYILDIZ, Rafael Rezende et al.

ICLR 2024arXiv:2402.09237
#5466

Memorization Capacity of Multi-Head Attention in Transformers

Sadegh Mahdavi, Renjie Liao, Christos Thrampoulidis

ICLR 2024spotlightarXiv:2306.02010
#5467

LUT-GEMM: Quantized Matrix Multiplication based on LUTs for Efficient Inference in Large-Scale Generative Language Models

Gunho Park, baeseong park, Minsub Kim et al.

ICLR 2024arXiv:2206.09557
#5468

Domain-Inspired Sharpness-Aware Minimization Under Domain Shifts

Ruipeng Zhang, Ziqing Fan, Jiangchao Yao et al.

ICLR 2024arXiv:2405.18861
#5469

Adaptive Retrieval and Scalable Indexing for k-NN Search with Cross-Encoders

Nishant Yadav, Nicholas Monath, Manzil Zaheer et al.

ICLR 2024arXiv:2405.03651
#5470

Enhancing Group Fairness in Online Settings Using Oblique Decision Forests

Somnath Basu Roy Chowdhury, Nicholas Monath, Ahmad Beirami et al.

ICLR 2024spotlightarXiv:2310.11401
#5471

True Knowledge Comes from Practice: Aligning Large Language Models with Embodied Environments via Reinforcement Learning

Weihao Tan, Wentao Zhang, Shanqi Liu et al.

ICLR 2024
#5472

Maximum Likelihood Estimation is All You Need for Well-Specified Covariate Shift

Jiawei Ge, Shange Tang, Jianqing Fan et al.

ICLR 2024arXiv:2311.15961
#5473

A Sublinear Adversarial Training Algorithm

Yeqi Gao, Lianke Qin, Zhao Song et al.

ICLR 2024arXiv:2208.05395
#5474

PhyloGFN: Phylogenetic inference with generative flow networks

MING YANG ZHOU, Zichao Yan, Elliot Layne et al.

ICLR 2024arXiv:2310.08774
#5475

Inducing High Energy-Latency of Large Vision-Language Models with Verbose Images

Kuofeng Gao, Yang Bai, Jindong Gu et al.

ICLR 2024oralarXiv:2401.11170
#5476

Performance Gaps in Multi-view Clustering under the Nested Matrix-Tensor Model

Hugo Lebeau, Mohamed El Amine Seddik, José Henrique Goulart

ICLR 2024arXiv:2402.10677
#5477

ZeRO++: Extremely Efficient Collective Communication for Large Model Training

Guanhua Wang, Heyang Qin, Sam Jacobs et al.

ICLR 2024
#5478

Multimodal Learning Without Labeled Multimodal Data: Guarantees and Applications

Paul Liang, Chun Kai Ling, Yun Cheng et al.

ICLR 2024arXiv:2306.04539
#5479

Dual Associated Encoder for Face Restoration

Yu-Ju Tsai, Yu-Lun Liu, Lu Qi et al.

ICLR 2024arXiv:2308.07314
#5480

CausalLM is not optimal for in-context learning

Nan Ding, Tomer Levinboim, Jialin Wu et al.

ICLR 2024arXiv:2308.06912
#5481

An Unforgeable Publicly Verifiable Watermark for Large Language Models

Aiwei Liu, Leyi Pan, Xuming Hu et al.

ICLR 2024arXiv:2307.16230
#5482

Does Writing with Language Models Reduce Content Diversity?

Vishakh Padmakumar, He He

ICLR 2024arXiv:2309.05196
#5483

Class Probability Matching with Calibrated Networks for Label Shift Adaption

Hongwei Wen, Annika Betken, Hanyuan Hang

ICLR 2024
#5484

Few-shot Hybrid Domain Adaptation of Image Generator

Hengjia Li, Yang Liu, Linxuan Xia et al.

ICLR 2024arXiv:2310.19378
#5485

Compositional Conservatism: A Transductive Approach in Offline Reinforcement Learning

Yeda Song, Dongwook Lee, Gunhee Kim

ICLR 2024oralarXiv:2404.04682
#5486

Adaptive Rational Activations to Boost Deep Reinforcement Learning

Quentin Delfosse, Patrick Schramowski, Martin Mundt et al.

ICLR 2024spotlightarXiv:2102.09407
#5487

InterpGNN: Understand and Improve Generalization Ability of Transdutive GNNs through the Lens of Interplay between Train and Test Nodes

Jiawei Sun, Kailai Li, Ruoxin Chen et al.

ICLR 2024
#5488

A Progressive Training Framework for Spiking Neural Networks with Learnable Multi-hierarchical Model

Zecheng Hao, Xinyu Shi, Zihan Huang et al.

ICLR 2024
#5489

Memory-Assisted Sub-Prototype Mining for Universal Domain Adaptation

Yuxiang (YU-HSIANG) LAI, Yi Zhou, Xinghong Liu et al.

ICLR 2024arXiv:2310.05453
#5490

Bounding Box Stability against Feature Dropout Reflects Detector Generalization across Environments

Yang Yang, Wenhai Wang, Zhe Chen et al.

ICLR 2024spotlightarXiv:2403.13803
#5491

Ito Diffusion Approximation of Universal Ito Chains for Sampling, Optimization and Boosting

Aleksei Ustimenko, Aleksandr Beznosikov

ICLR 2024arXiv:2310.06081
#5492

OWL: A Large Language Model for IT Operations

Hongcheng Guo, Jian Yang, Jiaheng Liu et al.

ICLR 2024arXiv:2309.09298
#5493

Towards Meta-Pruning via Optimal Transport

Alexander Theus, Olin Geimer, Friedrich Wicke et al.

ICLR 2024spotlightarXiv:2402.07839
#5494

Würstchen: An Efficient Architecture for Large-Scale Text-to-Image Diffusion Models

Pablo Pernías, Dominic Rampas, Mats L. Richter et al.

ICLR 2024
#5495

REFACTOR: Learning to Extract Theorems from Proofs

Jin Zhou, Yuhuai Wu, Qiyang Li et al.

ICLR 2024arXiv:2402.17032
#5496

From Posterior Sampling to Meaningful Diversity in Image Restoration

Noa Cohen, Hila Manor, Yuval Bahat et al.

ICLR 2024arXiv:2310.16047
#5497

Transformer Fusion with Optimal Transport

Moritz Imfeld, Jacopo Graldi, Marco Giordano et al.

ICLR 2024arXiv:2310.05719
#5498

Dynamic Neighborhood Construction for Structured Large Discrete Action Spaces

Fabian Akkerman, Julius Luy, Wouter van Heeswijk et al.

ICLR 2024arXiv:2305.19891
#5499

LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset

Lianmin Zheng, Wei-Lin Chiang, Ying Sheng et al.

ICLR 2024spotlightarXiv:2309.11998
#5500

A Recipe for Improved Certifiable Robustness

Kai Hu, Klas Leino, Zifan Wang et al.

ICLR 2024arXiv:2310.02513
#5501

Sparse MoE with Language Guided Routing for Multilingual Machine Translation

Xinyu Zhao, Xuxi Chen, Yu Cheng et al.

ICLR 2024
#5502

Neural Architecture Retrieval

Xiaohuan Pei, Yanxi Li, Minjing Dong et al.

ICLR 2024arXiv:2307.07919
#5503

Neural SDF Flow for 3D Reconstruction of Dynamic Scenes

wei mao, Richard Hartley, Mathieu Salzmann et al.

ICLR 2024
#5504

Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback

Yifu Yuan, Jianye HAO, Yi Ma et al.

ICLR 2024arXiv:2402.02423
#5505

Convolution Meets LoRA: Parameter Efficient Finetuning for Segment Anything Model

Zihan Zhong, Zhiqiang Tang, Tong He et al.

ICLR 2024arXiv:2401.17868
#5506

ADOPD: A Large-Scale Document Page Decomposition Dataset

Jiuxiang Gu, Xiangxi Shi, Jason Kuen et al.

ICLR 2024
#5507

Sample-Efficiency in Multi-Batch Reinforcement Learning: The Need for Dimension-Dependent Adaptivity

Emmeran Johnson, Ciara Pike-Burke, Patrick Rebeschini

ICLR 2024arXiv:2310.01616
#5508

Compressing LLMs: The Truth is Rarely Pure and Never Simple

AJAY JAISWAL, Zhe Gan, Xianzhi Du et al.

ICLR 2024arXiv:2310.01382
#5509

To the Cutoff... and Beyond? A Longitudinal Perspective on LLM Data Contamination

Manley Roberts, Himanshu Thakur, Christine Herlihy et al.

ICLR 2024
#5510

ToolChain*: Efficient Action Space Navigation in Large Language Models with A* Search

Yuchen Zhuang, Xiang Chen, Tong Yu et al.

ICLR 2024arXiv:2310.13227
#5511

LayoutNUWA: Revealing the Hidden Layout Expertise of Large Language Models

Zecheng Tang, Zecheng Tang, Chenfei Wu et al.

ICLR 2024arXiv:2309.09506
#5512

Delphic Offline Reinforcement Learning under Nonidentifiable Hidden Confounding

Alizée Pace, Hugo Yèche, Bernhard Schoelkopf et al.

ICLR 2024arXiv:2306.01157
#5513

Learning Mean Field Games on Sparse Graphs: A Hybrid Graphex Approach

Christian Fabian, Kai Cui, Heinz Koeppl

ICLR 2024arXiv:2401.12686
#5514

P2Seg: Pointly-supervised Segmentation via Mutual Distillation

Zipeng Wang, Xuehui Yu, Xumeng Han et al.

ICLR 2024arXiv:2401.09709
#5515

Consciousness-Inspired Spatio-Temporal Abstractions for Better Generalization in Reinforcement Learning

Mingde Zhao, Safa Alver, Harm Seijen et al.

ICLR 2024oralarXiv:2310.00229
#5516

Label-free Node Classification on Graphs with Large Language Models (LLMs)

Zhikai Chen, Haitao Mao, Hongzhi Wen et al.

ICLR 2024arXiv:2310.04668
#5517

Understanding In-Context Learning in Transformers and LLMs by Learning to Learn Discrete Functions

Satwik Bhattamishra, Arkil Patel, Phil Blunsom et al.

ICLR 2024arXiv:2310.03016
#5518

Function-space Parameterization of Neural Networks for Sequential Learning

Aidan Scannell, Riccardo Mereu, Paul Chang et al.

ICLR 2024arXiv:2403.10929
#5519

One-hot Generalized Linear Model for Switching Brain State Discovery

Chengrui Li, Soon Ho Kim, Chris Rodgers et al.

ICLR 2024oralarXiv:2310.15263
#5520

Generative Modeling of Regular and Irregular Time Series Data via Koopman VAEs

Ilan Naiman, N. Benjamin Erichson, Pu Ren et al.

ICLR 2024arXiv:2310.02619
#5521

Annealing Self-Distillation Rectification Improves Adversarial Training

Yu-Yu Wu, Hung-Jui Wang, Shang-Tse Chen

ICLR 2024arXiv:2305.12118
#5522

Boundary Denoising for Video Activity Localization

Mengmeng Xu, Mattia Soldan, Jialin Gao et al.

ICLR 2024oral
#5523

Out-of-Distribution Detection by Leveraging Between-Layer Transformation Smoothness

Fran Jelenić, Josip Jukić, Martin Tutek et al.

ICLR 2024arXiv:2310.02832
#5524

On Trajectory Augmentations for Off-Policy Evaluation

Ge Gao, Qitong Gao, Xi Yang et al.

ICLR 2024
#5525

How to Catch an AI Liar: Lie Detection in Black-Box LLMs by Asking Unrelated Questions

Lorenzo Pacchiardi, Alex Chan, Sören Mindermann et al.

ICLR 2024arXiv:2309.15840
#5526

Alt-Text with Context: Improving Accessibility for Images on Twitter

Nikita Srivatsan, Sofia Samaniego, Omar Florez et al.

ICLR 2024arXiv:2305.14779
#5527

Combinatorial Bandits for Maximum Value Reward Function under Value-Index Feedback

Yiliu Wang, Wei Chen, Milan Vojnovic

ICLR 2024
#5528

Reward-Free Curricula for Training Robust World Models

Marc Rigter, Minqi Jiang, Ingmar Posner

ICLR 2024arXiv:2306.09205
#5529

Yet Another ICU Benchmark: A Flexible Multi-Center Framework for Clinical ML

Robin van de Water, Hendrik Schmidt, Paul Elbers et al.

ICLR 2024oralarXiv:2306.05109
#5530

PixArt-$\alpha$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Junsong Chen, Jincheng YU, Chongjian GE et al.

ICLR 2024spotlight
#5531

Convergence of Bayesian Bilevel Optimization

Shi Fu, Fengxiang He, Xinmei Tian et al.

ICLR 2024spotlight
#5532

Functional Interpolation for Relative Positions improves Long Context Transformers

Shanda Li, Chong You, Guru Guruganesh et al.

ICLR 2024arXiv:2310.04418
#5533

Consistency Training with Learnable Data Augmentation for Graph Anomaly Detection with Limited Supervision

Nan Chen, Zemin Liu, Bryan Hooi et al.

ICLR 2024spotlight
#5534

Rotation Has Two Sides: Evaluating Data Augmentation for Deep One-class Classification

Guodong Wang, Yunhong Wang, Xiuguo Bao et al.

ICLR 2024spotlight
#5535

Small-scale proxies for large-scale Transformer training instabilities

Mitchell Wortsman, Peter Liu, Lechao Xiao et al.

ICLR 2024arXiv:2309.14322
#5536

Lion Secretly Solves a Constrained Optimization: As Lyapunov Predicts

Lizhang Chen, Bo Liu, Kaizhao Liang et al.

ICLR 2024spotlight
#5537

Improving equilibrium propagation without weight symmetry through Jacobian homeostasis

Axel Laborieux, Friedemann Zenke

ICLR 2024arXiv:2309.02214
#5538

Symmetric Single Index Learning

Aaron Zweig, Joan Bruna

ICLR 2024arXiv:2310.02117
#5539

Node2ket: Efficient High-Dimensional Network Embedding in Quantum Hilbert Space

Hao Xiong, Yehui Tang, Yunlin He et al.

ICLR 2024
#5540

Towards LLM4QPE: Unsupervised Pretraining of Quantum Property Estimation and A Benchmark

Yehui Tang, Hao Xiong, Nianzu Yang et al.

ICLR 2024spotlight
#5541

Object-Aware Inversion and Reassembly for Image Editing

Zhen Yang, Ganggui Ding, Wen Wang et al.

ICLR 2024arXiv:2310.12149
#5542

Analysis of Learning a Flow-based Generative Model from Limited Sample Complexity

Hugo Cui, Florent Krzakala, Eric Vanden-Eijnden et al.

ICLR 2024arXiv:2310.03575
#5543

What's In My Big Data?

Yanai Elazar, Akshita Bhagia, Ian Magnusson et al.

ICLR 2024spotlightarXiv:2310.20707
#5544

Minimum width for universal approximation using ReLU networks on compact domain

Namjun Kim, Chanho Min, Sejun Park

ICLR 2024arXiv:2309.10402
#5545

Provable Memory Efficient Self-Play Algorithm for Model-free Reinforcement Learning

Na Li, Yuchen Jiao, Hangguan Shan et al.

ICLR 2024arXiv:2512.00351
#5546

Retro-fallback: retrosynthetic planning in an uncertain world

Austin Tripp, Krzysztof Maziarz, Sarah Lewis et al.

ICLR 2024arXiv:2310.09270
#5547

Generative Human Motion Stylization in Latent Space

chuan guo, Yuxuan Mu, Xinxin Zuo et al.

ICLR 2024arXiv:2401.13505
#5548

TUVF: Learning Generalizable Texture UV Radiance Fields

An-Chieh Cheng, Xueting Li, Sifei Liu et al.

ICLR 2024arXiv:2305.03040
#5549

Achieving the Pareto Frontier of Regret Minimization and Best Arm Identification in Multi-Armed Bandits

Wang Chi Cheung, Vincent Tan, Zixin Zhong

ICLR 2024arXiv:2110.08627
#5550

Fast Imitation via Behavior Foundation Models

Matteo Pirotta, Andrea Tirinzoni, Ahmed Touati et al.

ICLR 2024oral
#5551

MEND: Meta Demonstration Distillation for Efficient and Effective In-Context Learning

Yichuan Li, Xiyao Ma, Sixing Lu et al.

ICLR 2024arXiv:2403.06914
#5552

Ins-DetCLIP: Aligning Detection Model to Follow Human-Language Instruction

Renjie Pi, Lewei Yao, Jianhua Han et al.

ICLR 2024
#5553

Only Pay for What Is Uncertain: Variance-Adaptive Thompson Sampling

Aadirupa Saha, Branislav Kveton

ICLR 2024arXiv:2303.09033
#5554

Tool-Augmented Reward Modeling

Lei Li, Yekun Chai, Shuohuan Wang et al.

ICLR 2024spotlightarXiv:2310.01045
#5555

Procedural Fairness Through Decoupling Objectionable Data Generating Components

Zeyu Tang, Jialu Wang, Yang Liu et al.

ICLR 2024spotlightarXiv:2311.14688
#5556

Generalized Policy Iteration using Tensor Approximation for Hybrid Control

Suhan Shetty, Teng Xue, Sylvain Calinon

ICLR 2024spotlight
#5557

Scaling Supervised Local Learning with Augmented Auxiliary Networks

Chenxiang Ma, Jibin Wu, Chenyang Si et al.

ICLR 2024arXiv:2402.17318
#5558

Diffusion Models for Multi-Task Generative Modeling

Changyou Chen, Han Ding, Bunyamin Sisman et al.

ICLR 2024
#5559

Causal Modelling Agents: Causal Graph Discovery through Synergising Metadata- and Data-driven Reasoning

Ahmed Abdulaal, Adamos Hadjivasiliou, Nina Montaña-Brown et al.

ICLR 2024
#5560

FedCompass: Efficient Cross-Silo Federated Learning on Heterogeneous Client Devices Using a Computing Power-Aware Scheduler

Zilinghan Li, Pranshu Chaturvedi, Shilan He et al.

ICLR 2024arXiv:2309.14675
#5561

Bridging State and History Representations: Understanding Self-Predictive RL

Tianwei Ni, Benjamin Eysenbach, Erfan Seyedsalehi et al.

ICLR 2024arXiv:2401.08898
#5562

Latent 3D Graph Diffusion

Yuning You, Ruida Zhou, Jiwoong Park et al.

ICLR 2024
#5563

State Representation Learning Using an Unbalanced Atlas

Li Meng, Morten Goodwin, Anis Yazidi et al.

ICLR 2024oralarXiv:2305.10267
#5564

Scalable Monotonic Neural Networks

Hyunho Kim, Jong-Seok Lee

ICLR 2024
#5565

Towards a statistical theory of data selection under weak supervision

Germain Kolossov, Andrea Montanari, Pulkit Tandon

ICLR 2024arXiv:2309.14563
#5566

BayesPrompt: Prompting Large-Scale Pre-Trained Language Models on Few-shot Inference via Debiased Domain Abstraction

Jiangmeng Li, Fei Song, Yifan Jin et al.

ICLR 2024arXiv:2401.14166
#5567

A Stable, Fast, and Fully Automatic Learning Algorithm for Predictive Coding Networks

Tommaso Salvatori, Yuhang Song, Yordan Yordanov et al.

ICLR 2024oralarXiv:2212.00720
#5568

NuwaDynamics: Discovering and Updating in Causal Spatio-Temporal Modeling

Kun Wang, Hao Wu, Yifan Duan et al.

ICLR 2024oral
#5569

Rethinking Information-theoretic Generalization: Loss Entropy Induced PAC Bounds

Yuxin Dong, Tieliang Gong, Hong Chen et al.

ICLR 2024
#5570

Decoupling Weighing and Selecting for Integrating Multiple Graph Pre-training Tasks

Tianyu Fan, Lirong Wu, Yufei Huang et al.

ICLR 2024arXiv:2403.01400
#5571

Democratizing Fine-grained Visual Recognition with Large Language Models

Mingxuan Liu, Subhankar Roy, Wenjing Li et al.

ICLR 2024arXiv:2401.13837
#5572

GAFormer: Enhancing Timeseries Transformers Through Group-Aware Embeddings

Jingyun Xiao, Ran Liu, Eva Dyer

ICLR 2024oral
#5573

DNA-GPT: Divergent N-Gram Analysis for Training-Free Detection of GPT-Generated Text

Xianjun Yang, Wei Cheng, Yue Wu et al.

ICLR 2024arXiv:2305.17359
#5574

SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models

Xin Zhang, Dong Zhang, Shimin Li et al.

ICLR 2024
#5575

Prompt Learning with Quaternion Networks

Boya Shi, Zhengqin Xu, Shuai Jia et al.

ICLR 2024
#5576

Prioritized Soft Q-Decomposition for Lexicographic Reinforcement Learning

Finn Rietz, Erik Schaffernicht, Stefan Heinrich et al.

ICLR 2024arXiv:2310.02360
#5577

A Simple Interpretable Transformer for Fine-Grained Image Classification and Analysis

DIPANJYOTI PAUL, Arpita Chowdhury, Xinqi Xiong et al.

ICLR 2024arXiv:2311.04157
#5578

UC-NERF: Neural Radiance Field for Under-Calibrated Multi-View Cameras in Autonomous Driving

Kai Cheng, Xiaoxiao Long, Wei Yin et al.

ICLR 2024oralarXiv:2311.16945
#5579

LUM-ViT: Learnable Under-sampling Mask Vision Transformer for Bandwidth Limited Optical Signal Acquisition

Lingfeng Liu, Dong Ni, Hangjie Yuan

ICLR 2024arXiv:2403.01412
#5580

Facing the Elephant in the Room: Visual Prompt Tuning or Full finetuning?

Cheng Han, Qifan Wang, Yiming Cui et al.

ICLR 2024arXiv:2401.12902
#5581

Accelerated Sampling with Stacked Restricted Boltzmann Machines

Jorge Fernandez-de-Cossio-Diaz, Clément Roussel, Simona Cocco et al.

ICLR 2024
#5582

BECLR: Batch Enhanced Contrastive Few-Shot Learning

Stylianos Poulakakis-Daktylidis, Hadi Jamali-Rad

ICLR 2024spotlightarXiv:2402.02444
#5583

DyST: Towards Dynamic Neural Scene Representations on Real-World Videos

Maximilian Seitzer, Sjoerd van Steenkiste, Thomas Kipf et al.

ICLR 2024spotlightarXiv:2310.06020
#5584

Meta-Learning Priors Using Unrolled Proximal Networks

Yilang Zhang, Georgios B Giannakis

ICLR 2024
#5585

A Topological Perspective on Demystifying GNN-Based Link Prediction Performance

Yu Wang, Tong Zhao, Yuying Zhao et al.

ICLR 2024arXiv:2310.04612
#5586

Self-Supervised Speech Quality Estimation and Enhancement Using Only Clean Speech

Szu-Wei Fu, Kuo-Hsuan Hung, Yu Tsao et al.

ICLR 2024arXiv:2402.16321
#5587

Circumventing Concept Erasure Methods For Text-To-Image Generative Models

Minh Pham, Kelly Marshall, Niv Cohen et al.

ICLR 2024arXiv:2308.01508
#5588

Continual Learning on a Diet: Learning from Sparsely Labeled Streams Under Constrained Computation

Wenxuan Zhang, Youssef Mohamed, Bernard Ghanem et al.

ICLR 2024arXiv:2404.12766
#5589

On the Posterior Distribution in Denoising: Application to Uncertainty Quantification

Hila Manor, Tomer Michaeli

ICLR 2024arXiv:2309.13598
#5590

Towards Eliminating Hard Label Constraints in Gradient Inversion Attacks

Yanbo Wang, Jian Liang, Ran He

ICLR 2024arXiv:2402.03124
#5591

Horizon-free Reinforcement Learning in Adversarial Linear Mixture MDPs

Kaixuan Ji, Qingyue Zhao, Jiafan He et al.

ICLR 2024arXiv:2305.08359
#5592

Ensemble Distillation for Unsupervised Constituency Parsing

Behzad Shayegh, Yanshuai Cao, Xiaodan Zhu et al.

ICLR 2024arXiv:2310.01717
#5593

Approximately Piecewise E(3) Equivariant Point Networks

Matan Atzmon, Jiahui Huang, Francis Williams et al.

ICLR 2024arXiv:2402.08529
#5594

Implicit Maximum a Posteriori Filtering via Adaptive Optimization

Gianluca Bencomo, Jake Snell, Thomas L. Griffiths

ICLR 2024arXiv:2311.10580
#5595

Multi-modal Gaussian Process Variational Autoencoders for Neural and Behavioral Data

Rabia Gondur, Usama Bin Sikandar, Evan Schaffer et al.

ICLR 2024oralarXiv:2310.03111
#5596

ContextRef: Evaluating Referenceless Metrics for Image Description Generation

Elisa Kreiss, Elisa Kreiss, Eric Zelikman et al.

ICLR 2024arXiv:2309.11710
#5597

Diffusion Model for Dense Matching

Jisu Nam, Gyuseong Lee, Seonwoo Kim et al.

ICLR 2024arXiv:2305.19094
#5598

Kernel Metric Learning for In-Sample Off-Policy Evaluation of Deterministic RL Policies

Haanvid Lee, Tri Wahyu Guntara, Jongmin Lee et al.

ICLR 2024oralarXiv:2405.18792
#5599

On Double Descent in Reinforcement Learning with LSTD and Random Features

David Brellmann, Eloïse Berthier, David Filliat et al.

ICLR 2024oralarXiv:2310.05518
#5600

Functional Bayesian Tucker Decomposition for Continuous-indexed Tensor Data

Shikai Fang, Xin Yu, Zheng Wang et al.

ICLR 2024arXiv:2311.04829