Most Cited 2024 "oversensitivity benchmark" Papers

12,324 papers found • Page 61 of 62

#12001

MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical Reasoning

Ke Wang, Houxing Ren, Aojun Zhou et al.

ICLR 2024posterarXiv:2310.03731
#12002

BadEdit: Backdooring Large Language Models by Model Editing

Yanzhou Li, Tianlin Li, Kangjie Chen et al.

ICLR 2024posterarXiv:2403.13355
#12003

One For All: Towards Training One Graph Model For All Classification Tasks

Hao Liu, Jiarui Feng, Lecheng Kong et al.

ICLR 2024spotlightarXiv:2310.00149
#12004

Neural Monge Map estimation and its applications

Shaojun Ma, Yongxin Chen, Hao-Min Zhou et al.

ICLR 2024posterarXiv:2106.03812
#12005

Rethinking the Power of Graph Canonization in Graph Representation Learning with Stability

Zehao Dong, Muhan Zhang, Philip Payne et al.

ICLR 2024posterarXiv:2309.00738
#12006

$t^3$-Variational Autoencoder: Learning Heavy-tailed Data with Student's t and Power Divergence

Juno Kim, Jaehyuk Kwon, Mincheol Cho et al.

ICLR 2024posterarXiv:2312.01133
#12007

On the Sample Complexity of Lipschitz Constant Estimation

Stephen Roberts, Julien Huang, Jan-Peter Calliess

ICLR 2024poster
#12008

Increasing Model Capacity for Free: A Simple Strategy for Parameter Efficient Fine-tuning

Haobo Song, Haobo SONG, Hao Zhao et al.

ICLR 2024posterarXiv:2407.01320
#12009

Image Background Serves as Good Proxy for Out-of-distribution Data

Sen Pei

ICLR 2024posterarXiv:2307.00519
#12010

FedTrans: Client-Transparent Utility Estimation for Robust Federated Learning

Mingkun Yang, Ran Zhu, Qing Wang et al.

ICLR 2024poster
#12011

CLEX: Continuous Length Extrapolation for Large Language Models

Guanzheng Chen, Xin Li, Zaiqiao Meng et al.

ICLR 2024posterarXiv:2310.16450
#12012

Combining Axes Preconditioners through Kronecker Approximation for Deep Learning

Venkata Sai Surya Subramanyam Duvvuri, Fnu Devvrit, Rohan Anil et al.

ICLR 2024poster
#12013

The Update-Equivalence Framework for Decision-Time Planning

Samuel Sokota, Gabriele Farina, David Wu et al.

ICLR 2024posterarXiv:2304.13138
#12014

MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning

Xiang Yue, Xingwei Qu, Ge Zhang et al.

ICLR 2024spotlightarXiv:2309.05653
#12015

Inverse Approximation Theory for Nonlinear Recurrent Neural Networks

Shida Wang, Zhong Li, Qianxiao Li

ICLR 2024spotlightarXiv:2305.19190
#12016

Less is More: Fewer Interpretable Region via Submodular Subset Selection

Ruoyu Chen, Hua Zhang, Siyuan Liang et al.

ICLR 2024posterarXiv:2402.09164
#12017

LEMON: Lossless model expansion

Yite Wang, Jiahao Su, Hanlin Lu et al.

ICLR 2024posterarXiv:2310.07999
#12018

Sparse Model Soups: A Recipe for Improved Pruning via Model Averaging

Max Zimmer, Christoph Spiegel, Sebastian Pokutta

ICLR 2024posterarXiv:2306.16788
#12019

Understanding Addition in Transformers

Philip Quirke, Fazl Barez

ICLR 2024posterarXiv:2310.13121
#12020

Deceptive Fairness Attacks on Graphs via Meta Learning

Jian Kang, Yinglong Xia, Ross Maciejewski et al.

ICLR 2024posterarXiv:2310.15653
#12021

Federated Text-driven Prompt Generation for Vision-Language Models

Chen Qiu, Xingyu Li, Chaithanya Kumar Mummadi et al.

ICLR 2024poster
#12022

Optimal Sketching for Residual Error Estimation for Matrix and Vector Norms

Yi Li, Honghao Lin, David Woodruff

ICLR 2024posterarXiv:2408.08494
#12023

Large Language Model Cascades with Mixture of Thought Representations for Cost-Efficient Reasoning

Murong Yue, Jie Zhao, Min Zhang et al.

ICLR 2024posterarXiv:2310.03094
#12024

PAE: Reinforcement Learning from External Knowledge for Efficient Exploration

Zhe Wu, Haofei Lu, Junliang Xing et al.

ICLR 2024poster
#12025

CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules

Hung Le, Hailin Chen, Amrita Saha et al.

ICLR 2024posterarXiv:2310.08992
#12026

CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models

Sreyan Ghosh, Ashish Seth, Sonal Kumar et al.

ICLR 2024posterarXiv:2310.08753
#12027

How Over-Parameterization Slows Down Gradient Descent in Matrix Sensing: The Curses of Symmetry and Initialization

Nuoya Xiong, Lijun Ding, Simon Du

ICLR 2024spotlightarXiv:2310.01769
#12028

MT-Ranker: Reference-free machine translation evaluation by inter-system ranking

Ibraheem Muhammad Moosa, Rui Zhang, Wenpeng Yin

ICLR 2024spotlightarXiv:2401.17099
#12029

Parametric Augmentation for Time Series Contrastive Learning

Xu Zheng, Tianchun Wang, Wei Cheng et al.

ICLR 2024oralarXiv:2402.10434
#12030

Towards Robust Out-of-Distribution Generalization Bounds via Sharpness

Yingtian Zou, Kenji Kawaguchi, Yingnan Liu et al.

ICLR 2024spotlightarXiv:2403.06392
#12031

A Unified Sampling Framework for Solver Searching of Diffusion Probabilistic Models

Enshu Liu, Xuefei Ning, Huazhong Yang et al.

ICLR 2024posterarXiv:2312.07243
#12032

On the Effect of Batch Size in Byzantine-Robust Distributed Learning

Yi-Rui Yang, Chang-Wei Shi, Wu-Jun Li

ICLR 2024poster
#12033

Expressivity of ReLU-Networks under Convex Relaxations

Maximilian Baader, Mark N Müller, Yuhao Mao et al.

ICLR 2024posterarXiv:2311.04015
#12034

DiffusionSat: A Generative Foundation Model for Satellite Imagery

Samar Khanna, Patrick Liu, Linqi Zhou et al.

ICLR 2024oralarXiv:2312.03606
#12035

Denoising Diffusion Bridge Models

Linqi Zhou, Aaron Lou, Samar Khanna et al.

ICLR 2024posterarXiv:2309.16948
#12036

Harnessing Explanations: LLM-to-LM Interpreter for Enhanced Text-Attributed Graph Representation Learning

Xiaoxin He, Xavier Bresson, Thomas Laurent et al.

ICLR 2024posterarXiv:2305.19523
#12037

Causal Inference with Conditional Front-Door Adjustment and Identifiable Variational Autoencoder

Ziqi Xu, Debo Cheng, Jiuyong Li et al.

ICLR 2024posterarXiv:2310.01937
#12038

From Bricks to Bridges: Product of Invariances to Enhance Latent Space Communication

Irene Cannistraci, Luca Moschella, Marco Fumero et al.

ICLR 2024spotlightarXiv:2310.01211
#12039

Batched Low-Rank Adaptation of Foundation Models

Yeming Wen, Swarat Chaudhuri

ICLR 2024posterarXiv:2312.05677
#12040

GRAPH-CONSTRAINED DIFFUSION FOR END-TO-END PATH PLANNING

DINGYUAN SHI, Yongxin Tong, Zimu Zhou et al.

ICLR 2024poster
#12041

Leveraging Hyperbolic Embeddings for Coarse-to-Fine Robot Design

Heng Dong, Junyu Zhang, Chongjie Zhang

ICLR 2024posterarXiv:2311.00462
#12042

Impact of Computation in Integral Reinforcement Learning for Continuous-Time Control

Wenhan Cao, Wei Pan

ICLR 2024spotlightarXiv:2402.17375
#12043

Generalized Schrödinger Bridge Matching

Guan-Horng Liu, Yaron Lipman, Maximilian Nickel et al.

ICLR 2024posterarXiv:2310.02233
#12044

Repeated Random Sampling for Minimizing the Time-to-Accuracy of Learning

Patrik Okanovic, Roger Waleffe, Vasilis Mageirakos et al.

ICLR 2024posterarXiv:2305.18424
#12045

Navigating the Design Space of Equivariant Diffusion-Based Generative Models for De Novo 3D Molecule Generation

Tuan Le, Julian Cremer, Frank Noe et al.

ICLR 2024posterarXiv:2309.17296
#12046

Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching

Yang Liu, Muzhi Zhu, Hengtao Li et al.

ICLR 2024posterarXiv:2305.13310
#12047

Ferret: Refer and Ground Anything Anywhere at Any Granularity

Haoxuan You, Haotian Zhang, Zhe Gan et al.

ICLR 2024spotlightarXiv:2310.07704
#12048

Demonstration-Regularized RL

Daniil Tiapkin, Denis Belomestny, Daniele Calandriello et al.

ICLR 2024posterarXiv:2310.17303
#12049

Manifold Diffusion Fields

Ahmed Elhag, Ahmed Elhag, Yuyang Wang et al.

ICLR 2024posterarXiv:2305.15586
#12050

Language Control Diffusion: Efficiently Scaling through Space, Time, and Tasks

David Bell, Yujie Lu, Shinda Huang et al.

ICLR 2024oral
#12051

PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt Optimization

Xinyuan Wang, Chenxi Li, Zhen Wang et al.

ICLR 2024posterarXiv:2310.16427
#12052

How do Language Models Bind Entities in Context?

Jiahai Feng, Jacob Steinhardt

ICLR 2024posterarXiv:2310.17191
#12053

Emergent mechanisms for long timescales depend on training curriculum and affect performance in memory tasks

Sina Khajehabdollahi, Roxana Zeraati, Emmanouil Giannakakis et al.

ICLR 2024oralarXiv:2309.12927
#12054

Score-based generative models break the curse of dimensionality in learning a family of sub-Gaussian distributions

Frank Cole, Yulong Lu

ICLR 2024poster
#12055

FairSeg: A Large-Scale Medical Image Segmentation Dataset for Fairness Learning Using Segment Anything Model with Fair Error-Bound Scaling

Yu Tian, Min Shi, Yan Luo et al.

ICLR 2024posterarXiv:2311.02189
#12056

Learning Performance-Improving Code Edits

Alexander Shypula, Aman Madaan, Yimeng Zeng et al.

ICLR 2024spotlightarXiv:2302.07867
#12057

Tensor Programs VI: Feature Learning in Infinite Depth Neural Networks

Greg Yang, Dingli Yu, Chen Zhu et al.

ICLR 2024posterarXiv:2310.02244
#12058

Privately Aligning Language Models with Reinforcement Learning

Fan Wu, Huseyin Inan, Arturs Backurs et al.

ICLR 2024posterarXiv:2310.16960
#12059

Let's Verify Step by Step

Hunter Lightman, Vineet Kosaraju, Yuri Burda et al.

ICLR 2024posterarXiv:2305.20050
#12060

Complete and Efficient Graph Transformers for Crystal Material Property Prediction

Keqiang Yan, Cong Fu, Xiaofeng Qian et al.

ICLR 2024posterarXiv:2403.11857
#12061

Uncertainty Quantification via Stable Distribution Propagation

Felix Petersen, Aashwin Mishra, Hilde Kuehne et al.

ICLR 2024posterarXiv:2402.08324
#12062

Understanding Convergence and Generalization in Federated Learning through Feature Learning Theory

Wei Huang, Ye Shi, Zhongyi Cai et al.

ICLR 2024poster
#12063

LLMs Meet VLMs: Boost Open Vocabulary Object Detection with Fine-grained Descriptors

Sheng JIn, Xueying Jiang, Jiaxing Huang et al.

ICLR 2024posterarXiv:2402.04630
#12064

Embodied Active Defense: Leveraging Recurrent Feedback to Counter Adversarial Patches

Lingxuan Wu, Xiao Yang, Yinpeng Dong et al.

ICLR 2024posterarXiv:2404.00540
#12065

$\texttt{NAISR}$: A 3D Neural Additive Model for Interpretable Shape Representation

Yining Jiao, Carlton ZDANSKI, Julia Kimbell et al.

ICLR 2024spotlight
#12066

Towards Offline Opponent Modeling with In-context Learning

Yuheng Jing, Kai Li, Bingyun Liu et al.

ICLR 2024poster
#12067

MVSFormer++: Revealing the Devil in Transformer's Details for Multi-View Stereo

chenjie cao, xinlin ren, Yanwei Fu

ICLR 2024posterarXiv:2401.11673
#12068

SKILL-MIX: a Flexible and Expandable Family of Evaluations for AI Models

Dingli Yu, Simran Kaur, Arushi Gupta et al.

ICLR 2024posterarXiv:2310.17567
#12069

LoTa-Bench: Benchmarking Language-oriented Task Planners for Embodied Agents

Jae-Woo Choi, Youngwoo Yoon, Youngwoo Yoon et al.

ICLR 2024posterarXiv:2402.08178
#12070

Hybrid Directional Graph Neural Network for Molecules

Junyi An, Chao Qu, Zhipeng Zhou et al.

ICLR 2024spotlight
#12071

DAFA: Distance-Aware Fair Adversarial Training

Hyungyu Lee, Saehyung Lee, Hyemi Jang et al.

ICLR 2024posterarXiv:2401.12532
#12072

Fast-ELECTRA for Efficient Pre-training

Chengyu Dong, Liyuan Liu, Hao Cheng et al.

ICLR 2024posterarXiv:2310.07347
#12073

Course Correcting Koopman Representations

Mahan Fathi, Clement Gehring, Jonathan Pilault et al.

ICLR 2024posterarXiv:2310.15386
#12074

Generating Pragmatic Examples to Train Neural Program Synthesizers

Saujas Vaduguru, Daniel Fried, Yewen Pu

ICLR 2024posterarXiv:2311.05740
#12075

Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization

Weiyang Liu, Zeju Qiu, Yao Feng et al.

ICLR 2024posterarXiv:2311.06243
#12076

Learning with Language-Guided State Abstractions

Andi Peng, Ilia Sucholutsky, Belinda Li et al.

ICLR 2024posterarXiv:2402.18759
#12077

Successor Heads: Recurring, Interpretable Attention Heads In The Wild

Rhys Gould, Euan Ong, George Ogden et al.

ICLR 2024posterarXiv:2312.09230
#12078

On the Expressivity of Objective-Specification Formalisms in Reinforcement Learning

Rohan Subramani, Marcus Williams, Max Heitmann et al.

ICLR 2024oralarXiv:2310.11840
#12079

Latent Representation and Simulation of Markov Processes via Time-Lagged Information Bottleneck

Marco Federici, Patrick Forré, Ryota Tomioka et al.

ICLR 2024oralarXiv:2309.07200
#12080

DreamClean: Restoring Clean Image Using Deep Diffusion Prior

Jie Xiao, Ruili Feng, Han Zhang et al.

ICLR 2024poster
#12081

Enhancing Neural Subset Selection: Integrating Background Information into Set Representations

Binghui Xie, Yatao Bian, Kaiwen Zhou et al.

ICLR 2024posterarXiv:2402.03139
#12082

Probabilistic Adaptation of Black-Box Text-to-Video Models

Sherry Yang, Yilun Du, Bo Dai et al.

ICLR 2024poster
#12083

Partitioning Message Passing for Graph Fraud Detection

Wei Zhuo, Zemin Liu, Bryan Hooi et al.

ICLR 2024posterarXiv:2412.00020
#12084

Meta-Evolve: Continuous Robot Evolution for One-to-many Policy Transfer

Xingyu Liu, Deepak Pathak, DING ZHAO

ICLR 2024posterarXiv:2405.03534
#12085

Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding

Zilong Wang, Hao Zhang, Chun-Liang Li et al.

ICLR 2024posterarXiv:2401.04398
#12086

TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained Models

Zuxin Liu, Jesse Zhang, Kavosh Asadi et al.

ICLR 2024posterarXiv:2310.05905
#12087

Concept Bottleneck Generative Models

Aya Abdelsalam Ismail, Julius Adebayo, Hector Corrada Bravo et al.

ICLR 2024poster
#12088

Safe Collaborative Filtering

Riku Togashi, Tatsushi Oka, Naoto Ohsaka et al.

ICLR 2024posterarXiv:2306.05292
#12089

Skip-Attention: Improving Vision Transformers by Paying Less Attention

Shashank Venkataramanan, Amir Ghodrati, Yuki Asano et al.

ICLR 2024posterarXiv:2301.02240
#12090

Reasoning on Graphs: Faithful and Interpretable Large Language Model Reasoning

Linhao Luo, Yuan-Fang Li, Reza Haffari et al.

ICLR 2024posterarXiv:2310.01061
#12091

Robust Model-Based Optimization for Challenging Fitness Landscapes

Saba Ghaffari, Ehsan Saleh, Alex Schwing et al.

ICLR 2024posterarXiv:2305.13650
#12092

Analytically Tractable Hidden-States Inference in Bayesian Neural Networks

Luong-Ha Nguyen, James-A. Goulet

ICLR 2024posterarXiv:2107.03759
#12093

An interpretable error correction method for enhancing code-to-code translation

Min Xue, Artur Andrzejak, Marla Leuther

ICLR 2024poster
#12094

Fiber Monte Carlo

Nick Richardson, Deniz Oktay, Yaniv Ovadia et al.

ICLR 2024poster
#12095

NeRM: Learning Neural Representations for High-Framerate Human Motion Synthesis

Dong Wei, Huaijiang Sun, Bin Li et al.

ICLR 2024oral
#12096

A Unified Experiment Design Approach for Cyclic and Acyclic Causal Models

Ehsan Mokhtarian, Saber Salehkaleybar, AmirEmad Ghassami et al.

ICLR 2024posterarXiv:2205.10083
#12097

A Framework and Benchmark for Deep Batch Active Learning for Regression

David Holzmüller, Viktor Zaverkin, Johannes Kästner et al.

ICLR 2024posterarXiv:2203.09410
#12098

Tackling the Data Heterogeneity in Asynchronous Federated Learning with Cached Update Calibration

Yujia Wang, Yuanpu Cao, Jingcheng Wu et al.

ICLR 2024poster
#12099

ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation

Jiaming Liu, Senqiao Yang, Peidong Jia et al.

ICLR 2024posterarXiv:2306.04344
#12100

Automatic Functional Differentiation in JAX

Min Lin

ICLR 2024posterarXiv:2311.18727
#12101

Manipulating dropout reveals an optimal balance of efficiency and robustness in biological and machine visual systems

Jacob Prince, Gabriel Fajardo, George Alvarez et al.

ICLR 2024oral
#12102

$\mathcal{B}$-Coder: Value-Based Deep Reinforcement Learning for Program Synthesis

Zishun Yu, Yunzhe Tao, Liyu Chen et al.

ICLR 2024spotlightarXiv:2310.03173
#12103

Octavius: Mitigating Task Interference in MLLMs via LoRA-MoE

Zeren Chen, ziqin wang, zhen wang et al.

ICLR 2024posterarXiv:2311.02684
#12104

ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving

Zhibin Gou, Zhihong Shao, Yeyun Gong et al.

ICLR 2024posterarXiv:2309.17452
#12105

Sample-efficient Learning of Infinite-horizon Average-reward MDPs with General Function Approximation

Jianliang He, Han Zhong, Zhuoran Yang

ICLR 2024posterarXiv:2404.12648
#12106

Towards Robust Offline Reinforcement Learning under Diverse Data Corruption

Rui Yang, Han Zhong, Jiawei Xu et al.

ICLR 2024spotlightarXiv:2310.12955
#12107

Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative Instructions

Juncheng Li, Kaihang Pan, Zhiqi Ge et al.

ICLR 2024spotlightarXiv:2308.04152
#12108

Towards domain-invariant Self-Supervised Learning with Batch Styles Standardization

Marin Scalbert, Maria Vakalopoulou, Florent Couzinie-Devy

ICLR 2024posterarXiv:2303.06088
#12109

SNIP: Bridging Mathematical Symbolic and Numeric Realms with Unified Pre-training

Kazem Meidani, Parshin Shojaee, Chandan Reddy et al.

ICLR 2024spotlightarXiv:2310.02227
#12110

Learning from Label Proportions: Bootstrapping Supervised Learners via Belief Propagation

Shreyas Havaldar, Navodita Sharma, Shubhi Sareen et al.

ICLR 2024posterarXiv:2310.08056
#12111

Transformer-Modulated Diffusion Models for Probabilistic Multivariate Time Series Forecasting

Yuxin Li, Wenchao Chen, Xinyue Hu et al.

ICLR 2024poster
#12112

Vanishing Gradients in Reinforcement Finetuning of Language Models

Noam Razin, Hattie Zhou, Omid Saremi et al.

ICLR 2024posterarXiv:2310.20703
#12113

What Algorithms can Transformers Learn? A Study in Length Generalization

Hattie Zhou, Arwen Bradley, Etai Littwin et al.

ICLR 2024posterarXiv:2310.16028
#12114

Neural Network-Based Score Estimation in Diffusion Models: Optimization and Generalization

Yinbin Han, Meisam Razaviyayn, Renyuan Xu

ICLR 2024posterarXiv:2401.15604
#12115

Enhancing Small Medical Learners with Privacy-preserving Contextual Prompting

xinlu zhang, Shiyang Li, Xianjun Yang et al.

ICLR 2024posterarXiv:2305.12723
#12116

Optimal criterion for feature learning of two-layer linear neural network in high dimensional interpolation regime

Keita Suzuki, Taiji Suzuki

ICLR 2024poster
#12117

On the Scalability and Memory Efficiency of Semidefinite Programs for Lipschitz Constant Estimation of Neural Networks

Zi Wang, Bin Hu, Aaron Havens et al.

ICLR 2024poster
#12118

Intelligent Switching for Reset-Free RL

Darshan Patil, Janarthanan Rajendran, Glen Berseth et al.

ICLR 2024posterarXiv:2405.01684
#12119

Quantifying the Sensitivity of Inverse Reinforcement Learning to Misspecification

Joar Skalse, Alessandro Abate

ICLR 2024posterarXiv:2403.06854
#12120

Effective and Efficient Federated Tree Learning on Hybrid Data

Qinbin Li, Chulin Xie, Xiaojun Xu et al.

ICLR 2024posterarXiv:2310.11865
#12121

Neural Processing of Tri-Plane Hybrid Neural Fields

Adriano Cardace, Pierluigi Zama Ramirez, Francesco Ballerini et al.

ICLR 2024posterarXiv:2310.01140
#12122

Boosting the Adversarial Robustness of Graph Neural Networks: An OOD Perspective

Kuan Li, YiWen Chen, Yang Liu et al.

ICLR 2024poster
#12123

Byzantine Robust Cooperative Multi-Agent Reinforcement Learning as a Bayesian Game

Simin Li, Jun Guo, Jingqiao Xiu et al.

ICLR 2024posterarXiv:2305.12872
#12124

SetCSE: Set Operations using Contrastive Learning of Sentence Embeddings

Kang Liu

ICLR 2024posterarXiv:2404.17606
#12125

#InsTag: Instruction Tagging for Analyzing Supervised Fine-tuning of Large Language Models

Keming Lu, Hongyi Yuan, Zheng Yuan et al.

ICLR 2024posterarXiv:2308.07074
#12126

Debiasing Attention Mechanism in Transformer without Demographics

Shenyu Lu, Yipei Wang, Xiaoqian Wang

ICLR 2024poster
#12127

Unsupervised Pretraining for Fact Verification by Language Model Distillation

Adrian Bazaga, Pietro Lio, Gos Micklem

ICLR 2024posterarXiv:2309.16540
#12128

Image Translation as Diffusion Visual Programmers

Cheng Han, James Liang, Qifan Wang et al.

ICLR 2024posterarXiv:2401.09742
#12129

Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation

Haruka Kiyohara, Ren Kishimoto, Kosuke Kawakami et al.

ICLR 2024posterarXiv:2311.18207
#12130

Adversarial Imitation Learning via Boosting

Jonathan Chang, Dhruv Sreenivas, Yingbing Huang et al.

ICLR 2024posterarXiv:2404.08513
#12131

Bayes Conditional Distribution Estimation for Knowledge Distillation Based on Conditional Mutual Information

Linfeng Ye, Shayan Mohajer Hamidi, Renhao Tan et al.

ICLR 2024posterarXiv:2401.08732
#12132

Provable Reward-Agnostic Preference-Based Reinforcement Learning

Wenhao Zhan, Masatoshi Uehara, Wen Sun et al.

ICLR 2024spotlightarXiv:2305.18505
#12133

Transformers as Decision Makers: Provable In-Context Reinforcement Learning via Supervised Pretraining

Licong Lin, Yu Bai, Song Mei

ICLR 2024posterarXiv:2310.08566
#12134

Improving Convergence and Generalization Using Parameter Symmetries

Bo Zhao, Robert M. Gower, Robin Walters et al.

ICLR 2024posterarXiv:2305.13404
#12135

COLEP: Certifiably Robust Learning-Reasoning Conformal Prediction via Probabilistic Circuits

Mintong Kang, Nezihe Merve Gürel, Linyi Li et al.

ICLR 2024posterarXiv:2403.11348
#12136

Manifold Preserving Guided Diffusion

Yutong He, Naoki Murata, Chieh-Hsin Lai et al.

ICLR 2024posterarXiv:2311.16424
#12137

Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators

Daniel Geng, Andrew Owens

ICLR 2024posterarXiv:2401.18085
#12138

Threaten Spiking Neural Networks through Combining Rate and Temporal Information

Zecheng Hao, Tong Bu, Xinyu Shi et al.

ICLR 2024oral
#12139

Exploring Target Representations for Masked Autoencoders

xingbin liu, Jinghao Zhou, Tao Kong et al.

ICLR 2024posterarXiv:2209.03917
#12140

Federated Recommendation with Additive Personalization

Zhiwei Li, Guodong Long, Tianyi Zhou

ICLR 2024posterarXiv:2301.09109
#12141

Neural Language of Thought Models

Yi-Fu Wu, Minseung Lee, Sungjin Ahn

ICLR 2024posterarXiv:2402.01203
#12142

Text2Reward: Reward Shaping with Language Models for Reinforcement Learning

Tianbao Xie, Siheng Zhao, Chen Henry Wu et al.

ICLR 2024spotlightarXiv:2309.11489
#12143

Towards Training Without Depth Limits: Batch Normalization Without Gradient Explosion

Alexandru Meterez, Amir Joudaki, Francesco Orabona et al.

ICLR 2024posterarXiv:2310.02012
#12144

Statistical Rejection Sampling Improves Preference Optimization

Tianqi Liu, Yao Zhao, Rishabh Joshi et al.

ICLR 2024posterarXiv:2309.06657
#12145

Tell Your Model Where to Attend: Post-hoc Attention Steering for LLMs

Qingru Zhang, Chandan Singh, Liyuan Liu et al.

ICLR 2024posterarXiv:2311.02262
#12146

Privacy Amplification for Matrix Mechanisms

Christopher Choquette-Choo, Arun Ganesh, Thomas Steinke et al.

ICLR 2024spotlightarXiv:2310.15526
#12147

Negative Label Guided OOD Detection with Pretrained Vision-Language Models

Xue JIANG, Feng Liu, Zhen Fang et al.

ICLR 2024spotlightarXiv:2403.20078
#12148

PTaRL: Prototype-based Tabular Representation Learning via Space Calibration

Hangting Ye, Wei Fan, Xiaozhuang Song et al.

ICLR 2024spotlightarXiv:2407.05364
#12149

Constrained Bi-Level Optimization: Proximal Lagrangian Value Function Approach and Hessian-free Algorithm

Wei Yao, Chengming Yu, Shangzhi Zeng et al.

ICLR 2024spotlightarXiv:2401.16164
#12150

Correlated Noise Provably Beats Independent Noise for Differentially Private Learning

Christopher Choquette-Choo, Krishnamurthy Dvijotham, Krishna Pillutla et al.

ICLR 2024posterarXiv:2310.06771
#12151

ModuLoRA: Finetuning 2-Bit LLMs on Consumer GPUs by Integrating with Modular Quantizers

Junjie Oscar Yin, Yingheng Wang, Volodymyr Kuleshov et al.

ICLR 2024posterarXiv:2309.16119
#12152

On the Stability of Expressive Positional Encodings for Graphs

Yinan Huang, William Lu, Joshua Robinson et al.

ICLR 2024posterarXiv:2310.02579
#12153

Evaluating Representation Learning on the Protein Structure Universe

Arian Jamasb, Alex Morehead, Chaitanya Joshi et al.

ICLR 2024posterarXiv:2406.13864
#12154

AutoVP: An Automated Visual Prompting Framework and Benchmark

Hsi-Ai Tsao, Lei Hsiung, Pin-Yu Chen et al.

ICLR 2024posterarXiv:2310.08381
#12155

On the Hardness of Constrained Cooperative Multi-Agent Reinforcement Learning

Ziyi Chen, Yi Zhou, Heng Huang

ICLR 2024poster
#12156

Information Retention via Learning Supplemental Features

Zhipeng Xie, Yahe Li

ICLR 2024spotlight
#12157

Geometry-Aware Projective Mapping for Unbounded Neural Radiance Fields

Junoh Lee, Hyunjun Jung, Jinhwi Park et al.

ICLR 2024poster
#12158

Off-Policy Primal-Dual Safe Reinforcement Learning

Zifan Wu, Bo Tang, Qian Lin et al.

ICLR 2024posterarXiv:2401.14758
#12159

When should we prefer Decision Transformers for Offline Reinforcement Learning?

Prajjwal Bhargava, Rohan Chitnis, Alborz Geramifard et al.

ICLR 2024posterarXiv:2305.14550
#12160

ARM: Refining Multivariate Forecasting with Adaptive Temporal-Contextual Learning

Jiecheng Lu, Xu Han, Shihao Yang

ICLR 2024oralarXiv:2310.09488
#12161

SAS: Structured Activation Sparsification

Yusuke Sekikawa, Shingo Yashima

ICLR 2024poster
#12162

Learning Multi-Agent Communication with Contrastive Learning

Yat Long (Richie) Lo, Biswa Sengupta, Jakob Foerster et al.

ICLR 2024posterarXiv:2307.01403
#12163

Xformer: Hybrid X-Shaped Transformer for Image Denoising

Jiale Zhang, Yulun Zhang, Jinjin Gu et al.

ICLR 2024posterarXiv:2303.06440
#12164

Dynamics-Informed Protein Design with Structure Conditioning

Urszula Julia Komorowska, Simon Mathis, Kieran Didi et al.

ICLR 2024poster
#12165

Identifiable Latent Polynomial Causal Models through the Lens of Change

Yuhang Liu, Zhen Zhang, Dong Gong et al.

ICLR 2024posterarXiv:2310.15580
#12166

SYMBOL: Generating Flexible Black-Box Optimizers through Symbolic Equation Learning

Jiacheng Chen, Zeyuan Ma, Hongshu Guo et al.

ICLR 2024posterarXiv:2402.02355
#12167

Graph Lottery Ticket Automated

Guibin Zhang, Kun Wang, Wei Huang et al.

ICLR 2024poster
#12168

Threshold-Consistent Margin Loss for Open-World Deep Metric Learning

Qin ZHANG, Linghan Xu, Jun Fang et al.

ICLR 2024posterarXiv:2307.04047
#12169

Encoding Unitig-level Assembly Graphs with Heterophilous Constraints for Metagenomic Contigs Binning

Hansheng Xue, Vijini Mallawaarachchi, Lexing Xie et al.

ICLR 2024poster
#12170

Adaptive Regret for Bandits Made Possible: Two Queries Suffice

Zhou Lu, Qiuyi (Richard) Zhang, Xinyi Chen et al.

ICLR 2024posterarXiv:2401.09278
#12171

AdaMerging: Adaptive Model Merging for Multi-Task Learning

Enneng Yang, Zhenyi Wang, Li Shen et al.

ICLR 2024posterarXiv:2310.02575
#12172

Statistically Optimal $K$-means Clustering via Nonnegative Low-rank Semidefinite Programming

Yubo Zhuang, Xiaohui Chen, Yun Yang et al.

ICLR 2024posterarXiv:2305.18436
#12173

Improved statistical and computational complexity of the mean-field Langevin dynamics under structured data

Atsushi Nitanda, Kazusato Oko, Taiji Suzuki et al.

ICLR 2024poster
#12174

Bridging Neural and Symbolic Representations with Transitional Dictionary Learning

Junyan Cheng, Peter Chin

ICLR 2024posterarXiv:2308.02000
#12175

Thin-Shell Object Manipulations With Differentiable Physics Simulations

Yian Wang, Juntian Zheng, Zhehuan Chen et al.

ICLR 2024spotlightarXiv:2404.00451
#12176

Bayesian Coreset Optimization for Personalized Federated Learning

Prateek Chanda, Shrey Modi, Ganesh Ramakrishnan

ICLR 2024posterarXiv:2511.01800
#12177

Beyond Spatio-Temporal Representations: Evolving Fourier Transform for Temporal Graphs

Anson Simon Bastos, Kuldeep Singh, Abhishek Nadgeri et al.

ICLR 2024oralarXiv:2402.16078
#12178

Hierarchical Context Merging: Better Long Context Understanding for Pre-trained LLMs

Woomin Song, Seunghyuk Oh, Sangwoo Mo et al.

ICLR 2024posterarXiv:2404.10308
#12179

Towards Best Practices of Activation Patching in Language Models: Metrics and Methods

Fred Zhang, Neel Nanda

ICLR 2024posterarXiv:2309.16042
#12180

Scale-Adaptive Diffusion Model for Complex Sketch Synthesis

Jijin Hu, Ke Li, Yonggang Qi et al.

ICLR 2024poster
#12181

On the Over-Memorization During Natural, Robust and Catastrophic Overfitting

Runqi Lin, Chaojian Yu, Bo Han et al.

ICLR 2024posterarXiv:2310.08847
#12182

Mastering Memory Tasks with World Models

Mohammad Reza Samsami, Artem Zholus, Janarthanan Rajendran et al.

ICLR 2024oralarXiv:2403.04253
#12183

Towards Principled Representation Learning from Videos for Reinforcement Learning

Dipendra Kumar Misra, Akanksha Saran, Tengyang Xie et al.

ICLR 2024oralarXiv:2403.13765
#12184

Expected flow networks in stochastic environments and two-player zero-sum games

Marco Jiralerspong, Bilun Sun, Danilo Vucetic et al.

ICLR 2024posterarXiv:2310.02779
#12185

Towards Unified Multi-Modal Personalization: Large Vision-Language Models for Generative Recommendation and Beyond

Tianxin Wei, Bowen Jin, Ruirui Li et al.

ICLR 2024posterarXiv:2403.10667
#12186

DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models

Yung-Sung Chuang, Yujia Xie, Hongyin Luo et al.

ICLR 2024posterarXiv:2309.03883
#12187

Energy-conserving equivariant GNN for elasticity of lattice architected metamaterials

Ivan Grega, Ilyes Batatia, Gábor Csányi et al.

ICLR 2024posterarXiv:2401.16914
#12188

SALMON: Self-Alignment with Instructable Reward Models

Zhiqing Sun, Yikang Shen, Hongxin Zhang et al.

ICLR 2024posterarXiv:2310.05910
#12189

Get more for less: Principled Data Selection for Warming Up Fine-Tuning in LLMs

Feiyang Kang, Hoang Anh Just, Yifan Sun et al.

ICLR 2024posterarXiv:2405.02774
#12190

Augmenting Transformers with Recursively Composed Multi-grained Representations

Xiang Hu, Qingyang Zhu, Kewei Tu et al.

ICLR 2024posterarXiv:2309.16319
#12191

Adversarial Training on Purification (AToP): Advancing Both Robustness and Generalization

Guang Lin, Chao Li, Jianhai Zhang et al.

ICLR 2024posterarXiv:2401.16352
#12192

Large Language Models as Generalizable Policies for Embodied Tasks

Andrew Szot, Max Schwarzer, Harsh Agrawal et al.

ICLR 2024posterarXiv:2310.17722
#12193

The Joint Effect of Task Similarity and Overparameterization on Catastrophic Forgetting — An Analytical Model

Daniel Goldfarb, Itay Evron, Nir Weinberger et al.

ICLR 2024posterarXiv:2401.12617
#12194

Fast Equilibrium of SGD in Generic Situations

Zhiyuan Li, Yi Wang, Zhiren Wang

ICLR 2024poster
#12195

Connect, Collapse, Corrupt: Learning Cross-Modal Tasks with Uni-Modal Data

Yuhui Zhang, Elaine Sui, Serena Yeung

ICLR 2024posterarXiv:2401.08567
#12196

Compositional Preference Models for Aligning LMs

DONGYOUNG GO, Tomek Korbak, Germàn Kruszewski et al.

ICLR 2024posterarXiv:2310.13011
#12197

Diffusion Posterior Sampling for Linear Inverse Problem Solving: A Filtering Perspective

Zehao Dou, Yang Song

ICLR 2024poster
#12198

Demystifying Local & Global Fairness Trade-offs in Federated Learning Using Partial Information Decomposition

Faisal Hamman, Sanghamitra Dutta

ICLR 2024poster
#12199

Learning Conditional Invariances through Non-Commutativity

Abhra Chaudhuri, Serban Georgescu, Anjan Dutta

ICLR 2024posterarXiv:2402.11682
#12200

Generative Modeling with Phase Stochastic Bridge

Tianrong Chen, Jiatao Gu, Laurent Dinh et al.

ICLR 2024posterarXiv:2310.07805