Most Cited 2024 "sft" Papers

12,324 papers found • Page 61 of 62

#12001

Robust Model-Based Optimization for Challenging Fitness Landscapes

Saba Ghaffari, Ehsan Saleh, Alex Schwing et al.

ICLR 2024posterarXiv:2305.13650
#12002

Analytically Tractable Hidden-States Inference in Bayesian Neural Networks

Luong-Ha Nguyen, James-A. Goulet

ICLR 2024posterarXiv:2107.03759
#12003

An interpretable error correction method for enhancing code-to-code translation

Min Xue, Artur Andrzejak, Marla Leuther

ICLR 2024poster
#12004

Fiber Monte Carlo

Nick Richardson, Deniz Oktay, Yaniv Ovadia et al.

ICLR 2024poster
#12005

NeRM: Learning Neural Representations for High-Framerate Human Motion Synthesis

Dong Wei, Huaijiang Sun, Bin Li et al.

ICLR 2024oral
#12006

A Unified Experiment Design Approach for Cyclic and Acyclic Causal Models

Ehsan Mokhtarian, Saber Salehkaleybar, AmirEmad Ghassami et al.

ICLR 2024posterarXiv:2205.10083
#12007

A Framework and Benchmark for Deep Batch Active Learning for Regression

David Holzmüller, Viktor Zaverkin, Johannes Kästner et al.

ICLR 2024posterarXiv:2203.09410
#12008

Tackling the Data Heterogeneity in Asynchronous Federated Learning with Cached Update Calibration

Yujia Wang, Yuanpu Cao, Jingcheng Wu et al.

ICLR 2024poster
#12009

ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation

Jiaming Liu, Senqiao Yang, Peidong Jia et al.

ICLR 2024posterarXiv:2306.04344
#12010

Automatic Functional Differentiation in JAX

Min Lin

ICLR 2024posterarXiv:2311.18727
#12011

Manipulating dropout reveals an optimal balance of efficiency and robustness in biological and machine visual systems

Jacob Prince, Gabriel Fajardo, George Alvarez et al.

ICLR 2024oral
#12012

$\mathcal{B}$-Coder: Value-Based Deep Reinforcement Learning for Program Synthesis

Zishun Yu, Yunzhe Tao, Liyu Chen et al.

ICLR 2024spotlightarXiv:2310.03173
#12013

Octavius: Mitigating Task Interference in MLLMs via LoRA-MoE

Zeren Chen, ziqin wang, zhen wang et al.

ICLR 2024posterarXiv:2311.02684
#12014

ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving

Zhibin Gou, Zhihong Shao, Yeyun Gong et al.

ICLR 2024posterarXiv:2309.17452
#12015

Sample-efficient Learning of Infinite-horizon Average-reward MDPs with General Function Approximation

Jianliang He, Han Zhong, Zhuoran Yang

ICLR 2024posterarXiv:2404.12648
#12016

Towards Robust Offline Reinforcement Learning under Diverse Data Corruption

Rui Yang, Han Zhong, Jiawei Xu et al.

ICLR 2024spotlightarXiv:2310.12955
#12017

Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative Instructions

Juncheng Li, Kaihang Pan, Zhiqi Ge et al.

ICLR 2024spotlightarXiv:2308.04152
#12018

Towards domain-invariant Self-Supervised Learning with Batch Styles Standardization

Marin Scalbert, Maria Vakalopoulou, Florent Couzinie-Devy

ICLR 2024posterarXiv:2303.06088
#12019

SNIP: Bridging Mathematical Symbolic and Numeric Realms with Unified Pre-training

Kazem Meidani, Parshin Shojaee, Chandan Reddy et al.

ICLR 2024spotlightarXiv:2310.02227
#12020

Learning from Label Proportions: Bootstrapping Supervised Learners via Belief Propagation

Shreyas Havaldar, Navodita Sharma, Shubhi Sareen et al.

ICLR 2024posterarXiv:2310.08056
#12021

Transformer-Modulated Diffusion Models for Probabilistic Multivariate Time Series Forecasting

Yuxin Li, Wenchao Chen, Xinyue Hu et al.

ICLR 2024poster
#12022

Vanishing Gradients in Reinforcement Finetuning of Language Models

Noam Razin, Hattie Zhou, Omid Saremi et al.

ICLR 2024posterarXiv:2310.20703
#12023

What Algorithms can Transformers Learn? A Study in Length Generalization

Hattie Zhou, Arwen Bradley, Etai Littwin et al.

ICLR 2024posterarXiv:2310.16028
#12024

Neural Network-Based Score Estimation in Diffusion Models: Optimization and Generalization

Yinbin Han, Meisam Razaviyayn, Renyuan Xu

ICLR 2024posterarXiv:2401.15604
#12025

Enhancing Small Medical Learners with Privacy-preserving Contextual Prompting

xinlu zhang, Shiyang Li, Xianjun Yang et al.

ICLR 2024posterarXiv:2305.12723
#12026

Optimal criterion for feature learning of two-layer linear neural network in high dimensional interpolation regime

Keita Suzuki, Taiji Suzuki

ICLR 2024poster
#12027

On the Scalability and Memory Efficiency of Semidefinite Programs for Lipschitz Constant Estimation of Neural Networks

Zi Wang, Bin Hu, Aaron Havens et al.

ICLR 2024poster
#12028

Intelligent Switching for Reset-Free RL

Darshan Patil, Janarthanan Rajendran, Glen Berseth et al.

ICLR 2024posterarXiv:2405.01684
#12029

Quantifying the Sensitivity of Inverse Reinforcement Learning to Misspecification

Joar Skalse, Alessandro Abate

ICLR 2024posterarXiv:2403.06854
#12030

Effective and Efficient Federated Tree Learning on Hybrid Data

Qinbin Li, Chulin Xie, Xiaojun Xu et al.

ICLR 2024posterarXiv:2310.11865
#12031

Neural Processing of Tri-Plane Hybrid Neural Fields

Adriano Cardace, Pierluigi Zama Ramirez, Francesco Ballerini et al.

ICLR 2024posterarXiv:2310.01140
#12032

Boosting the Adversarial Robustness of Graph Neural Networks: An OOD Perspective

Kuan Li, YiWen Chen, Yang Liu et al.

ICLR 2024poster
#12033

Byzantine Robust Cooperative Multi-Agent Reinforcement Learning as a Bayesian Game

Simin Li, Jun Guo, Jingqiao Xiu et al.

ICLR 2024posterarXiv:2305.12872
#12034

SetCSE: Set Operations using Contrastive Learning of Sentence Embeddings

Kang Liu

ICLR 2024posterarXiv:2404.17606
#12035

#InsTag: Instruction Tagging for Analyzing Supervised Fine-tuning of Large Language Models

Keming Lu, Hongyi Yuan, Zheng Yuan et al.

ICLR 2024posterarXiv:2308.07074
#12036

Debiasing Attention Mechanism in Transformer without Demographics

Shenyu Lu, Yipei Wang, Xiaoqian Wang

ICLR 2024poster
#12037

Unsupervised Pretraining for Fact Verification by Language Model Distillation

Adrian Bazaga, Pietro Lio, Gos Micklem

ICLR 2024posterarXiv:2309.16540
#12038

Image Translation as Diffusion Visual Programmers

Cheng Han, James Liang, Qifan Wang et al.

ICLR 2024posterarXiv:2401.09742
#12039

Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation

Haruka Kiyohara, Ren Kishimoto, Kosuke Kawakami et al.

ICLR 2024posterarXiv:2311.18207
#12040

Adversarial Imitation Learning via Boosting

Jonathan Chang, Dhruv Sreenivas, Yingbing Huang et al.

ICLR 2024posterarXiv:2404.08513
#12041

Bayes Conditional Distribution Estimation for Knowledge Distillation Based on Conditional Mutual Information

Linfeng Ye, Shayan Mohajer Hamidi, Renhao Tan et al.

ICLR 2024posterarXiv:2401.08732
#12042

Provable Reward-Agnostic Preference-Based Reinforcement Learning

Wenhao Zhan, Masatoshi Uehara, Wen Sun et al.

ICLR 2024spotlightarXiv:2305.18505
#12043

Transformers as Decision Makers: Provable In-Context Reinforcement Learning via Supervised Pretraining

Licong Lin, Yu Bai, Song Mei

ICLR 2024posterarXiv:2310.08566
#12044

Improving Convergence and Generalization Using Parameter Symmetries

Bo Zhao, Robert M. Gower, Robin Walters et al.

ICLR 2024posterarXiv:2305.13404
#12045

COLEP: Certifiably Robust Learning-Reasoning Conformal Prediction via Probabilistic Circuits

Mintong Kang, Nezihe Merve Gürel, Linyi Li et al.

ICLR 2024posterarXiv:2403.11348
#12046

Manifold Preserving Guided Diffusion

Yutong He, Naoki Murata, Chieh-Hsin Lai et al.

ICLR 2024posterarXiv:2311.16424
#12047

Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators

Daniel Geng, Andrew Owens

ICLR 2024posterarXiv:2401.18085
#12048

Threaten Spiking Neural Networks through Combining Rate and Temporal Information

Zecheng Hao, Tong Bu, Xinyu Shi et al.

ICLR 2024oral
#12049

Exploring Target Representations for Masked Autoencoders

xingbin liu, Jinghao Zhou, Tao Kong et al.

ICLR 2024posterarXiv:2209.03917
#12050

Federated Recommendation with Additive Personalization

Zhiwei Li, Guodong Long, Tianyi Zhou

ICLR 2024posterarXiv:2301.09109
#12051

Neural Language of Thought Models

Yi-Fu Wu, Minseung Lee, Sungjin Ahn

ICLR 2024posterarXiv:2402.01203
#12052

Text2Reward: Reward Shaping with Language Models for Reinforcement Learning

Tianbao Xie, Siheng Zhao, Chen Henry Wu et al.

ICLR 2024spotlightarXiv:2309.11489
#12053

Towards Training Without Depth Limits: Batch Normalization Without Gradient Explosion

Alexandru Meterez, Amir Joudaki, Francesco Orabona et al.

ICLR 2024posterarXiv:2310.02012
#12054

Statistical Rejection Sampling Improves Preference Optimization

Tianqi Liu, Yao Zhao, Rishabh Joshi et al.

ICLR 2024posterarXiv:2309.06657
#12055

Tell Your Model Where to Attend: Post-hoc Attention Steering for LLMs

Qingru Zhang, Chandan Singh, Liyuan Liu et al.

ICLR 2024posterarXiv:2311.02262
#12056

Privacy Amplification for Matrix Mechanisms

Christopher Choquette-Choo, Arun Ganesh, Thomas Steinke et al.

ICLR 2024spotlightarXiv:2310.15526
#12057

Negative Label Guided OOD Detection with Pretrained Vision-Language Models

Xue JIANG, Feng Liu, Zhen Fang et al.

ICLR 2024spotlightarXiv:2403.20078
#12058

PTaRL: Prototype-based Tabular Representation Learning via Space Calibration

Hangting Ye, Wei Fan, Xiaozhuang Song et al.

ICLR 2024spotlightarXiv:2407.05364
#12059

Constrained Bi-Level Optimization: Proximal Lagrangian Value Function Approach and Hessian-free Algorithm

Wei Yao, Chengming Yu, Shangzhi Zeng et al.

ICLR 2024spotlightarXiv:2401.16164
#12060

Correlated Noise Provably Beats Independent Noise for Differentially Private Learning

Christopher Choquette-Choo, Krishnamurthy Dvijotham, Krishna Pillutla et al.

ICLR 2024posterarXiv:2310.06771
#12061

ModuLoRA: Finetuning 2-Bit LLMs on Consumer GPUs by Integrating with Modular Quantizers

Junjie Oscar Yin, Yingheng Wang, Volodymyr Kuleshov et al.

ICLR 2024posterarXiv:2309.16119
#12062

On the Stability of Expressive Positional Encodings for Graphs

Yinan Huang, William Lu, Joshua Robinson et al.

ICLR 2024posterarXiv:2310.02579
#12063

Evaluating Representation Learning on the Protein Structure Universe

Arian Jamasb, Alex Morehead, Chaitanya Joshi et al.

ICLR 2024posterarXiv:2406.13864
#12064

AutoVP: An Automated Visual Prompting Framework and Benchmark

Hsi-Ai Tsao, Lei Hsiung, Pin-Yu Chen et al.

ICLR 2024posterarXiv:2310.08381
#12065

On the Hardness of Constrained Cooperative Multi-Agent Reinforcement Learning

Ziyi Chen, Yi Zhou, Heng Huang

ICLR 2024poster
#12066

Information Retention via Learning Supplemental Features

Zhipeng Xie, Yahe Li

ICLR 2024spotlight
#12067

Geometry-Aware Projective Mapping for Unbounded Neural Radiance Fields

Junoh Lee, Hyunjun Jung, Jinhwi Park et al.

ICLR 2024poster
#12068

Off-Policy Primal-Dual Safe Reinforcement Learning

Zifan Wu, Bo Tang, Qian Lin et al.

ICLR 2024posterarXiv:2401.14758
#12069

When should we prefer Decision Transformers for Offline Reinforcement Learning?

Prajjwal Bhargava, Rohan Chitnis, Alborz Geramifard et al.

ICLR 2024posterarXiv:2305.14550
#12070

ARM: Refining Multivariate Forecasting with Adaptive Temporal-Contextual Learning

Jiecheng Lu, Xu Han, Shihao Yang

ICLR 2024oralarXiv:2310.09488
#12071

SAS: Structured Activation Sparsification

Yusuke Sekikawa, Shingo Yashima

ICLR 2024poster
#12072

Learning Multi-Agent Communication with Contrastive Learning

Yat Long (Richie) Lo, Biswa Sengupta, Jakob Foerster et al.

ICLR 2024posterarXiv:2307.01403
#12073

Xformer: Hybrid X-Shaped Transformer for Image Denoising

Jiale Zhang, Yulun Zhang, Jinjin Gu et al.

ICLR 2024posterarXiv:2303.06440
#12074

Dynamics-Informed Protein Design with Structure Conditioning

Urszula Julia Komorowska, Simon Mathis, Kieran Didi et al.

ICLR 2024poster
#12075

Identifiable Latent Polynomial Causal Models through the Lens of Change

Yuhang Liu, Zhen Zhang, Dong Gong et al.

ICLR 2024posterarXiv:2310.15580
#12076

SYMBOL: Generating Flexible Black-Box Optimizers through Symbolic Equation Learning

Jiacheng Chen, Zeyuan Ma, Hongshu Guo et al.

ICLR 2024posterarXiv:2402.02355
#12077

Graph Lottery Ticket Automated

Guibin Zhang, Kun Wang, Wei Huang et al.

ICLR 2024poster
#12078

Threshold-Consistent Margin Loss for Open-World Deep Metric Learning

Qin ZHANG, Linghan Xu, Jun Fang et al.

ICLR 2024posterarXiv:2307.04047
#12079

Encoding Unitig-level Assembly Graphs with Heterophilous Constraints for Metagenomic Contigs Binning

Hansheng Xue, Vijini Mallawaarachchi, Lexing Xie et al.

ICLR 2024poster
#12080

Adaptive Regret for Bandits Made Possible: Two Queries Suffice

Zhou Lu, Qiuyi (Richard) Zhang, Xinyi Chen et al.

ICLR 2024posterarXiv:2401.09278
#12081

AdaMerging: Adaptive Model Merging for Multi-Task Learning

Enneng Yang, Zhenyi Wang, Li Shen et al.

ICLR 2024posterarXiv:2310.02575
#12082

Statistically Optimal $K$-means Clustering via Nonnegative Low-rank Semidefinite Programming

Yubo Zhuang, Xiaohui Chen, Yun Yang et al.

ICLR 2024posterarXiv:2305.18436
#12083

Improved statistical and computational complexity of the mean-field Langevin dynamics under structured data

Atsushi Nitanda, Kazusato Oko, Taiji Suzuki et al.

ICLR 2024poster
#12084

Bridging Neural and Symbolic Representations with Transitional Dictionary Learning

Junyan Cheng, Peter Chin

ICLR 2024posterarXiv:2308.02000
#12085

Thin-Shell Object Manipulations With Differentiable Physics Simulations

Yian Wang, Juntian Zheng, Zhehuan Chen et al.

ICLR 2024spotlightarXiv:2404.00451
#12086

Bayesian Coreset Optimization for Personalized Federated Learning

Prateek Chanda, Shrey Modi, Ganesh Ramakrishnan

ICLR 2024posterarXiv:2511.01800
#12087

Beyond Spatio-Temporal Representations: Evolving Fourier Transform for Temporal Graphs

Anson Simon Bastos, Kuldeep Singh, Abhishek Nadgeri et al.

ICLR 2024oralarXiv:2402.16078
#12088

Hierarchical Context Merging: Better Long Context Understanding for Pre-trained LLMs

Woomin Song, Seunghyuk Oh, Sangwoo Mo et al.

ICLR 2024posterarXiv:2404.10308
#12089

Towards Best Practices of Activation Patching in Language Models: Metrics and Methods

Fred Zhang, Neel Nanda

ICLR 2024posterarXiv:2309.16042
#12090

Scale-Adaptive Diffusion Model for Complex Sketch Synthesis

Jijin Hu, Ke Li, Yonggang Qi et al.

ICLR 2024poster
#12091

On the Over-Memorization During Natural, Robust and Catastrophic Overfitting

Runqi Lin, Chaojian Yu, Bo Han et al.

ICLR 2024posterarXiv:2310.08847
#12092

Mastering Memory Tasks with World Models

Mohammad Reza Samsami, Artem Zholus, Janarthanan Rajendran et al.

ICLR 2024oralarXiv:2403.04253
#12093

Towards Principled Representation Learning from Videos for Reinforcement Learning

Dipendra Kumar Misra, Akanksha Saran, Tengyang Xie et al.

ICLR 2024oralarXiv:2403.13765
#12094

Expected flow networks in stochastic environments and two-player zero-sum games

Marco Jiralerspong, Bilun Sun, Danilo Vucetic et al.

ICLR 2024posterarXiv:2310.02779
#12095

Towards Unified Multi-Modal Personalization: Large Vision-Language Models for Generative Recommendation and Beyond

Tianxin Wei, Bowen Jin, Ruirui Li et al.

ICLR 2024posterarXiv:2403.10667
#12096

DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models

Yung-Sung Chuang, Yujia Xie, Hongyin Luo et al.

ICLR 2024posterarXiv:2309.03883
#12097

Energy-conserving equivariant GNN for elasticity of lattice architected metamaterials

Ivan Grega, Ilyes Batatia, Gábor Csányi et al.

ICLR 2024posterarXiv:2401.16914
#12098

SALMON: Self-Alignment with Instructable Reward Models

Zhiqing Sun, Yikang Shen, Hongxin Zhang et al.

ICLR 2024posterarXiv:2310.05910
#12099

Get more for less: Principled Data Selection for Warming Up Fine-Tuning in LLMs

Feiyang Kang, Hoang Anh Just, Yifan Sun et al.

ICLR 2024posterarXiv:2405.02774
#12100

Augmenting Transformers with Recursively Composed Multi-grained Representations

Xiang Hu, Qingyang Zhu, Kewei Tu et al.

ICLR 2024posterarXiv:2309.16319
#12101

Adversarial Training on Purification (AToP): Advancing Both Robustness and Generalization

Guang Lin, Chao Li, Jianhai Zhang et al.

ICLR 2024posterarXiv:2401.16352
#12102

Large Language Models as Generalizable Policies for Embodied Tasks

Andrew Szot, Max Schwarzer, Harsh Agrawal et al.

ICLR 2024posterarXiv:2310.17722
#12103

The Joint Effect of Task Similarity and Overparameterization on Catastrophic Forgetting — An Analytical Model

Daniel Goldfarb, Itay Evron, Nir Weinberger et al.

ICLR 2024posterarXiv:2401.12617
#12104

Fast Equilibrium of SGD in Generic Situations

Zhiyuan Li, Yi Wang, Zhiren Wang

ICLR 2024poster
#12105

Connect, Collapse, Corrupt: Learning Cross-Modal Tasks with Uni-Modal Data

Yuhui Zhang, Elaine Sui, Serena Yeung

ICLR 2024posterarXiv:2401.08567
#12106

Compositional Preference Models for Aligning LMs

DONGYOUNG GO, Tomek Korbak, Germàn Kruszewski et al.

ICLR 2024posterarXiv:2310.13011
#12107

Diffusion Posterior Sampling for Linear Inverse Problem Solving: A Filtering Perspective

Zehao Dou, Yang Song

ICLR 2024poster
#12108

Demystifying Local & Global Fairness Trade-offs in Federated Learning Using Partial Information Decomposition

Faisal Hamman, Sanghamitra Dutta

ICLR 2024poster
#12109

Learning Conditional Invariances through Non-Commutativity

Abhra Chaudhuri, Serban Georgescu, Anjan Dutta

ICLR 2024posterarXiv:2402.11682
#12110

Generative Modeling with Phase Stochastic Bridge

Tianrong Chen, Jiatao Gu, Laurent Dinh et al.

ICLR 2024posterarXiv:2310.07805
#12111

Bandits Meet Mechanism Design to Combat Clickbait in Online Recommendation

Thomas Kleine Buening, Aadirupa Saha, Christos Dimitrakakis et al.

ICLR 2024spotlightarXiv:2311.15647
#12112

RobustTSF: Towards Theory and Design of Robust Time Series Forecasting with Anomalies

Hao Cheng, Qingsong Wen, Yang Liu et al.

ICLR 2024posterarXiv:2402.02032
#12113

Tailoring Self-Rationalizers with Multi-Reward Distillation

Sahana Ramnath, Brihi Joshi, Skyler Hallinan et al.

ICLR 2024posterarXiv:2311.02805
#12114

Controlling Vision-Language Models for Multi-Task Image Restoration

Ziwei Luo, Fredrik K. Gustafsson, Zheng Zhao et al.

ICLR 2024posterarXiv:2310.01018
#12115

VFLAIR: A Research Library and Benchmark for Vertical Federated Learning

TIANYUAN ZOU, Zixuan GU, Yu He et al.

ICLR 2024posterarXiv:2310.09827
#12116

Measuring Vision-Language STEM Skills of Neural Models

Jianhao Shen, Ye Yuan, Srbuhi Mirzoyan et al.

ICLR 2024posterarXiv:2402.17205
#12117

Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers

Qingyan Guo, Rui Wang, Junliang Guo et al.

ICLR 2024poster
#12118

MCM: Masked Cell Modeling for Anomaly Detection in Tabular Data

Jiaxin Yin, Yuanyuan Qiao, Zitang Zhou et al.

ICLR 2024poster
#12119

NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers

Kai Shen, Zeqian Ju, Xu Tan et al.

ICLR 2024spotlightarXiv:2304.09116
#12120

CLIP-MUSED: CLIP-Guided Multi-Subject Visual Neural Information Semantic Decoding

Qiongyi Zhou, Changde Du, Shengpei Wang et al.

ICLR 2024posterarXiv:2402.08994
#12121

How connectivity structure shapes rich and lazy learning in neural circuits

Yuhan Helena Liu, Aristide Baratin, Jonathan Cornford et al.

ICLR 2024posterarXiv:2310.08513
#12122

ARGS: Alignment as Reward-Guided Search

Maxim Khanov, Jirayu Burapacheep, Yixuan Li

ICLR 2024posterarXiv:2402.01694
#12123

Let Models Speak Ciphers: Multiagent Debate through Embeddings

Chau Pham, Boyi Liu, Yingxiang Yang et al.

ICLR 2024posterarXiv:2310.06272
#12124

NeuroBack: Improving CDCL SAT Solving using Graph Neural Networks

Wenxi Wang, Yang Hu, Mohit Tiwari et al.

ICLR 2024posterarXiv:2110.14053
#12125

Understanding when Dynamics-Invariant Data Augmentations Benefit Model-free Reinforcement Learning Updates

Nicholas Corrado, Josiah Hanna

ICLR 2024posterarXiv:2310.17786
#12126

Revisiting Deep Audio-Text Retrieval Through the Lens of Transportation

Tien Manh Luong, Khai Nguyen, Nhat Ho et al.

ICLR 2024posterarXiv:2405.10084
#12127

Text-to-3D with Classifier Score Distillation

Xin Yu, Yuan-Chen Guo, Yangguang Li et al.

ICLR 2024posterarXiv:2310.19415
#12128

Transformers can optimally learn regression mixture models

Reese Pathak, Rajat Sen, Weihao Kong et al.

ICLR 2024posterarXiv:2311.08362
#12129

Dirichlet-based Per-Sample Weighting by Transition Matrix for Noisy Label Learning

HeeSun Bae, Seungjae Shin, Byeonghu Na et al.

ICLR 2024posterarXiv:2403.02690
#12130

Branch-GAN: Improving Text Generation with (not so) Large Language Models

Fredrik Carlsson, Johan Broberg, Erik Hillbom et al.

ICLR 2024poster
#12131

SocioDojo: Building Lifelong Analytical Agents with Real-world Text and Time Series

Junyan Cheng, Peter Chin

ICLR 2024spotlight
#12132

A unique M-pattern for micro-expression spotting in long videos

Jinxuan Wang, Shiting Xu, Tong Zhang

ICLR 2024poster
#12133

Internal Cross-layer Gradients for Extending Homogeneity to Heterogeneity in Federated Learning

Yun-Hin Chan, Rui Zhou, Running Zhao et al.

ICLR 2024spotlightarXiv:2308.11464
#12134

iTransformer: Inverted Transformers Are Effective for Time Series Forecasting

Yong Liu, Tengge Hu, Haoran Zhang et al.

ICLR 2024oralarXiv:2310.06625
#12135

A Mutual Information Perspective on Federated Contrastive Learning

Christos Louizos, Matthias Reisser, Denis Korzhenkov

ICLR 2024spotlightarXiv:2405.02081
#12136

Local Graph Clustering with Noisy Labels

Artur Back de Luca, Kimon Fountoulakis, Shenghao Yang

ICLR 2024posterarXiv:2310.08031
#12137

DistillSpec: Improving Speculative Decoding via Knowledge Distillation

Yongchao Zhou, Kaifeng Lyu, Ankit Singh Rawat et al.

ICLR 2024posterarXiv:2310.08461
#12138

Faithful Vision-Language Interpretation via Concept Bottleneck Models

Songning Lai, Lijie Hu, Junxiao Wang et al.

ICLR 2024poster
#12139

Stylized Offline Reinforcement Learning: Extracting Diverse High-Quality Behaviors from Heterogeneous Datasets

Yihuan Mao, Chengjie Wu, Xi Chen et al.

ICLR 2024oral
#12140

Variance-aware Regret Bounds for Stochastic Contextual Dueling Bandits

Qiwei Di, Tao Jin, Yue Wu et al.

ICLR 2024posterarXiv:2310.00968
#12141

Demystifying Embedding Spaces using Large Language Models

Guy Tennenholtz, Yinlam Chow, ChihWei Hsu et al.

ICLR 2024posterarXiv:2310.04475
#12142

A Newborn Embodied Turing Test for Comparing Object Segmentation Across Animals and Machines

Manju Garimella, Denizhan Pak, Justin Wood et al.

ICLR 2024poster
#12143

DeepZero: Scaling Up Zeroth-Order Optimization for Deep Model Training

AOCHUAN CHEN, Yimeng Zhang, Jinghan Jia et al.

ICLR 2024posterarXiv:2310.02025
#12144

Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.

Raj Ghugare, Matthieu Geist, Glen Berseth et al.

ICLR 2024oralarXiv:2401.11237
#12145

Unveiling the Pitfalls of Knowledge Editing for Large Language Models

Zhoubo Li, Ningyu Zhang, Yunzhi Yao et al.

ICLR 2024posterarXiv:2310.02129
#12146

Learning Thresholds with Latent Values and Censored Feedback

Jiahao Zhang, Tao Lin, Weiqiang Zheng et al.

ICLR 2024posterarXiv:2312.04653
#12147

Extending Power of Nature from Binary to Real-Valued Graph Learning in Real World

Chunshu Wu, Ruibing Song, Chuan Liu et al.

ICLR 2024poster
#12148

Robustifying and Boosting Training-Free Neural Architecture Search

Zhenfeng He, Yao Shu, Zhongxiang Dai et al.

ICLR 2024posterarXiv:2403.07591
#12149

Guess & Sketch: Language Model Guided Transpilation

Celine Lee, Abdulrahman Mahmoud, Michal Kurek et al.

ICLR 2024posterarXiv:2309.14396
#12150

Zero and Few-shot Semantic Parsing with Ambiguous Inputs

Elias Stengel-Eskin, Kyle Rawlins, Benjamin Van Durme

ICLR 2024posterarXiv:2306.00824
#12151

Large-Vocabulary 3D Diffusion Model with Transformer

Ziang Cao, Fangzhou Hong, Tong Wu et al.

ICLR 2024posterarXiv:2309.07920
#12152

An Investigation of Representation and Allocation Harms in Contrastive Learning

Subha Maity, Mayank Agarwal, Mikhail Yurochkin et al.

ICLR 2024posterarXiv:2310.01583
#12153

Channel Vision Transformers: An Image Is Worth 1 x 16 x 16 Words

Yujia Bao, Srinivasan Sivanandan, THEOFANIS KARALETSOS

ICLR 2024posterarXiv:2309.16108
#12154

Solving High Frequency and Multi-Scale PDEs with Gaussian Processes

Shikai Fang, Madison Cooley, Da Long et al.

ICLR 2024posterarXiv:2311.04465
#12155

Adversarial Attacks on Fairness of Graph Neural Networks

Binchi Zhang, Yushun Dong, Chen Chen et al.

ICLR 2024posterarXiv:2310.13822
#12156

Task structure and nonlinearity jointly determine learned representational geometry

Matteo Alleman, Jack Lindsey, Stefano Fusi

ICLR 2024posterarXiv:2401.13558
#12157

ReLU Strikes Back: Exploiting Activation Sparsity in Large Language Models

Iman Mirzadeh, Keivan Alizadeh-Vahid, Sachin Mehta et al.

ICLR 2024posterarXiv:2310.04564
#12158

Think-on-Graph: Deep and Responsible Reasoning of Large Language Model on Knowledge Graph

Jiashuo Sun, Chengjin Xu, Lumingyuan Tang et al.

ICLR 2024posterarXiv:2307.07697
#12159

Tractable MCMC for Private Learning with Pure and Gaussian Differential Privacy

Yingyu Lin, Yian Ma, Yu-Xiang Wang et al.

ICLR 2024posterarXiv:2310.14661
#12160

Jailbreak in pieces: Compositional Adversarial Attacks on Multi-Modal Language Models

Erfan Shayegani, Yue Dong, Nael Abu-Ghazaleh

ICLR 2024spotlightarXiv:2307.14539
#12161

Graph Transformers on EHRs: Better Representation Improves Downstream Performance

Raphael Poulain, Rahmatollah Beheshti

ICLR 2024oral
#12162

SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step Reasoning

Ning Miao, Yee Whye Teh, Tom Rainforth

ICLR 2024posterarXiv:2308.00436
#12163

Scalable Modular Network: A Framework for Adaptive Learning via Agreement Routing

Minyang Hu, Hong Chang, Bingpeng Ma et al.

ICLR 2024poster
#12164

Improved Regret Bounds for Non-Convex Online-Within-Online Meta Learning

Jiechao GUAN, Hui Xiong

ICLR 2024poster
#12165

Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis

Jonghyun Lee, Hansam Cho, YoungJoon Yoo et al.

ICLR 2024posterarXiv:2401.09048
#12166

Recursive Generalization Transformer for Image Super-Resolution

Zheng Chen, Yulun Zhang, Jinjin Gu et al.

ICLR 2024posterarXiv:2303.06373
#12167

Score Models for Offline Goal-Conditioned Reinforcement Learning

Harshit Sikchi, Rohan Chitnis, Ahmed Touati et al.

ICLR 2024posterarXiv:2311.02013
#12168

Treatment Effects Estimation By Uniform Transformer

Ruoqi Yu, Shulei Wang

ICLR 2024posterarXiv:2008.03738
#12169

Representation Deficiency in Masked Language Modeling

Yu Meng, Jitin Krishnan, Sinong Wang et al.

ICLR 2024posterarXiv:2302.02060
#12170

Sampling Multimodal Distributions with the Vanilla Score: Benefits of Data-Based Initialization

Frederic Koehler, Thuy-Duong Vuong

ICLR 2024posterarXiv:2310.01762
#12171

MaGIC: Multi-modality Guided Image Completion

Hao Wang, Yongsheng Yu, Tiejian Luo et al.

ICLR 2024posterarXiv:2305.11818
#12172

Learning Robust Generalizable Radiance Field with Visibility and Feature Augmented Point Representation

Jiaxu Wang, Ziyi Zhang, Renjing Xu

ICLR 2024posterarXiv:2401.14354
#12173

HoloNets: Spectral Convolutions do extend to Directed Graphs

Christian Koke, Daniel Cremers

ICLR 2024posterarXiv:2310.02232
#12174

Searching for High-Value Molecules Using Reinforcement Learning and Transformers

Raj Ghugare, Santiago Miret, Adriana Hugessen et al.

ICLR 2024posterarXiv:2310.02902
#12175

Interpretable Meta-Learning of Physical Systems

Matthieu Blanke, marc lelarge

ICLR 2024posterarXiv:2312.00477
#12176

An Image Is Worth 1000 Lies: Transferability of Adversarial Images across Prompts on Vision-Language Models

Haochen Luo, Jindong Gu, Fengyuan Liu et al.

ICLR 2024spotlight
#12177

Fast Value Tracking for Deep Reinforcement Learning

Frank Shih, Faming Liang

ICLR 2024oralarXiv:2403.13178
#12178

Interpretable Sparse System Identification: Beyond Recent Deep Learning Techniques on Time-Series Prediction

Liu Xiaoyi, Duxin Chen, Wenjia Wei et al.

ICLR 2024poster
#12179

FedInverse: Evaluating Privacy Leakage in Federated Learning

DI WU, Jun Bai, Yiliao Song et al.

ICLR 2024poster
#12180

CircuitNet 2.0: An Advanced Dataset for Promoting Machine Learning Innovations in Realistic Chip Design Environment

Xun Jiang, zhuomin chai, Yuxiang Zhao et al.

ICLR 2024poster
#12181

Are Transformers with One Layer Self-Attention Using Low-Rank Weight Matrices Universal Approximators?

Tokio Kajitsuka, Issei Sato

ICLR 2024posterarXiv:2307.14023
#12182

Self-Supervised Contrastive Learning for Long-term Forecasting

Junwoo Park, Daehoon Gwak, Jaegul Choo et al.

ICLR 2024posterarXiv:2402.02023
#12183

Federated Orthogonal Training: Mitigating Global Catastrophic Forgetting in Continual Federated Learning

Yavuz Faruk Bakman, Duygu Nur Yaldiz, Yahya Ezzeldin et al.

ICLR 2024posterarXiv:2309.01289
#12184

Neuron Activation Coverage: Rethinking Out-of-distribution Detection and Generalization

Yibing Liu, Chris Xing TIAN, Haoliang Li et al.

ICLR 2024spotlightarXiv:2306.02879
#12185

Skeleton-of-Thought: Prompting LLMs for Efficient Parallel Generation

Xuefei Ning, Zinan Lin, Zixuan Zhou et al.

ICLR 2024posterarXiv:2307.15337
#12186

Rethinking CNN’s Generalization to Backdoor Attack from Frequency Domain

Quanrui Rao, Lin Wang, Wuying Liu

ICLR 2024poster
#12187

Variance Reduced Halpern Iteration for Finite-Sum Monotone Inclusions

Xufeng Cai, Ahmet Alacaoglu, Jelena Diakonikolas

ICLR 2024posterarXiv:2310.02987
#12188

Flow to Better: Offline Preference-based Reinforcement Learning via Preferred Trajectory Generation

Zhilong Zhang, Yihao Sun, Junyin Ye et al.

ICLR 2024oral
#12189

LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts

Hanan Gani, Shariq Bhat, Muzammal Naseer et al.

ICLR 2024posterarXiv:2310.10640
#12190

VBH-GNN: Variational Bayesian Heterogeneous Graph Neural Networks for Cross-subject Emotion Recognition

Chenyu Liu, XINLIANG ZHOU, Zhengri Zhu et al.

ICLR 2024oral
#12191

Neural Rate Control for Learned Video Compression

yiwei zhang, Guo Lu, Yunuo Chen et al.

ICLR 2024oral
#12192

PnP Inversion: Boosting Diffusion-based Editing with 3 Lines of Code

Xuan Ju, Ailing Zeng, Yuxuan Bian et al.

ICLR 2024poster
#12193

Harnessing Density Ratios for Online Reinforcement Learning

Philip Amortila, Dylan Foster, Nan Jiang et al.

ICLR 2024spotlightarXiv:2401.09681
#12194

Sliced Denoising: A Physics-Informed Molecular Pre-Training Method

yuyan ni, Shikun Feng, Wei-Ying Ma et al.

ICLR 2024posterarXiv:2311.02124
#12195

Improved Efficiency Based on Learned Saccade and Continuous Scene Reconstruction From Foveated Visual Sampling

Jiayang Liu, Yiming Bu, Daniel Tso et al.

ICLR 2024spotlight
#12196

Poisoned Forgery Face: Towards Backdoor Attacks on Face Forgery Detection

Jiawei Liang, Siyuan Liang, Aishan Liu et al.

ICLR 2024spotlightarXiv:2402.11473
#12197

Negatively Correlated Ensemble Reinforcement Learning for Online Diverse Game Level Generation

Ziqi Wang, Chengpeng Hu, Jialin Liu et al.

ICLR 2024poster
#12198

Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning

Zhaoyi Zhou, Chuning Zhu, Runlong Zhou et al.

ICLR 2024posterarXiv:2310.19308
#12199

Local Composite Saddle Point Optimization

Site Bai, Brian Bullins

ICLR 2024poster
#12200

ASID: Active Exploration for System Identification in Robotic Manipulation

Marius Memmel, Andrew Wagenmaker, Chuning Zhu et al.

ICLR 2024posterarXiv:2404.12308