Most Cited 2024 "kv cache" Papers

12,324 papers found • Page 62 of 62

#12201

A Framework and Benchmark for Deep Batch Active Learning for Regression

David Holzmüller, Viktor Zaverkin, Johannes Kästner et al.

ICLR 2024arXiv:2203.09410
#12202

Tackling the Data Heterogeneity in Asynchronous Federated Learning with Cached Update Calibration

Yujia Wang, Yuanpu Cao, Jingcheng Wu et al.

ICLR 2024
#12203

ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation

Jiaming Liu, Senqiao Yang, Peidong Jia et al.

ICLR 2024arXiv:2306.04344
#12204

Automatic Functional Differentiation in JAX

Min Lin

ICLR 2024arXiv:2311.18727
#12205

Manipulating dropout reveals an optimal balance of efficiency and robustness in biological and machine visual systems

Jacob Prince, Gabriel Fajardo, George Alvarez et al.

ICLR 2024oral
#12206

$\mathcal{B}$-Coder: Value-Based Deep Reinforcement Learning for Program Synthesis

Zishun Yu, Yunzhe Tao, Liyu Chen et al.

ICLR 2024spotlightarXiv:2310.03173
#12207

Octavius: Mitigating Task Interference in MLLMs via LoRA-MoE

Zeren Chen, ziqin wang, zhen wang et al.

ICLR 2024arXiv:2311.02684
#12208

ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving

Zhibin Gou, Zhihong Shao, Yeyun Gong et al.

ICLR 2024arXiv:2309.17452
#12209

Sample-efficient Learning of Infinite-horizon Average-reward MDPs with General Function Approximation

Jianliang He, Han Zhong, Zhuoran Yang

ICLR 2024arXiv:2404.12648
#12210

Towards Robust Offline Reinforcement Learning under Diverse Data Corruption

Rui Yang, Han Zhong, Jiawei Xu et al.

ICLR 2024spotlightarXiv:2310.12955
#12211

Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative Instructions

Juncheng Li, Kaihang Pan, Zhiqi Ge et al.

ICLR 2024spotlightarXiv:2308.04152
#12212

Towards domain-invariant Self-Supervised Learning with Batch Styles Standardization

Marin Scalbert, Maria Vakalopoulou, Florent Couzinie-Devy

ICLR 2024arXiv:2303.06088
#12213

SNIP: Bridging Mathematical Symbolic and Numeric Realms with Unified Pre-training

Kazem Meidani, Parshin Shojaee, Chandan Reddy et al.

ICLR 2024spotlightarXiv:2310.02227
#12214

Learning from Label Proportions: Bootstrapping Supervised Learners via Belief Propagation

Shreyas Havaldar, Navodita Sharma, Shubhi Sareen et al.

ICLR 2024arXiv:2310.08056
#12215

Transformer-Modulated Diffusion Models for Probabilistic Multivariate Time Series Forecasting

Yuxin Li, Wenchao Chen, Xinyue Hu et al.

ICLR 2024
#12216

Vanishing Gradients in Reinforcement Finetuning of Language Models

Noam Razin, Hattie Zhou, Omid Saremi et al.

ICLR 2024arXiv:2310.20703
#12217

What Algorithms can Transformers Learn? A Study in Length Generalization

Hattie Zhou, Arwen Bradley, Etai Littwin et al.

ICLR 2024arXiv:2310.16028
#12218

Neural Network-Based Score Estimation in Diffusion Models: Optimization and Generalization

Yinbin Han, Meisam Razaviyayn, Renyuan Xu

ICLR 2024arXiv:2401.15604
#12219

Enhancing Small Medical Learners with Privacy-preserving Contextual Prompting

xinlu zhang, Shiyang Li, Xianjun Yang et al.

ICLR 2024arXiv:2305.12723
#12220

Optimal criterion for feature learning of two-layer linear neural network in high dimensional interpolation regime

Keita Suzuki, Taiji Suzuki

ICLR 2024
#12221

On the Scalability and Memory Efficiency of Semidefinite Programs for Lipschitz Constant Estimation of Neural Networks

Zi Wang, Bin Hu, Aaron Havens et al.

ICLR 2024
#12222

Intelligent Switching for Reset-Free RL

Darshan Patil, Janarthanan Rajendran, Glen Berseth et al.

ICLR 2024arXiv:2405.01684
#12223

Quantifying the Sensitivity of Inverse Reinforcement Learning to Misspecification

Joar Skalse, Alessandro Abate

ICLR 2024arXiv:2403.06854
#12224

Effective and Efficient Federated Tree Learning on Hybrid Data

Qinbin Li, Chulin Xie, Xiaojun Xu et al.

ICLR 2024arXiv:2310.11865
#12225

Neural Processing of Tri-Plane Hybrid Neural Fields

Adriano Cardace, Pierluigi Zama Ramirez, Francesco Ballerini et al.

ICLR 2024arXiv:2310.01140
#12226

Boosting the Adversarial Robustness of Graph Neural Networks: An OOD Perspective

Kuan Li, YiWen Chen, Yang Liu et al.

ICLR 2024
#12227

Byzantine Robust Cooperative Multi-Agent Reinforcement Learning as a Bayesian Game

Simin Li, Jun Guo, Jingqiao Xiu et al.

ICLR 2024arXiv:2305.12872
#12228

SetCSE: Set Operations using Contrastive Learning of Sentence Embeddings

Kang Liu

ICLR 2024arXiv:2404.17606
#12229

#InsTag: Instruction Tagging for Analyzing Supervised Fine-tuning of Large Language Models

Keming Lu, Hongyi Yuan, Zheng Yuan et al.

ICLR 2024arXiv:2308.07074
#12230

Debiasing Attention Mechanism in Transformer without Demographics

Shenyu Lu, Yipei Wang, Xiaoqian Wang

ICLR 2024
#12231

Unsupervised Pretraining for Fact Verification by Language Model Distillation

Adrian Bazaga, Pietro Lio, Gos Micklem

ICLR 2024arXiv:2309.16540
#12232

Image Translation as Diffusion Visual Programmers

Cheng Han, James Liang, Qifan Wang et al.

ICLR 2024arXiv:2401.09742
#12233

Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation

Haruka Kiyohara, Ren Kishimoto, Kosuke Kawakami et al.

ICLR 2024arXiv:2311.18207
#12234

Adversarial Imitation Learning via Boosting

Jonathan Chang, Dhruv Sreenivas, Yingbing Huang et al.

ICLR 2024arXiv:2404.08513
#12235

Bayes Conditional Distribution Estimation for Knowledge Distillation Based on Conditional Mutual Information

Linfeng Ye, Shayan Mohajer Hamidi, Renhao Tan et al.

ICLR 2024arXiv:2401.08732
#12236

Provable Reward-Agnostic Preference-Based Reinforcement Learning

Wenhao Zhan, Masatoshi Uehara, Wen Sun et al.

ICLR 2024spotlightarXiv:2305.18505
#12237

Transformers as Decision Makers: Provable In-Context Reinforcement Learning via Supervised Pretraining

Licong Lin, Yu Bai, Song Mei

ICLR 2024arXiv:2310.08566
#12238

Improving Convergence and Generalization Using Parameter Symmetries

Bo Zhao, Robert M. Gower, Robin Walters et al.

ICLR 2024arXiv:2305.13404
#12239

COLEP: Certifiably Robust Learning-Reasoning Conformal Prediction via Probabilistic Circuits

Mintong Kang, Nezihe Merve Gürel, Linyi Li et al.

ICLR 2024arXiv:2403.11348
#12240

Manifold Preserving Guided Diffusion

Yutong He, Naoki Murata, Chieh-Hsin Lai et al.

ICLR 2024arXiv:2311.16424
#12241

Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators

Daniel Geng, Andrew Owens

ICLR 2024arXiv:2401.18085
#12242

Threaten Spiking Neural Networks through Combining Rate and Temporal Information

Zecheng Hao, Tong Bu, Xinyu Shi et al.

ICLR 2024oral
#12243

Exploring Target Representations for Masked Autoencoders

xingbin liu, Jinghao Zhou, Tao Kong et al.

ICLR 2024arXiv:2209.03917
#12244

Federated Recommendation with Additive Personalization

Zhiwei Li, Guodong Long, Tianyi Zhou

ICLR 2024arXiv:2301.09109
#12245

Neural Language of Thought Models

Yi-Fu Wu, Minseung Lee, Sungjin Ahn

ICLR 2024arXiv:2402.01203
#12246

Text2Reward: Reward Shaping with Language Models for Reinforcement Learning

Tianbao Xie, Siheng Zhao, Chen Henry Wu et al.

ICLR 2024spotlightarXiv:2309.11489
#12247

Towards Training Without Depth Limits: Batch Normalization Without Gradient Explosion

Alexandru Meterez, Amir Joudaki, Francesco Orabona et al.

ICLR 2024arXiv:2310.02012
#12248

Statistical Rejection Sampling Improves Preference Optimization

Tianqi Liu, Yao Zhao, Rishabh Joshi et al.

ICLR 2024arXiv:2309.06657
#12249

Tell Your Model Where to Attend: Post-hoc Attention Steering for LLMs

Qingru Zhang, Chandan Singh, Liyuan Liu et al.

ICLR 2024arXiv:2311.02262
#12250

Privacy Amplification for Matrix Mechanisms

Christopher Choquette-Choo, Arun Ganesh, Thomas Steinke et al.

ICLR 2024spotlightarXiv:2310.15526
#12251

Negative Label Guided OOD Detection with Pretrained Vision-Language Models

Xue JIANG, Feng Liu, Zhen Fang et al.

ICLR 2024spotlightarXiv:2403.20078
#12252

PTaRL: Prototype-based Tabular Representation Learning via Space Calibration

Hangting Ye, Wei Fan, Xiaozhuang Song et al.

ICLR 2024spotlightarXiv:2407.05364
#12253

Constrained Bi-Level Optimization: Proximal Lagrangian Value Function Approach and Hessian-free Algorithm

Wei Yao, Chengming Yu, Shangzhi Zeng et al.

ICLR 2024spotlightarXiv:2401.16164
#12254

Correlated Noise Provably Beats Independent Noise for Differentially Private Learning

Christopher Choquette-Choo, Krishnamurthy Dvijotham, Krishna Pillutla et al.

ICLR 2024arXiv:2310.06771
#12255

ModuLoRA: Finetuning 2-Bit LLMs on Consumer GPUs by Integrating with Modular Quantizers

Junjie Oscar Yin, Yingheng Wang, Volodymyr Kuleshov et al.

ICLR 2024arXiv:2309.16119
#12256

On the Stability of Expressive Positional Encodings for Graphs

Yinan Huang, William Lu, Joshua Robinson et al.

ICLR 2024arXiv:2310.02579
#12257

Evaluating Representation Learning on the Protein Structure Universe

Arian Jamasb, Alex Morehead, Chaitanya Joshi et al.

ICLR 2024arXiv:2406.13864
#12258

AutoVP: An Automated Visual Prompting Framework and Benchmark

Hsi-Ai Tsao, Lei Hsiung, Pin-Yu Chen et al.

ICLR 2024arXiv:2310.08381
#12259

On the Hardness of Constrained Cooperative Multi-Agent Reinforcement Learning

Ziyi Chen, Yi Zhou, Heng Huang

ICLR 2024
#12260

Information Retention via Learning Supplemental Features

Zhipeng Xie, Yahe Li

ICLR 2024spotlight
#12261

Geometry-Aware Projective Mapping for Unbounded Neural Radiance Fields

Junoh Lee, Hyunjun Jung, Jinhwi Park et al.

ICLR 2024
#12262

Off-Policy Primal-Dual Safe Reinforcement Learning

Zifan Wu, Bo Tang, Qian Lin et al.

ICLR 2024arXiv:2401.14758
#12263

When should we prefer Decision Transformers for Offline Reinforcement Learning?

Prajjwal Bhargava, Rohan Chitnis, Alborz Geramifard et al.

ICLR 2024arXiv:2305.14550
#12264

ARM: Refining Multivariate Forecasting with Adaptive Temporal-Contextual Learning

Jiecheng Lu, Xu Han, Shihao Yang

ICLR 2024oralarXiv:2310.09488
#12265

SAS: Structured Activation Sparsification

Yusuke Sekikawa, Shingo Yashima

ICLR 2024
#12266

Learning Multi-Agent Communication with Contrastive Learning

Yat Long (Richie) Lo, Biswa Sengupta, Jakob Foerster et al.

ICLR 2024arXiv:2307.01403
#12267

Xformer: Hybrid X-Shaped Transformer for Image Denoising

Jiale Zhang, Yulun Zhang, Jinjin Gu et al.

ICLR 2024arXiv:2303.06440
#12268

Dynamics-Informed Protein Design with Structure Conditioning

Urszula Julia Komorowska, Simon Mathis, Kieran Didi et al.

ICLR 2024
#12269

Identifiable Latent Polynomial Causal Models through the Lens of Change

Yuhang Liu, Zhen Zhang, Dong Gong et al.

ICLR 2024arXiv:2310.15580
#12270

SYMBOL: Generating Flexible Black-Box Optimizers through Symbolic Equation Learning

Jiacheng Chen, Zeyuan Ma, Hongshu Guo et al.

ICLR 2024arXiv:2402.02355
#12271

Graph Lottery Ticket Automated

Guibin Zhang, Kun Wang, Wei Huang et al.

ICLR 2024
#12272

Threshold-Consistent Margin Loss for Open-World Deep Metric Learning

Qin ZHANG, Linghan Xu, Jun Fang et al.

ICLR 2024arXiv:2307.04047
#12273

Encoding Unitig-level Assembly Graphs with Heterophilous Constraints for Metagenomic Contigs Binning

Hansheng Xue, Vijini Mallawaarachchi, Lexing Xie et al.

ICLR 2024
#12274

Adaptive Regret for Bandits Made Possible: Two Queries Suffice

Zhou Lu, Qiuyi (Richard) Zhang, Xinyi Chen et al.

ICLR 2024arXiv:2401.09278
#12275

AdaMerging: Adaptive Model Merging for Multi-Task Learning

Enneng Yang, Zhenyi Wang, Li Shen et al.

ICLR 2024arXiv:2310.02575
#12276

Statistically Optimal $K$-means Clustering via Nonnegative Low-rank Semidefinite Programming

Yubo Zhuang, Xiaohui Chen, Yun Yang et al.

ICLR 2024arXiv:2305.18436
#12277

Improved statistical and computational complexity of the mean-field Langevin dynamics under structured data

Atsushi Nitanda, Kazusato Oko, Taiji Suzuki et al.

ICLR 2024
#12278

Bridging Neural and Symbolic Representations with Transitional Dictionary Learning

Junyan Cheng, Peter Chin

ICLR 2024arXiv:2308.02000
#12279

Thin-Shell Object Manipulations With Differentiable Physics Simulations

Yian Wang, Juntian Zheng, Zhehuan Chen et al.

ICLR 2024spotlightarXiv:2404.00451
#12280

Bayesian Coreset Optimization for Personalized Federated Learning

Prateek Chanda, Shrey Modi, Ganesh Ramakrishnan

ICLR 2024arXiv:2511.01800
#12281

Beyond Spatio-Temporal Representations: Evolving Fourier Transform for Temporal Graphs

Anson Simon Bastos, Kuldeep Singh, Abhishek Nadgeri et al.

ICLR 2024oralarXiv:2402.16078
#12282

Hierarchical Context Merging: Better Long Context Understanding for Pre-trained LLMs

Woomin Song, Seunghyuk Oh, Sangwoo Mo et al.

ICLR 2024arXiv:2404.10308
#12283

Towards Best Practices of Activation Patching in Language Models: Metrics and Methods

Fred Zhang, Neel Nanda

ICLR 2024arXiv:2309.16042
#12284

Scale-Adaptive Diffusion Model for Complex Sketch Synthesis

Jijin Hu, Ke Li, Yonggang Qi et al.

ICLR 2024
#12285

On the Over-Memorization During Natural, Robust and Catastrophic Overfitting

Runqi Lin, Chaojian Yu, Bo Han et al.

ICLR 2024arXiv:2310.08847
#12286

Mastering Memory Tasks with World Models

Mohammad Reza Samsami, Artem Zholus, Janarthanan Rajendran et al.

ICLR 2024oralarXiv:2403.04253
#12287

Towards Principled Representation Learning from Videos for Reinforcement Learning

Dipendra Kumar Misra, Akanksha Saran, Tengyang Xie et al.

ICLR 2024oralarXiv:2403.13765
#12288

Expected flow networks in stochastic environments and two-player zero-sum games

Marco Jiralerspong, Bilun Sun, Danilo Vucetic et al.

ICLR 2024arXiv:2310.02779
#12289

Towards Unified Multi-Modal Personalization: Large Vision-Language Models for Generative Recommendation and Beyond

Tianxin Wei, Bowen Jin, Ruirui Li et al.

ICLR 2024arXiv:2403.10667
#12290

DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models

Yung-Sung Chuang, Yujia Xie, Hongyin Luo et al.

ICLR 2024arXiv:2309.03883
#12291

Energy-conserving equivariant GNN for elasticity of lattice architected metamaterials

Ivan Grega, Ilyes Batatia, Gábor Csányi et al.

ICLR 2024arXiv:2401.16914
#12292

SALMON: Self-Alignment with Instructable Reward Models

Zhiqing Sun, Yikang Shen, Hongxin Zhang et al.

ICLR 2024arXiv:2310.05910
#12293

Get more for less: Principled Data Selection for Warming Up Fine-Tuning in LLMs

Feiyang Kang, Hoang Anh Just, Yifan Sun et al.

ICLR 2024arXiv:2405.02774
#12294

Augmenting Transformers with Recursively Composed Multi-grained Representations

Xiang Hu, Qingyang Zhu, Kewei Tu et al.

ICLR 2024arXiv:2309.16319
#12295

Adversarial Training on Purification (AToP): Advancing Both Robustness and Generalization

Guang Lin, Chao Li, Jianhai Zhang et al.

ICLR 2024arXiv:2401.16352
#12296

Large Language Models as Generalizable Policies for Embodied Tasks

Andrew Szot, Max Schwarzer, Harsh Agrawal et al.

ICLR 2024arXiv:2310.17722
#12297

The Joint Effect of Task Similarity and Overparameterization on Catastrophic Forgetting — An Analytical Model

Daniel Goldfarb, Itay Evron, Nir Weinberger et al.

ICLR 2024arXiv:2401.12617
#12298

Fast Equilibrium of SGD in Generic Situations

Zhiyuan Li, Yi Wang, Zhiren Wang

ICLR 2024
#12299

Connect, Collapse, Corrupt: Learning Cross-Modal Tasks with Uni-Modal Data

Yuhui Zhang, Elaine Sui, Serena Yeung

ICLR 2024arXiv:2401.08567
#12300

Compositional Preference Models for Aligning LMs

DONGYOUNG GO, Tomek Korbak, Germàn Kruszewski et al.

ICLR 2024arXiv:2310.13011
#12301

Diffusion Posterior Sampling for Linear Inverse Problem Solving: A Filtering Perspective

Zehao Dou, Yang Song

ICLR 2024
#12302

Demystifying Local & Global Fairness Trade-offs in Federated Learning Using Partial Information Decomposition

Faisal Hamman, Sanghamitra Dutta

ICLR 2024
#12303

Learning Conditional Invariances through Non-Commutativity

Abhra Chaudhuri, Serban Georgescu, Anjan Dutta

ICLR 2024arXiv:2402.11682
#12304

Generative Modeling with Phase Stochastic Bridge

Tianrong Chen, Jiatao Gu, Laurent Dinh et al.

ICLR 2024arXiv:2310.07805
#12305

Bandits Meet Mechanism Design to Combat Clickbait in Online Recommendation

Thomas Kleine Buening, Aadirupa Saha, Christos Dimitrakakis et al.

ICLR 2024spotlightarXiv:2311.15647
#12306

RobustTSF: Towards Theory and Design of Robust Time Series Forecasting with Anomalies

Hao Cheng, Qingsong Wen, Yang Liu et al.

ICLR 2024arXiv:2402.02032
#12307

Tailoring Self-Rationalizers with Multi-Reward Distillation

Sahana Ramnath, Brihi Joshi, Skyler Hallinan et al.

ICLR 2024arXiv:2311.02805
#12308

Controlling Vision-Language Models for Multi-Task Image Restoration

Ziwei Luo, Fredrik K. Gustafsson, Zheng Zhao et al.

ICLR 2024arXiv:2310.01018
#12309

VFLAIR: A Research Library and Benchmark for Vertical Federated Learning

TIANYUAN ZOU, Zixuan GU, Yu He et al.

ICLR 2024arXiv:2310.09827
#12310

Measuring Vision-Language STEM Skills of Neural Models

Jianhao Shen, Ye Yuan, Srbuhi Mirzoyan et al.

ICLR 2024arXiv:2402.17205
#12311

Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers

Qingyan Guo, Rui Wang, Junliang Guo et al.

ICLR 2024
#12312

MCM: Masked Cell Modeling for Anomaly Detection in Tabular Data

Jiaxin Yin, Yuanyuan Qiao, Zitang Zhou et al.

ICLR 2024
#12313

NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers

Kai Shen, Zeqian Ju, Xu Tan et al.

ICLR 2024spotlightarXiv:2304.09116
#12314

CLIP-MUSED: CLIP-Guided Multi-Subject Visual Neural Information Semantic Decoding

Qiongyi Zhou, Changde Du, Shengpei Wang et al.

ICLR 2024arXiv:2402.08994
#12315

How connectivity structure shapes rich and lazy learning in neural circuits

Yuhan Helena Liu, Aristide Baratin, Jonathan Cornford et al.

ICLR 2024arXiv:2310.08513
#12316

ARGS: Alignment as Reward-Guided Search

Maxim Khanov, Jirayu Burapacheep, Yixuan Li

ICLR 2024arXiv:2402.01694
#12317

Let Models Speak Ciphers: Multiagent Debate through Embeddings

Chau Pham, Boyi Liu, Yingxiang Yang et al.

ICLR 2024arXiv:2310.06272
#12318

NeuroBack: Improving CDCL SAT Solving using Graph Neural Networks

Wenxi Wang, Yang Hu, Mohit Tiwari et al.

ICLR 2024arXiv:2110.14053
#12319

Understanding when Dynamics-Invariant Data Augmentations Benefit Model-free Reinforcement Learning Updates

Nicholas Corrado, Josiah Hanna

ICLR 2024arXiv:2310.17786
#12320

Revisiting Deep Audio-Text Retrieval Through the Lens of Transportation

Tien Manh Luong, Khai Nguyen, Nhat Ho et al.

ICLR 2024arXiv:2405.10084
#12321

Text-to-3D with Classifier Score Distillation

Xin Yu, Yuan-Chen Guo, Yangguang Li et al.

ICLR 2024arXiv:2310.19415
#12322

Transformers can optimally learn regression mixture models

Reese Pathak, Rajat Sen, Weihao Kong et al.

ICLR 2024arXiv:2311.08362
#12323

Dirichlet-based Per-Sample Weighting by Transition Matrix for Noisy Label Learning

HeeSun Bae, Seungjae Shin, Byeonghu Na et al.

ICLR 2024arXiv:2403.02690
#12324

Branch-GAN: Improving Text Generation with (not so) Large Language Models

Fredrik Carlsson, Johan Broberg, Erik Hillbom et al.

ICLR 2024