ICML 2024 Papers

2,635 papers found • Page 53 of 53

Xing Han Lù, Zdeněk Kasner, Siva Reddy

Lee-Ad Gottlieb, Timor Sharabi, Roi Weiss

Billy Franks, Christopher Morris, Ameya Velingker et al.

Snir Hordan, Tal Amir, Nadav Dym

Xingwu Chen, Difan Zou

Hongkang Li, Meng Wang, Tengfei Ma et al.

William Yang, Ye Zhu, Zhiwei Deng et al.

Waïss Azizian, Franck Iutzeler, Jérôme Malick et al.

Aaditya Singh, Ted Moskovitz, Feilx Hill et al.

raghav singhal, Mark Goldstein, Rajesh Ranganath

Xisen Jin, Xiang Ren

Ching-Yun (Irene) Ko, Pin-Yu Chen, Payel Das et al.

Xuefeng Du, Yiyou Sun, Sharon Li

Zhening Li, Gabriel Poesia, Armando Solar-Lezama

My Phan, Kianté Brantley, Stephanie Milani et al.

Haoran You, Yichao Fu, Zheng Wang et al.

Loek van Rossem, Andrew Saxe

Yang Zhao, Hao Zhang, Xiuyuan Hu

Yuxiao Wen, Arthur Jacot

Xavi Suau, Pieter Delobelle, Katherine Metcalf et al.

Jin Hwa Lee, Stefano Mannelli, Andrew Saxe

Pratik Bhowal, Achint Soni, Sirisha Rambhatla

Mohamad Amin Mohamadi, Zhiyuan Li, Lei Wu et al.

Zhenmei Shi, Junyi Wei, Zhuoyan Xu et al.

Kumar Shubham, Aishwarya Jayagopal, Syed Danish et al.

Alexandre Drouin, Maxime Gasse, Massimo Caccia et al.

Buyun Zhang, Liang Luo, Yuxin Chen et al.

Yiwei Ma, Zhekai Lin, Jiayi Ji et al.

Ritwik Gupta, Shufan Li, Tyler Zhu et al.

che liu, Zhongwei Wan, Cheng Ouyang et al.

Tyler Ingebrand, Amy Zhang, Ufuk Topcu

Hila Manor, Tomer Michaeli

Anton Plaksin, Vitaly Kalev

Zhuanghua Liu, Cheng Chen, Luo Luo et al.

ICML 2024 Papers

Conference

Paper Type

WebLINX: Real-World Website Navigation with Multi-Turn Dialogue

Weighted distance nearest neighbor condensing

Weisfeiler-Leman at the margin: When more expressivity matters

Weisfeiler Leman for Euclidean Equivariant Machine Learning

What Can Transformer Learn with Varying Depth? Case Studies on Sequence Learning Tasks

What Improves the Generalization of Graph Transformers? A Theoretical Dive into the Self-attention and Positional Encoding

What is Dataset Distillation Learning?

What is the Long-Run Distribution of Stochastic Gradient Descent? A Large Deviations Analysis

What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation

What’s the score? Automated Denoising Score Matching for Nonlinear Diffusions

What Will My Model Forget? Forecasting Forgotten Examples in Language Model Refinement

What Would Gauss Say About Representations? Probing Pretrained Image Models using Synthetic Gaussian Benchmarks

When and How Does In-Distribution Label Help Out-of-Distribution Detection?

When Do Skills Help Reinforcement Learning? A Theoretical Analysis of Temporal Abstractions

When is Transfer Learning Possible?

When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models

When Representations Align: Universality in Representation Learning Dynamics

When Will Gradient Regularization Be Harmful?

Which Frequencies do CNNs Need? Emergent Bottleneck Structure in Feature Learning

Whispering Experts: Neural Interventions for Toxicity Mitigation in Language Models

Why Do Animals Need Shaping? A Theory of Task Composition and Curriculum Learning

Why do Variational Autoencoders Really Promote Disentanglement?

Why Do You Grok? A Theoretical Analysis on Grokking Modular Addition

Why Larger Language Models Do In-context Learning Differently?

Winner-takes-all learners are geometry-aware conditional density estimators

WISER: Weak Supervision and Supervised Representation Learning to Improve Drug Response Prediction in Cancer

WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?

Wukong: Towards a Scaling Law for Large-Scale Recommendation

X-Oscar: A Progressive Framework for High-quality Text-guided 3D Animatable Avatar Generation

xT: Nested Tokenization for Larger Context in Large Images

Zero-Shot ECG Classification with Multimodal Learning and Test-time Clinical Knowledge Enhancement

Zero-Shot Reinforcement Learning via Function Encoders

Zero-Shot Unsupervised and Text-Based Audio Editing Using DDPM Inversion

Zero-Sum Positional Differential Games as a Framework for Robust Reinforcement Learning: Deep Q-Learning Approach

Zeroth-Order Methods for Constrained Nonconvex Nonsmooth Stochastic Optimization