ICML 2024 Papers
2,635 papers found • Page 53 of 53
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Xing Han Lù, Zdeněk Kasner, Siva Reddy
Weighted distance nearest neighbor condensing
Lee-Ad Gottlieb, Timor Sharabi, Roi Weiss
Weisfeiler-Leman at the margin: When more expressivity matters
Billy Franks, Christopher Morris, Ameya Velingker et al.
Weisfeiler Leman for Euclidean Equivariant Machine Learning
Snir Hordan, Tal Amir, Nadav Dym
What Can Transformer Learn with Varying Depth? Case Studies on Sequence Learning Tasks
Xingwu Chen, Difan Zou
What Improves the Generalization of Graph Transformers? A Theoretical Dive into the Self-attention and Positional Encoding
Hongkang Li, Meng Wang, Tengfei Ma et al.
What is Dataset Distillation Learning?
William Yang, Ye Zhu, Zhiwei Deng et al.
What is the Long-Run Distribution of Stochastic Gradient Descent? A Large Deviations Analysis
Waïss Azizian, Franck Iutzeler, Jérôme Malick et al.
What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation
Aaditya Singh, Ted Moskovitz, Feilx Hill et al.
What’s the score? Automated Denoising Score Matching for Nonlinear Diffusions
raghav singhal, Mark Goldstein, Rajesh Ranganath
What Will My Model Forget? Forecasting Forgotten Examples in Language Model Refinement
Xisen Jin, Xiang Ren
What Would Gauss Say About Representations? Probing Pretrained Image Models using Synthetic Gaussian Benchmarks
Ching-Yun (Irene) Ko, Pin-Yu Chen, Payel Das et al.
When and How Does In-Distribution Label Help Out-of-Distribution Detection?
Xuefeng Du, Yiyou Sun, Sharon Li
When Do Skills Help Reinforcement Learning? A Theoretical Analysis of Temporal Abstractions
Zhening Li, Gabriel Poesia, Armando Solar-Lezama
When is Transfer Learning Possible?
My Phan, Kianté Brantley, Stephanie Milani et al.
When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models
Haoran You, Yichao Fu, Zheng Wang et al.
When Representations Align: Universality in Representation Learning Dynamics
Loek van Rossem, Andrew Saxe
When Will Gradient Regularization Be Harmful?
Yang Zhao, Hao Zhang, Xiuyuan Hu
Which Frequencies do CNNs Need? Emergent Bottleneck Structure in Feature Learning
Yuxiao Wen, Arthur Jacot
Whispering Experts: Neural Interventions for Toxicity Mitigation in Language Models
Xavi Suau, Pieter Delobelle, Katherine Metcalf et al.
Why Do Animals Need Shaping? A Theory of Task Composition and Curriculum Learning
Jin Hwa Lee, Stefano Mannelli, Andrew Saxe
Why do Variational Autoencoders Really Promote Disentanglement?
Pratik Bhowal, Achint Soni, Sirisha Rambhatla
Why Do You Grok? A Theoretical Analysis on Grokking Modular Addition
Mohamad Amin Mohamadi, Zhiyuan Li, Lei Wu et al.
Why Larger Language Models Do In-context Learning Differently?
Zhenmei Shi, Junyi Wei, Zhuoyan Xu et al.
Winner-takes-all learners are geometry-aware conditional density estimators
Victor Letzelter, David Perera, Cédric Rommel et al.
WISER: Weak Supervision and Supervised Representation Learning to Improve Drug Response Prediction in Cancer
Kumar Shubham, Aishwarya Jayagopal, Syed Danish et al.
WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?
Alexandre Drouin, Maxime Gasse, Massimo Caccia et al.
Wukong: Towards a Scaling Law for Large-Scale Recommendation
Buyun Zhang, Liang Luo, Yuxin Chen et al.
X-Oscar: A Progressive Framework for High-quality Text-guided 3D Animatable Avatar Generation
Yiwei Ma, Zhekai Lin, Jiayi Ji et al.
xT: Nested Tokenization for Larger Context in Large Images
Ritwik Gupta, Shufan Li, Tyler Zhu et al.
Zero-Shot ECG Classification with Multimodal Learning and Test-time Clinical Knowledge Enhancement
che liu, Zhongwei Wan, Cheng Ouyang et al.
Zero-Shot Reinforcement Learning via Function Encoders
Tyler Ingebrand, Amy Zhang, Ufuk Topcu
Zero-Shot Unsupervised and Text-Based Audio Editing Using DDPM Inversion
Hila Manor, Tomer Michaeli
Zero-Sum Positional Differential Games as a Framework for Robust Reinforcement Learning: Deep Q-Learning Approach
Anton Plaksin, Vitaly Kalev
Zeroth-Order Methods for Constrained Nonconvex Nonsmooth Stochastic Optimization
Zhuanghua Liu, Cheng Chen, Luo Luo et al.