ICML Papers
5,975 papers found • Page 120 of 120
What Will My Model Forget? Forecasting Forgotten Examples in Language Model Refinement
Xisen Jin, Xiang Ren
What Would Gauss Say About Representations? Probing Pretrained Image Models using Synthetic Gaussian Benchmarks
Ching-Yun (Irene) Ko, Pin-Yu Chen, Payel Das et al.
When and How Does In-Distribution Label Help Out-of-Distribution Detection?
Xuefeng Du, Yiyou Sun, Sharon Li
When Do Skills Help Reinforcement Learning? A Theoretical Analysis of Temporal Abstractions
Zhening Li, Gabriel Poesia, Armando Solar-Lezama
When is Transfer Learning Possible?
My Phan, Kianté Brantley, Stephanie Milani et al.
When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models
Haoran You, Yichao Fu, Zheng Wang et al.
When Representations Align: Universality in Representation Learning Dynamics
Loek van Rossem, Andrew Saxe
When Will Gradient Regularization Be Harmful?
Yang Zhao, Hao Zhang, Xiuyuan Hu
Which Frequencies do CNNs Need? Emergent Bottleneck Structure in Feature Learning
Yuxiao Wen, Arthur Jacot
Whispering Experts: Neural Interventions for Toxicity Mitigation in Language Models
Xavi Suau, Pieter Delobelle, Katherine Metcalf et al.
Why Do Animals Need Shaping? A Theory of Task Composition and Curriculum Learning
Jin Hwa Lee, Stefano Mannelli, Andrew Saxe
Why do Variational Autoencoders Really Promote Disentanglement?
Pratik Bhowal, Achint Soni, Sirisha Rambhatla
Why Do You Grok? A Theoretical Analysis on Grokking Modular Addition
Mohamad Amin Mohamadi, Zhiyuan Li, Lei Wu et al.
Why Larger Language Models Do In-context Learning Differently?
Zhenmei Shi, Junyi Wei, Zhuoyan Xu et al.
Winner-takes-all learners are geometry-aware conditional density estimators
Victor Letzelter, David Perera, Cédric Rommel et al.
WISER: Weak Supervision and Supervised Representation Learning to Improve Drug Response Prediction in Cancer
Kumar Shubham, Aishwarya Jayagopal, Syed Danish et al.
WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?
Alexandre Drouin, Maxime Gasse, Massimo Caccia et al.
Wukong: Towards a Scaling Law for Large-Scale Recommendation
Buyun Zhang, Liang Luo, Yuxin Chen et al.
X-Oscar: A Progressive Framework for High-quality Text-guided 3D Animatable Avatar Generation
Yiwei Ma, Zhekai Lin, Jiayi Ji et al.
xT: Nested Tokenization for Larger Context in Large Images
Ritwik Gupta, Shufan Li, Tyler Zhu et al.
Zero-Shot ECG Classification with Multimodal Learning and Test-time Clinical Knowledge Enhancement
che liu, Zhongwei Wan, Cheng Ouyang et al.
Zero-Shot Reinforcement Learning via Function Encoders
Tyler Ingebrand, Amy Zhang, Ufuk Topcu
Zero-Shot Unsupervised and Text-Based Audio Editing Using DDPM Inversion
Hila Manor, Tomer Michaeli
Zero-Sum Positional Differential Games as a Framework for Robust Reinforcement Learning: Deep Q-Learning Approach
Anton Plaksin, Vitaly Kalev
Zeroth-Order Methods for Constrained Nonconvex Nonsmooth Stochastic Optimization
Zhuanghua Liu, Cheng Chen, Luo Luo et al.