Most Cited ICML "convolutional layers" Papers
5,975 papers found • Page 21 of 30
Conference
Wukong: Towards a Scaling Law for Large-Scale Recommendation
Buyun Zhang, Liang Luo, Yuxin Chen et al.
Sparse-to-dense Multimodal Image Registration via Multi-Task Learning
Kaining Zhang, Jiayi Ma
In-Context Principle Learning from Mistakes
Tianjun Zhang, Aman Madaan, Luyu Gao et al.
A Federated Stochastic Multi-level Compositional Minimax Algorithm for Deep AUC Maximization
Xinwen Zhang, Ali Payani, Myungjin Lee et al.
Enhancing Storage and Computational Efficiency in Federated Multimodal Learning for Large-Scale Models
Zixin Zhang, Fan Qi, Changsheng Xu
Online Resource Allocation with Non-Stationary Customers
Xiaoyue Zhang, Hanzhang Qin, Mabel Chou
Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics
Xinyu Zhang, Wenjie Qiu, Yi-Chen Li et al.
Fair Risk Control: A Generalized Framework for Calibrating Multi-group Fairness Risks
Lujing Zhang, Aaron Roth, Linjun Zhang
Online Matching with Stochastic Rewards: Provable Better Bound via Adversarial Reinforcement Learning
Qiankun Zhang, Aocheng Shen, Boyu Zhang et al.
Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning
Zongmeng Zhang, Yufeng Shi, Jinhua Zhu et al.
Interpreting and Improving Large Language Models in Arithmetic Calculation
Wei Zhang, Wan Chaoqun, Yonggang Zhang et al.
Candidate Pseudolabel Learning: Enhancing Vision-Language Models by Prompt Tuning with Unlabeled Data
Jiahan Zhang, Qi Wei, Feng Liu et al.
Exploring the Benefit of Activation Sparsity in Pre-training
Zhengyan Zhang, Chaojun Xiao, Qiujieli Qin et al.
Causal Representation Learning from Multiple Distributions: A General Setting
Kun Zhang, Shaoan Xie, Ignavier Ng et al.
FESSNC: Fast Exponentially Stable and Safe Neural Controller
Jingdong Zhang, Luan Yang, Qunxi Zhu et al.
Minimax Optimality of Score-based Diffusion Models: Beyond the Density Lower Bound Assumptions
Kaihong Zhang, Heqi Yin, Feng Liang et al.
Beyond the ROC Curve: Classification Trees Using Cost-Optimal Curves, with Application to Imbalanced Datasets
Magzhan Gabidolla, Arman Zharmagambetov, Miguel Carreira-Perpinan
Distributionally Robust Data Valuation
Xiaoqiang Lin, Xinyi Xu, Zhaoxuan Wu et al.
Efficient Contextual Bandits with Uninformed Feedback Graphs
Mengxiao Zhang, Yuheng Zhang, Haipeng Luo et al.
Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases
Ziyi Zhang, Sen Zhang, Yibing Zhan et al.
Uncertainty-Aware Reward-Free Exploration with General Function Approximation
Junkai Zhang, Weitong Zhang, Dongruo Zhou et al.
Feature Contamination: Neural Networks Learn Uncorrelated Features and Fail to Generalize
Tianren Zhang, Chujie Zhao, Guanyu Chen et al.
On the Expressive Power of Spectral Invariant Graph Neural Networks
Bohang Zhang, Lingxiao Zhao, Haggai Maron
Position: Measure Dataset Diversity, Don't Just Claim It
Dora Zhao, Jerone Andrews, Orestis Papakyriakopoulos et al.
Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning
Hao Zhao, Maksym Andriushchenko, Francesco Croce et al.
Spider: A Unified Framework for Context-dependent Concept Segmentation
Xiaoqi Zhao, Youwei Pang, Wei Ji et al.
Quantum Implicit Neural Representations
Jiaming Zhao, Wenbo Qiao, Peng Zhang et al.
Is Inverse Reinforcement Learning Harder than Standard Reinforcement Learning? A Theoretical Perspective
Lei Zhao, Mengdi Wang, Yu Bai
A Statistical Theory of Regularization-Based Continual Learning
Xuyang Zhao, Huiyuan Wang, Weiran Huang et al.
Conformal Predictions under Markovian Data
Frédéric Zheng, Alexandre Proutiere
On Prompt-Driven Safeguarding for Large Language Models
Chujie Zheng, Fan Yin, Hao Zhou et al.
On the Emergence of Cross-Task Linearity in Pretraining-Finetuning Paradigm
Zhanpeng Zhou, Zijun Chen, Yilan Chen et al.
Finite-Time Convergence and Sample Complexity of Actor-Critic Multi-Objective Reinforcement Learning
Tianchen Zhou, Hairi, Haibo Yang et al.
Conformalized Adaptive Forecasting of Heterogeneous Trajectories
Yanfei Zhou, Lars Lindemann, Matteo Sesia
RAUCA: A Novel Physical Adversarial Attack on Vehicle Detectors via Robust and Accurate Camouflage Generation
Jiawei Zhou, Linye Lyu, Daojing He et al.
CurBench: Curriculum Learning Benchmark
Yuwei Zhou, Zirui Pan, Xin Wang et al.
ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL
Yifei Zhou, Andrea Zanette, Jiayi Pan et al.
Generative Active Learning for Long-tailed Instance Segmentation
Muzhi Zhu, Chengxiang Fan, Hao Chen et al.
Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
Lianghui Zhu, Bencheng Liao, Qian Zhang et al.
Switched Flow Matching: Eliminating Singularities via Switching ODEs
Qunxi Zhu, Wei Lin
Online Learning in Betting Markets: Profit versus Prediction
Haiqing Zhu, Alexander Soen, Yun Kuen Cheung et al.
Translation Equivariant Transformer Neural Processes
Matthew Ashman, Cristiana Diaconu, Junhyuck Kim et al.
Language Models Represent Beliefs of Self and Others
Wentao Zhu, Zhining Zhang, Yizhou Wang
Converting Transformers to Polynomial Form for Secure Inference Over Homomorphic Encryption
Itamar Zimerman, Moran Baruch, Nir Drucker et al.
Viewing Transformers Through the Lens of Long Convolutions Layers
Itamar Zimerman, Lior Wolf
Compositional Few-Shot Class-Incremental Learning
Yixiong Zou, Shanghang Zhang, haichen zhou et al.
Improving Equivariant Graph Neural Networks on Large Geometric Graphs via Virtual Nodes Learning
Yuelin Zhang, Jiacheng Cen, Jiaqi Han et al.
REST: Efficient and Accelerated EEG Seizure Analysis through Residual State Updates
Arshia Afzal, Grigorios Chrysos, Volkan Cevher et al.
Amend to Alignment: Decoupled Prompt Tuning for Mitigating Spurious Correlation in Vision-Language Models
Jie ZHANG, Xiaosong Ma, Song Guo et al.
Sample Complexity Bounds for Estimating Probability Divergences under Invariances
Behrooz Tahmasebi, Stefanie Jegelka
On the Embedding Collapse when Scaling up Recommendation Models
Xingzhuo Guo, Junwei Pan, Ximei Wang et al.
Position: Understanding LLMs Requires More Than Statistical Generalization
Patrik Reizinger, Szilvia Ujváry, Anna Mészáros et al.
Differentially Private Sum-Product Networks
Xenia Heilmann, Mattia Cerrato, Ernst Althaus
Log Neural Controlled Differential Equations: The Lie Brackets Make A Difference
Benjamin Walker, Andrew McLeod, Tiexin QIN et al.
Improved Differentially Private and Lazy Online Convex Optimization: Lower Regret without Smoothness Requirements
Naman Agarwal, Satyen Kale, Karan Singh et al.
Expand-and-Cluster: Parameter Recovery of Neural Networks
Flavio Martinelli, Berfin Simsek, Wulfram Gerstner et al.
REMEDI: Corrective Transformations for Improved Neural Entropy Estimation
Viktor Nilsson, Anirban Samaddar, Sandeep Madireddy et al.
Principled Penalty-based Methods for Bilevel Reinforcement Learning and RLHF
Han Shen, Zhuoran Yang, Tianyi Chen
Momentor: Advancing Video Large Language Model with Fine-Grained Temporal Reasoning
Long Qian, Juncheng Li, Yu Wu et al.
StackSight: Unveiling WebAssembly through Large Language Models and Neurosymbolic Chain-of-Thought Decompilation
Weike Fang, Zhejian Zhou, Junzhou He et al.
Disentangled Continual Graph Neural Architecture Search with Invariant Modular Supernet
Zeyang Zhang, Xin Wang, Yijian Qin et al.
Disentangled Graph Self-supervised Learning for Out-of-Distribution Generalization
Haoyang Li, Xin Wang, Zeyang Zhang et al.
On the Effectiveness of Supervision in Asymmetric Non-Contrastive Learning
Jeongheon Oh, Kibok Lee
DiracDiffusion: Denoising and Incremental Reconstruction with Assured Data-Consistency
Zalan Fabian, Berk Tinaz, Mahdi Soltanolkotabi
Position: Data Authenticity, Consent, & Provenance for AI are all broken: what will it take to fix them?
Shayne Longpre, Robert Mahari, Naana Obeng-Marnu et al.
Lookbehind-SAM: k steps back, 1 step forward
Gonçalo Mordido, Pranshu Malviya, Aristide Baratin et al.
From Neurons to Neutrons: A Case Study in Interpretability
Ouail Kitouni, Niklas Nolte, Víctor Samuel Pérez-Díaz et al.
Do Transformer World Models Give Better Policy Gradients?
Michel Ma, Tianwei Ni, Clement Gehring et al.
Conformal prediction for multi-dimensional time series by ellipsoidal sets
Chen Xu, Hanyang Jiang, Yao Xie
Skill Set Optimization: Reinforcing Language Model Behavior via Transferable Skills
Kolby Nottingham, Bodhisattwa Prasad Majumder, Bhavana Dalvi et al.
MC-GTA: Metric-Constrained Model-Based Clustering using Goodness-of-fit Tests with Autocorrelations
Zhangyu Wang, Gengchen Mai, Krzysztof Janowicz et al.
Enforcing Constraints in RNA Secondary Structure Predictions: A Post-Processing Framework Based on the Assignment Problem
Geewon Suh, Gyeongjo Hwang, SeokjunKang et al.
Reshape and Adapt for Output Quantization (RAOQ): Quantization-aware Training for In-memory Computing Systems
Bonan Zhang, Chia-Yu Chen, Naveen Verma
What Improves the Generalization of Graph Transformers? A Theoretical Dive into the Self-attention and Positional Encoding
Hongkang Li, Meng Wang, Tengfei Ma et al.
Catapults in SGD: spikes in the training loss and their impact on generalization through feature learning
Libin Zhu, Chaoyue Liu, Adityanarayanan Radhakrishnan et al.
ProtoGate: Prototype-based Neural Networks with Global-to-local Feature Selection for Tabular Biomedical Data
Xiangjian Jiang, Andrei Margeloiu, Nikola Simidjievski et al.
Controllable Prompt Tuning For Balancing Group Distributional Robustness
Hoang Phan, Andrew Wilson, Qi Lei
Position: Scarce Resource Allocations That Rely On Machine Learning Should Be Randomized
Shomik Jain, Kathleen A. Creel, Ashia Wilson
Deep Functional Factor Models: Forecasting High-Dimensional Functional Time Series via Bayesian Nonparametric Factorization
Yirui Liu, Xinghao Qiao, Yulong Pei et al.
Diffusion Posterior Sampling is Computationally Intractable
Shivam Gupta, Ajil Jalal, Aditya Parulekar et al.
PerceptAnon: Exploring the Human Perception of Image Anonymization Beyond Pseudonymization for GDPR
Kartik Patwari, Chen-Nee Chuah, Lingjuan Lyu et al.
Stochastic Optimization with Arbitrary Recurrent Data Sampling
William Powell, Hanbaek Lyu
Partially Stochastic Infinitely Deep Bayesian Neural Networks
Sergio Calvo Ordoñez, Matthieu Meunier, Francesco Piatti et al.
Neuro-Visualizer: A Novel Auto-Encoder-Based Loss Landscape Visualization Method With an Application in Knowledge-Guided Machine Learning
Mohannad Elhamod, Anuj Karpatne
Discovering Bias in Latent Space: An Unsupervised Debiasing Approach
Dyah Adila, Shuai Zhang, Boran Han et al.
Centralized Selection with Preferences in the Presence of Biases
L. Elisa Celis, Amit Kumar, Nisheeth K. Vishnoi et al.
Generalizing Knowledge Graph Embedding with Universal Orthogonal Parameterization
Rui Li, Chaozhuo Li, Yanming Shen et al.
Rethinking Transformers in Solving POMDPs
Chenhao Lu, Ruizhe Shi, Yuyao Liu et al.
Symmetric Replay Training: Enhancing Sample Efficiency in Deep Reinforcement Learning for Combinatorial Optimization
Hyeonah Kim, Minsu Kim, Sungsoo Ahn et al.
Embodied CoT Distillation From LLM To Off-the-shelf Agents
Wonje Choi, Woo Kyung Kim, Minjong Yoo et al.
A General Framework for Sequential Decision-Making under Adaptivity Constraints
Nuoya Xiong, Zhaoran Wang, Zhuoran Yang
RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis
Yao Mu, Junting Chen, Qing-Long Zhang et al.
Layerwise Change of Knowledge in Neural Networks
Xu Cheng, Lei Cheng, Zhaoran Peng et al.
Use Your INSTINCT: INSTruction optimization for LLMs usIng Neural bandits Coupled with Transformers
Xiaoqiang Lin, Zhaoxuan Wu, Zhongxiang Dai et al.
Tilt your Head: Activating the Hidden Spatial-Invariance of Classifiers
Johann Schmidt, Sebastian Stober
WISER: Weak Supervision and Supervised Representation Learning to Improve Drug Response Prediction in Cancer
Kumar Shubham, Aishwarya Jayagopal, Syed Danish et al.
Recurrent Early Exits for Federated Learning with Heterogeneous Clients
Royson Lee, Javier Fernandez-Marques, Xu Hu et al.
Individual Contributions as Intrinsic Exploration Scaffolds for Multi-agent Reinforcement Learning
Xinran Li, Zifan LIU, Shibo Chen et al.
Revitalizing Multivariate Time Series Forecasting: Learnable Decomposition with Inter-Series Dependencies and Intra-Series Variations Modeling
Guoqi Yu, Jing Zou, Xiaowei Hu et al.
On the Calibration of Human Pose Estimation
Kerui Gu, Rongyu Chen, Xuanlong Yu et al.
MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI
Kaining Ying, Fanqing Meng, Jin Wang et al.
The Surprising Effectiveness of Skip-Tuning in Diffusion Sampling
Jiajun Ma, Shuchen Xue, Tianyang Hu et al.
DFD: Distilling the Feature Disparity Differently for Detectors
Kang Liu, Yingyi Zhang, Jingyun Zhang et al.
Evolution of Heuristics: Towards Efficient Automatic Algorithm Design Using Large Language Model
Fei Liu, Tong Xialiang, Mingxuan Yuan et al.
Reward Model Learning vs. Direct Policy Optimization: A Comparative Analysis of Learning from Human Preferences
Andi Nika, Debmalya Mandal, Parameswaran Kamalaruban et al.
Agent-Specific Effects: A Causal Effect Propagation Analysis in Multi-Agent MDPs
Stelios Triantafyllou, Aleksa Sukovic, Debmalya Mandal et al.
Auto-Linear Phenomenon in Subsurface Imaging
Yinan Feng, Yinpeng Chen, Peng Jin et al.
DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models (Exemplified as A Video Agent)
Zongxin Yang, Guikun Chen, Xiaodi Li et al.
Distributed Bilevel Optimization with Communication Compression
Yutong He, Jie Hu, Xinmeng Huang et al.
On the Weight Dynamics of Deep Normalized Networks
Christian H.X. Ali Mehmeti-Göpel, Michael Wand
Diffusion Tempering Improves Parameter Estimation with Probabilistic Integrators for Ordinary Differential Equations
Jonas Beck, Nathanael Bosch, Michael Deistler et al.
A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts
Kuang-Huei Lee, Xinyun Chen, Hiroki Furuta et al.
Don’t Label Twice: Quantity Beats Quality when Comparing Binary Classifiers on a Budget
Florian Dorner, Moritz Hardt
Membership Inference Attacks on Diffusion Models via Quantile Regression
Shuai Tang, Steven Wu, Sergul Aydore et al.
Premise Order Matters in Reasoning with Large Language Models
Xinyun Chen, Ryan Chi, Xuezhi Wang et al.
Graph As Point Set
Xiyuan Wang, Pan Li, Muhan Zhang
An Analysis of Linear Time Series Forecasting Models
William Toner, Luke Darlow
Accelerating Legacy Numerical Solvers by Non-intrusive Gradient-based Meta-solving
Sohei Arisaka, Qianxiao Li
Mastering Robot Manipulation with Multimodal Prompts through Pretraining and Multi-task Fine-tuning
Jiachen Li, Qiaozi Gao, Michael Johnston et al.
Optimizing Watermarks for Large Language Models
Bram Wouters
A Probabilistic Approach to Learning the Degree of Equivariance in Steerable CNNs
Lars Veefkind, Gabriele Cesa
Hieros: Hierarchical Imagination on Structured State Space Sequence World Models
Paul Mattes, Rainer Schlosser, Ralf Herbrich
Improved Stability and Generalization Guarantees of the Decentralized SGD Algorithm
Batiste Le Bars, Aurélien Bellet, Marc Tommasi et al.
Equivariant Diffusion for Crystal Structure Prediction
Peijia Lin, Pin Chen, Rui Jiao et al.
Online Variational Sequential Monte Carlo
Alessandro Mastrototaro, Jimmy Olsson
Generalized Preference Optimization: A Unified Approach to Offline Alignment
Yunhao Tang, Zhaohan Guo, Zeyu Zheng et al.
Sequential Asynchronous Action Coordination in Multi-Agent Systems: A Stackelberg Decision Transformer Approach
Bin Zhang, Hangyu Mao, Lijuan Li et al.
MS$^3$D: A RG Flow-Based Regularization for GAN Training with Limited Data
Jian Wang, Xin Lan, Yuxin Tian et al.
DiffDA: a Diffusion model for weather-scale Data Assimilation
Langwen Huang, Lukas Gianinazzi, Yuejiang Yu et al.
The Stronger the Diffusion Model, the Easier the Backdoor: Data Poisoning to Induce Copyright BreachesWithout Adjusting Finetuning Pipeline
Haonan Wang, Qianli Shen, Yao Tong et al.
Dynamic Spectral Clustering with Provable Approximation Guarantee
Steinar Laenen, He Sun
Adversarially Robust Deep Multi-View Clustering: A Novel Attack and Defense Framework
Haonan Huang, Guoxu Zhou, Yanghang Zheng et al.
Rich-Observation Reinforcement Learning with Continuous Latent Dynamics
Yuda Song, Lili Wu, Dylan Foster et al.
Sparse Dimensionality Reduction Revisited
Mikael Møller Høgsgaard, Lior Kamma, Kasper Green Larsen et al.
A New Theoretical Perspective on Data Heterogeneity in Federated Optimization
Jiayi Wang, Shiqiang Wang, Rong-Rong Chen et al.
Speech Self-Supervised Learning Using Diffusion Model Synthetic Data
Heting Gao, Kaizhi Qian, Junrui Ni et al.
ContPhy: Continuum Physical Concept Learning and Reasoning from Videos
Zhicheng Zheng, Xin Yan, Zhenfang Chen et al.
RoboGen: Towards Unleashing Infinite Data for Automated Robot Learning via Generative Simulation
Yufei Wang, Zhou Xian, Feng Chen et al.
Model Tailor: Mitigating Catastrophic Forgetting in Multi-modal Large Language Models
Didi Zhu, Zhongyi Sun, Zexi Li et al.
Diffusion Rejection Sampling
Byeonghu Na, Yeongmin Kim, Minsang Park et al.
Information-Directed Pessimism for Offline Reinforcement Learning
Alec Koppel, Sujay Bhatt, Jiacheng Guo et al.
Exploring the Enigma of Neural Dynamics Through A Scattering-Transform Mixer Landscape for Riemannian Manifold
Tingting Dan, Ziquan Wei, Won Hwa Kim et al.
Partial Optimality in the Linear Ordering Problem
David Stein, Bjoern Andres
Differentially Private Domain Adaptation with Theoretical Guarantees
Raef Bassily, Corinna Cortes, Anqi Mao et al.
FreeBind: Free Lunch in Unified Multimodal Space via Knowledge Fusion
Zehan Wang, Ziang Zhang, xize cheng et al.
Box Facets and Cut Facets of Lifted Multicut Polytopes
Lucas Fabian Naumann, Jannik Irmai, Shengxian Zhao et al.
Improving Computational Complexity in Statistical Models with Local Curvature Information
Pedram Akbarian, Tongzheng Ren, Jiacheng Zhuo et al.
Online Algorithms with Uncertainty-Quantified Predictions
Bo Sun, Jerry Huang, Nicolas Christianson et al.
A Fresh Take on Stale Embeddings: Improving Dense Retriever Training with Corrector Networks
Nicholas Monath, Will Grathwohl, Michael Boratko et al.
R2E: Turning any Github Repository into a Programming Agent Environment
Naman Jain, Manish Shetty Molahalli, Tianjun Zhang et al.
Enhancing Value Function Estimation through First-Order State-Action Dynamics in Offline Reinforcement Learning
Yun-Hsuan Lien, Ping-Chun Hsieh, Tzu-Mao Li et al.
DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents
Yilun Xu, Gabriele Corso, Tommi Jaakkola et al.
Reward Shaping for Reinforcement Learning with An Assistant Reward Agent
Haozhe Ma, Kuankuan Sima, Thanh Vinh Vo et al.
Human Alignment of Large Language Models through Online Preference Optimisation
Daniele Calandriello, Zhaohan Guo, REMI MUNOS et al.
Expressivity and Generalization: Fragment-Biases for Molecular GNNs
Tom Wollschläger, Niklas Kemper, Leon Hetzel et al.
SelfIE: Self-Interpretation of Large Language Model Embeddings
Haozhe Chen, Carl Vondrick, Chengzhi Mao
QBMK: Quantum-based Matching Kernels for Un-attributed Graphs
Lu Bai, Lixin Cui, Ming Li et al.
Sparsest Models Elude Pruning: An Exposé of Pruning’s Current Capabilities
Stephen Zhang, Vardan Papyan
Efficient Precision and Recall Metrics for Assessing Generative Models using Hubness-aware Sampling
Yuanbang Liang, Jing Wu, Yu-Kun Lai et al.
An Empirical Study Into What Matters for Calibrating Vision-Language Models
Weijie Tu, Weijian Deng, Dylan Campbell et al.
A Computational Framework for Solving Wasserstein Lagrangian Flows
Kirill Neklyudov, Rob Brekelmans, Alexander Tong et al.
Contextual Feature Selection with Conditional Stochastic Gates
Ram Dyuthi Sristi, Ofir Lindenbaum, Shira Lifshitz et al.
Weisfeiler Leman for Euclidean Equivariant Machine Learning
Snir Hordan, Tal Amir, Nadav Dym
Learning to Scale Logits for Temperature-Conditional GFlowNets
Minsu Kim, Joohwan Ko, Taeyoung Yun et al.
Reinforcement Learning and Regret Bounds for Admission Control
Lucas Weber, Ana Busic, Jiamin ZHU
QuIP$\#$: Even Better LLM Quantization with Hadamard Incoherence and Lattice Codebooks
Albert Tseng, Jerry Chee, Qingyao Sun et al.
Harmony in Diversity: Merging Neural Networks with Canonical Correlation Analysis
Stefan Horoi, Albert Manuel Orozco Camacho, Eugene Belilovsky et al.
Revealing Vision-Language Integration in the Brain with Multimodal Networks
Vighnesh Subramaniam, Colin Conwell, Christopher Wang et al.
Learning Optimal Projection for Forecast Reconciliation of Hierarchical Time Series
Asterios Tsiourvas, Wei Sun, Georgia Perakis et al.
Switching the Loss Reduces the Cost in Batch Reinforcement Learning
Alex Ayoub, Kaiwen Wang, Vincent Liu et al.
Sampling-based Multi-dimensional Recalibration
Youngseog Chung, Ian Char, Jeff Schneider
NExT: Teaching Large Language Models to Reason about Code Execution
Ansong Ni, Miltiadis Allamanis, Arman Cohan et al.
PANDA: Expanded Width-Aware Message Passing Beyond Rewiring
Jeongwhan Choi, Sumin Parksumin, Hyowon Wi et al.
Position: AI-Powered Autonomous Weapons Risk Geopolitical Instability and Threaten AI Research
Riley Simmons-Edler, Ryan Badman, Shayne Longpre et al.
MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance
Yake Wei, Di Hu
Exploration-Driven Policy Optimization in RLHF: Theoretical Insights on Efficient Data Utilization
Yihan Du, Anna Winnicki, Gal Dalal et al.
Model-Based Minimum Bayes Risk Decoding for Text Generation
Yuu Jinnai, Tetsuro Morimura, Ukyo Honda et al.
Variance-reduced Zeroth-Order Methods for Fine-Tuning Language Models
Tanmay Gautam, Youngsuk Park, Hao Zhou et al.
Collage: Light-Weight Low-Precision Strategy for LLM Training
Tao Yu, Gaurav Gupta, KARTHICK GOPALSWAMY et al.
SelfVC: Voice Conversion With Iterative Refinement using Self Transformations
Paarth Neekhara, Shehzeen Hussain, Rafael Valle et al.
Evaluating Quantized Large Language Models
Shiyao Li, Xuefei Ning, Luning Wang et al.
First-Order Manifold Data Augmentation for Regression Learning
Ilya Kaufman, Omri Azencot
Causal Discovery with Fewer Conditional Independence Tests
Kirankumar Shiragur, Jiaqi Zhang, Caroline Uhler
What is the Long-Run Distribution of Stochastic Gradient Descent? A Large Deviations Analysis
Waïss Azizian, Franck Iutzeler, Jérôme Malick et al.
Gradient Compressed Sensing: A Query-Efficient Gradient Estimator for High-Dimensional Zeroth-Order Optimization
Ruizhong Qiu, Hanghang Tong
Fundamental Limitations of Alignment in Large Language Models
Yotam Wolf, Noam Wies, Oshri Avnery et al.
Predicting Dose-Response Curves with Deep Neural Networks
Pedro A. Campana, Paul Prasse, Tobias Scheffer
Provably Efficient Long-Horizon Exploration in Monte Carlo Tree Search through State Occupancy Regularization
Liam Schramm, Abdeslam Boularias
Causal Effect Identification in LiNGAM Models with Latent Confounders
Daniele Tramontano, Yaroslav Kivva, Saber Salehkaleybar et al.
Extending Adversarial Attacks to Produce Adversarial Class Probability Distributions
Jon Vadillo, Roberto Santana, Jose A Lozano
Tilt and Average : Geometric Adjustment of the Last Layer for Recalibration
Gyusang Cho, Chan-Hyun Youn
Pessimism Meets Risk: Risk-Sensitive Offline Reinforcement Learning
Dake Zhang, Boxiang Lyu, Shuang Qiu et al.
Implicit Representations for Constrained Image Segmentation
Jan Philipp Schneider, Mishal Fatima, Jovita Lukasik et al.
Improving Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning
Wei Li, Hehe Fan, Yongkang Wong et al.
InferCept: Efficient Intercept Support for Augmented Large Language Model Inference
Reyna Abhyankar, Zijian He, Vikranth Srivatsa et al.
Scalable Safe Policy Improvement for Factored Multi-Agent MDPs
Federico Bianchi, Edoardo Zorzi, Alberto Castellini et al.
Local Feature Selection without Label or Feature Leakage for Interpretable Machine Learning Predictions
Harrie Oosterhuis, Lijun Lyu, Avishek Anand
Efficient Low-Rank Matrix Estimation, Experimental Design, and Arm-Set-Dependent Low-Rank Bandits
Kyoungseok Jang, Chicheng Zhang, Kwang-Sung Jun
Lightweight Image Super-Resolution via Flexible Meta Pruning
Yulun Zhang, Kai Zhang, Luc Van Gool et al.
KnowFormer: Revisiting Transformers for Knowledge Graph Reasoning
Junnan Liu, Qianren Mao, Weifeng Jiang et al.