Most Cited ICML "negation feature learning" Papers
5,975 papers found • Page 5 of 30
Conference
Geometry Informed Tokenization of Molecules for Language Model Generation
Xiner Li, Limei Wang, Youzhi Luo et al.
Testing the Limits of Fine-Tuning for Improving Visual Cognition in Vision Language Models
Luca M. Schulze Buschoff, Konstantinos Voudouris, Elif Akata et al.
Graph4MM: Weaving Multimodal Learning with Structural Information
Xuying Ning, Dongqi Fu, Tianxin Wei et al.
UltraTWD: Optimizing Ultrametric Trees for Tree-Wasserstein Distance
Fangchen Yu, Yanzhen Chen, Jiaxing Wei et al.
DynaMind: Reasoning over Abstract Video Dynamics for Embodied Decision-Making
Ziru Wang, Mengmeng Wang, Jade Dai et al.
Do Not Mimic My Voice : Speaker Identity Unlearning for Zero-Shot Text-to-Speech
Taesoo Kim, Jinju Kim, Dongchan Kim et al.
LARM: Large Auto-Regressive Model for Long-Horizon Embodied Intelligence
Zhuoling Li, Xiaogang Xu, Zhenhua Xu et al.
Generative Social Choice: The Next Generation
Niclas Boehmer, Sara Fish, Ariel Procaccia
Tilted Sharpness-Aware Minimization
Tian Li, Tianyi Zhou, Jeff Bilmes
Hybrid Batch Normalisation: Resolving the Dilemma of Batch Normalisation in Federated Learning
Hongyao Chen, Tianyang Xu, Xiaojun Wu et al.
LensLLM: Unveiling Fine-Tuning Dynamics for LLM Selection
Xinyue Zeng, Haohui Wang, Junhong Lin et al.
Better to Teach than to Give: Domain Generalized Semantic Segmentation via Agent Queries with Diffusion Model Guidance
Fan Li, Xuan Wang, Min Qi et al.
Revisiting Instance-Optimal Cluster Recovery in the Labeled Stochastic Block Model
Kaito Ariu, Alexandre Proutiere, Se-Young Yun
On Efficient Estimation of Distributional Treatment Effects under Covariate-Adaptive Randomization
Undral Byambadalai, Tomu Hirata, Tatsushi Oka et al.
Density Ratio Estimation with Conditional Probability Paths
Hanlin Yu, Arto Klami, Aapo Hyvarinen et al.
Categorical Distributional Reinforcement Learning with Kullback-Leibler Divergence: Convergence and Asymptotics
Tyler Kastner, Mark Rowland, Yunhao Tang et al.
How to Evaluate and Mitigate IP Infringement in Visual Generative AI?
Zhenting Wang, Chen Chen, Vikash Sehwag et al.
Alpha-SQL: Zero-Shot Text-to-SQL using Monte Carlo Tree Search
Boyan Li, Jiayi Zhang, Ju Fan et al.
A Closer Look at Transformers for Time Series Forecasting: Understanding Why They Work and Where They Struggle
Yu Chen, Nathalia Céspedes, Payam Barnaghi
Survival Analysis via Density Estimation
Hiroki Yanagisawa, Shunta Akiyama
What Limits Bidirectional Model's Generative Capabilities? A Uni-Bi-Directional Mixture-of-Expert Method For Bidirectional Fine-tuning
Zuchao Li, Yonghua Hei, Qiwei Li et al.
Efficient Molecular Conformer Generation with SO(3)-Averaged Flow Matching and Reflow
Zhonglin Cao, Mario Geiger, Allan Costa et al.
Towards Understanding Gradient Dynamics of the Sliced-Wasserstein Distance via Critical Point Analysis
Christophe Vauthier, Anna Korba, Quentin Mérigot
Diversity By Design: Leveraging Distribution Matching for Offline Model-Based Optimization
Michael S Yao, James Gee, Osbert Bastani
Hardware and Software Platform Inference
Cheng Zhang, Hanna Foerster, Robert Mullins et al.
SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity
Samir Khaki, Xiuyu Li, Junxian Guo et al.
Exponential Family Variational Flow Matching for Tabular Data Generation
Andres Guzman Cordero, Floor Eijkelboom, Jan-Willem van de Meent
LIMEFLDL: A Local Interpretable Model-Agnostic Explanations Approach for Label Distribution Learning
Xiuyi Jia, Jinchi Li, Yunan Lu et al.
Explicit Discovery of Nonlinear Symmetries from Dynamic Data
Lexiang Hu, Yikang Li, Zhouchen Lin
Towards Global-level Mechanistic Interpretability: A Perspective of Modular Circuits of Large Language Models
Yinhan He, Wendy Zheng, Yushun Dong et al.
Provable Policy Gradient for Robust Average-Reward MDPs Beyond Rectangularity
Qiuhao Wang, Yuqi Zha, Chin Pang Ho et al.
GRAIL: Graph Edit Distance and Node Alignment using LLM-Generated Code
Samidha Verma, Arushi Goyal, Ananya Mathur et al.
Otter: Generating Tests from Issues to Validate SWE Patches
Toufique Ahmed, Jatin Ganhotra, Rangeet Pan et al.
RULEBREAKERS: Challenging LLMs at the Crossroads between Formal Logic and Human-like Reasoning
Jason Chan, Robert Gaizauskas, Zhixue Zhao
Limitations of measure-first protocols in quantum machine learning
Casper Gyurik, Riccardo Molteni, Vedran Dunjko
Goal-Space Planning with Subgoal Models
Chunlok Lo, Kevin Roice, Parham Mohammad Panahi et al.
Training High Performance Spiking Neural Network by Temporal Model Calibration
Jiaqi Yan, Changping Wang, De Ma et al.
Rethinking External Slow-Thinking: From Snowball Errors to Probability of Correct Reasoning
Zeyu Gan, Yun Liao, Yong Liu
Adaptive Elicitation of Latent Information Using Natural Language
Jimmy Wang, Tom Zollo, Richard Zemel et al.
A Generic Family of Graphical Models: Diversity, Efficiency, and Heterogeneity
Yufei Huang, Changhu Wang, Junjie Tang et al.
Explaining, Fast and Slow: Abstraction and Refinement of Provable Explanations
Shahaf Bassan, Yizhak Elboher, Tobias Ladner et al.
Differential Privacy Guarantees of Markov Chain Monte Carlo Algorithms
Andrea Bertazzi, Tim Johnston, Gareth Roberts et al.
SAeUron: Interpretable Concept Unlearning in Diffusion Models with Sparse Autoencoders
Bartosz Cywiński, Kamil Deja
Double Machine Learning for Causal Inference under Shared-State Interference
Chris Hays, Manish Raghavan
Cooperation of Experts: Fusing Heterogeneous Information with Large Margin
Shuo Wang, Shunyang Huang, Jinghui Yuan et al.
SGD Jittering: A Training Strategy for Robust and Accurate Model-Based Architectures
Peimeng Guan, Mark Davenport
Optimizing Robustness and Accuracy in Mixture of Experts: A Dual-Model Approach
Xu Zhang, Kaidi Xu, Ziqing Hu et al.
Improved Learning via k-DTW: A Novel Dissimilarity Measure for Curves
Amer Krivosija, Alexander Munteanu, André Nusser et al.
Disentangling and Integrating Relational and Sensory Information in Transformer Architectures
Awni Altabaa, John Lafferty
Policy Gradient with Tree Expansion
Gal Dalal, Assaf Hallak, Gugan Chandrashekhar Mallika Thoppe et al.
LIVS: A Pluralistic Alignment Dataset for Inclusive Public Spaces
Rashid Mushkani, Perampalli Shravan Nayak, Hugo Berard et al.
LBI-FL: Low-Bit Integerized Federated Learning with Temporally Dynamic Bit-Width Allocation
Li Ding, Hao Zhang, Wenrui Dai et al.
On the Clean Generalization and Robust Overfitting in Adversarial Training from Two Theoretical Views: Representation Complexity and Training Dynamics
Binghui Li, Yuanzhi Li
Solving Zero-Sum Convex Markov Games
Fivos Kalogiannis, Emmanouil-Vasileios Vlatakis-Gkaragkounis, Ian Gemp et al.
Where is the Truth? The Risk of Getting Confounded in a Continual World
Florian Peter Busch, Roshni Ramanna Kamath, Rupert Mitchell et al.
Eliciting Language Model Behaviors with Investigator Agents
Xiang Li, Neil Chowdhury, Daniel Johnson et al.
Learning Multi-Level Features with Matryoshka Sparse Autoencoders
Bart Bussmann, Noa Nabeshima, Adam Karvonen et al.
Improved Regret Analysis in Gaussian Process Bandits: Optimality for Noiseless Reward, RKHS norm, and Non-Stationary Variance
Shogo Iwazaki, Shion Takeno
The Berkeley Function Calling Leaderboard (BFCL): From Tool Use to Agentic Evaluation of Large Language Models
Shishir G. Patil, Huanzhi Mao, Fanjia Yan et al.
Functional Alignment Can Mislead: Examining Model Stitching
Damian Smith, Harvey Mannering, Antonia Marcu
Dynamical phases of short-term memory mechanisms in RNNs
Bariscan Kurtkaya, Fatih Dinc, Mert Yuksekgonul et al.
AutoAL: Automated Active Learning with Differentiable Query Strategy Search
Yifeng Wang, Xueying Zhan, Siyu Huang
Inverse Optimization via Learning Feasible Regions
Ke Ren, Peyman Mohajerin Esfahani, Angelos Georghiou
Boosting Adversarial Robustness with CLAT: Criticality Leveraged Adversarial Training
Bhavna Gopal, Huanrui Yang, Jingyang Zhang et al.
The Number of Trials Matters in Infinite-Horizon General-Utility Markov Decision Processes
Pedro Santos, Alberto Sardinha, Francisco S. Melo
Hierarchical Equivariant Policy via Frame Transfer
Haibo Zhao, Dian Wang, Yizhe Zhu et al.
AEQA-NAT : Adaptive End-to-end Quantization Alignment Training Framework for Non-autoregressive Machine Translation
Xiangyu Qu, Guojing Liu, Liang Li
HYGMA: Hypergraph Coordination Networks with Dynamic Grouping for Multi-Agent Reinforcement Learning
Chiqiang Liu, Dazi Li
DTZO: Distributed Trilevel Zeroth Order Learning with Provable Non-Asymptotic Convergence
Yang Jiao, Kai Yang, Chengtao Jian
Alberta Wells Dataset: Pinpointing Oil and Gas Wells from Satellite Imagery
Pratinav Seth, Michelle Lin, BREFO YAW et al.
Improving the Statistical Efficiency of Cross-Conformal Prediction
Invariant Deep Uplift Modeling for Incentive Assignment in Online Marketing via Probability of Necessity and Sufficiency
Zexu Sun, Qiyu Han, Hao Yang et al.
A General Graph Spectral Wavelet Convolution via Chebyshev Order Decomposition
Nian Liu, Xiaoxin He, Thomas Laurent et al.
Tensorized Multi-View Multi-Label Classification via Laplace Tensor Rank
Qiyu Zhong, Yi Shan, Haobo Wang et al.
Diffusion Models are Secretly Exchangeable: Parallelizing DDPMs via Auto Speculation
Hengyuan Hu, Aniket Das, Dorsa Sadigh et al.
Conformal Prediction as Bayesian Quadrature
Jake Snell, Thomas Griffiths
MMInference: Accelerating Pre-filling for Long-Context Visual Language Models via Modality-Aware Permutation Sparse Attention
Yucheng Li, Huiqiang Jiang, Chengruidong Zhang et al.
EquivaMap: Leveraging LLMs for Automatic Equivalence Checking of Optimization Formulations
Haotian Zhai, Connor Lawless, Ellen Vitercik et al.
EasyInv: Toward Fast and Better DDIM Inversion
Ziyue Zhang, Mingbao Lin, Shuicheng YAN et al.
NextCoder: Robust Adaptation of Code LMs to Diverse Code Edits
Tushar Aggarwal, Swayam Singh, Abhijeet Awasthi et al.
Private Federated Learning using Preference-Optimized Synthetic Data
Charlie Hou, Mei-Yu Wang, Yige Zhu et al.
Breaking Barriers: Combinatorial Algorithms for Non-Monotone Submodular Maximization with Sublinear Adaptivity and $1/e$ Approximation
Yixin Chen, Wenjing Chen, Alan Kuhnle
Rethink GraphODE Generalization within Coupled Dynamical System
Guancheng Wan, Zijie Huang, Wanjia Zhao et al.
Interaction-Aware Gaussian Weighting for Clustered Federated Learning
Alessandro Licciardi, Davide Leo, Eros Fanì et al.
AutoCATE: End-to-End, Automated Treatment Effect Estimation
Toon Vanderschueren, Tim Verdonck, Mihaela van der Schaar et al.
Robust Noise Attenuation via Adaptive Pooling of Transformer Outputs
Greyson Brothers
Learning Adversarial MDPs with Stochastic Hard Constraints
Francesco Emanuele Stradi, Matteo Castiglioni, Alberto Marchesi et al.
Lean and Mean Adaptive Optimization via Subset-Norm and Subspace-Momentum with Convergence Guarantees
Thien Nguyen, Huy Nguyen
Covered Forest: Fine-grained generalization analysis of graph neural networks
Antonis Vasileiou, Ben Finkelshtein, Floris Geerts et al.
Protein Structure Tokenization: Benchmarking and New Recipe
Xinyu Yuan, Zichen Wang, Marcus Collins et al.
LoRA Training Provably Converges to a Low-Rank Global Minimum Or It Fails Loudly (But it Probably Won't Fail)
Junsu Kim, Jaeyeon Kim, Ernest Ryu
Fluctuations of the largest eigenvalues of transformed spiked Wigner matrices
Aro Lee, Ji Oon Lee
NeuralCohort: Cohort-aware Neural Representation Learning for Healthcare Analytics
Changshuo Liu, Lingze Zeng, Kaiping Zheng et al.
Lightweight Dataset Pruning without Full Training via Example Difficulty and Prediction Uncertainty
Yeseul Cho, Baekrok Shin, Changmin Kang et al.
Adaptive Exploration for Multi-Reward Multi-Policy Evaluation
Alessio Russo, Aldo Pacchiano
Power Mean Estimation in Stochastic Continuous Monte-Carlo Tree Search
Tuan Dam
How to Move Your Dragon: Text-to-Motion Synthesis for Large-Vocabulary Objects
Wonkwang Lee, Jongwon Jeong, Taehong Moon et al.
Near Optimal Non-asymptotic Sample Complexity of 1-Identification
Zitian Li, Wang Chi Cheung
Online Learning in Risk Sensitive constrained MDP
Arnob Ghosh, Mehrdad Moharrami
TINED: GNNs-to-MLPs by Teacher Injection and Dirichlet Energy Distillation
Ziang Zhou, Zhihao DING, Jieming Shi et al.
Larger or Smaller Reward Margins to Select Preferences for LLM Alignment?
Kexin Huang, Junkang Wu, Ziqian Chen et al.
Causal Logistic Bandits with Counterfactual Fairness Constraints
Jiajun Chen, Jin Tian, Chris Quinn
Rethinking Point Cloud Data Augmentation: Topologically Consistent Deformation
Jian Bi, Qianliang Wu, Xiang Li et al.
EFDTR: Learnable Elliptical Fourier Descriptor Transformer for Instance Segmentation
Jiawei Cao, Chaochen Gu, Hao Cheng et al.
AffectGPT: A New Dataset, Model, and Benchmark for Emotion Understanding with Multimodal Large Language Models
Zheng Lian, Haoyu Chen, Lan Chen et al.
SageAttention2: Efficient Attention with Thorough Outlier Smoothing and Per-thread INT4 Quantization
Jintao Zhang, Haofeng Huang, Pengle Zhang et al.
Identifying Neural Dynamics Using Interventional State Space Models
Amin Nejatbakhsh, Yixin Wang
Efficiently Serving Large Multimodal Models Using EPD Disaggregation
Gursimran Singh, Xinglu Wang, Yifan Hu et al.
Data-driven Design of Randomized Control Trials with Guaranteed Treatment Effects
Santiago Cortes-Gomez, Naveen Raman, Aarti Singh et al.
Hyper: Hyperparameter Robust Efficient Exploration in Reinforcement Learning
Yiran Wang, Chenshu Liu, Yunfan Li et al.
On the Private Estimation of Smooth Transport Maps
Clément Lalanne, Franck Iutzeler, Loubes Jean-Michel et al.
EditLord: Learning Code Transformation Rules for Code Editing
Weichen Li, Albert Jan, Baishakhi Ray et al.
LoRA-Gen: Specializing Large Language Model via Online LoRA Generation
Yicheng Xiao, Lin Song, Rui Yang et al.
Learning to Keep a Promise: Scaling Language Model Decoding Parallelism with Learned Asynchronous Decoding
Tian Jin, Ellie Cheng, Zachary Ankner et al.
Empowering World Models with Reflection for Embodied Video Prediction
Xiaowei Chi, Chun-Kai Fan, Hengyuan Zhang et al.
Equivariant Neural Tangent Kernels
Philipp Misof, Pan Kessel, Jan Gerken
An Improved Clique-Picking Algorithm for Counting Markov Equivalent DAGs via Super Cliques Transfer
Lifu Liu, Shiyuan He, Jianhua Guo
An Online Learning Approach to Prompt-based Selection of Generative Models and LLMs
Xiaoyan Hu, Ho-fung Leung, Farzan Farnia
Exploring Criteria of Loss Reweighting to Enhance LLM Unlearning
Puning Yang, Qizhou Wang, Zhuo Huang et al.
Continual Reinforcement Learning by Planning with Online World Models
Zichen Liu, Guoji Fu, Chao Du et al.
Minerva: A Programmable Memory Test Benchmark for Language Models
Menglin Xia, Victor Ruehle, Saravanakumar Rajmohan et al.
Boosting Masked ECG-Text Auto-Encoders as Discriminative Learners
Hung Manh Pham, Aaqib Saeed, Dong Ma
R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts
Zhongyang Li, Ziyue Li, Tianyi Zhou
Zero-Shot Adaptation of Parameter-Efficient Fine-Tuning in Diffusion Models
Farzad Farhadzadeh, Debasmit Das, Shubhankar Borse et al.
Learning Distribution-wise Control in Representation Space for Language Models
Deng, Ruidi Chang, Hanjie Chen
Chameleon: A Flexible Data-mixing Framework for Language Model Pretraining and Finetuning
Wanyun Xie, Francesco Tonin, Volkan Cevher
Optimal and Practical Batched Linear Bandit Algorithm
Sanghoon Yu, Min-hwan Oh
FicGCN: Unveiling the Homomorphic Encryption Efficiency from Irregular Graph Convolutional Networks
Zhaoxuan Kan, Husheng Han, shangyi shi et al.
An Instrumental Value for Data Production and its Application to Data Pricing
Rui Ai, Boxiang Lyu, Zhaoran Wang et al.
Fishers for Free? Approximating the Fisher Information Matrix by Recycling the Squared Gradient Accumulator
YuXin Li, Felix Dangel, Derek Tam et al.
Scalable Model Merging with Progressive Layer-wise Distillation
Jing Xu, Jiazheng Li, Jingzhao Zhang
Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers
Roman Abramov, Felix Steinbauer, Gjergji Kasneci
Code-Generated Graph Representations Using Multiple LLM Agents for Material Properties Prediction
Jiao Huang, Qianli Xing, Jinglong Ji et al.
CAN: Leveraging Clients As Navigators for Generative Replay in Federated Continual Learning
Xuankun Rong, Jianshu Zhang, Kun He et al.
ITBench: Evaluating AI Agents across Diverse Real-World IT Automation Tasks
Saurabh Jha, Rohan Arora, Yuji Watanabe et al.
FeatSharp: Your Vision Model Features, Sharper
Mike Ranzinger, Greg Heinrich, Pavlo Molchanov et al.
A Tale of Two Structures: Do LLMs Capture the Fractal Complexity of Language?
Ibrahim Alabdulmohsin, Andreas Steiner
WILTing Trees: Interpreting the Distance Between MPNN Embeddings
Masahiro Negishi, Thomas Gärtner, Pascal Welke
Private Lossless Multiple Release
Joel Daniel Andersson, Lukas Retschmeier, Boel Nelson et al.
Making Hard Problems Easier with Custom Data Distributions and Loss Regularization: A Case Study in Modular Arithmetic
Eshika Saxena, Alberto Alfarano, Emily Wenger et al.
An Adaptive Orthogonal Convolution Scheme for Efficient and Flexible CNN Architectures
Thibaut Boissin, Franck Mamalet, Thomas Fel et al.
Directed Graph Grammars for Sequence-based Learning
Michael Sun, Orion Foo, Gang Liu et al.
Grammar-Forced Translation of Natural Language to Temporal Logic using LLMs
William English, Dominic Simon, Sumit Jha et al.
WOMD-Reasoning: A Large-Scale Dataset for Interaction Reasoning in Driving
Yiheng Li, Cunxin Fan, Chongjian GE et al.
Branches: Efficiently Seeking Optimal Sparse Decision Trees via AO*
Ayman Chaouki, Jesse Read, Albert Bifet
SDMG: Smoothing Your Diffusion Models for Powerful Graph Representation Learning
Junyou Zhu, Langzhou He, Chao Gao et al.
Unlocking the Power of Rehearsal in Continual Learning: A Theoretical Perspective
Junze Deng, Qinhang Wu, Peizhong Ju et al.
Best of Both Worlds: Regret Minimization versus Minimax Play
Adrian Müller, Jon Schneider, EFSTRATIOS PANTELEIMON SKOULAKIS et al.
Low-Rank Adapting Models for Sparse Autoencoders
Matthew Chen, Josh Engels, Max Tegmark
Mixed-curvature decision trees and random forests
Philippe Chlenski, Quentin Chu, Raiyan Khan et al.
Provable Benefit of Random Permutations over Uniform Sampling in Stochastic Coordinate Descent
Donghwa Kim, Jaewook Lee, Chulhee Yun
Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for Mixture-of-Experts Language Models
Samira Abnar, Harshay Shah, Dan Busbridge et al.
StealthInk: A Multi-bit and Stealthy Watermark for Large Language Models
Ya Jiang, Chuxiong Wu, Massieh Kordi Boroujeny et al.
EvoPress: Accurate Dynamic Model Compression via Evolutionary Search
Oliver Sieberling, Denis Kuznedelev, Eldar Kurtic et al.
Fast and Provable Algorithms for Sparse PCA with Improved Sample Complexity
Jian-Feng Cai, Zhuozhi XIAN, Jiaxi Ying
LDMol: A Text-to-Molecule Diffusion Model with Structurally Informative Latent Space Surpasses AR Models
Jinho Chang, Jong Chul YE
Scalable Non-Equivariant 3D Molecule Generation via Rotational Alignment
Yuhui Ding, Thomas Hofmann
A Reductions Approach to Risk-Sensitive Reinforcement Learning with Optimized Certainty Equivalents
Kaiwen Wang, Dawen Liang, Nathan Kallus et al.
Preconditioned Riemannian Gradient Descent Algorithm for Low-Multilinear-Rank Tensor Completion
Yuanwei Zhang, Fengmiao Bian, Xiaoqun Zhang et al.
OrcaLoca: An LLM Agent Framework for Software Issue Localization
Zhongming Yu, Hejia Zhang, Yujie Zhao et al.
Reflect-then-Plan: Offline Model-Based Planning through a Doubly Bayesian Lens
Jihwan Jeong, Xiaoyu Wang, Jingmin Wang et al.
Efficient Heterogeneity-Aware Federated Active Data Selection
Yingpeng Tang, Chao Ren, Xiaoli Tang et al.
Gradient Descent Converges Arbitrarily Fast for Logistic Regression via Large and Adaptive Stepsizes
Ruiqi Zhang, Jingfeng Wu, Peter Bartlett
Nonlinearly Preconditioned Gradient Methods under Generalized Smoothness
Konstantinos Oikonomidis, Jan Quan, Emanuel Laude et al.
The Elicitation Game: Evaluating Capability Elicitation Techniques
Felix Hofstätter, Teun van der Weij, Jayden Teoh et al.
Learning Efficient Robotic Garment Manipulation with Standardization
zhou changshi, Feng Luan, hujiarui et al.
Consensus Is All You Get: The Role of Attention in Transformers
Alvaro Rodriguez Abella, João Pedro Silvestre, Paulo Tabuada
Causality Inspired Federated Learning for OOD Generalization
Jiayuan Zhang, Xuefeng Liu, Jianwei Niu et al.
Falsification of Unconfoundedness by Testing Independence of Causal Mechanisms
Rickard K.A. Karlsson, Jesse H. Krijthe
Domain2Vec: Vectorizing Datasets to Find the Optimal Data Mixture without Training
Mozhi Zhang, Howe Tissue, Lu Wang et al.
Revisiting Convergence: Shuffling Complexity Beyond Lipschitz Smoothness
Qi He, Peiran Yu, Ziyi Chen et al.
Optimal Decision Tree Pruning Revisited: Algorithms and Complexity
Juha Harviainen, Frank Sommer, Manuel Sorge et al.
Understanding Sharpness Dynamics in NN Training with a Minimalist Example: The Effects of Dataset Difficulty, Depth, Stochasticity, and More
Geonhui Yoo, Minhak Song, Chulhee Yun
Locate-then-edit for Multi-hop Factual Recall under Knowledge Editing
Zhuoran Zhang, Yongxiang Li, Zijian Kan et al.
Inductive Gradient Adjustment for Spectral Bias in Implicit Neural Representations
Kexuan Shi, Hai Chen, Leheng Zhang et al.
Generalized Category Discovery via Reciprocal Learning and Class-Wise Distribution Regularization
Duo Liu, Zhiquan Tan, Linglan Zhao et al.
Clipped SGD Algorithms for Performative Prediction: Tight Bounds for Stochastic Bias and Remedies
Qiang Li, Michal Yemini, Hoi To Wai
Instruction-Following Pruning for Large Language Models
Bairu Hou, Qibin Chen, Jianyu Wang et al.
One Stone, Two Birds: Enhancing Adversarial Defense Through the Lens of Distributional Discrepancy
Jiacheng Zhang, Benjamin Rubinstein, Jingfeng Zhang et al.
Domain-Adapted Diffusion Model for PROTAC Linker Design Through the Lens of Density Ratio in Chemical Space
Zixing Song, Ziqiao Meng, Jose Miguel Hernandez-Lobato
Three-Dimensional Trajectory Prediction with 3DMoTraj Dataset
Hao Zhou, Xu Yang, Mingyu Fan et al.
Continuously Updating Digital Twins using Large Language Models
Harry Amad, Nicolás Astorga, Mihaela van der Schaar
Random Policy Evaluation Uncovers Policies of Generative Flow Networks
Haoran He, Emmanuel Bengio, Qingpeng Cai et al.
HGOT: Self-supervised Heterogeneous Graph Neural Network with Optimal Transport
Yanbei Liu, Chongxu Wang, Zhitao Xiao et al.
PAC-Bayes Analysis for Recalibration in Classification
Masahiro Fujisawa, Futoshi Futami
Global Context-aware Representation Learning for Spatially Resolved Transcriptomics
Yunhak Oh, Junseok Lee, Yeongmin Kim et al.
Provably Efficient Algorithm for Best Scoring Rule Identification in Online Principal-Agent Information Acquisition
Zichen Wang, Chuanhao Li, Huazheng Wang
Aggregation of Dependent Expert Distributions in Multimodal Variational Autoencoders
Rogelio A. Mancisidor, Robert Jenssen, Shujian Yu et al.
Benefits of Early Stopping in Gradient Descent for Overparameterized Logistic Regression
Jingfeng Wu, Peter Bartlett, Matus Telgarsky et al.
Black-Box Adversarial Attacks on LLM-Based Code Completion
Slobodan Jenko, Niels Mündler, Jingxuan He et al.
Divide and Conquer: Grounding LLMs as Efficient Decision-Making Agents via Offline Hierarchical Reinforcement Learning
Zican Hu, Wei Liu, Xiaoye Qu et al.
Training Diffusion-based Generative Models with Limited Data
Zhaoyu Zhang, Yang Hua, Guanxiong Sun et al.
Rethinking Time Encoding via Learnable Transformation Functions
Xi Chen, Yateng Tang, Jiarong Xu et al.
Counterfactual Effect Decomposition in Multi-Agent Sequential Decision Making
Stelios Triantafyllou, Aleksa Sukovic, Yasaman Zolfimoselo et al.
Deep Reinforcement Learning from Hierarchical Preference Design
Alexander Bukharin, Yixiao Li, Pengcheng He et al.
Optimizing Noise Distributions for Differential Privacy
Atefeh Gilani, Felipe Gomez, Shahab Asoodeh et al.
The Batch Complexity of Bandit Pure Exploration
Adrienne Tuynman, Rémy Degenne
iN2V: Bringing Transductive Node Embeddings to Inductive Graphs
Nicolas Lell, Ansgar Scherp
SecEmb: Sparsity-Aware Secure Federated Learning of On-Device Recommender System with Large Embedding
Peihua Mai, Youlong Ding, Ziyan Lyu et al.
GrokFormer: Graph Fourier Kolmogorov-Arnold Transformers
GUOGUO AI, Guansong Pang, Hezhe Qiao et al.