Most Cited ICML "monotonic interpolation" Papers
5,975 papers found • Page 13 of 30
Conference
MuLan: Adapting Multilingual Diffusion Models for Hundreds of Languages with Negligible Cost
Sen Xing, Muyan Zhong, Zeqiang Lai et al.
Direct Prediction Set Minimization via Bilevel Conformal Classifier Training
Yuanjie Shi, Hooman Shahrokhi, Xuesong Jia et al.
Posterior Inference with Diffusion Models for High-dimensional Black-box Optimization
Taeyoung Yun, Kiyoung Om, Jaewoo Lee et al.
Sharpness-Aware Data Generation for Zero-shot Quantization
Hoang Dung, Cuong Pham, Trung Le et al.
Exact Conversion of In-Context Learning to Model Weights in Linearized-Attention Transformers
Brian Chen, Tianyang Hu, Hui Jin et al.
Keep the Momentum: Conservation Laws beyond Euclidean Gradient Flows
Sibylle Marcotte, Rémi Gribonval, Gabriel Peyré
Generalization Bounds for Heavy-Tailed SDEs through the Fractional Fokker-Planck Equation
Benjamin Dupuis, Umut Simsekli
Rethinking Score Distilling Sampling for 3D Editing and Generation
Xingyu Miao, Haoran Duan, Yang Long et al.
Neuro-Symbolic Temporal Point Processes
Yang Yang, Chao Yang, Boyang Li et al.
General framework for online-to-nonconvex conversion: Schedule-free SGD is also effective for nonconvex optimization
Kwangjun Ahn, Gagik Magakyan, Ashok Cutkosky
Representation Shattering in Transformers: A Synthetic Study with Knowledge Editing
Kento Nishi, Rahul Ramesh, Maya Okawa et al.
Improved Online Confidence Bounds for Multinomial Logistic Bandits
Joongkyu Lee, Min-hwan Oh
Energy-Based Preference Model Offers Better Offline Alignment than the Bradley-Terry Preference Model
Yuzhong Hong, Hanshan Zhang, Junwei Bao et al.
EARL-BO: Reinforcement Learning for Multi-Step Lookahead, High-Dimensional Bayesian Optimization
Mujin Cheon, Jay Lee, Dong-Yeun Koh et al.
Acquisition Conditioned Oracle for Nongreedy Active Feature Acquisition
Michael Valancius, Maxwell Lennon, Junier Oliva
Neighboring Perturbations of Knowledge Editing on Large Language Models
Jun-Yu Ma, Zhen-Hua Ling, Ningyu Zhang et al.
SK-VQA: Synthetic Knowledge Generation at Scale for Training Context-Augmented Multimodal LLMs
Xin Su, Man Luo, Kris Pan et al.
RZ-NAS: Enhancing LLM-guided Neural Architecture Search via Reflective Zero-Cost Strategy
Zipeng Ji, Guanghui Zhu, Chunfeng Yuan et al.
Impact of Decentralized Learning on Player Utilities in Stackelberg Games
Kate Donahue, Nicole Immorlica, Meena Jagadeesan et al.
Exploring the LLM Journey from Cognition to Expression with Linear Representations
Yuzi Yan, Jialian Li, YipinZhang et al.
BinauralFlow: A Causal and Streamable Approach for High-Quality Binaural Speech Synthesis with Flow Matching Models
Susan Liang, Dejan Markovic, Israel D. Gebru et al.
Efficient Robotic Policy Learning via Latent Space Backward Planning
Dongxiu Liu, Haoyi Niu, Zhihao Wang et al.
Characterizing Large Language Model Geometry Helps Solve Toxicity Detection and Generation
Randall Balestriero, Romain Cosentino, Sarath Shekkizhar
TruthFlow: Truthful LLM Generation via Representation Flow Correction
Hanyu Wang, Bochuan Cao, Yuanpu Cao et al.
Autonomy-of-Experts Models
Ang Lv, Ruobing Xie, Yining Qian et al.
Playmate: Flexible Control of Portrait Animation via 3D-Implicit Space Guided Diffusion
Xingpei Ma, Jiaran Cai, Yuansheng Guan et al.
Error Feedback Can Accurately Compress Preconditioners
Ionut-Vlad Modoranu, Aleksei Kalinov, Eldar Kurtic et al.
AttNS: Attention-Inspired Numerical Solving For Limited Data Scenarios
Zhongzhan Huang, Mingfu Liang, Shanshan Zhong et al.
GRADEO: Towards Human-Like Evaluation for Text-to-Video Generation via Multi-Step Reasoning
Zhun Mou, Bin Xia, Zhengchao Huang et al.
MPO: An Efficient Post-Processing Framework for Mixing Diverse Preference Alignment
Tianze Wang, Dongnan Gui, Yifan Hu et al.
Annealing Flow Generative Models Towards Sampling High-Dimensional and Multi-Modal Distributions
Dongze Wu, Yao Xie
SyncMind: Measuring Agent Out-of-Sync Recovery in Collaborative Software Engineering
Xuehang Guo, Xingyao Wang, Yangyi Chen et al.
Intersectional Unfairness Discovery
Gezheng Xu, Qi CHEN, Charles X. Ling et al.
Online bipartite matching with imperfect advice
Davin Choo, Themis Gouleakis, Chun Kai Ling et al.
Federated Neuro-Symbolic Learning
Pengwei Xing, Songtao Lu, Han Yu
Dimension-Independent Rates for Structured Neural Density Estimation
Vandermeulen, Wai Ming Tai, Bryon Aragam
Kernel-based Unsupervised Embedding Alignment for Enhanced Visual Representation in Vision-language Models
Shizhan Gong, Yankai Jiang, DOU QI et al.
Dynamic Byzantine-Robust Learning: Adapting to Switching Byzantine Workers
Ron Dorfman, Naseem Yehya, Kfir Levy
Probabilistic Routing for Graph-Based Approximate Nearest Neighbor Search
Kejing Lu, Chuan Xiao, Yoshiharu Ishikawa
Focus On This, Not That! Steering LLMs with Adaptive Feature Specification
Tom A. Lamb, Adam Davies, Alasdair J Paren et al.
Split-Ensemble: Efficient OOD-aware Ensemble via Task and Model Splitting
Anthony Chen, Huanrui Yang, Yulu Gan et al.
Probabilistic Constrained Reinforcement Learning with Formal Interpretability
YANRAN WANG, QIUCHEN QIAN, David Boyle
Geometry-Calibrated DRO: Combating Over-Pessimism with Free Energy Implications
Jiashuo Liu, Jiayun Wu, Tianyu Wang et al.
FlowDrag: 3D-aware Drag-based Image Editing with Mesh-guided Deformation Vector Flow Fields
Gwanhyeong Koo, Sunjae Yoon, Younghwan Lee et al.
Tree-Sliced Wasserstein Distance with Nonlinear Projection
Thanh Tran, Viet Hoang Tran, Thanh Chu et al.
Symmetric Matrix Completion with ReLU Sampling
Huikang Liu, Peng Wang, Longxiu Huang et al.
Compressed and distributed least-squares regression: convergence rates with applications to federated learning
Constantin Philippenko, Aymeric Dieuleveut
Density Ratio Estimation with Conditional Probability Paths
Hanlin Yu, Arto Klami, Aapo Hyvarinen et al.
Unraveling the Impact of Heterophilic Structures on Graph Positive-Unlabeled Learning
Yuhao Wu, Jiangchao Yao, Bo Han et al.
BRAIn: Bayesian Reward-conditioned Amortized Inference for natural language generation from feedback
Gaurav Pandey, Yatin Nandwani, Tahira Naseem et al.
Attention Mechanisms Perspective: Exploring LLM Processing of Graph-Structured Data
Guan Zhong, Likang Wu, Hongke Zhao et al.
What makes an Ensemble (Un) Interpretable?
Shahaf Bassan, Guy Amir, Meirav Zehavi et al.
Learning Exceptional Subgroups by End-to-End Maximizing KL-Divergence
Sascha Xu, Nils Philipp Walter, Janis Kalofolias et al.
Nonlinearly Preconditioned Gradient Methods under Generalized Smoothness
Konstantinos Oikonomidis, Jan Quan, Emanuel Laude et al.
Feature Attribution with Necessity and Sufficiency via Dual-stage Perturbation Test for Causal Explanation
Xuexin Chen, Ruichu Cai, Zhengting Huang et al.
Confidence-aware Contrastive Learning for Selective Classification
Yu-Chang Wu, Shen-Huan Lyu, Haopu Shang et al.
LDMol: A Text-to-Molecule Diffusion Model with Structurally Informative Latent Space Surpasses AR Models
Jinho Chang, Jong Chul YE
KnowFormer: Revisiting Transformers for Knowledge Graph Reasoning
Junnan Liu, Qianren Mao, Weifeng Jiang et al.
WGFormer: An SE(3)-Transformer Driven by Wasserstein Gradient Flows for Molecular Ground-State Conformation Prediction
Fanmeng Wang, Minjie Cheng, Hongteng Xu
Optimizing Robustness and Accuracy in Mixture of Experts: A Dual-Model Approach
Xu Zhang, Kaidi Xu, Ziqing Hu et al.
MODA: MOdular Duplex Attention for Multimodal Perception, Cognition, and Emotion Understanding
Zhicheng Zhang, Wuyou Xia, Chenxi Zhao et al.
FeatSharp: Your Vision Model Features, Sharper
Mike Ranzinger, Greg Heinrich, Pavlo Molchanov et al.
Where is the Truth? The Risk of Getting Confounded in a Continual World
Florian Peter Busch, Roshni Ramanna Kamath, Rupert Mitchell et al.
Improved Regret Analysis in Gaussian Process Bandits: Optimality for Noiseless Reward, RKHS norm, and Non-Stationary Variance
Shogo Iwazaki, Shion Takeno
ELITE: Enhanced Language-Image Toxicity Evaluation for Safety
Wonjun Lee, Doehyeon Lee, Eugene Choi et al.
Logarithmic Regret for Online KL-Regularized Reinforcement Learning
Heyang Zhao, Chenlu Ye, Wei Xiong et al.
TTFSFormer: A TTFS-based Lossless Conversion of Spiking Transformer
Lusen Zhao, Zihan Huang, Ding Jianhao et al.
Importance Corrected Neural JKO Sampling
Johannes Hertrich, Robert Gruhlke
Lie Neurons: Adjoint-Equivariant Neural Networks for Semisimple Lie Algebras
Tzu-Yuan Lin, Minghan Zhu, Maani Ghaffari
Private Vector Mean Estimation in the Shuffle Model: Optimal Rates Require Many Messages
Hilal Asi, Vitaly Feldman, Jelani Nelson et al.
Homophily Enhanced Graph Domain Adaptation
Ruiyi Fang, Bingheng Li, Jingyu Zhao et al.
$K^2$VAE: A Koopman-Kalman Enhanced Variational AutoEncoder for Probabilistic Time Series Forecasting
Xingjian Wu, Xiangfei Qiu, Hongfan Gao et al.
Revisiting Inexact Fixed-Point Iterations for Min-Max Problems: Stochasticity and Structured Nonconvexity
Ahmet Alacaoglu, Donghwan Kim, Stephen Wright
Understanding and Mitigating Memorization in Generative Models via Sharpness of Probability Landscapes
Dongjae Jeon, Dueun Kim, Albert No
Nonlinear Filtering with Brenier Optimal Transport Maps
Mohammad Al-Jarrah, Niyizhen Jin, Bamdad Hosseini et al.
Scaling Laws for Floating–Point Quantization Training
Xingwu Sun, Shuaipeng Li, Ruobing Xie et al.
PARQ: Piecewise-Affine Regularized Quantization
Lisa Jin, Jianhao Ma, Zechun Liu et al.
Learning Soft Sparse Shapes for Efficient Time-Series Classification
Zhen Liu, Yicheng Luo, Boyuan Li et al.
Minerva: A Programmable Memory Test Benchmark for Language Models
Menglin Xia, Victor Ruehle, Saravanakumar Rajmohan et al.
Parsimonious Learning-Augmented Approximations for Dense Instances of $\mathcal{NP}$-hard Problems
Evripidis Bampis, Bruno Escoffier, Michalis Xefteris
Open Ad Hoc Teamwork with Cooperative Game Theory
Jianhong Wang, Yang Li, Yuan Zhang et al.
Improving Adversarial Energy-Based Model via Diffusion Process
Cong Geng, Tian Han, Peng-Tao Jiang et al.
Pi-DUAL: Using privileged information to distinguish clean from noisy labels
Ke Wang, Guillermo Ortiz-Jimenez, Rodolphe Jenatton et al.
Synonymous Variational Inference for Perceptual Image Compression
Zijian Liang, Kai Niu, Changshuo Wang et al.
Averaging $n$-step Returns Reduces Variance in Reinforcement Learning
Brett Daley, Martha White, Marlos C. Machado
The Global Convergence Time of Stochastic Gradient Descent in Non-Convex Landscapes: Sharp Estimates via Large Deviations
Waïss Azizian, Franck Iutzeler, Jérôme Malick et al.
Diagnosing the Compositional Knowledge of Vision Language Models from a Game-Theoretic View
Jin Wang, Shichao Dong, Yapeng Zhu et al.
Devil is in the Details: Density Guidance for Detail-Aware Generation with Flow Models
Rafał Karczewski, Markus Heinonen, Vikas Garg
TIC-TAC: A Framework For Improved Covariance Estimation In Deep Heteroscedastic Regression
Megh Shukla, Mathieu Salzmann, Alexandre Alahi
Contextual Feature Selection with Conditional Stochastic Gates
Ram Dyuthi Sristi, Ofir Lindenbaum, Shira Lifshitz et al.
any4: Learned 4-bit Numeric Representation for LLMs
Mostafa Elhoushi, Jeff Johnson
On The Fairness Impacts of Hardware Selection in Machine Learning
Sree Harsha Nelaturu, Nishaanth Kanna, Cuong Tran et al.
Implicit Compressibility of Overparametrized Neural Networks Trained with Heavy-Tailed SGD
Yijun Wan, Melih Barsbey, Abdellatif Zaidi et al.
Angle Domain Guidance: Latent Diffusion Requires Rotation Rather Than Extrapolation
Cheng Jin, Zhenyu Xiao, Chutao Liu et al.
Guided Search Strategies in Non-Serializable Environments with Applications to Software Engineering Agents
Karina Zainullina, Aleksandr Golubev, Maria Trofimova et al.
Major-Minor Mean Field Multi-Agent Reinforcement Learning
Kai Cui, Christian Fabian, Anam Tahir et al.
On Teacher Hacking in Language Model Distillation
Daniil Tiapkin, Daniele Calandriello, Johan Ferret et al.
PID: Prompt-Independent Data Protection Against Latent Diffusion Models
Ang Li, Yichuan Mo, Mingjie Li et al.
Universal Approximation Theorem of Deep Q-Networks
Qian Qi
Near-Optimal Regret in Linear MDPs with Aggregate Bandit Feedback
Asaf Cassel, Haipeng Luo, Aviv Rosenberg et al.
Multi-Region Markovian Gaussian Process: An Efficient Method to Discover Directional Communications Across Multiple Brain Regions
Weihan Li, Chengrui Li, Yule Wang et al.
How Do Images Align and Complement LiDAR? Towards a Harmonized Multi-modal 3D Panoptic Segmentation
Yining Pan, Qiongjie Cui, Xulei Yang et al.
Promises and Pitfalls of Generative Masked Language Modeling: Theoretical Framework and Practical Guidelines
Yuchen Li, Alexandre Kirchmeyer, Aashay Mehta et al.
Avoiding spurious sharpness minimization broadens applicability of SAM
Sidak Pal Singh, Hossein Mobahi, Atish Agarwala et al.
LowRA: Accurate and Efficient LoRA Fine-Tuning of LLMs under 2 Bits
Zikai Zhou, Qizheng Zhang, Hermann Kumbong et al.
Parameter Estimation in DAGs from Incomplete Data via Optimal Transport
Vy Vo, Trung Le, Tung-Long Vuong et al.
Accelerated Policy Gradient: On the Convergence Rates of the Nesterov Momentum for Reinforcement Learning
Yen-Ju Chen, Nai-Chieh Huang, Ching-pei Lee et al.
RL-CFR: Improving Action Abstraction for Imperfect Information Extensive-Form Games with Reinforcement Learning
Boning Li, Zhixuan Fang, Longbo Huang
Reflection-Window Decoding: Text Generation with Selective Refinement
Zeyu Tang, Zhenhao Chen, Xiangchen Song et al.
CommVQ: Commutative Vector Quantization for KV Cache Compression
Junyan Li, Yang Zhang, Muhammad Yusuf Hassan et al.
Survival Kernets: Scalable and Interpretable Deep Kernel Survival Analysis with an Accuracy Guarantee
George Chen
On the Maximal Local Disparity of Fairness-Aware Classifiers
Jinqiu Jin, Haoxuan Li, Fuli Feng
Multi-group Learning for Hierarchical Groups
Samuel Deng, Daniel Hsu
The Relationship Between No-Regret Learning and Online Conformal Prediction
Ramya Ramalingam, Shayan Kiyani, Aaron Roth
Neural Genetic Search in Discrete Spaces
Hyeonah Kim, Sanghyeok Choi, Jiwoo Son et al.
Chasing Convex Functions with Long-term Constraints
Adam Lechowicz, Nicolas Christianson, Bo Sun et al.
Fundamental Limits of Visual Autoregressive Transformers: Universal Approximation Abilities
Yifang Chen, Xiaoyu Li, Yingyu Liang et al.
Model Immunization from a Condition Number Perspective
Amber Yijia Zheng, Cedar Site Bai, Brian Bullins et al.
RoSTE: An Efficient Quantization-Aware Supervised Fine-Tuning Approach for Large Language Models
Quan Wei, Chung-Yiu Yau, Hoi To Wai et al.
Outlier Gradient Analysis: Efficiently Identifying Detrimental Training Samples for Deep Learning Models
Anshuman Chhabra, Bo Li, Jian Chen et al.
Robust Multi-bit Text Watermark with LLM-based Paraphrasers
Xiaojun Xu, jinghan jia, Yuanshun Yao et al.
KV Shifting Attention Enhances Language Modeling
Mingyu Xu, Bingning Wang, Weipeng Chen
Doubly Protected Estimation for Survival Outcomes Utilizing External Controls for Randomized Clinical Trials
Chenyin Gao, Shu Yang, Mingyang Shan et al.
Does Data Scaling Lead to Visual Compositional Generalization?
Arnas Uselis, Andrea Dittadi, Seong Joon Oh
Robust Multimodal Large Language Models Against Modality Conflict
Zongmeng Zhang, Wengang Zhou, Jie Zhao et al.
Core Context Aware Transformers for Long Context Language Modeling
Yaofo Chen, Zeng You, Shuhai Zhang et al.
On a Combinatorial Problem Arising in Machine Teaching
Joakim Sunde, Brigt Håvardstun, Jan Kratochvíl et al.
Position: Embracing Negative Results in Machine Learning
Florian Karl, Malte Kemeter, Gabriel Dax et al.
One Size Fits All for Semantic Shifts: Adaptive Prompt Tuning for Continual Learning
Doyoung Kim, Susik Yoon, Dongmin Park et al.
Position: Build Agent Advocates, Not Platform Agents
Sayash Kapoor, Noam Kolt, Seth Lazar
Refining Adaptive Zeroth-Order Optimization at Ease
Yao Shu, Qixin Zhang, Kun He et al.
Vintix: Action Model via In-Context Reinforcement Learning
Andrei Polubarov, Nikita Lyubaykin, Alexander Derevyagin et al.
Scalable Gaussian Processes with Latent Kronecker Structure
Jihao Andreas Lin, Sebastian Ament, Maximilian Balandat et al.
AUTOCIRCUIT-RL: Reinforcement Learning-Driven LLM for Automated Circuit Topology Generation
Prashanth Vijayaraghavan, Luyao Shi, Ehsan Degan et al.
Learning Dynamics in Continual Pre-Training for Large Language Models
Xingjin Wang, Howe Tissue, Lu Wang et al.
Generalists vs. Specialists: Evaluating LLMs on Highly-Constrained Biophysical Sequence Optimization Tasks
Angelica Chen, Samuel Stanton, Frances Ding et al.
Sample Efficient Demonstration Selection for In-Context Learning
Kiran Purohit, Venktesh V, Sourangshu Bhattacharya et al.
Weak-to-Strong Generalization Even in Random Feature Networks, Provably
Marko Medvedev, Kaifeng Lyu, Dingli Yu et al.
Improving Robustness to Multiple Spurious Correlations by Multi-Objective Optimization
Nayeong Kim, Juwon Kang, Sungsoo Ahn et al.
How Transformers Learn Regular Language Recognition: A Theoretical Study on Training Dynamics and Implicit Bias
Ruiquan Huang, Yingbin LIANG, Jing Yang
Ultra Lowrate Image Compression with Semantic Residual Coding and Compression-aware Diffusion
Anle Ke, Xu Zhang, Tong Chen et al.
SToFM: a Multi-scale Foundation Model for Spatial Transcriptomics
Suyuan Zhao, YIZHEN LUO, Ganbo Yang et al.
Reference Neural Operators: Learning the Smooth Dependence of Solutions of PDEs on Geometric Deformations
Ze Cheng, Zhongkai Hao, Wang Xiaoqiang et al.
Active Label Correction for Semantic Segmentation with Foundation Models
Hoyoung Kim, SEHYUN HWANG, Suha Kwak et al.
Conditional Language Learning with Context
Xiao Zhang, Miao Li, Ji Wu
PIPER: Primitive-Informed Preference-based Hierarchical Reinforcement Learning via Hindsight Relabeling
Utsav Singh, Wesley A. Suttle, Brian Sadler et al.
Inverse Bridge Matching Distillation
Nikita Gushchin, David Li, Daniil Selikhanovych et al.
The Lock-in Hypothesis: Stagnation by Algorithm
Tianyi Qiu, Zhonghao He, Tejasveer Chugh et al.
Diffusion Tempering Improves Parameter Estimation with Probabilistic Integrators for Ordinary Differential Equations
Jonas Beck, Nathanael Bosch, Michael Deistler et al.
From Thousands to Billions: 3D Visual Language Grounding via Render-Supervised Distillation from 2D VLMs
Ang Cao, Sergio Arnaud, Oleksandr Maksymets et al.
Hybrid Reinforcement Learning from Offline Observation Alone
Yuda Song, J. Bagnell, Aarti Singh
Blink of an eye: a simple theory for feature localization in generative models
Marvin Li, Aayush Karan, Sitan Chen
CodeSync: Synchronizing Large Language Models with Dynamic Code Evolution at Scale
Chenlong Wang, Zhaoyang Chu, Zhengxiang Cheng et al.
From Individual Experience to Collective Evidence: A Reporting-Based Framework for Identifying Systemic Harms
Jessica Dai, Paula Gradu, Inioluwa Raji et al.
LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging
Jinuk Kim, Marwa El Halabi, Mingi Ji et al.
Breaking through the learning plateaus of in-context learning in Transformer
Jingwen Fu, Tao Yang, Yuwang Wang et al.
Latent Noise Segmentation: How Neural Noise Leads to the Emergence of Segmentation and Grouping
Ben Lonnqvist, Zhengqing Wu, Michael Herzog
Clone-Robust AI Alignment
Ariel Procaccia, Benjamin Schiffer, Shirley Zhang
Certifiably Byzantine-Robust Federated Conformal Prediction
Mintong Kang, Zhen Lin, Jimeng Sun et al.
Hierarchical Integral Probability Metrics: A distance on random probability measures with low sample complexity
Marta Catalano, Hugo Lavenant
Efficient Robust Conformal Prediction via Lipschitz-Bounded Networks
Thomas Massena, Léo Andéol, Thibaut Boissin et al.
Provable Efficiency of Guidance in Diffusion Models for General Data Distribution
Gen Li, Yuchen Jiao
WISER: Weak Supervision and Supervised Representation Learning to Improve Drug Response Prediction in Cancer
Kumar Shubham, Aishwarya Jayagopal, Syed Danish et al.
On Interpolating Experts and Multi-Armed Bandits
Houshuang Chen, Yuchen He, Chihao Zhang
M3-JEPA: Multimodal Alignment via Multi-gate MoE based on the Joint-Embedding Predictive Architecture
Hongyang Lei, Xiaolong Cheng, Qi Qin et al.
In-Context Learning and Occam's Razor
Eric Elmoznino, Tom Marty, Tejas Kasetty et al.
Tree-Sliced Wasserstein Distance: A Geometric Perspective
Viet Hoang Tran, Trang Pham, Tho Tran Huu et al.
Field Matching: an Electrostatic Paradigm to Generate and Transfer Data
Alexander Kolesov, S. Manukhov, Vladimir Palyulin et al.
Unlocking the Power of SAM 2 for Few-Shot Segmentation
Qianxiong Xu, Lanyun Zhu, Xuanyi Liu et al.
Parametric Scaling Law of Tuning Bias in Conformal Prediction
Hao Zeng, Kangdao Liu, Bingyi Jing et al.
Tilt your Head: Activating the Hidden Spatial-Invariance of Classifiers
Johann Schmidt, Sebastian Stober
Minimalist Concept Erasure in Generative Models
Yang Zhang, Er Jin, Yanfei Dong et al.
Fast and Low-Cost Genomic Foundation Models via Outlier Removal
Haozheng Luo, Chenghao Qiu, Maojiang Su et al.
Dueling Convex Optimization with General Preferences
Aadirupa Saha, Tomer Koren, Yishay Mansour
An Independence-promoting Loss for Music Generation with Language Models
Jean-Marie Lemercier, Simon Rouard, Jade Copet et al.
GAPrompt: Geometry-Aware Point Cloud Prompt for 3D Vision Model
Zixiang Ai, Zichen Liu, Yuanhang Lei et al.
Aligning with Logic: Measuring, Evaluating and Improving Logical Preference Consistency in Large Language Models
Yinhong Liu, Zhijiang Guo, Tianya Liang et al.
GaussMarker: Robust Dual-Domain Watermark for Diffusion Models
Kecen Li, Zhicong Huang, Xinwen Hou et al.
Differentiable Combinatorial Scheduling at Scale
Mingju Liu, Yingjie Li, Jiaqi Yin et al.
Automatically Identify and Rectify: Robust Deep Contrastive Multi-view Clustering in Noisy Scenarios
xihong yang, Siwei Wang, Fangdi Wang et al.
Connect Later: Improving Fine-tuning for Robustness with Targeted Augmentations
Helen Qu, Sang Michael Xie
BalancEdit: Dynamically Balancing the Generality-Locality Trade-off in Multi-modal Model Editing
Dongliang Guo, Mengxuan Hu, Zihan Guan et al.
Optimal Batched Linear Bandits
Xuanfei Ren, Tianyuan Jin, Pan Xu
DipLLM: Fine-Tuning LLM for Strategic Decision-making in Diplomacy
Kaixuan Xu, Jiajun Chai, Sicheng Li et al.
CaPS: Collaborative and Private Synthetic Data Generation from Distributed Sources
Sikha Pentyala, Mayana Pereira, Martine De Cock
Depth Degeneracy in Neural Networks: Vanishing Angles in Fully Connected ReLU Networks on Initialization
Cameron Jakub, Mihai Nica
Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning
Lang Feng, Weihao Tan, Zhiyi Lyu et al.
Optimal Recurrent Network Topologies for Dynamical Systems Reconstruction
Christoph Jürgen Hemmer, Manuel Brenner, Florian Hess et al.
Contextual Online Decision Making with Infinite-Dimensional Functional Regression
Haichen Hu, Rui Ai, Stephen Bates et al.
Direct Motion Models for Assessing Generated Videos
Kelsey Allen, Carl Doersch, Guangyao Zhou et al.
Incremental Topological Ordering and Cycle Detection with Predictions
Samuel McCauley, Benjamin Moseley, Aidin Niaparast et al.
MERGE$^3$: Efficient Evolutionary Merging on Consumer-grade GPUs
Tommaso Mencattini, Adrian Robert Minut, Donato Crisostomi et al.
BiMark: Unbiased Multilayer Watermarking for Large Language Models
Xiaoyan Feng, He Zhang, Yanjun Zhang et al.
On Volume Minimization in Conformal Regression
Batiste Le Bars, Pierre Humbert
Spatial Reasoning with Denoising Models
Christopher Wewer, Bartlomiej Pogodzinski, Bernt Schiele et al.
Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism
Aviv Bick, Eric Xing, Albert Gu
Pragmatic Feature Preferences: Learning Reward-Relevant Preferences from Human Input
Andi Peng, Yuying Sun, Tianmin Shu et al.
Grokking at the Edge of Linear Separability
Alon Beck, Noam Levi, Yohai Bar-Sinai
Diffusion-based Adversarial Purification from the Perspective of the Frequency Domain
Gaozheng Pei, Ke Ma, Yingfei Sun et al.
Linear Transformers as VAR Models: Aligning Autoregressive Attention Mechanisms with Autoregressive Forecasting
Jiecheng Lu, Shihao Yang