Most Cited 2025 "compliance control" Papers
22,274 papers found • Page 110 of 112
Conference
SageAttention2: Efficient Attention with Thorough Outlier Smoothing and Per-thread INT4 Quantization
Jintao Zhang, Haofeng Huang, Pengle Zhang et al.
Identifying Neural Dynamics Using Interventional State Space Models
Amin Nejatbakhsh, Yixin Wang
Efficiently Serving Large Multimodal Models Using EPD Disaggregation
Gursimran Singh, Xinglu Wang, Yifan Hu et al.
EasyInv: Toward Fast and Better DDIM Inversion
Ziyue Zhang, Mingbao Lin, Shuicheng YAN et al.
Hyper: Hyperparameter Robust Efficient Exploration in Reinforcement Learning
Yiran Wang, Chenshu Liu, Yunfan Li et al.
Diffusion Models are Secretly Exchangeable: Parallelizing DDPMs via Auto Speculation
Hengyuan Hu, Aniket Das, Dorsa Sadigh et al.
Tensorized Multi-View Multi-Label Classification via Laplace Tensor Rank
Qiyu Zhong, Yi Shan, Haobo Wang et al.
Learning to Keep a Promise: Scaling Language Model Decoding Parallelism with Learned Asynchronous Decoding
Tian Jin, Ellie Cheng, Zachary Ankner et al.
Invariant Deep Uplift Modeling for Incentive Assignment in Online Marketing via Probability of Necessity and Sufficiency
Zexu Sun, Qiyu Han, Hao Yang et al.
An Improved Clique-Picking Algorithm for Counting Markov Equivalent DAGs via Super Cliques Transfer
Lifu Liu, Shiyuan He, Jianhua Guo
An Online Learning Approach to Prompt-based Selection of Generative Models and LLMs
Xiaoyan Hu, Ho-fung Leung, Farzan Farnia
HYGMA: Hypergraph Coordination Networks with Dynamic Grouping for Multi-Agent Reinforcement Learning
Chiqiang Liu, Dazi Li
Continual Reinforcement Learning by Planning with Online World Models
Zichen Liu, Guoji Fu, Chao Du et al.
AEQA-NAT : Adaptive End-to-end Quantization Alignment Training Framework for Non-autoregressive Machine Translation
Xiangyu Qu, Guojing Liu, Liang Li
Boosting Masked ECG-Text Auto-Encoders as Discriminative Learners
Hung Manh Pham, Aaqib Saeed, Dong Ma
Chameleon: A Flexible Data-mixing Framework for Language Model Pretraining and Finetuning
Wanyun Xie, Francesco Tonin, Volkan Cevher
Functional Alignment Can Mislead: Examining Model Stitching
Damian Smith, Harvey Mannering, Antonia Marcu
FicGCN: Unveiling the Homomorphic Encryption Efficiency from Irregular Graph Convolutional Networks
Zhaoxuan Kan, Husheng Han, shangyi shi et al.
An Instrumental Value for Data Production and its Application to Data Pricing
Rui Ai, Boxiang Lyu, Zhaoran Wang et al.
Fishers for Free? Approximating the Fisher Information Matrix by Recycling the Squared Gradient Accumulator
YuXin Li, Felix Dangel, Derek Tam et al.
Learning Multi-Level Features with Matryoshka Sparse Autoencoders
Bart Bussmann, Noa Nabeshima, Adam Karvonen et al.
Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers
Roman Abramov, Felix Steinbauer, Gjergji Kasneci
Code-Generated Graph Representations Using Multiple LLM Agents for Material Properties Prediction
Jiao Huang, Qianli Xing, Jinglong Ji et al.
CAN: Leveraging Clients As Navigators for Generative Replay in Federated Continual Learning
Xuankun Rong, Jianshu Zhang, Kun He et al.
On the Clean Generalization and Robust Overfitting in Adversarial Training from Two Theoretical Views: Representation Complexity and Training Dynamics
Binghui Li, Yuanzhi Li
FeatSharp: Your Vision Model Features, Sharper
Mike Ranzinger, Greg Heinrich, Pavlo Molchanov et al.
LBI-FL: Low-Bit Integerized Federated Learning with Temporally Dynamic Bit-Width Allocation
Li Ding, Hao Zhang, Wenrui Dai et al.
WILTing Trees: Interpreting the Distance Between MPNN Embeddings
Masahiro Negishi, Thomas Gärtner, Pascal Welke
Private Lossless Multiple Release
Joel Daniel Andersson, Lukas Retschmeier, Boel Nelson et al.
Making Hard Problems Easier with Custom Data Distributions and Loss Regularization: A Case Study in Modular Arithmetic
Eshika Saxena, Alberto Alfarano, Emily Wenger et al.
Disentangling and Integrating Relational and Sensory Information in Transformer Architectures
Awni Altabaa, John Lafferty
Directed Graph Grammars for Sequence-based Learning
Michael Sun, Orion Foo, Gang Liu et al.
Cooperation of Experts: Fusing Heterogeneous Information with Large Margin
Shuo Wang, Shunyang Huang, Jinghui Yuan et al.
WOMD-Reasoning: A Large-Scale Dataset for Interaction Reasoning in Driving
Yiheng Li, Cunxin Fan, Chongjian GE et al.
Branches: Efficiently Seeking Optimal Sparse Decision Trees via AO*
Ayman Chaouki, Jesse Read, Albert Bifet
SDMG: Smoothing Your Diffusion Models for Powerful Graph Representation Learning
Junyou Zhu, Langzhou He, Chao Gao et al.
Differential Privacy Guarantees of Markov Chain Monte Carlo Algorithms
Andrea Bertazzi, Tim Johnston, Gareth Roberts et al.
Best of Both Worlds: Regret Minimization versus Minimax Play
Adrian Müller, Jon Schneider, EFSTRATIOS PANTELEIMON SKOULAKIS et al.
Low-Rank Adapting Models for Sparse Autoencoders
Matthew Chen, Josh Engels, Max Tegmark
Rethinking External Slow-Thinking: From Snowball Errors to Probability of Correct Reasoning
Zeyu Gan, Yun Liao, Yong Liu
Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for Mixture-of-Experts Language Models
Samira Abnar, Harshay Shah, Dan Busbridge et al.
Training High Performance Spiking Neural Network by Temporal Model Calibration
Jiaqi Yan, Changping Wang, De Ma et al.
EvoPress: Accurate Dynamic Model Compression via Evolutionary Search
Oliver Sieberling, Denis Kuznedelev, Eldar Kurtic et al.
Limitations of measure-first protocols in quantum machine learning
Casper Gyurik, Riccardo Molteni, Vedran Dunjko
LDMol: A Text-to-Molecule Diffusion Model with Structurally Informative Latent Space Surpasses AR Models
Jinho Chang, Jong Chul YE
Scalable Non-Equivariant 3D Molecule Generation via Rotational Alignment
Yuhui Ding, Thomas Hofmann
A Reductions Approach to Risk-Sensitive Reinforcement Learning with Optimized Certainty Equivalents
Kaiwen Wang, Dawen Liang, Nathan Kallus et al.
Explicit Discovery of Nonlinear Symmetries from Dynamic Data
Lexiang Hu, Yikang Li, Zhouchen Lin
LIMEFLDL: A Local Interpretable Model-Agnostic Explanations Approach for Label Distribution Learning
Xiuyi Jia, Jinchi Li, Yunan Lu et al.
Exponential Family Variational Flow Matching for Tabular Data Generation
Andres Guzman Cordero, Floor Eijkelboom, Jan-Willem van de Meent
Hardware and Software Platform Inference
Cheng Zhang, Hanna Foerster, Robert Mullins et al.
Nonlinearly Preconditioned Gradient Methods under Generalized Smoothness
Konstantinos Oikonomidis, Jan Quan, Emanuel Laude et al.
The Elicitation Game: Evaluating Capability Elicitation Techniques
Felix Hofstätter, Teun van der Weij, Jayden Teoh et al.
What Limits Bidirectional Model's Generative Capabilities? A Uni-Bi-Directional Mixture-of-Expert Method For Bidirectional Fine-tuning
Zuchao Li, Yonghua Hei, Qiwei Li et al.
Falsification of Unconfoundedness by Testing Independence of Causal Mechanisms
Rickard K.A. Karlsson, Jesse H. Krijthe
Domain2Vec: Vectorizing Datasets to Find the Optimal Data Mixture without Training
Mozhi Zhang, Howe Tissue, Lu Wang et al.
A Closer Look at Transformers for Time Series Forecasting: Understanding Why They Work and Where They Struggle
Yu Chen, Nathalia Céspedes, Payam Barnaghi
Alpha-SQL: Zero-Shot Text-to-SQL using Monte Carlo Tree Search
Boyan Li, Jiayi Zhang, Ju Fan et al.
How to Evaluate and Mitigate IP Infringement in Visual Generative AI?
Zhenting Wang, Chen Chen, Vikash Sehwag et al.
Locate-then-edit for Multi-hop Factual Recall under Knowledge Editing
Zhuoran Zhang, Yongxiang Li, Zijian Kan et al.
On Efficient Estimation of Distributional Treatment Effects under Covariate-Adaptive Randomization
Undral Byambadalai, Tomu Hirata, Tatsushi Oka et al.
Revisiting Instance-Optimal Cluster Recovery in the Labeled Stochastic Block Model
Kaito Ariu, Alexandre Proutiere, Se-Young Yun
Better to Teach than to Give: Domain Generalized Semantic Segmentation via Agent Queries with Diffusion Model Guidance
Fan Li, Xuan Wang, Min Qi et al.
Instruction-Following Pruning for Large Language Models
Bairu Hou, Qibin Chen, Jianyu Wang et al.
Domain-Adapted Diffusion Model for PROTAC Linker Design Through the Lens of Density Ratio in Chemical Space
Zixing Song, Ziqiao Meng, Jose Miguel Hernandez-Lobato
Three-Dimensional Trajectory Prediction with 3DMoTraj Dataset
Hao Zhou, Xu Yang, Mingyu Fan et al.
Continuously Updating Digital Twins using Large Language Models
Harry Amad, Nicolás Astorga, Mihaela van der Schaar
Do Not Mimic My Voice : Speaker Identity Unlearning for Zero-Shot Text-to-Speech
Taesoo Kim, Jinju Kim, Dongchan Kim et al.
HGOT: Self-supervised Heterogeneous Graph Neural Network with Optimal Transport
Yanbei Liu, Chongxu Wang, Zhitao Xiao et al.
PAC-Bayes Analysis for Recalibration in Classification
Masahiro Fujisawa, Futoshi Futami
Global Context-aware Representation Learning for Spatially Resolved Transcriptomics
Yunhak Oh, Junseok Lee, Yeongmin Kim et al.
Provably Efficient Algorithm for Best Scoring Rule Identification in Online Principal-Agent Information Acquisition
Zichen Wang, Chuanhao Li, Huazheng Wang
Aggregation of Dependent Expert Distributions in Multimodal Variational Autoencoders
Rogelio A. Mancisidor, Robert Jenssen, Shujian Yu et al.
Testing the Limits of Fine-Tuning for Improving Visual Cognition in Vision Language Models
Luca M. Schulze Buschoff, Konstantinos Voudouris, Elif Akata et al.
Black-Box Adversarial Attacks on LLM-Based Code Completion
Slobodan Jenko, Niels Mündler, Jingxuan He et al.
Behavior-agnostic Task Inference for Robust Offline In-context Reinforcement Learning
Long Ma, Fangwei Zhong, Yizhou Wang
Counterfactual Effect Decomposition in Multi-Agent Sequential Decision Making
Stelios Triantafyllou, Aleksa Sukovic, Yasaman Zolfimoselo et al.
Distillation of Discrete Diffusion through Dimensional Correlations
Satoshi Hayakawa, Yuhta Takida, Masaaki Imaizumi et al.
Clustering Properties of Self-Supervised Learning
Xi Weng, Jianing An, Xudong Ma et al.
SecEmb: Sparsity-Aware Secure Federated Learning of On-Device Recommender System with Large Embedding
Peihua Mai, Youlong Ding, Ziyan Lyu et al.
Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search
Maohao Shen, Guangtao Zeng, Zhenting Qi et al.
Self-Improving Transformers Overcome Easy-to-Hard and Length Generalization Challenges
Nayoung Lee, Jack Cai, Avi Schwarzschild et al.
DeepCrossAttention: Supercharging Transformer Residual Connections
Mike Heddes, Adel Javanmard, Kyriakos Axiotis et al.
ML$^2$-GCL: Manifold Learning Inspired Lightweight Graph Contrastive Learning
Jianqing Liang, Zhiqiang Li, Xinkai Wei et al.
Multi-agent Architecture Search via Agentic Supernet
Guibin Zhang, Luyang Niu, Junfeng Fang et al.
BaxBench: Can LLMs Generate Correct and Secure Backends?
Mark Vero, Niels Mündler, Viktor Chibotaru et al.
Ensemble Learned Bloom Filters: Two Oracles are Better than One
Ming Lin, Lin CHEN
Learn Beneficial Noise as Graph Augmentation
Siqi Huang, Yanchen Xu, Hongyuan Zhang et al.
Radio: Rate–Distortion Optimization for Large Language Model Compression
Sean I. Young
An Analysis of Quantile Temporal-Difference Learning
Mark Rowland, Remi Munos, Mohammad Gheshlaghi Azar et al.
Explicit Preference Optimization: No Need for an Implicit Reward Model
Xiangkun Hu, Lemin Kong, Tong He et al.
Automated Hypothesis Validation with Agentic Sequential Falsifications
Kexin Huang, Ying Jin, Ryan Li et al.
Annealing Flow Generative Models Towards Sampling High-Dimensional and Multi-Modal Distributions
Dongze Wu, Yao Xie
ZeroFlow: Overcoming Catastrophic Forgetting is Easier than You Think
Tao Feng, Wei Li, Didi Zhu et al.
MixBridge: Heterogeneous Image-to-Image Backdoor Attack through Mixture of Schrödinger Bridges
Shixi Qin, Zhiyong Yang, Shilong Bao et al.
RePaViT: Scalable Vision Transformer Acceleration via Structural Reparameterization on Feedforward Network Layers
Xuwei Xu, Yang Li, Yudong Chen et al.
TransPL: VQ-Code Transition Matrices for Pseudo-Labeling of Time Series Unsupervised Domain Adaptation
Jaeho Kim, Seulki Lee
Extracting Rare Dependence Patterns via Adaptive Sample Reweighting
Yiqing Li, Yewei Xia, Xiaofei Wang et al.
Strengthen Out-of-Distribution Detection Capability with Progressive Self-Knowledge Distillation
Yang Yang, Haonan Xu
On Mitigating Affinity Bias through Bandits with Evolving Biased Feedback
Matthew Faw, Constantine Caramanis, Jessica Hoffmann
Learning Joint Interventional Effects from Single-Variable Interventions in Additive Models
Armin Kekić, Sergio Hernan Garrido Mejia, Bernhard Schölkopf
Positional Attention: Expressivity and Learnability of Algorithmic Computation
Artur Back de Luca, George Giapitzakis, Shenghao Yang et al.
Weakly Supervised Anomaly Detection via Dual-Tailed Kernel
Walid Durani, Tobias Nitzl, Claudia Plant et al.
Q-Supervised Contrastive Representation: A State Decoupling Framework for Safe Offline Reinforcement Learning
Zhihe Yang, Yunjian Xu, Yang Zhang
ExtPose: Robust and Coherent Pose Estimation by Extending ViTs
Glory Rongyu CHEN, Li'an Zhuo, Linlin Yang et al.
C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Generation
Guoxin Chen, Minpeng Liao, Peiying Yu et al.
Bipartite Ranking From Multiple Labels: On Loss Versus Label Aggregation
Michal Lukasik, Lin Chen, Harikrishna Narasimhan et al.
Gumiho: A Hybrid Architecture to Prioritize Early Tokens in Speculative Decoding
Jinze Li, Yixing Xu, Haiduo Huang et al.
ELoRA: Low-Rank Adaptation for Equivariant GNNs
Chen Wang, Siyu Hu, Guangming Tan et al.
iDPA: Instance Decoupled Prompt Attention for Incremental Medical Object Detection
Huahui Yi, Wei Xu, Ziyuan Qin et al.
PTTA: Purifying Malicious Samples for Test-Time Model Adaptation
Jing Ma, Hanlin Li, Xiang Xiang
Stay Hungry, Keep Learning: Sustainable Plasticity for Deep Reinforcement Learning
Huaicheng Zhou, Zifeng Zhuang, Donglin Wang
MapEval: A Map-Based Evaluation of Geo-Spatial Reasoning in Foundation Models
Mahir Labib Dihan, Tanvir Hassan, Md Tanvir Parvez et al.
Fully Heteroscedastic Count Regression with Deep Double Poisson Networks
Spencer Young, Porter Jenkins, Longchao Da et al.
Fully Dynamic Embedding into $\ell_p$ Spaces
Kiarash Banihashem, Xiang Chen, MohammadTaghi Hajiaghayi et al.
CFPT: Empowering Time Series Forecasting through Cross-Frequency Interaction and Periodic-Aware Timestamp Modeling
Feifei Kou, Jiahao Wang, Lei Shi et al.
Redundancy Undermines the Trustworthiness of Self-Interpretable GNNs
Wenxin Tai, Ting Zhong, Goce Trajcevski et al.
Modular Duality in Deep Learning
Jeremy Bernstein, Laker Newhouse
Gap-Dependent Bounds for Federated $Q$-Learning
Haochen Zhang, Zhong Zheng, Lingzhou Xue
Masked Generative Nested Transformers with Decode Time Scaling
Sahil Goyal, Debapriya Tula, Gagan Jain et al.
CoastalBench: A Decade-Long High-Resolution Dataset to Emulate Complex Coastal Processes
Zelin Xu, Yupu Zhang, Tingsong Xiao et al.
PiD: Generalized AI-Generated Images Detection with Pixelwise Decomposition Residuals
Xinghe Fu, Zhiyuan Yan, Zheng Yang et al.
Slimming the Fat-Tail: Morphing-Flow for Adaptive Time Series Modeling
Tianyu Liu, kai sun, Fuchun Sun et al.
Learning from Sample Stability for Deep Clustering
Zhixin Li, Yuheng Jia, Hui LIU et al.
Optimal Task Order for Continual Learning of Multiple Tasks
Ziyan Li, Naoki Hiratani
MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents
Kaijie Zhu, Xianjun Yang, Jindong Wang et al.
Flow Matching for Denoised Social Recommendation
Yinxuan Huang, KE LIANG, Zhuofan Dong et al.
Enhancing Ligand Validity and Affinity in Structure-Based Drug Design with Multi-Reward Optimization
Seungbeom Lee, Munsun Jo, Jungseul Ok et al.
GS-Bias: Global-Spatial Bias Learner for Single-Image Test-Time Adaptation of Vision-Language Models
Zhaohong Huang, Yuxin Zhang, JingJing Xie et al.
Auditing Prompt Caching in Language Model APIs
Chenchen Gu, Xiang Li, Rohith Kuditipudi et al.
BECAME: Bayesian Continual Learning with Adaptive Model Merging
Mei Li, Yuxiang Lu, Qinyan Dai et al.
Learning to Steer Learners in Games
Yizhou Zhang, Yian Ma, Eric Mazumdar
Structure-Guided Large Language Models for Text-to-SQL Generation
Qinggang Zhang, Hao Chen, Junnan Dong et al.
Reinforcement Learning with Segment Feedback
Yihan Du, Anna Winnicki, Gal Dalal et al.
Feature Learning beyond the Lazy-Rich Dichotomy: Insights from Representational Geometry
Chi-Ning Chou, Hang Le, Yichen Wang et al.
Implicit degree bias in the link prediction task
Rachith Aiyappa, Xin Wang, Munjung Kim et al.
Rethinking Score Distilling Sampling for 3D Editing and Generation
Xingyu Miao, Haoran Duan, Yang Long et al.
Pre-Training Graph Contrastive Masked Autoencoders are Strong Distillers for EEG
Xinxu Wei, kanhao zhao, Yong Jiao et al.
WildChat-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training
Benjamin Feuer, Chinmay Hegde
HEAP: Hyper Extended A-PDHG Operator for Constrained High-dim PDEs
Mingquan Feng, Weixin Liao, Yixin Huang et al.
Contour Integration Underlies Human-Like Vision
Ben Lonnqvist, Elsa Scialom, Abdulkadir Gokce et al.
MathConstruct: Challenging LLM Reasoning with Constructive Proofs
Mislav Balunovic, Jasper Dekoninck, Nikola Jovanović et al.
BSemiFL: Semi-supervised Federated Learning via a Bayesian Approach
Haozhao Wang, Shengyu Wang, Jiaming Li et al.
Falcon: Fast Visuomotor Policies via Partial Denoising
Haojun Chen, Minghao Liu, Chengdong Ma et al.
Zebra: In-Context Generative Pretraining for Solving Parametric PDEs
Louis Serrano, Armand Kassaï Koupaï, Thomas Wang et al.
CodeIO: Condensing Reasoning Patterns via Code Input-Output Prediction
Junlong Li, Daya Guo, Dejian Yang et al.
Reducing Confounding Bias without Data Splitting for Causal Inference via Optimal Transport
Yuguang Yan, Zongyu Li, Haolin Yang et al.
Linear Contextual Bandits With Interference
Yang Xu, Wenbin Lu, Rui Song
Multi-band Frequency Reconstruction for Neural Psychoacoustic Coding
Dianwen Ng, Kun Zhou, Yi-Wen Chao et al.
Commute Graph Neural Networks
Wei Zhuo, Han Yu, Guang Tan et al.
Textural or Textual: How Vision-Language Models Read Text in Images
Hanzhang Wang, Qingyuan Ma
Supervised Contrastive Learning from Weakly-Labeled Audio Segments for Musical Version Matching
Joan Serrà, Recep Oguz Araz, Dmitry Bogdanov et al.
Censor Dependent Variational Inference
Chuanhui Liu, Xiao Wang
Reward Modeling with Ordinal Feedback: Wisdom of the Crowd
Shang Liu, Yu Pan, Guanting Chen et al.
MATS: An Audio Language Model under Text-only Supervision
Wen Wang, Ruibing Hou, Hong Chang et al.
QLASS: Boosting Language Agent Inference via Q-Guided Stepwise Search
Zongyu Lin, Yao Tang, Xingcheng Yao et al.
Understanding the Kronecker Matrix-Vector Complexity of Linear Algebra
Raphael Meyer, William Swartworth, David Woodruff
Offline Model-based Optimization for Real-World Molecular Discovery
Dong-Hee Shin, Young-Han Son, Hyun Jung Lee et al.
On the Statistical Mechanisms of Distributional Compositional Generalization
Jingwen Fu, Nanning Zheng
FG-CLIP: Fine-Grained Visual and Textual Alignment
Chunyu Xie, Bin Wang, Fanjing Kong et al.
Instance-Optimal Pure Exploration for Linear Bandits on Continuous Arms
Sho Takemori, Yuhei Umeda, Aditya Gopalan
Flat-LoRA: Low-Rank Adaptation over a Flat Loss Landscape
Tao Li, Zhengbao He, Yujun Li et al.
Global Convergence and Rich Feature Learning in $L$-Layer Infinite-Width Neural Networks under $\mu$ Parametrization
Zixiang Chen, Greg Yang, Qingyue Zhao et al.
Multi-Armed Bandits with Interference: Bridging Causal Inference and Adversarial Bandits
Su Jia, Peter Frazier, Nathan Kallus
Square$\chi$PO: Differentially Private and Robust $\chi^2$-Preference Optimization in Offline Direct Alignment
Xingyu Zhou, Yulian Wu, Wenqian Weng et al.
Equivariant Polynomial Functional Networks
Thieu Vo, Viet Hoang Tran, Tho Tran Huu et al.
Certified Unlearning for Neural Networks
Anastasiia Koloskova, Youssef Allouah, Animesh Jha et al.
An Effective and Secure Federated Multi-View Clustering Method with Information-Theoretic Perspective
Xinyue Chen, Jinfeng Peng, Yuhao Li et al.
QMamba: On First Exploration of Vision Mamba for Image Quality Assessment
Fengbin Guan, Xin Li, Zihao Yu et al.
ExpProof : Operationalizing Explanations for Confidential Models with ZKPs
Chhavi Yadav, Evan Laufer, Dan Boneh et al.
Deep Neural Cellular Potts Models
Koen Minartz, Tim d'Hondt, Leon Hillmann et al.
Speak Easy: Eliciting Harmful Jailbreaks from LLMs with Simple Interactions
Yik Siu Chan, Narutatsu Ri, Yuxin Xiao et al.
NestQuant: nested lattice quantization for matrix products and LLMs
Semyon Savkin, Eitan Porat, Or Ordentlich et al.
The Sample Complexity of Online Strategic Decision Making with Information Asymmetry and Knowledge Transportability
Jiachen Hu, Rui Ai, Han Zhong et al.
Understanding the Logic of Direct Preference Alignment through Logic
Kyle Richardson, Vivek Srikumar, Ashish Sabharwal
Banyan: Improved Representation Learning with Explicit Structure
Mattia Opper, Siddharth N
Learning with Exact Invariances in Polynomial Time
Ashkan Soleymani, Behrooz Tahmasebi, Stefanie Jegelka et al.
Contextual Optimization Under Model Misspecification: A Tractable and Generalizable Approach
Omar Bennouna, Jiawei Zhang, Saurabh Amin et al.
FedClean: A General Robust Label Noise Correction for Federated Learning
Xiaoqian Jiang, Jing Zhang
Neural Solver Selection for Combinatorial Optimization
Chengrui Gao, Haopu Shang, Ke Xue et al.
FisherSFT: Data-Efficient Supervised Fine-Tuning of Language Models Using Information Gain
Rohan Deb, Kiran Thekumparampil, Kousha Kalantari et al.
Distillation Scaling Laws
Dan Busbridge, Amitis Shidani, Floris Weers et al.
How Distributed Collaboration Influences the Diffusion Model Training? A Theoretical Perspective
Jing Qiao, Yu Liu, YUAN YUAN et al.
Modularized Self-Reflected Video Reasoner for Multimodal LLM with Application to Video Question Answering
Zihan Song, Xin Wang, Zi Qian et al.
Product of Experts with LLMs: Boosting Performance on ARC Is a Matter of Perspective
Daniel Franzen, Jan Disselhoff, David Hartmann
Physics-Informed Weakly Supervised Learning For Interatomic Potentials
Makoto Takamoto, Viktor Zaverkin, Mathias Niepert
General agents need world models
Jonathan Richens, Tom Everitt, David Abel
Testing Conditional Mean Independence Using Generative Neural Networks
Yi Zhang, Linjun Huang, Yun Yang et al.
WeGeFT: Weight‑Generative Fine‑Tuning for Multi‑Faceted Efficient Adaptation of Large Models
Chinmay Savadikar, Xi Song, Tianfu Wu
A Theoretical Study of (Hyper) Self-Attention through the Lens of Interactions: Representation, Training, Generalization
Muhammed Ustaomeroglu, Guannan Qu
Implicit Language Models are RNNs: Balancing Parallelization and Expressivity
Mark Schoene, Babak Rahmani, Heiner Kremer et al.
Discovering Symbolic Cognitive Models from Human and Animal Behavior
Pablo Samuel Castro, Nenad Tomasev, Ankit Anand et al.
FlexiReID: Adaptive Mixture of Expert for Multi-Modal Person Re-Identification
Zhen Sun, Lei Tan, Yunhang Shen et al.
Towards flexible perception with visual memory
Robert Geirhos, Priyank Jaini, Austin Stone et al.
Scalable Approximation Algorithms for $p$-Wasserstein Distance and Its Variants
Nathaniel Lahn, Sharath Raghvendra, Emma Saarinen et al.
Streamline Without Sacrifice - Squeeze out Computation Redundancy in LMM
Penghao Wu, Lewei Lu, Ziwei Liu
Sorbet: A Neuromorphic Hardware-Compatible Transformer-Based Spiking Language Model
Kaiwen Tang, Zhanglu Yan, Weng-Fai Wong
Online Episodic Convex Reinforcement Learning
Bianca Marin Moreno, Khaled Eldowa, Pierre Gaillard et al.
SkipGPT: Each Token is One of a Kind
Anhao Zhao, Fanghua Ye, Yingqi Fan et al.
Conservative Offline Goal-Conditioned Implicit V-Learning
Ke Kaiqiang, qian lin, Zongkai Liu et al.