Most Cited ICML "communication-computation overlap" Papers
5,975 papers found • Page 30 of 30
Conference
Ad Hoc Teamwork via Offline Goal-Based Decision Transformers
Xinzhi Zhang, Hoehi Chan, Deheng Ye et al.
Noisy SIGNSGD Is More Differentially Private Than You (Might) Think
Richeng Jin, Huaiyu (David) Dai
Implicit Regularization for Tubal Tensor Factorizations via Gradient Descent
Santhosh Karnik, Anna Veselovska, Mark Iwen et al.
AlphaVerus: Bootstrapping Formally Verified Code Generation through Self-Improving Translation and Treefinement
Pranjal Aggarwal, Bryan Parno, Sean Welleck
MENTOR: Mixture-of-Experts Network with Task-Oriented Perturbation for Visual Reinforcement Learning
Suning Huang, Zheyu Zhang, Tianhai Liang et al.
Efficient Federated Incomplete Multi-View Clustering
Suyuan Liu, Hao Yu, Hao Tan et al.
The Ripple Effect: On Unforeseen Complications of Backdoor Attacks
Rui Zhang, Yun Shen, Hongwei Li et al.
A Sharper Global Convergence Analysis for Average Reward Reinforcement Learning via an Actor-Critic Approach
Swetha Ganesh, Washim Mondal, Vaneet Aggarwal
Simplifying DINO via Coding Rate Regularization
Ziyang Wu, Jingyuan Zhang, Druv Pai et al.
Evaluating LLMs Across Multi-Cognitive Levels: From Medical Knowledge Mastery to Scenario-Based Problem Solving
Yuxuan Zhou, Xien Liu, Chenwei Yan et al.
TraceGrad: a Framework Learning Expressive SO(3)-equivariant Non-linear Representations for Electronic-Structure Hamiltonian Prediction
Shi Yin, Xinyang Pan, fengyan wang et al.
CommVQ: Commutative Vector Quantization for KV Cache Compression
Junyan Li, Yang Zhang, Muhammad Yusuf Hassan et al.
From Language Models over Tokens to Language Models over Characters
Tim Vieira, Benjamin LeBrun, Mario Giulianelli et al.
Addressing Concept Mislabeling in Concept Bottleneck Models Through Preference Optimization
Emiliano Penaloza, Tianyue Zhang, Laurent Charlin et al.
SCENIR: Visual Semantic Clarity through Unsupervised Scene Graph Retrieval
Nikolaos Chaidos, Angeliki Dimitriou, Maria Lymperaiou et al.
Contrastive Localized Language-Image Pre-Training
Hong-You Chen, Zhengfeng Lai, Haotian Zhang et al.
Robust Conformal Outlier Detection under Contaminated Reference Data
Meshi Bashari, Matteo Sesia, Yaniv Romano
Generalized Interpolating Discrete Diffusion
Dimitri von Rütte, Janis Fluri, Yuhui Ding et al.
Implicit Bias of Gradient Descent for Non-Homogeneous Deep Networks
Yuhang Cai, Kangjie Zhou, Jingfeng Wu et al.
Learning Attribute-Aware Hash Codes for Fine-Grained Image Retrieval via Query Optimization
Peng Wang, Yong Li, Lin Zhao et al.
Can Compressed LLMs Truly Act? An Empirical Evaluation of Agentic Capabilities in LLM Compression
Peijie Dong, Zhenheng Tang, Xiang Liu et al.
Observation Interference in Partially Observable Assistance Games
Scott Emmons, Caspar Oesterheld, Vincent Conitzer et al.
Improved and Oracle-Efficient Online $\ell_1$-Multicalibration
Rohan Ghuge, Vidya Muthukumar, Sahil Singla
One Diffusion Step to Real-World Super-Resolution via Flow Trajectory Distillation
Jianze Li, Jiezhang Cao, Yong Guo et al.
Capturing Temporal Dynamics in Large-Scale Canopy Tree Height Estimation
Jan Pauls, Max Zimmer, Berkant Turan et al.
Towards the Efficient Inference by Incorporating Automated Computational Phenotypes under Covariate Shift
chao ying, Jun Jin, Yi Guo et al.
On the Resilience of LLM-Based Multi-Agent Collaboration with Faulty Agents
Jen-Tse Huang, Jiaxu Zhou, Tailin Jin et al.
Bayesian Weight Enhancement with Steady-State Adaptation for Test-time Adaptation in Dynamic Environments
Jae-Hong Lee
A Sample Efficient Conditional Independence Test in the Presence of Discretization
Boyang Sun, Yu Yao, Xinshuai Dong et al.
Are Sparse Autoencoders Useful? A Case Study in Sparse Probing
Subhash Kantamneni, Josh Engels, Senthooran Rajamanoharan et al.
Large Displacement Motion Transfer with Unsupervised Anytime Interpolation
Guixiang Wang, Jianjun Li
Securing Equal Share: A Principled Approach for Learning Multiplayer Symmetric Games
Jiawei Ge, Yuanhao Wang, Wenzhe Li et al.
"Why Is There a Tumor?": Tell Me the Reason, Show Me the Evidence
Mengmeng Ma, Tang Li, Yunxiang Peng et al.
On Teacher Hacking in Language Model Distillation
Daniil Tiapkin, Daniele Calandriello, Johan Ferret et al.
A Two-Stage Learning-to-Defer Approach for Multi-Task Learning
Yannis Montreuil, Shu Heng Yeo, Axel Carlier et al.
Exactly Tight Information-theoretic Generalization Bounds via Binary Jensen-Shannon Divergence
Yuxin Dong, Haoran Guo, Tieliang Gong et al.
Diverging Preferences: When do Annotators Disagree and do Models Know?
Michael Zhang, Zhilin Wang, Jena Hwang et al.
All-Purpose Mean Estimation over R: Optimal Sub-Gaussianity with Outlier Robustness and Low Moments Performance
Jasper Lee, Walter McKelvie, Maoyuan Song et al.
INRFlow: Flow Matching for INRs in Ambient Space
Yuyang Wang, Anurag Ranjan, Joshua M Susskind et al.
STAMP Your Content: Proving Dataset Membership via Watermarked Rephrasings
Saksham Rastogi, Pratyush Maini, Danish Pruthi
Self-cross Feature based Spiking Neural Networks for Efficient Few-shot Learning
Qi Xu, Junyang Zhu, Dongdong Zhou et al.
Active Treatment Effect Estimation via Limited Samples
Zhiheng Zhang, Haoxiang Wang, Haoxuan Li et al.
A Causal World Model Underlying Next Token Prediction: Exploring GPT in a Controlled Environment
Raanan Yehezkel Rohekar, Yaniv Gurwicz, Sungduk Yu et al.
Self-Play $Q$-Learners Can Provably Collude in the Iterated Prisoner's Dilemma
Quentin Bertrand, Juan Duque, Emilio Calvano et al.
OmniBal: Towards Fast Instruction-Tuning for Vision-Language Models via Omniverse Computation Balance
Yongqiang Yao, Jingru Tan, Feizhao Zhang et al.
Simple Randomized Rounding for Max-Min Eigenvalue Augmentation
Jourdain Lamperski, Haeseong Yang, Oleg Prokopyev
DiffAdvMAP: Flexible Diffusion-Based Framework for Generating Natural Unrestricted Adversarial Examples
Zhengzhao Pan, Hua Chen, Xiaogang Zhang
Beyond Self-Interest: How Group Strategies Reshape Content Creation in Recommendation Platforms?
Yaolong Yu, Fan Yao, Sinno Jialin Pan
Enhancing Visual Localization with Cross-Domain Image Generation
Yuanze Wang, Yichao Yan, Shiming Song et al.
Deep Reinforcement Learning from Hierarchical Preference Design
Alexander Bukharin, Yixiao Li, Pengcheng He et al.
Rethinking Time Encoding via Learnable Transformation Functions
Xi Chen, Yateng Tang, Jiarong Xu et al.
Divide and Conquer: Grounding LLMs as Efficient Decision-Making Agents via Offline Hierarchical Reinforcement Learning
Zican Hu, Wei Liu, Xiaoye Qu et al.
Random Policy Evaluation Uncovers Policies of Generative Flow Networks
Haoran He, Emmanuel Bengio, Qingpeng Cai et al.
One Stone, Two Birds: Enhancing Adversarial Defense Through the Lens of Distributional Discrepancy
Jiacheng Zhang, Benjamin Rubinstein, Jingfeng Zhang et al.
Generalized Category Discovery via Reciprocal Learning and Class-Wise Distribution Regularization
Duo Liu, Zhiquan Tan, Linglan Zhao et al.
Inductive Gradient Adjustment for Spectral Bias in Implicit Neural Representations
Kexuan Shi, Hai Chen, Leheng Zhang et al.
Revisiting Convergence: Shuffling Complexity Beyond Lipschitz Smoothness
Qi He, Peiran Yu, Ziyi Chen et al.
Causality Inspired Federated Learning for OOD Generalization
Jiayuan Zhang, Xuefeng Liu, Jianwei Niu et al.
Learning Efficient Robotic Garment Manipulation with Standardization
zhou changshi, Feng Luan, hujiarui et al.
Efficient Heterogeneity-Aware Federated Active Data Selection
Yingpeng Tang, Chao Ren, Xiaoli Tang et al.
Reflect-then-Plan: Offline Model-Based Planning through a Doubly Bayesian Lens
Jihwan Jeong, Xiaoyu Wang, Jingmin Wang et al.
Preconditioned Riemannian Gradient Descent Algorithm for Low-Multilinear-Rank Tensor Completion
Yuanwei Zhang, Fengmiao Bian, Xiaoqun Zhang et al.
Fast and Provable Algorithms for Sparse PCA with Improved Sample Complexity
Jian-Feng Cai, Zhuozhi XIAN, Jiaxi Ying
Unlocking the Power of Rehearsal in Continual Learning: A Theoretical Perspective
Junze Deng, Qinhang Wu, Peizhong Ju et al.
Grammar-Forced Translation of Natural Language to Temporal Logic using LLMs
William English, Dominic Simon, Sumit Jha et al.
A Tale of Two Structures: Do LLMs Capture the Fractal Complexity of Language?
Ibrahim Alabdulmohsin, Andreas Steiner
Optimal and Practical Batched Linear Bandit Algorithm
Sanghoon Yu, Min-hwan Oh
Zero-Shot Adaptation of Parameter-Efficient Fine-Tuning in Diffusion Models
Farzad Farhadzadeh, Debasmit Das, Shubhankar Borse et al.
R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts
Zhongyang Li, Ziyue Li, Tianyi Zhou
Equivariant Neural Tangent Kernels
Philipp Misof, Pan Kessel, Jan Gerken
Empowering World Models with Reflection for Embodied Video Prediction
Xiaowei Chi, Chun-Kai Fan, Hengyuan Zhang et al.
LoRA-Gen: Specializing Large Language Model via Online LoRA Generation
Yicheng Xiao, Lin Song, Rui Yang et al.
On the Private Estimation of Smooth Transport Maps
Clément Lalanne, Franck Iutzeler, Loubes Jean-Michel et al.
Data-driven Design of Randomized Control Trials with Guaranteed Treatment Effects
Santiago Cortes-Gomez, Naveen Raman, Aarti Singh et al.
Rethinking Point Cloud Data Augmentation: Topologically Consistent Deformation
Jian Bi, Qianliang Wu, Xiang Li et al.
LoRA Training Provably Converges to a Low-Rank Global Minimum Or It Fails Loudly (But it Probably Won't Fail)
Junsu Kim, Jaeyeon Kim, Ernest Ryu
Protein Structure Tokenization: Benchmarking and New Recipe
Xinyu Yuan, Zichen Wang, Marcus Collins et al.
How to Move Your Dragon: Text-to-Motion Synthesis for Large-Vocabulary Objects
Wonkwang Lee, Jongwon Jeong, Taehong Moon et al.
Near Optimal Non-asymptotic Sample Complexity of 1-Identification
Zitian Li, Wang Chi Cheung
Learning Adversarial MDPs with Stochastic Hard Constraints
Francesco Emanuele Stradi, Matteo Castiglioni, Alberto Marchesi et al.
TINED: GNNs-to-MLPs by Teacher Injection and Dirichlet Energy Distillation
Ziang Zhou, Zhihao DING, Jieming Shi et al.
Larger or Smaller Reward Margins to Select Preferences for LLM Alignment?
Kexin Huang, Junkang Wu, Ziqian Chen et al.
Causal Logistic Bandits with Counterfactual Fairness Constraints
Jiajun Chen, Jin Tian, Chris Quinn
Rethink GraphODE Generalization within Coupled Dynamical System
Guancheng Wan, Zijie Huang, Wanjia Zhao et al.
EFDTR: Learnable Elliptical Fourier Descriptor Transformer for Instance Segmentation
Jiawei Cao, Chaochen Gu, Hao Cheng et al.
SageAttention2: Efficient Attention with Thorough Outlier Smoothing and Per-thread INT4 Quantization
Jintao Zhang, Haofeng Huang, Pengle Zhang et al.
Identifying Neural Dynamics Using Interventional State Space Models
Amin Nejatbakhsh, Yixin Wang
Efficiently Serving Large Multimodal Models Using EPD Disaggregation
Gursimran Singh, Xinglu Wang, Yifan Hu et al.
EasyInv: Toward Fast and Better DDIM Inversion
Ziyue Zhang, Mingbao Lin, Shuicheng YAN et al.
Hyper: Hyperparameter Robust Efficient Exploration in Reinforcement Learning
Yiran Wang, Chenshu Liu, Yunfan Li et al.
Diffusion Models are Secretly Exchangeable: Parallelizing DDPMs via Auto Speculation
Hengyuan Hu, Aniket Das, Dorsa Sadigh et al.
Tensorized Multi-View Multi-Label Classification via Laplace Tensor Rank
Qiyu Zhong, Yi Shan, Haobo Wang et al.
Learning to Keep a Promise: Scaling Language Model Decoding Parallelism with Learned Asynchronous Decoding
Tian Jin, Ellie Cheng, Zachary Ankner et al.
Invariant Deep Uplift Modeling for Incentive Assignment in Online Marketing via Probability of Necessity and Sufficiency
Zexu Sun, Qiyu Han, Hao Yang et al.
An Improved Clique-Picking Algorithm for Counting Markov Equivalent DAGs via Super Cliques Transfer
Lifu Liu, Shiyuan He, Jianhua Guo
An Online Learning Approach to Prompt-based Selection of Generative Models and LLMs
Xiaoyan Hu, Ho-fung Leung, Farzan Farnia
HYGMA: Hypergraph Coordination Networks with Dynamic Grouping for Multi-Agent Reinforcement Learning
Chiqiang Liu, Dazi Li
Continual Reinforcement Learning by Planning with Online World Models
Zichen Liu, Guoji Fu, Chao Du et al.
AEQA-NAT : Adaptive End-to-end Quantization Alignment Training Framework for Non-autoregressive Machine Translation
Xiangyu Qu, Guojing Liu, Liang Li
Boosting Masked ECG-Text Auto-Encoders as Discriminative Learners
Hung Manh Pham, Aaqib Saeed, Dong Ma
Chameleon: A Flexible Data-mixing Framework for Language Model Pretraining and Finetuning
Wanyun Xie, Francesco Tonin, Volkan Cevher
Functional Alignment Can Mislead: Examining Model Stitching
Damian Smith, Harvey Mannering, Antonia Marcu
FicGCN: Unveiling the Homomorphic Encryption Efficiency from Irregular Graph Convolutional Networks
Zhaoxuan Kan, Husheng Han, shangyi shi et al.
An Instrumental Value for Data Production and its Application to Data Pricing
Rui Ai, Boxiang Lyu, Zhaoran Wang et al.
Fishers for Free? Approximating the Fisher Information Matrix by Recycling the Squared Gradient Accumulator
YuXin Li, Felix Dangel, Derek Tam et al.
Learning Multi-Level Features with Matryoshka Sparse Autoencoders
Bart Bussmann, Noa Nabeshima, Adam Karvonen et al.
Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers
Roman Abramov, Felix Steinbauer, Gjergji Kasneci
Code-Generated Graph Representations Using Multiple LLM Agents for Material Properties Prediction
Jiao Huang, Qianli Xing, Jinglong Ji et al.
CAN: Leveraging Clients As Navigators for Generative Replay in Federated Continual Learning
Xuankun Rong, Jianshu Zhang, Kun He et al.
On the Clean Generalization and Robust Overfitting in Adversarial Training from Two Theoretical Views: Representation Complexity and Training Dynamics
Binghui Li, Yuanzhi Li
FeatSharp: Your Vision Model Features, Sharper
Mike Ranzinger, Greg Heinrich, Pavlo Molchanov et al.
LBI-FL: Low-Bit Integerized Federated Learning with Temporally Dynamic Bit-Width Allocation
Li Ding, Hao Zhang, Wenrui Dai et al.
WILTing Trees: Interpreting the Distance Between MPNN Embeddings
Masahiro Negishi, Thomas Gärtner, Pascal Welke
Private Lossless Multiple Release
Joel Daniel Andersson, Lukas Retschmeier, Boel Nelson et al.
Making Hard Problems Easier with Custom Data Distributions and Loss Regularization: A Case Study in Modular Arithmetic
Eshika Saxena, Alberto Alfarano, Emily Wenger et al.
Disentangling and Integrating Relational and Sensory Information in Transformer Architectures
Awni Altabaa, John Lafferty
Directed Graph Grammars for Sequence-based Learning
Michael Sun, Orion Foo, Gang Liu et al.
Cooperation of Experts: Fusing Heterogeneous Information with Large Margin
Shuo Wang, Shunyang Huang, Jinghui Yuan et al.
WOMD-Reasoning: A Large-Scale Dataset for Interaction Reasoning in Driving
Yiheng Li, Cunxin Fan, Chongjian GE et al.
Branches: Efficiently Seeking Optimal Sparse Decision Trees via AO*
Ayman Chaouki, Jesse Read, Albert Bifet
SDMG: Smoothing Your Diffusion Models for Powerful Graph Representation Learning
Junyou Zhu, Langzhou He, Chao Gao et al.
Differential Privacy Guarantees of Markov Chain Monte Carlo Algorithms
Andrea Bertazzi, Tim Johnston, Gareth Roberts et al.
Best of Both Worlds: Regret Minimization versus Minimax Play
Adrian Müller, Jon Schneider, EFSTRATIOS PANTELEIMON SKOULAKIS et al.
Low-Rank Adapting Models for Sparse Autoencoders
Matthew Chen, Josh Engels, Max Tegmark
Rethinking External Slow-Thinking: From Snowball Errors to Probability of Correct Reasoning
Zeyu Gan, Yun Liao, Yong Liu
Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for Mixture-of-Experts Language Models
Samira Abnar, Harshay Shah, Dan Busbridge et al.
Training High Performance Spiking Neural Network by Temporal Model Calibration
Jiaqi Yan, Changping Wang, De Ma et al.
EvoPress: Accurate Dynamic Model Compression via Evolutionary Search
Oliver Sieberling, Denis Kuznedelev, Eldar Kurtic et al.
Limitations of measure-first protocols in quantum machine learning
Casper Gyurik, Riccardo Molteni, Vedran Dunjko
LDMol: A Text-to-Molecule Diffusion Model with Structurally Informative Latent Space Surpasses AR Models
Jinho Chang, Jong Chul YE
Scalable Non-Equivariant 3D Molecule Generation via Rotational Alignment
Yuhui Ding, Thomas Hofmann
A Reductions Approach to Risk-Sensitive Reinforcement Learning with Optimized Certainty Equivalents
Kaiwen Wang, Dawen Liang, Nathan Kallus et al.
Explicit Discovery of Nonlinear Symmetries from Dynamic Data
Lexiang Hu, Yikang Li, Zhouchen Lin
LIMEFLDL: A Local Interpretable Model-Agnostic Explanations Approach for Label Distribution Learning
Xiuyi Jia, Jinchi Li, Yunan Lu et al.
Exponential Family Variational Flow Matching for Tabular Data Generation
Andres Guzman Cordero, Floor Eijkelboom, Jan-Willem van de Meent
Hardware and Software Platform Inference
Cheng Zhang, Hanna Foerster, Robert Mullins et al.
Nonlinearly Preconditioned Gradient Methods under Generalized Smoothness
Konstantinos Oikonomidis, Jan Quan, Emanuel Laude et al.
The Elicitation Game: Evaluating Capability Elicitation Techniques
Felix Hofstätter, Teun van der Weij, Jayden Teoh et al.
What Limits Bidirectional Model's Generative Capabilities? A Uni-Bi-Directional Mixture-of-Expert Method For Bidirectional Fine-tuning
Zuchao Li, Yonghua Hei, Qiwei Li et al.
Falsification of Unconfoundedness by Testing Independence of Causal Mechanisms
Rickard K.A. Karlsson, Jesse H. Krijthe
Domain2Vec: Vectorizing Datasets to Find the Optimal Data Mixture without Training
Mozhi Zhang, Howe Tissue, Lu Wang et al.
A Closer Look at Transformers for Time Series Forecasting: Understanding Why They Work and Where They Struggle
Yu Chen, Nathalia Céspedes, Payam Barnaghi
Alpha-SQL: Zero-Shot Text-to-SQL using Monte Carlo Tree Search
Boyan Li, Jiayi Zhang, Ju Fan et al.
How to Evaluate and Mitigate IP Infringement in Visual Generative AI?
Zhenting Wang, Chen Chen, Vikash Sehwag et al.
Locate-then-edit for Multi-hop Factual Recall under Knowledge Editing
Zhuoran Zhang, Yongxiang Li, Zijian Kan et al.
On Efficient Estimation of Distributional Treatment Effects under Covariate-Adaptive Randomization
Undral Byambadalai, Tomu Hirata, Tatsushi Oka et al.
Revisiting Instance-Optimal Cluster Recovery in the Labeled Stochastic Block Model
Kaito Ariu, Alexandre Proutiere, Se-Young Yun
Better to Teach than to Give: Domain Generalized Semantic Segmentation via Agent Queries with Diffusion Model Guidance
Fan Li, Xuan Wang, Min Qi et al.
Instruction-Following Pruning for Large Language Models
Bairu Hou, Qibin Chen, Jianyu Wang et al.
Domain-Adapted Diffusion Model for PROTAC Linker Design Through the Lens of Density Ratio in Chemical Space
Zixing Song, Ziqiao Meng, Jose Miguel Hernandez-Lobato
Three-Dimensional Trajectory Prediction with 3DMoTraj Dataset
Hao Zhou, Xu Yang, Mingyu Fan et al.
Continuously Updating Digital Twins using Large Language Models
Harry Amad, Nicolás Astorga, Mihaela van der Schaar
Do Not Mimic My Voice : Speaker Identity Unlearning for Zero-Shot Text-to-Speech
Taesoo Kim, Jinju Kim, Dongchan Kim et al.
HGOT: Self-supervised Heterogeneous Graph Neural Network with Optimal Transport
Yanbei Liu, Chongxu Wang, Zhitao Xiao et al.
PAC-Bayes Analysis for Recalibration in Classification
Masahiro Fujisawa, Futoshi Futami
Global Context-aware Representation Learning for Spatially Resolved Transcriptomics
Yunhak Oh, Junseok Lee, Yeongmin Kim et al.
Provably Efficient Algorithm for Best Scoring Rule Identification in Online Principal-Agent Information Acquisition
Zichen Wang, Chuanhao Li, Huazheng Wang
Aggregation of Dependent Expert Distributions in Multimodal Variational Autoencoders
Rogelio A. Mancisidor, Robert Jenssen, Shujian Yu et al.
Testing the Limits of Fine-Tuning for Improving Visual Cognition in Vision Language Models
Luca M. Schulze Buschoff, Konstantinos Voudouris, Elif Akata et al.
Black-Box Adversarial Attacks on LLM-Based Code Completion
Slobodan Jenko, Niels Mündler, Jingxuan He et al.
Behavior-agnostic Task Inference for Robust Offline In-context Reinforcement Learning
Long Ma, Fangwei Zhong, Yizhou Wang
Counterfactual Effect Decomposition in Multi-Agent Sequential Decision Making
Stelios Triantafyllou, Aleksa Sukovic, Yasaman Zolfimoselo et al.
Distillation of Discrete Diffusion through Dimensional Correlations
Satoshi Hayakawa, Yuhta Takida, Masaaki Imaizumi et al.
Clustering Properties of Self-Supervised Learning
Xi Weng, Jianing An, Xudong Ma et al.
SecEmb: Sparsity-Aware Secure Federated Learning of On-Device Recommender System with Large Embedding
Peihua Mai, Youlong Ding, Ziyan Lyu et al.
Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search
Maohao Shen, Guangtao Zeng, Zhenting Qi et al.
Self-Improving Transformers Overcome Easy-to-Hard and Length Generalization Challenges
Nayoung Lee, Jack Cai, Avi Schwarzschild et al.
DeepCrossAttention: Supercharging Transformer Residual Connections
Mike Heddes, Adel Javanmard, Kyriakos Axiotis et al.
ML$^2$-GCL: Manifold Learning Inspired Lightweight Graph Contrastive Learning
Jianqing Liang, Zhiqiang Li, Xinkai Wei et al.
Multi-agent Architecture Search via Agentic Supernet
Guibin Zhang, Luyang Niu, Junfeng Fang et al.
BaxBench: Can LLMs Generate Correct and Secure Backends?
Mark Vero, Niels Mündler, Viktor Chibotaru et al.
Ensemble Learned Bloom Filters: Two Oracles are Better than One
Ming Lin, Lin CHEN
Learn Beneficial Noise as Graph Augmentation
Siqi Huang, Yanchen Xu, Hongyuan Zhang et al.
Radio: Rate–Distortion Optimization for Large Language Model Compression
Sean I. Young
An Analysis of Quantile Temporal-Difference Learning
Mark Rowland, Remi Munos, Mohammad Gheshlaghi Azar et al.