Most Cited ICML "causal continual pre-training" Papers
5,975 papers found • Page 5 of 30
Conference
Safety Certificate against Latent Variables with Partially Unidentifiable Dynamics
Haoming Jing, Yorie Nakahira
Counterfactual Effect Decomposition in Multi-Agent Sequential Decision Making
Stelios Triantafyllou, Aleksa Sukovic, Yasaman Zolfimoselo et al.
Hyper-Transforming Latent Diffusion Models
Ignacio Peis, Batuhan Koyuncu, Isabel Valera et al.
REG: Rectified Gradient Guidance for Conditional Diffusion Models
Zhengqi Gao, Kaiwen Zha, Tianyuan Zhang et al.
Active feature acquisition via explainability-driven ranking
Osman Berke Guney, Ketan Saichandran, Karim Elzokm et al.
Behavior-agnostic Task Inference for Robust Offline In-context Reinforcement Learning
Long Ma, Fangwei Zhong, Yizhou Wang
Investigating the Overlooked Hessian Structure: From CNNs to LLMs
Qian-Yuan Tang, Yufei Gu, Yunfeng Cai et al.
The Polynomial Stein Discrepancy for Assessing Moment Convergence
Narayan Srinivasan, Matthew Sutton, Christopher Drovandi et al.
Reward-Guided Prompt Evolving in Reinforcement Learning for LLMs
Ziyu Ye, Rishabh Agarwal, Tianqi Liu et al.
Black-Box Adversarial Attacks on LLM-Based Code Completion
Slobodan Jenko, Niels Mündler, Jingxuan He et al.
Position: Human Baselines in Model Evaluations Need Rigor and Transparency (With Recommendations & Reporting Checklist)
Kevin Wei, Patricia Paskov, Sunishchal Dev et al.
FairICP: Encouraging Equalized Odds via Inverse Conditional Permutation
Yuheng Lai, Leying Guan
Testing the Limits of Fine-Tuning for Improving Visual Cognition in Vision Language Models
Luca M. Schulze Buschoff, Konstantinos Voudouris, Elif Akata et al.
Provably Efficient Algorithm for Best Scoring Rule Identification in Online Principal-Agent Information Acquisition
Zichen Wang, Chuanhao Li, Huazheng Wang
Global Context-aware Representation Learning for Spatially Resolved Transcriptomics
Yunhak Oh, Junseok Lee, Yeongmin Kim et al.
Moirai-MoE: Empowering Time Series Foundation Models with Sparse Mixture of Experts
Xu Liu, Juncheng Liu, Gerald Woo et al.
Better to Teach than to Give: Domain Generalized Semantic Segmentation via Agent Queries with Diffusion Model Guidance
Fan Li, Xuan Wang, Min Qi et al.
Geometric Median (GM) Matching for Robust k-Subset Selection from Noisy Data
Anish Acharya, Sujay Sanghavi, Alex Dimakis et al.
Empirical Design in Reinforcement Learning
Andrew Patterson, Samuel F Neumann, Martha White et al.
Revisiting Instance-Optimal Cluster Recovery in the Labeled Stochastic Block Model
Kaito Ariu, Alexandre Proutiere, Se-Young Yun
A Closer Look at Transformers for Time Series Forecasting: Understanding Why They Work and Where They Struggle
Yu Chen, Nathalia Céspedes, Payam Barnaghi
LARM: Large Auto-Regressive Model for Long-Horizon Embodied Intelligence
Zhuoling Li, Xiaogang Xu, Zhenhua Xu et al.
Provable Benefit of Random Permutations over Uniform Sampling in Stochastic Coordinate Descent
Donghwa Kim, Jaewook Lee, Chulhee Yun
Learning Vision and Language Concepts for Controllable Image Generation
Shaoan Xie, Lingjing Kong, Yujia Zheng et al.
MindCustomer: Multi-Context Image Generation Blended with Brain Signal
Muzhou Yu, Shuyun Lin, Lei Ma et al.
SDE Matching: Scalable and Simulation-Free Training of Latent Stochastic Differential Equations
Grigory Bartosh, Dmitry Vetrov, Christian Andersson Naesseth
Improved Off-policy Reinforcement Learning in Biological Sequence Design
Hyeonah Kim, Minsu Kim, Taeyoung Yun et al.
Joint Metric Space Embedding by Unbalanced Optimal Transport with Gromov–Wasserstein Marginal Penalization
Florian Beier, Moritz Piening, Robert Beinert et al.
Exploring Vision Semantic Prompt for Efficient Point Cloud Understanding
Yixin Zha, Chuxin Wang, Wenfei Yang et al.
Linear convergence of Sinkhorn's algorithm for generalized static Schrödinger bridge
Rahul Choudhary, Hanbaek Lyu
Falsification of Unconfoundedness by Testing Independence of Causal Mechanisms
Rickard K.A. Karlsson, Jesse H. Krijthe
Explicit Discovery of Nonlinear Symmetries from Dynamic Data
Lexiang Hu, Yikang Li, Zhouchen Lin
Attention Mechanisms Perspective: Exploring LLM Processing of Graph-Structured Data
Guan Zhong, Likang Wu, Hongke Zhao et al.
Scalable Non-Equivariant 3D Molecule Generation via Rotational Alignment
Yuhui Ding, Thomas Hofmann
LDMol: A Text-to-Molecule Diffusion Model with Structurally Informative Latent Space Surpasses AR Models
Jinho Chang, Jong Chul YE
DiLQR: Differentiable Iterative Linear Quadratic Regulator via Implicit Differentiation
Shuyuan Wang, Philip D. Loewen, Michael Forbes et al.
Limitations of measure-first protocols in quantum machine learning
Casper Gyurik, Riccardo Molteni, Vedran Dunjko
RULEBREAKERS: Challenging LLMs at the Crossroads between Formal Logic and Human-like Reasoning
Jason Chan, Robert Gaizauskas, Zhixue Zhao
iN2V: Bringing Transductive Node Embeddings to Inductive Graphs
Nicolas Lell, Ansgar Scherp
Generalized Random Forests Using Fixed-Point Trees
David Fleischer, David A Stephens, Archer Yang
Training High Performance Spiking Neural Network by Temporal Model Calibration
Jiaqi Yan, Changping Wang, De Ma et al.
Best of Both Worlds: Regret Minimization versus Minimax Play
Adrian Müller, Jon Schneider, EFSTRATIOS PANTELEIMON SKOULAKIS et al.
SDMG: Smoothing Your Diffusion Models for Powerful Graph Representation Learning
Junyou Zhu, Langzhou He, Chao Gao et al.
Explaining, Fast and Slow: Abstraction and Refinement of Provable Explanations
Shahaf Bassan, Yizhak Elboher, Tobias Ladner et al.
Primal-Dual Neural Algorithmic Reasoning
Yu He, Ellen Vitercik
Directed Graph Grammars for Sequence-based Learning
Michael Sun, Orion Foo, Gang Liu et al.
Improving Out-of-Distribution Detection with Markov Logic Networks
Konstantin Kirchheim, Frank Ortmeier
Making Hard Problems Easier with Custom Data Distributions and Loss Regularization: A Case Study in Modular Arithmetic
Eshika Saxena, Alberto Alfarano, Emily Wenger et al.
LBI-FL: Low-Bit Integerized Federated Learning with Temporally Dynamic Bit-Width Allocation
Li Ding, Hao Zhang, Wenrui Dai et al.
Constrained Pareto Set Identification with Bandit Feedback
Cyrille Kone, Emilie Kaufmann, Laura Richert
Fishers for Free? Approximating the Fisher Information Matrix by Recycling the Squared Gradient Accumulator
YuXin Li, Felix Dangel, Derek Tam et al.
Double Machine Learning for Causal Inference under Shared-State Interference
Chris Hays, Manish Raghavan
A Cross Modal Knowledge Distillation & Data Augmentation Recipe for Improving Transcriptomics Representations through Morphological Features
Ihab Bendidi, Yassir El Mesbahi, Alisandra Denton et al.
Reflection-Window Decoding: Text Generation with Selective Refinement
Zeyu Tang, Zhenhao Chen, Xiangchen Song et al.
FicGCN: Unveiling the Homomorphic Encryption Efficiency from Irregular Graph Convolutional Networks
Zhaoxuan Kan, Husheng Han, shangyi shi et al.
The Missing Alignment Link of In-context Learning on Sequences
Harshvardhan Agarwal, Sunita Sarawagi
Functional Alignment Can Mislead: Examining Model Stitching
Damian Smith, Harvey Mannering, Antonia Marcu
Chameleon: A Flexible Data-mixing Framework for Language Model Pretraining and Finetuning
Wanyun Xie, Francesco Tonin, Volkan Cevher
Probabilistic Factorial Experimental Design for Combinatorial Interventions
Divya Shyamal, Jiaqi Zhang, Caroline Uhler
Eliciting Language Model Behaviors with Investigator Agents
Xiang Li, Neil Chowdhury, Daniel Johnson et al.
Graph Minimum Factor Distance and Its Application to Large-Scale Graph Data Clustering
Jicong Fan
EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents
Rui Yang, Hanyang(Jeremy) Chen, Junyu Zhang et al.
The Berkeley Function Calling Leaderboard (BFCL): From Tool Use to Agentic Evaluation of Large Language Models
Shishir G. Patil, Huanzhi Mao, Fanjia Yan et al.
AEQA-NAT : Adaptive End-to-end Quantization Alignment Training Framework for Non-autoregressive Machine Translation
Xiangyu Qu, Guojing Liu, Liang Li
Continual Reinforcement Learning by Planning with Online World Models
Zichen Liu, Guoji Fu, Chao Du et al.
Dynamical phases of short-term memory mechanisms in RNNs
Bariscan Kurtkaya, Fatih Dinc, Mert Yuksekgonul et al.
Instance Correlation Graph-based Naive Bayes
Chengyuan Li, Liangxiao Jiang, Wenjun Zhang et al.
Semantic Shift Estimation via Dual-Projection and Classifier Reconstruction for Exemplar-Free Class-Incremental Learning
Run He, Di Fang, Yicheng Xu et al.
HYGMA: Hypergraph Coordination Networks with Dynamic Grouping for Multi-Agent Reinforcement Learning
Chiqiang Liu, Dazi Li
An Online Learning Approach to Prompt-based Selection of Generative Models and LLMs
Xiaoyan Hu, Ho-fung Leung, Farzan Farnia
DTZO: Distributed Trilevel Zeroth Order Learning with Provable Non-Asymptotic Convergence
Yang Jiao, Kai Yang, Chengtao Jian
An Improved Clique-Picking Algorithm for Counting Markov Equivalent DAGs via Super Cliques Transfer
Lifu Liu, Shiyuan He, Jianhua Guo
Efficient Multi-modal Long Context Learning for Training-free Adaptation
Zehong Ma, Shiliang Zhang, Longhui Wei et al.
Invariant Deep Uplift Modeling for Incentive Assignment in Online Marketing via Probability of Necessity and Sufficiency
Zexu Sun, Qiyu Han, Hao Yang et al.
Adapting While Learning: Grounding LLMs for Scientific Problems with Tool Usage Adaptation
Bohan Lyu, Yadi Cao, Duncan Watson-Parris et al.
Sample-specific Noise Injection for Diffusion-based Adversarial Purification
Yuhao Sun, Jiacheng Zhang, Zesheng Ye et al.
Exploiting Presentative Feature Distributions for Parameter-Efficient Continual Learning of Large Language Models
Xin Cheng, Jiabo Ye, Haiyang Xu et al.
Hyper: Hyperparameter Robust Efficient Exploration in Reinforcement Learning
Yiran Wang, Chenshu Liu, Yunfan Li et al.
Identifying Neural Dynamics Using Interventional State Space Models
Amin Nejatbakhsh, Yixin Wang
Diversified Flow Matching with Translation Identifiability
Sagar Shrestha, Xiao Fu
DataDecide: How to Predict Best Pretraining Data with Small Experiments
Ian Magnusson, Tai Nguyen, Ben Bogin et al.
Gradient Aligned Regression via Pairwise Losses
Dixian Zhu, Tianbao Yang, Livnat Jerby
NeuroTree: Hierarchical Functional Brain Pathway Decoding for Mental Health Disorders
Jun-En Ding, Dongsheng Luo, Chenwei Wu et al.
RAPID: Long-Context Inference with Retrieval-Augmented Speculative Decoding
Guanzheng Chen, Qilong Feng, Jinjie Ni et al.
Geometric Representation Condition Improves Equivariant Molecule Generation
Zian Li, Cai Zhou, Xiyuan Wang et al.
A New Approach to Backtracking Counterfactual Explanations: A Unified Causal Framework for Efficient Model Interpretability
Pouria Fatemi, Ehsan Sharifian, Mohammad Hossein Yassaee
Rethink GraphODE Generalization within Coupled Dynamical System
Guancheng Wan, Zijie Huang, Wanjia Zhao et al.
Weak-to-Strong Jailbreaking on Large Language Models
Xuandong Zhao, Xianjun Yang, Tianyu Pang et al.
Optimal Algorithm for Max-Min Fair Bandit
Zilong Wang, Zhiyao Zhang, Shuai Li
Causal Logistic Bandits with Counterfactual Fairness Constraints
Jiajun Chen, Jin Tian, Chris Quinn
GPEN: Global Position Encoding Network for Enhanced Subgraph Representation Learning
Nannan Wu, Yuming Huang, Yiming Zhao et al.
SafeArena: Evaluating the Safety of Autonomous Web Agents
Ada Tur, Nicholas Meade, Xing Han Lù et al.
Larger or Smaller Reward Margins to Select Preferences for LLM Alignment?
Kexin Huang, Junkang Wu, Ziqian Chen et al.
Robust Noise Attenuation via Adaptive Pooling of Transformer Outputs
Greyson Brothers
LineFlow: A Framework to Learn Active Control of Production Lines
Kai Müller, Martin Wenzel, Tobias Windisch
Revisiting Chain-of-Thought in Code Generation: Do Language Models Need to Learn Reasoning before Coding?
Ren-Biao Liu, Anqi Li, ChaodingYang et al.
LoRA Training Provably Converges to a Low-Rank Global Minimum Or It Fails Loudly (But it Probably Won't Fail)
Junsu Kim, Jaeyeon Kim, Ernest Ryu
Homophily Enhanced Graph Domain Adaptation
Ruiyi Fang, Bingheng Li, Jingyu Zhao et al.
Continuous-Time Analysis of Heavy Ball Momentum in Min-Max Games
Yi Feng, Kaito Fujii, EFSTRATIOS PANTELEIMON SKOULAKIS et al.
Fluctuations of the largest eigenvalues of transformed spiked Wigner matrices
Aro Lee, Ji Oon Lee
Beyond Zero Initialization: Investigating the Impact of Non-Zero Initialization on LoRA Fine-Tuning Dynamics
Shiwei Li, Xiandi Luo, Xing Tang et al.
Data-driven Design of Randomized Control Trials with Guaranteed Treatment Effects
Santiago Cortes-Gomez, Naveen Raman, Aarti Singh et al.
Compute Optimal Inference and Provable Amortisation Gap in Sparse Autoencoders
Charles O'Neill, Alim Gumran, David Klindt
All-atom inverse protein folding through discrete flow matching
Kai Yi, Kiarash Jamali, Sjors Scheres
On the Private Estimation of Smooth Transport Maps
Clément Lalanne, Franck Iutzeler, Loubes Jean-Michel et al.
EditLord: Learning Code Transformation Rules for Code Editing
Weichen Li, Albert Jan, Baishakhi Ray et al.
Approximately Correct Label Distribution Learning
Weiwei Li, Haitao Wu, Yunan Lu et al.
Value-Based Deep RL Scales Predictably
Oleh Rybkin, Michal Nauman, Preston Fu et al.
R.I.P.: Better Models by Survival of the Fittest Prompts
Ping Yu, Weizhe Yuan, Olga Golovneva et al.
Mechanistic Unlearning: Robust Knowledge Unlearning and Editing via Mechanistic Localization
Phillip Guo, Aaquib Syed, Abhay Sheshadri et al.
C2IQL: Constraint-Conditioned Implicit Q-learning for Safe Offline Reinforcement Learning
Zifan LIU, Xinran Li, Jun Zhang
AKRMap: Adaptive Kernel Regression for Trustworthy Visualization of Cross-Modal Embeddings
Yilin Ye, Junchao Huang, Xingchen ZENG et al.
Prior Knowledge Guided Neural Architecture Generation
Jingrong Xie, Han Ji, Yanan Sun
Finite-Sample Convergence Bounds for Trust Region Policy Optimization in Mean Field Games
Antonio Ocello, Daniil Tiapkin, Lorenzo Mancini et al.
Convergence of Policy Mirror Descent Beyond Compatible Function Approximation
Uri Sherman, Tomer Koren, Yishay Mansour
OptMATH: A Scalable Bidirectional Data Synthesis Framework for Optimization Modeling
Hongliang Lu, Zhonglin Xie, Yaoyu Wu et al.
A Tale of Two Structures: Do LLMs Capture the Fractal Complexity of Language?
Ibrahim Alabdulmohsin, Andreas Steiner
Consensus Is All You Get: The Role of Attention in Transformers
Alvaro Rodriguez Abella, João Pedro Silvestre, Paulo Tabuada
Unlocking the Power of Rehearsal in Continual Learning: A Theoretical Perspective
Junze Deng, Qinhang Wu, Peizhong Ju et al.
Reflect-then-Plan: Offline Model-Based Planning through a Doubly Bayesian Lens
Jihwan Jeong, Xiaoyu Wang, Jingmin Wang et al.
A Bregman Proximal Viewpoint on Neural Operators
Abdel-Rahim Mezidi, Jordan Patracone, Saverio Salzo et al.
Residual TPP: A Unified Lightweight Approach for Event Stream Data Analysis
Ruoxin Yuan, Guanhua Fang
Reward-Guided Speculative Decoding for Efficient LLM Reasoning
Baohao Liao, Yuhui Xu, Hanze Dong et al.
A First-order Generative Bilevel Optimization Framework for Diffusion Models
Quan Xiao, Hui Yuan, A F M Saif et al.
Optimal Survey Design for Private Mean Estimation
Yu-Wei Chen, Raghu Pasupathy, Jordan A Awan
Learning Imperfect Information Extensive-form Games with Last-iterate Convergence under Bandit Feedback
Canzhe Zhao, Yutian Cheng, Jing Dong et al.
One Stone, Two Birds: Enhancing Adversarial Defense Through the Lens of Distributional Discrepancy
Jiacheng Zhang, Benjamin Rubinstein, Jingfeng Zhang et al.
Benefits of Early Stopping in Gradient Descent for Overparameterized Logistic Regression
Jingfeng Wu, Peter Bartlett, Matus Telgarsky et al.
Geometric Generative Modeling with Noise-Conditioned Graph Networks
Peter Pao-Huang, Mitchell Black, Xiaojie Qiu
NICE Data Selection for Instruction Tuning in LLMs with Non-differentiable Evaluation Metric
Jingtan Wang, Xiaoqiang Lin, Rui Qiao et al.
Multi-Timescale Dynamics Model Bayesian Optimization for Plasma Stabilization in Tokamaks
Rohit Sonker, Alexandre Capone, Andrew Rothstein et al.
VersaPRM: Multi-Domain Process Reward Model via Synthetic Reasoning Data
Thomas Zeng, Shuibai Zhang, Shutong Wu et al.
The Generalized Skew Spectrum of Graphs
Armando Bellante, Martin Plávala, Alessandro Luongo
NMA-tune: Generating Highly Designable and Dynamics Aware Protein Backbones
Urszula Julia Komorowska, Francisco Vargas, Alessandro Rondina et al.
TRUST-VLM: Thorough Red-Teaming for Uncovering Safety Threats in Vision-Language Models
Kangjie Chen, Muyang Li, Guanlin Li et al.
3D-LMVIC: Learning-based Multi-View Image Compression with 3D Gaussian Geometric Priors
Yujun Huang, Bin Chen, Niu Lian et al.
Generalization of noisy SGD in unbounded non-convex settings
Leello Dadi, Volkan Cevher
Distributed Nonparametric Estimation: from Sparse to Dense Samples per Terminal
Deheng Yuan, Tao Guo, Zhongyi Huang
Deep Reinforcement Learning from Hierarchical Preference Design
Alexander Bukharin, Yixiao Li, Pengcheng He et al.
Training Diffusion-based Generative Models with Limited Data
Zhaoyu Zhang, Yang Hua, Guanxiong Sun et al.
A Selective Learning Method for Temporal Graph Continual Learning
Hanmo Liu, Shimin Di, Haoyang LI et al.
Overcoming Vocabulary Mismatch: Vocabulary-agnostic Teacher Guided Language Modeling
Haebin Shin, Lei Ji, Xiao Liu et al.
Enhancing Visual Localization with Cross-Domain Image Generation
Yuanze Wang, Yichao Yan, Shiming Song et al.
Fast Min-$\epsilon$ Segmented Regression using Constant-Time Segment Merging
Ansgar Lößer, Max Schlecht, Florian Schintke et al.
A General Representation-Based Approach to Multi-Source Domain Adaptation
Ignavier Ng, Yan Li, Zijian Li et al.
DiffAdvMAP: Flexible Diffusion-Based Framework for Generating Natural Unrestricted Adversarial Examples
Zhengzhao Pan, Hua Chen, Xiaogang Zhang
FAB-PPI: Frequentist, Assisted by Bayes, Prediction-Powered Inference
Stefano Cortinovis, Francois Caron
Empower Structure-Based Molecule Optimization with Gradient Guided Bayesian Flow Networks
Keyue Qiu, Yuxuan Song, Jie Yu et al.
Emergence and Effectiveness of Task Vectors in In-Context Learning: An Encoder Decoder Perspective
Seungwook Han, Jinyeop Song, Jeff Gore et al.
Overcoming Spurious Solutions in Semi-Dual Neural Optimal Transport: A Smoothing Approach for Learning the Optimal Transport Plan
Jaemoo Choi, Jaewoong Choi, Dohyun Kwon
Prompt-based Depth Pruning of Large Language Models
Juyun Wee, Minjae Park, Jaeho Lee
Self-Organizing Visual Prototypes for Non-Parametric Representation Learning
Thalles Silva, Helio Pedrini, Adín Ramírez Rivera
Finding Wasserstein Ball Center: Efficient Algorithm and The Applications in Fairness
Yuntao Wang, Yuxuan Li, Qingyuan Yang et al.
Self-Play $Q$-Learners Can Provably Collude in the Iterated Prisoner's Dilemma
Quentin Bertrand, Juan Duque, Emilio Calvano et al.
Emergent Response Planning in LLMs
Zhichen Dong, Zhanhui Zhou, Zhixuan Liu et al.
EARTH: Epidemiology-Aware Neural ODE with Continuous Disease Transmission Graph
Guancheng Wan, Zewen Liu, Xiaojun Shan et al.
All-Purpose Mean Estimation over R: Optimal Sub-Gaussianity with Outlier Robustness and Low Moments Performance
Jasper Lee, Walter McKelvie, Maoyuan Song et al.
Adapting to Linear Separable Subsets with Large-Margin in Differentially Private Learning
Erchi Wang, Yuqing Zhu, Yu-Xiang Wang
Plan-and-Act: Improving Planning of Agents for Long-Horizon Tasks
Lutfi Erdogan, Hiroki Furuta, Sehoon Kim et al.
Compressed and distributed least-squares regression: convergence rates with applications to federated learning
Constantin Philippenko, Aymeric Dieuleveut
Not All Tokens Matter All The Time: Dynamic Token Aggregation Towards Efficient Detection Transformers
Jiacheng Cheng, Xiwen Yao, Xiang Yuan et al.
Geometric Feature Embedding for Effective 3D Few-Shot Class Incremental Learning
Xiangqi Li, Libo Huang, Zhulin An et al.
Exactly Tight Information-theoretic Generalization Bounds via Binary Jensen-Shannon Divergence
Yuxin Dong, Haoran Guo, Tieliang Gong et al.
Visual Abstraction: A Plug-and-Play Approach for Text-Visual Retrieval
Guofeng Ding, Yiding Lu, Peng Hu et al.
Mitigating Object Hallucination in Large Vision-Language Models via Image-Grounded Guidance
Linxi Zhao, Yihe Deng, Weitong Zhang et al.
The Sharpness Disparity Principle in Transformers for Accelerating Language Model Pre-Training
Jinbo Wang, Mingze Wang, Zhanpeng Zhou et al.
Ladder-Residual: Parallelism-Aware Architecture for Accelerating Large Model Inference with Communication Overlapping
Muru Zhang, Mayank Mishra, Zhongzhu Zhou et al.
Multilayer Matrix Factorization via Dimension-Reducing Diffusion Variational Inference
Junbin Liu, Farzan Farnia, Wing-Kin Ma
Monte Carlo Tree Search for Comprehensive Exploration in LLM-Based Automatic Heuristic Design
Zhi Zheng, Zhuoliang Xie, Zhenkun Wang et al.
Modified K-means Algorithm with Local Optimality Guarantees
Mingyi Li, Michael R. Metel, Akiko Takeda
Securing Equal Share: A Principled Approach for Learning Multiplayer Symmetric Games
Jiawei Ge, Yuanhao Wang, Wenzhe Li et al.
Beyond Sensor Data: Foundation Models of Behavioral Data from Wearables Improve Health Predictions
Eray Erturk, Fahad Kamran, Salar Abbaspourazad et al.
The Logical Implication Steering Method for Conditional Interventions on Transformer Generation
Damjan Kalajdzievski
Are Sparse Autoencoders Useful? A Case Study in Sparse Probing
Subhash Kantamneni, Josh Engels, Senthooran Rajamanoharan et al.
VTGaussian-SLAM: RGBD SLAM for Large Scale Scenes with Splatting View-Tied 3D Gaussians
Pengchong Hu, Zhizhong Han
Reidentify: Context-Aware Identity Generation for Contextual Multi-Agent Reinforcement Learning
Zhiwei XU, Kun Hu, Xin Xin et al.
Janus: Dual-Server Multi-Round Secure Aggregation with Verifiability for Federated Learning
Lang Pu, Jingjing Gu, Chao Lin et al.
SpikF: Spiking Fourier Network for Efficient Long-term Prediction
Wenjie Wu, Dexuan Huo, Hong Chen
Faster Stochastic Optimization with Arbitrary Delays via Adaptive Asynchronous Mini-Batching
Amit Attia, Ofir Gaash, Tomer Koren
Introducing 3D Representation for Dense Volume-to-Volume Translation via Score Fusion
Xiyue Zhu, Dou Kwark, Ruike Zhu et al.
Designing Cyclic Peptides via Harmonic SDE with Atom-Bond Modeling
Xiangxin Zhou, Mingyu Li, xiao yi et al.
SAFE: Finding Sparse and Flat Minima to Improve Pruning
Dongyeop Lee, Kwanhee Lee, Jinseok Chung et al.
Bayesian Weight Enhancement with Steady-State Adaptation for Test-time Adaptation in Dynamic Environments
Jae-Hong Lee
Uncertainty-Based Extensible Codebook for Discrete Federated Learning in Heterogeneous Data Silos
Tianyi Zhang, Yu Cao, Dianbo Liu
Universal Approximation Theorem of Deep Q-Networks
Qian Qi
On the Resilience of LLM-Based Multi-Agent Collaboration with Faulty Agents
Jen-Tse Huang, Jiaxu Zhou, Tailin Jin et al.
Online Clustering of Dueling Bandits
Zhiyong Wang, Jiahang Sun, Mingze Kong et al.
Identifying Metric Structures of Deep Latent Variable Models
Stas Syrota, Yevgen Zainchkovskyy, Johnny Xi et al.
Learning Single Index Models with Diffusion Priors
Anqi Tang, Youming Chen, Shuchen Xue et al.
Exploring Large Action Sets with Hyperspherical Embeddings using von Mises-Fisher Sampling
Walid Bendada, Guillaume Salha-Galvan, Romain Hennequin et al.
Regression for the Mean: Auto-Evaluation and Inference with Few Labels through Post-hoc Regression
Benjamin Eyre, David Madras
DVI:A Derivative-based Vision Network for INR
RUNZHAO YANG, Xiaolong Wu, Zhihong Zhang et al.
Beyond Log-Concavity and Score Regularity: Improved Convergence Bounds for Score-Based Generative Models in W2-distance
Marta Gentiloni Silveri, Antonio Ocello
Adaptive Partitioning Schemes for Optimistic Optimization
Raja Sunkara, Ardhendu Tripathy
Average Certified Radius is a Poor Metric for Randomized Smoothing
Chenhao Sun, Yuhao Mao, Mark Müller et al.
CTBench: A Library and Benchmark for Certified Training
Yuhao Mao, Stefan Balauca, Martin Vechev
Addressing Concept Mislabeling in Concept Bottleneck Models Through Preference Optimization
Emiliano Penaloza, Tianyue Zhang, Laurent Charlin et al.
Sounding that Object: Interactive Object-Aware Image to Audio Generation
Tingle Li, Baihe Huang, Xiaobin Zhuang et al.
Cross-regularization: Adaptive Model Complexity through Validation Gradients
Carlos Stein Naves de Brito
Inverse problems with experiment-guided AlphaFold
Sai Advaith Maddipatla, Nadav Bojan, Meital Bojan et al.