Most Cited ICML "deep gnns" Papers
5,975 papers found • Page 9 of 30
Conference
Minimalist Concept Erasure in Generative Models
Yang Zhang, Er Jin, Yanfei Dong et al.
Quantifying Treatment Effects: Estimating Risk Ratios via Observational Studies
Ahmed Boughdiri, julie Josse, Erwan Scornet
Communicating Activations Between Language Model Agents
Vignav Ramesh, Kenneth Li
MedRAX: Medical Reasoning Agent for Chest X-ray
Adibvafa Fallahpour, Jun Ma, Alif Munim et al.
On the Power of Learning-Augmented Search Trees
Jingbang Chen, Xinyuan Cao, Alicia Stepin et al.
Universal Biological Sequence Reranking for Improved De Novo Peptide Sequencing
Zijie Qiu, Jiaqi Wei, Xiang Zhang et al.
Latent Variable Causal Discovery under Selection Bias
Haoyue Dai, Yiwen Qiu, Ignavier Ng et al.
WMarkGPT: Watermarked Image Understanding via Multimodal Large Language Models
Tan Songbai, Xuerui Qiu, Yao Shu et al.
LEVIS: Large Exact Verifiable Input Spaces for Neural Networks
Mohamad Chehade, Wenting Li, Brian Bell et al.
Diversifying Robot Locomotion Behaviors with Extrinsic Behavioral Curiosity
Zhenglin Wan, Xingrui Yu, David Bossens et al.
Overcoming Non-monotonicity in Transducer-based Streaming Generation
Zhengrui Ma, Yang Feng, Min zhang
Efficient Bisection Projection to Ensure Neural-Network Solution Feasibility for Optimization over General Set
Enming Liang, Minghua Chen
Time-Aware World Model for Adaptive Prediction and Control
Anh Nhu, Sanghyun Son, Ming Lin
A Mixture-Based Framework for Guiding Diffusion Models
Yazid Janati, Badr MOUFAD, Mehdi Qassime et al.
CVE-Bench: A Benchmark for AI Agents’ Ability to Exploit Real-World Web Application Vulnerabilities
Yuxuan Zhu, Antony Kellermann, Dylan Bowman et al.
Mixture of Hidden-Dimensions: Not All Hidden-States’ Dimensions are Needed in Transformer
Yilong Chen, Junyuan Shang, Zhenyu Zhang et al.
Conformal Tail Risk Control for Large Language Model Alignment
Catherine Chen, Jingyan Shen, Xinyu Yang et al.
Aligning LLMs by Predicting Preferences from User Writing Samples
Stéphane Aroca-Ouellette, Natalie Mackraz, Barry-John Theobald et al.
MetaOptimize: A Framework for Optimizing Step Sizes and Other Meta-parameters
Arsalan Sharifnassab, Saber Salehkaleybar, Rich Sutton
In-Context Deep Learning via Transformer Models
Weimin Wu, Maojiang Su, Jerry Yao-Chieh Hu et al.
Learning dynamics in linear recurrent neural networks
Alexandra Proca, Clémentine Dominé, Murray Shanahan et al.
Benign Overfitting in Token Selection of Attention Mechanism
Keitaro Sakamoto, Issei Sato
On Expressive Power of Looped Transformers: Theoretical Analysis and Enhancement via Timestep Encoding
Kevin Xu, Issei Sato
Structured Preconditioners in Adaptive Optimization: A Unified Analysis
Shuo Xie, Tianhao Wang, Sashank J. Reddi et al.
KoNODE: Koopman-Driven Neural Ordinary Differential Equations with Evolving Parameters for Time Series Analysis
Hanru Bai, Weiyang Ding
Structure-informed Risk Minimization for Robust Ensemble Learning
Fengchun Qiao, Yanlin Chen, Xi Peng
Feasible Action Search for Bandit Linear Programs via Thompson Sampling
Aditya Gangrade, Aldo Pacchiano, Clay Scott et al.
Policy Regularization on Globally Accessible States in Cross-Dynamics Reinforcement Learning
Zhenghai Xue, Lang Feng, Jiacheng Xu et al.
AuPair: Golden Example Pairs for Code Repair
Aditi Mavalankar, Hassan Mansoor, Zita Marinho et al.
Deep Streaming View Clustering
Honglin Yuan, Xingfeng Li, Jian Dai et al.
Janus: Dual-Server Multi-Round Secure Aggregation with Verifiability for Federated Learning
Lang Pu, Jingjing Gu, Chao Lin et al.
Lego Sketch: A Scalable Memory-augmented Neural Network for Sketching Data Streams
Yuan Feng, Yukun Cao, Hairu Wang et al.
EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM
Zhuofan Zong, Dongzhi Jiang, Bingqi Ma et al.
Risk and cross validation in ridge regression with correlated samples
Alexander Atanasov, Jacob A Zavatone-Veth, Cengiz Pehlevan
Identification of Latent Confounders via Investigating the Tensor Ranks of the Nonlinear Observations
Zhengming Chen, Yewei Xia, Feng Xie et al.
Online Learning in the Random-Order Model
Martino Bernasconi, Andrea Celli, Riccardo Colini Baldeschi et al.
On the Similarities of Embeddings in Contrastive Learning
Chungpa Lee, Sehee Lim, Kibok Lee et al.
VerbalTS: Generating Time Series from Texts
Shuqi Gu, Chuyue Li, Baoyu Jing et al.
Private Model Personalization Revisited
Conor Snedeker, Xinyu Zhou, Raef Bassily
Are LLMs Prescient? A Continuous Evaluation using Daily News as the Oracle
Hui Dai, Ryan Teehan, Mengye Ren
Balancing Preservation and Modification: A Region and Semantic Aware Metric for Instruction-Based Image Editing
Zhuoying Li, Zhu Xu, Yuxin Peng et al.
Diss-l-ECT: Dissecting Graph Data with Local Euler Characteristic Transforms
Julius Von Rohrscheidt, Bastian Rieck
Voronoi-grid-based Pareto Front Learning and Its Application to Collaborative Federated Learning
Mengmeng Chen, Xiaohu Wu, QIQI LIU et al.
Beyond Cropped Regions: New Benchmark and Corresponding Baseline for Chinese Scene Text Retrieval in Diverse Layouts
Li gengluo, Huawen Shen, Yu ZHOU
Harmonizing Geometry and Uncertainty: Diffusion with Hyperspheres
Muskan Dosi, Chiranjeev Chiranjeev, Kartik Thakral et al.
Towards Rationale-Answer Alignment of LVLMs via Self-Rationale Calibration
Yuanchen Wu, Ke Yan, Shouhong Ding et al.
Unveiling Markov heads in Pretrained Language Models for Offline Reinforcement Learning
Wenhao Zhao, Qiushui Xu, Linjie Xu et al.
Stability and Generalization Capability of Subgraph Reasoning Models for Inductive Knowledge Graph Completion
Minsung Hwang, Jaejun Lee, Joyce Whang
Improving Zero-Shot Adversarial Robustness in Vision-Language Models by Closed-form Alignment of Adversarial Path Simplices
Junhao Dong, Piotr Koniusz, Yifei Zhang et al.
Inverse problems with experiment-guided AlphaFold
Sai Advaith Maddipatla, Nadav Bojan, Meital Bojan et al.
Cross-regularization: Adaptive Model Complexity through Validation Gradients
Carlos Stein Naves de Brito
CTBench: A Library and Benchmark for Certified Training
Yuhao Mao, Stefan Balauca, Martin Vechev
Average Certified Radius is a Poor Metric for Randomized Smoothing
Chenhao Sun, Yuhao Mao, Mark Müller et al.
Generalization Principles for Inference over Text-Attributed Graphs with Large Language Models
Haoyu Wang, Shikun Liu, Rongzhe Wei et al.
Identifying and Understanding Cross-Class Features in Adversarial Training
Zeming Wei, Yiwen Guo, Yisen Wang
Sum-of-Parts: Self-Attributing Neural Networks with End-to-End Learning of Feature Groups
Weiqiu You, Helen Qu, Marco Gatti et al.
Ad-Hoc Human-AI Coordination Challenge
Tin Dizdarevic, Ravi Hammond, Tobias Gessler et al.
Revisiting Unbiased Implicit Variational Inference
Tobias Pielok, Bernd Bischl, David Rügamer
MVA: Linear Attention with High-order Query-Keys Integration and Multi-level Vocabulary Decomposition
ning wang, Zekun Li, Tongxin Bai et al.
Exploring Large Action Sets with Hyperspherical Embeddings using von Mises-Fisher Sampling
Walid Bendada, Guillaume Salha-Galvan, Romain Hennequin et al.
Stochastic Encodings for Active Feature Acquisition
Alexander Norcliffe, Changhee Lee, Fergus Imrie et al.
Predicting High-precision Depth on Low-Precision Devices Using 2D Hilbert Curves
Mykhailo Uss, Ruslan Yermolenko, Oleksii Shashko et al.
D-Fusion: Direct Preference Optimization for Aligning Diffusion Models with Visually Consistent Samples
Zijing Hu, Fengda Zhang, Kun Kuang
Approximating Latent Manifolds in Neural Networks via Vanishing Ideals
Nico Pelleriti, Max Zimmer, Elias Wirth et al.
Models of Heavy-Tailed Mechanistic Universality
Liam Hodgkinson, Zhichao Wang, Michael Mahoney
Discovering Latent Causal Graphs from Spatiotemporal Data
Kun Wang, Sumanth Varambally, Duncan Watson-Parris et al.
Adjusting Model Size in Continual Gaussian Processes: How Big is Big Enough?
Guiomar Pescador-Barrios, Sarah Filippi, Mark van der Wilk
SBGD: Improving Graph Diffusion Generative Model via Stochastic Block Diffusion
Junwei Su, shan Wu
BOPO: Neural Combinatorial Optimization via Best-anchored and Objective-guided Preference Optimization
Zijun Liao, Jinbiao Chen, Debing Wang et al.
Identifying Metric Structures of Deep Latent Variable Models
Stas Syrota, Yevgen Zainchkovskyy, Johnny Xi et al.
CoDy: Counterfactual Explainers for Dynamic Graphs
Zhan Qu, Daniel Gomm, Michael Färber
How Do Images Align and Complement LiDAR? Towards a Harmonized Multi-modal 3D Panoptic Segmentation
Yining Pan, Qiongjie Cui, Xulei Yang et al.
Eigen Analysis of Conjugate Kernel and Neural Tangent Kernel
Xiangchao Li, Xiao Han, Qing Yang
Causal Invariance-aware Augmentation for Brain Graph Contrastive Learning
Minqi Yu, Jinduo Liu, Junzhong Ji
Online Clustering of Dueling Bandits
Zhiyong Wang, Jiahang Sun, Mingze Kong et al.
Learnable Spatial-Temporal Positional Encoding for Link Prediction
Katherine Tieu, Dongqi Fu, Zihao Li et al.
Elucidating the Design Space of Multimodal Protein Language Models
Cheng-Yen Hsieh, Xinyou Wang, Daiheng Zhang et al.
Curriculum Learning for Biological Sequence Prediction: The Case of De Novo Peptide Sequencing
Xiang Zhang, Jiaqi Wei, Zijie Qiu et al.
Understanding Model Reprogramming for CLIP via Decoupling Visual Prompts
Chengyi Cai, Zesheng Ye, Lei Feng et al.
Automatically Interpreting Millions of Features in Large Language Models
Gonçalo Paulo, Alex Mallen, Caden Juang et al.
Phase and Amplitude-aware Prompting for Enhancing Adversarial Robustness
Yibo Xu, Dawei Zhou, Decheng Liu et al.
On Differential Privacy for Adaptively Solving Search Problems via Sketching
Shiyuan Feng, Ying Feng, George Li et al.
Uncertainty-Based Extensible Codebook for Discrete Federated Learning in Heterogeneous Data Silos
Tianyi Zhang, Yu Cao, Dianbo Liu
Designing Cyclic Peptides via Harmonic SDE with Atom-Bond Modeling
Xiangxin Zhou, Mingyu Li, xiao yi et al.
Can Diffusion Models Learn Hidden Inter-Feature Rules Behind Images?
Yujin Han, Andi Han, Wei Huang et al.
Towards Escaping from Class Dependency Modeling for Multi-Dimensional Classification
Teng Huang, Bin-Bin Jia, Min-Ling Zhang
Skip the Equations: Learning Behavior of Personalized Dynamical Systems Directly From Data
Krzysztof Kacprzyk, Julianna Piskorz, Mihaela van der Schaar
Provably Near-Optimal Federated Ensemble Distillation with Negligible Overhead
Won-Jun Jang, Hyeon-Seo Park, Si-Hyeon Lee
Prediction-Powered Adaptive Shrinkage Estimation
Sida Li, Nikolaos Ignatiadis
Understanding Mode Connectivity via Parameter Space Symmetry
Bo Zhao, Nima Dehmamy, Robin Walters et al.
Introducing 3D Representation for Dense Volume-to-Volume Translation via Score Fusion
Xiyue Zhu, Dou Kwark, Ruike Zhu et al.
The Logical Implication Steering Method for Conditional Interventions on Transformer Generation
Damjan Kalajdzievski
ProDiff: Prototype-Guided Diffusion for Minimal Information Trajectory Imputation
Tianci Bu, Le Zhou, Wenchuan Yang et al.
Flopping for FLOPs: Leveraging Equivariance for Computational Efficiency
Georg Bökman, David Nordström, Fredrik Kahl
Deterministic Sparse Fourier Transform for Continuous Signals with Frequency Gap
Xiaoyu Li, Zhao Song, Shenghao Xie
Beyond Sensor Data: Foundation Models of Behavioral Data from Wearables Improve Health Predictions
Eray Erturk, Fahad Kamran, Salar Abbaspourazad et al.
Learning Initial Basis Selection for Linear Programming via Duality-Inspired Tripartite Graph Representation and Comprehensive Supervision
Anqi Lu, Junchi Yan
Unified Screening for Multiple Diseases
Yiğit Narter, Alihan Hüyük, Mihaela van der Schaar et al.
Generalization and Robustness of the Tilted Empirical Risk
Gholamali Aminian, Amir R. Asadi, Tian Li et al.
TIMING: Temporality-Aware Integrated Gradients for Time Series Explanation
Hyeongwon Jang, Changhun Kim, Eunho Yang
Diving into Self-Evolving Training for Multimodal Reasoning
Wei Liu, Junlong Li, Xiwen Zhang et al.
Angle Domain Guidance: Latent Diffusion Requires Rotation Rather Than Extrapolation
Cheng Jin, Zhenyu Xiao, Chutao Liu et al.
Segment Anyword: Mask Prompt Inversion for Open-Set Grounded Segmentation
Zhihua Liu, Amrutha Saseendran, Lei Tong et al.
Multilayer Matrix Factorization via Dimension-Reducing Diffusion Variational Inference
Junbin Liu, Farzan Farnia, Wing-Kin Ma
Attention-Level Speculation
Jack Cai, Ammar Vora, Randolph Zhang et al.
GIVE: Structured Reasoning of Large Language Models with Knowledge Graph Inspired Veracity Extrapolation
Jiashu HE, Mingyu Ma, Jinxuan Fan et al.
any4: Learned 4-bit Numeric Representation for LLMs
Mostafa Elhoushi, Jeff Johnson
Feature Shift Localization Network
Míriam Barrabés, Daniel Mas Montserrat, Kapal Dev et al.
The Sharpness Disparity Principle in Transformers for Accelerating Language Model Pre-Training
Jinbo Wang, Mingze Wang, Zhanpeng Zhou et al.
Contract Design Under Approximate Best Responses
Francesco Bacchiocchi, Jiarui Gan, Matteo Castiglioni et al.
A Closer Look at Backdoor Attacks on CLIP
Shuo He, Zhifang Zhang, Feng Liu et al.
Aligning with Logic: Measuring, Evaluating and Improving Logical Preference Consistency in Large Language Models
Yinhong Liu, Zhijiang Guo, Tianya Liang et al.
Toward Data-centric Directed Graph Learning: An Entropy-driven Approach
Xunkai Li, Zhengyu Wu, Kaichi Yu et al.
PhantomWiki: On-Demand Datasets for Reasoning and Retrieval Evaluation
Albert Gong, Kamilė Stankevičiūtė, Chao Wan et al.
Visual Abstraction: A Plug-and-Play Approach for Text-Visual Retrieval
Guofeng Ding, Yiding Lu, Peng Hu et al.
Geometric Feature Embedding for Effective 3D Few-Shot Class Incremental Learning
Xiangqi Li, Libo Huang, Zhulin An et al.
Not All Tokens Matter All The Time: Dynamic Token Aggregation Towards Efficient Detection Transformers
Jiacheng Cheng, Xiwen Yao, Xiang Yuan et al.
Expert Race: A Flexible Routing Strategy for Scaling Diffusion Transformer with Mixture of Experts
Yike Yuan, Ziyu Wang, Zihao Huang et al.
Conformal Anomaly Detection in Event Sequences
Shuai Zhang, Chuan Zhou, Yang Liu et al.
When to retrain a machine learning model
Florence Regol, Leo Schwinn, Kyle Sprague et al.
Cavia: Camera-controllable Multi-view Video Diffusion with View-Integrated Attention
Dejia Xu, Yifan Jiang, Chen Huang et al.
EARTH: Epidemiology-Aware Neural ODE with Continuous Disease Transmission Graph
Guancheng Wan, Zewen Liu, Xiaojun Shan et al.
Pareto-frontier Entropy Search with Variational Lower Bound Maximization
Masanori Ishikura, Masayuki Karasuyama
SAN: Hypothesizing Long-Term Synaptic Development and Neural Engram Mechanism in Scalable Model's Parameter-Efficient Fine-Tuning
Gaole Dai, Chun-Kai Fan, Yiming Tang et al.
Learning Invariant Causal Mechanism from Vision-Language Models
Zeen Song, Siyu Zhao, Xingyu Zhang et al.
Emergent Response Planning in LLMs
Zhichen Dong, Zhanhui Zhou, Zhixuan Liu et al.
Non-Asymptotic and Non-Lipschitzian Bounds on Optimal Values in Stochastic Optimization Under Heavy Tails
Jindong Tong, Hongcheng Liu, Johannes Royset
Finding Wasserstein Ball Center: Efficient Algorithm and The Applications in Fairness
Yuntao Wang, Yuxuan Li, Qingyuan Yang et al.
Self-Organizing Visual Prototypes for Non-Parametric Representation Learning
Thalles Silva, Helio Pedrini, Adín Ramírez Rivera
CaDA: Cross-Problem Routing Solver with Constraint-Aware Dual-Attention
Han Li, Fei Liu, Zhi Zheng et al.
BAME: Block-Aware Mask Evolution for Efficient N:M Sparse Training
Chenyi yang, Wenjie Nie, Yuxin Zhang et al.
VCT: Training Consistency Models with Variational Noise Coupling
Gianluigi Silvestri, Luca Ambrogioni, Chieh-Hsin Lai et al.
Learning from True-False Labels via Multi-modal Prompt Retrieving
Zhongnian Li, Jinghao Xu, Peng Ying et al.
Empower Structure-Based Molecule Optimization with Gradient Guided Bayesian Flow Networks
Keyue Qiu, Yuxuan Song, Jie Yu et al.
Automated Benchmark Generation for Repository-Level Coding Tasks
Konstantinos Vergopoulos, Mark Müller, Martin Vechev
RAGGED: Towards Informed Design of Scalable and Stable RAG Systems
Jennifer Hsia, Afreen Shaikh, Zhiruo Wang et al.
A General Representation-Based Approach to Multi-Source Domain Adaptation
Ignavier Ng, Yan Li, Zijian Li et al.
Fast Min-$\epsilon$ Segmented Regression using Constant-Time Segment Merging
Ansgar Lößer, Max Schlecht, Florian Schintke et al.
ParallelComp: Parallel Long-Context Compressor for Length Extrapolation
Jing Xiong, Jianghan Shen, Chuanyang Zheng et al.
Improved Last-Iterate Convergence of Shuffling Gradient Methods for Nonsmooth Convex Optimization
Zijian Liu, Zhengyuan Zhou
Local Pan-privacy for Federated Analytics
Vitaly Feldman, Audra McMillan, Guy Rothblum et al.
Topological Signatures of Adversaries in Multimodal Alignments
Minh Vu, Geigh Zollicoffer, Huy Mai et al.
DeepLayout: Learning Neural Representations of Circuit Placement Layout
Yuxiang Zhao, zhuomin chai, Xun Jiang et al.
Accelerated Diffusion Models via Speculative Sampling
Valentin De Bortoli, Alexandre Galashov, Arthur Gretton et al.
Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning
Zhenni Bi, Kai Han, Chuanjian Liu et al.
A Selective Learning Method for Temporal Graph Continual Learning
Hanmo Liu, Shimin Di, Haoyang LI et al.
Predictive Performance of Deep Quantum Data Re-uploading Models
Xin Wang, Hanxiao Tao, Re-Bing Wu
Scalable First-order Method for Certifying Optimal k-Sparse GLMs
Jiachang Liu, Soroosh Shafiee, Andrea Lodi
Speculate, then Collaborate: Fusing Knowledge of Language Models during Decoding
Ziyao Wang, Muneeza Azmat, Ang Li et al.
Visual and Domain Knowledge for Professional-level Graph-of-Thought Medical Reasoning
Rina Bao, Shilong Dong, Zhenfang Chen et al.
Reinforcement Learning for Quantum Control under Physical Constraints
Jan Ole Ernst, Aniket Chatterjee, Tim Franzmeyer et al.
Simplicity Bias and Optimization Threshold in Two-Layer ReLU Networks
Etienne Boursier, Nicolas Flammarion
Knowledge Swapping via Learning and Unlearning
Mingyu Xing, Lechao Cheng, Shengeng Tang et al.
Optimization over Sparse Support-Preserving Sets: Two-Step Projection with Global Optimality Guarantees
William de Vazelhes, Xiaotong Yuan, Bin Gu
Teaching Physical Awareness to LLMs through Sounds
Weiguo Wang, Andy Nie, Wenrui Zhou et al.
Distributed Nonparametric Estimation: from Sparse to Dense Samples per Terminal
Deheng Yuan, Tao Guo, Zhongyi Huang
Explicit Exploration for High-Welfare Equilibria in Game-Theoretic Multiagent Reinforcement Learning
Austin Nguyen, Anri Gu, Michael Wellman
Generalization of noisy SGD in unbounded non-convex settings
Leello Dadi, Volkan Cevher
Compute or Load KV Cache? Why Not Both?
Shuowei Jin, Xueshen Liu, Qingzhao Zhang et al.
MoMa: Modulating Mamba for Adapting Image Foundation Models to Video Recognition
Yuhuan Yang, Chaofan Ma, Zhenjie Mao et al.
An All-Atom Generative Model for Designing Protein Complexes
Ruizhe Chen, Dongyu Xue, Xiangxin Zhou et al.
3D-LMVIC: Learning-based Multi-View Image Compression with 3D Gaussian Geometric Priors
Yujun Huang, Bin Chen, Niu Lian et al.
Reward-Guided Speculative Decoding for Efficient LLM Reasoning
Baohao Liao, Yuhui Xu, Hanze Dong et al.
Online Differentially Private Conformal Prediction for Uncertainty Quantification
Gradient Aligned Regression via Pairwise Losses
Dixian Zhu, Tianbao Yang, Livnat Jerby
A Bregman Proximal Viewpoint on Neural Operators
Abdel-Rahim Mezidi, Jordan Patracone, Saverio Salzo et al.
Fine-Grained Captioning of Long Videos through Scene Graph Consolidation
Sanghyeok Chu, Seonguk Seo, Bohyung Han
Stacey: Promoting Stochastic Steepest Descent via Accelerated $\ell_p$-Smooth Nonconvex Optimization
Xinyu Luo, Cedar Site Bai, Bolian Li et al.
FOCoOp: Enhancing Out-of-Distribution Robustness in Federated Prompt Learning for Vision-Language Models
Xinting Liao, Weiming Liu, Jiaming Qian et al.
Convergence Analysis of Policy Gradient Methods with Dynamic Stochasticity
Alessandro Montenegro, Marco Mussi, Matteo Papini et al.
ADIOS: Antibody Development via Opponent Shaping
Sebastian Towers, Aleksandra Kalisz, Philippe Robert et al.
WikiBigEdit: Understanding the Limits of Lifelong Knowledge Editing in LLMs
Lukas Thede, Karsten Roth, Matthias Bethge et al.
Autoencoder-Based Hybrid Replay for Class-Incremental Learning
Milad Khademi Nori, Il-Min Kim, Guanghui Wang
Learning from Suboptimal Data in Continuous Control via Auto-Regressive Soft Q-Network
Jijia Liu, Feng Gao, Qingmin Liao et al.
Habitizing Diffusion Planning for Efficient and Effective Decision Making
Haofei Lu, Yifei Shen, Dongsheng Li et al.
Fast Tensor Completion via Approximate Richardson Iteration
Mehrdad Ghadiri, Matthew Fahrbach, Yunbum Kook et al.
Internal Causal Mechanisms Robustly Predict Language Model Out-of-Distribution Behaviors
Jing Huang, Junyi Tao, Thomas Icard et al.
Improving Memory Efficiency for Training KANs via Meta Learning
Zhangchi Zhao, Jun Shu, Deyu Meng et al.
No Metric to Rule Them All: Toward Principled Evaluations of Graph-Learning Datasets
Corinna Coupette, Jeremy Wayland, Emily Simons et al.
MARS: Unleashing the Power of Variance Reduction for Training Large Models
Huizhuo Yuan, Yifeng Liu, Shuang Wu et al.
BalancEdit: Dynamically Balancing the Generality-Locality Trade-off in Multi-modal Model Editing
Dongliang Guo, Mengxuan Hu, Zihan Guan et al.
SERENA: A Unified Stochastic Recursive Variance Reduced Gradient Framework for Riemannian Non-Convex Optimization
Yan Liu, Mingjie Chen, Chaojie Ji et al.
Quadratic Upper Bound for Boosting Robustness
Euijin You, Hyang-Won Lee
C2IQL: Constraint-Conditioned Implicit Q-learning for Safe Offline Reinforcement Learning
Zifan LIU, Xinran Li, Jun Zhang
Mechanistic Unlearning: Robust Knowledge Unlearning and Editing via Mechanistic Localization
Phillip Guo, Aaquib Syed, Abhay Sheshadri et al.
Flow-based Domain Randomization for Learning and Sequencing Robotic Skills
Aidan Curtis, Eric Li, Michael S Noseworthy et al.
Quadruple Attention in Many-body Systems for Accurate Molecular Property Predictions
Jiahua Rao, Dahao Xu, Wentao Wei et al.
All-atom inverse protein folding through discrete flow matching
Kai Yi, Kiarash Jamali, Sjors Scheres
Progressively Label Enhancement for Large Language Model Alignment
Biao Liu, Ning Xu, Xin Geng
Compute Optimal Inference and Provable Amortisation Gap in Sparse Autoencoders
Charles O'Neill, Alim Gumran, David Klindt
Depth Degeneracy in Neural Networks: Vanishing Angles in Fully Connected ReLU Networks on Initialization
Cameron Jakub, Mihai Nica
Topology-aware Neural Flux Prediction Guided by Physics
Haoyang Jiang, Jindong Wang, Xingquan Zhu et al.
KernelBench: Can LLMs Write Efficient GPU Kernels?
Anne Ouyang, Simon Guo, Simran Arora et al.
DIS-CO: Discovering Copyrighted Content in VLMs Training Data
André Duarte, Xuandong Zhao, Arlindo Oliveira et al.
Continuous-Time Analysis of Heavy Ball Momentum in Min-Max Games
Yi Feng, Kaito Fujii, EFSTRATIOS PANTELEIMON SKOULAKIS et al.
DPO Meets PPO: Reinforced Token Optimization for RLHF
Han Zhong, Zikang Shan, Guhao Feng et al.
Revisiting Chain-of-Thought in Code Generation: Do Language Models Need to Learn Reasoning before Coding?
Ren-Biao Liu, Anqi Li, ChaodingYang et al.
Byzantine-Resilient Federated Alternating Gradient Descent and Minimization for Partly-Decoupled Low Rank Matrix Learning
Ankit Pratap Singh, Ahmed Abbasi, Namrata Vaswani
Safe-EF: Error Feedback for Non-smooth Constrained Optimization
Rustem Islamov, Yarden As, Ilyas Fatkhullin
Understanding Synthetic Context Extension via Retrieval Heads
Xinyu Zhao, Fangcong Yin, Greg Durrett