Most Cited 2025 "forward matrix deduction" Papers
22,274 papers found • Page 90 of 112
Conference
Dependency Matters: Enhancing LLM Reasoning with Explicit Knowledge Grounding
Xiangyu Wen, Min Li, Junhua Huang et al.
Computation and Memory-Efficient Model Compression with Gradient Reweighting
Zhiwei Li, Yuesen Liao, Binrui Wu et al.
Learnable Sampler Distillation for Discrete Diffusion Models
Feiyang Fu, Tongxian Guo, Zhaoqiang Liu
Dynamic Siamese Expansion Framework for Improving Robustness in Online Continual Learning
Fei Ye, Yulong Zhao, Qihe Liu et al.
Sparse Optimistic Information Directed Sampling
Ludovic Schwartz, Hamish Flynn, Gergely Neu
PlanU: Large Language Model Reasoning through Planning under Uncertainty
Ziwei Deng, Mian Deng, Chenjing Liang et al.
Automated Model Discovery via Multi-modal & Multi-step Pipeline
Lee Jung-Mok, Nam Hyeon-Woo, Moon Ye-Bin et al.
Rethinking Hebbian Principle: Low-Dimensional Structural Projection for Unsupervised Learning
Shikuang Deng, Jiayuan Zhang, Yuhang Wu et al.
Mitigating Occlusions in Virtual Try-On via A Simple-Yet-Effective Mask-Free Framework
Chenghu Du, Shengwu Xiong, junyin Wang et al.
Quantifying Uncertainty in Error Consistency: Towards Reliable Behavioral Comparison of Classifiers
Thomas Klein, Sascha Meyen, Wieland Brendel et al.
Topology-Aware Learning of Tubular Manifolds via SE(3)-Equivariant Network on Ball B-Spline Curve
Jingxuan Wang, Zhongke Wu, Wang et al.
Uncertainty-Calibrated Prediction of Randomly-Timed Biomarker Trajectories with Conformal Bands
Vasiliki Tassopoulou, Charis Stamouli, Haochang Shou et al.
Knee-Deep in C-RASP: A Transformer Depth Hierarchy
Andy J Yang, Michaël Cadilhac, David Chiang
Accelerating Model-Free Optimization via Averaging of Cost Samples
Guido Carnevale, Giuseppe Notarstefano
LaViDa: A Large Diffusion Model for Vision-Language Understanding
Shufan Li, Konstantinos Kallidromitis, Hritik Bansal et al.
DEFT: Decompositional Efficient Fine-Tuning for Text-to-Image Models
Komal Kumar, Rao Anwer, Fahad Shahbaz Khan et al.
EgoDTM: Towards 3D-Aware Egocentric Video-Language Pretraining
Boshen Xu, Yuting Mei, liu xinbi et al.
Neural Hamiltonian Diffusions for Modeling Structured Geometric Dynamics
Sungwoo Park
Metritocracy: Representative Metrics for Lite Benchmarks
Ariel Procaccia, Ben Schiffer, Serena Wang et al.
Adversarial Graph Fusion for Incomplete Multi-view Semi-supervised Learning with Tensorial Imputation
Zhangqi Jiang, Tingjin Luo, Xu Yang et al.
ComRank: Ranking Loss for Multi-Label Complementary Label Learning
Jing-Yi Zhu, Yi Gao, Miao Xu et al.
DynaNav: Dynamic Feature and Layer Selection for Efficient Visual Navigation
Jiahui Wang, Changhao Chen
$\Delta \mathrm{Energy}$: Optimizing Energy Change During Vision-Language Alignment Improves both OOD Detection and OOD Generalization
Lin Zhu, Yifeng Yang, Xinbing Wang et al.
FOCUS: Unified Vision-Language Modeling for Interactive Editing Driven by Referential Segmentation
Fan Yang, Yousong Zhu, Xin Li et al.
What Makes a Reward Model a Good Teacher? An Optimization Perspective
Noam Razin, Zixuan Wang, Hubert Strauss et al.
Learning 3D Anisotropic Noise Distributions Improves Molecular Force Fields
Xixian Liu, Rui Jiao, ZHIYUAN LIU et al.
DSCS: Fast CPDAG-Based Verification of Collapsible Submodels in High-Dimensional Bayesian Networks
Wentao Wu, Shiyuan He, Jianhua Guo
Large Language Models as End-to-end Combinatorial Optimization Solvers
Xia Jiang, Yaoxin Wu, Minshuo Li et al.
Hypergraph-Enhanced Contrastive Learning for Multi-View Clustering with Hyper-Laplacian Regularization
Zhibin Gu, weili wang
Asymptotics of SGD in Sequence-Single Index Models and Single-Layer Attention Networks
Luca Arnaboldi, Bruno Loureiro, Ludovic Stephan et al.
Personalized Exercise Recommendation with Semantically-Grounded Knowledge Tracing
Yilmazcan Ozyurt, Tunaberk Almaci, Stefan Feuerriegel et al.
On the Sample Complexity of Differentially Private Policy Optimization
Yi He, Xingyu Zhou
Ascent Fails to Forget
Ioannis Mavrothalassitis, Pol Puigdemont, Noam Levi et al.
Generalizing Single-Frame Supervision to Event-Level Understanding for Video Anomaly Detection
Junxi Chen, Liang Li, Yunbin Tu et al.
Neptune-X: Active X-to-Maritime Generation for Universal Maritime Object Detection
Yu Guo, Shengfeng He, Yuxu Lu et al.
NoPo-Avatar: Generalizable and Animatable Avatars from Sparse Inputs without Human Poses
Jing Wen, Alex Schwing, Shenlong Wang
Breaking the Compression Ceiling: Data-Free Pipeline for Ultra-Efficient Delta Compression
Xiaohui Wang, Peng Ye, Chenyu Huang et al.
AdvEDM: Fine-grained Adversarial Attack against VLM-based Embodied Agents
Yichen Wang, Hangtao Zhang, Hewen Pan et al.
GeGS-PCR: Fast and Robust Color 3D Point Cloud Registration with Two-Stage Geometric-3DGS Fusion
Jiayi Tian, Haiduo Huang, Tian Xia et al.
Elastic Robust Unlearning of Specific Knowledge in Large Language Models
Yize Sui, Jing Ren, Wenjing Yang et al.
From Pose to Muscle: Multimodal Learning for Piano Hand Muscle Electromyography
RUOFAN LIU, YICHEN PENG, Takanori Oku et al.
End-to-End Low-Light Enhancement for Object Detection with Learned Metadata from RAWs
Xuelin Shen, Haifeng Jiao, Yitong Wang et al.
Technical Debt in In-Context Learning: Diminishing Efficiency in Long Context
Taejong Joo, Diego Klabjan
ShoeFit: A New Dataset and Dual-image-stream DiT Framework for Virtual Footwear Try-On
Yuhan Li, Zhiyu Jin, Yifan Tong et al.
A Gradient Guidance Perspective on Stepwise Preference Optimization for Diffusion Models
Joshua Tian Jin Tee, Hee Suk Yoon, Abu Hanif Muhammad Syarubany et al.
Retrieval is Not Enough: Enhancing RAG through Test-Time Critique and Optimization
Jiaqi Wei, Hao Zhou, Xiang Zhang et al.
Relieving the Over-Aggregating Effect in Graph Transformers
Junshu Sun, Wanxing Chang, Chenxue Yang et al.
Statistical Inference for Decentralized Federated Learning
Jia Gu, Songxi Chen
GMV: A Unified and Efficient Graph Multi-View Learning Framework
Qipeng zhu, Jie Chen, Jian Pu et al.
Versatile differentially private learning for general loss functions
Qilong Lu, Songxi Chen, Yumou Qiu
Constrained Linear Thompson Sampling
Aditya Gangrade, Venkatesh Saligrama
Geometric Learning with Positively Decomposable Kernels
Nathael Da Costa, Cyrus Mostajeran, Juan-Pablo Ortega et al.
RePIC: Reinforced Post-Training for Personalizing Multi-Modal Language Models
Yeongtak Oh, Dohyun Chung, Juhyeon Shin et al.
RLtools: A Fast, Portable Deep Reinforcement Learning Library for Continuous Control
Jonas Eschmann, Dario Albani, Giuseppe Loianno
On the Stability and Generalization of Meta-Learning: the Impact of Inner-Levels
Wenjun Ding, Jingling Liu, Lixing Chen et al.
Improving Model Representation and Reducing KV Cache via Skip Connections with First Value Heads
Zhoutong Wu, Yuan Zhang, Yiming Dong et al.
ICLScan: Detecting Backdoors in Black-Box Large Language Models via Targeted In-context Illumination
Xiaoyi Pang, Xuanyi Hao, Song Guo et al.
Foundation Models for Scientific Discovery: From Paradigm Enhancement to Paradigm Transition
Fan LIU, Jindong Han, Tengfei Lyu et al.
Embracing Contradiction: Theoretical Inconsistency Will Not Impede the Road of Building Responsible AI Systems
Gordon Dai, Yunze Xiao
NeurIPS should lead scientific consensus on AI policy
Rishi Bommasani
World Models Should Prioritize the Unification of Physical and Social Dynamics
Xiaoyuan Zhang, Chengdong Ma, Yizhe Huang et al.
Sample-Conditional Coverage in Split-Conformal Prediction
John Duchi
Noise-Robustness Through Noise: A Framework combining Asymmetric LoRA with Poisoning MoE
Zhaokun Wang, Jinyu Guo, Jingwen Pu et al.
Setting $\varepsilon$ is not the Issue in Differential Privacy
Edwige Cyffers
Diffusion-Classifier Synergy: Reward-Aligned Learning via Mutual Boosting Loop for FSCIL
Ruitao Wu, Yifan Zhao, Guangyao Chen et al.
S$^2$M-Former: Spiking Symmetric Mixing Branchformer for Brain Auditory Attention Detection
Jiaqi Wang, Zhengyu Ma, Xiongri Shen et al.
Prompting as Scientific Inquiry
Ari Holtzman, Chenhao Tan
DeepKD: A Deeply Decoupled and Denoised Knowledge Distillation Trainer
Haiduo Huang, Jiangcheng Song, Yadong Zhang et al.
The Adaptive Complexity of Minimizing Relative Fisher Information
Huanjian Zhou, Masashi Sugiyama
HPSERec: A Hierarchical Partitioning and Stepwise Enhancement Framework for Long-tailed Sequential Recommendation
Xiaolong Xu, Xudong Zhao, Haolong Xiang et al.
Accurate KV Cache Eviction via Anchor Direction Projection for Efficient LLM Inference
Zijie Geng, Jie Wang, Ziqi Liu et al.
VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning
Senqiao Yang, Junyi Li, Xin Lai et al.
EF-3DGS: Event-Aided Free-Trajectory 3D Gaussian Splatting
Bohao Liao, Wei Zhai, Zengyu Wan et al.
A Unified Reasoning Framework for Holistic Zero-Shot Video Anomaly Analysis
Dongheng Lin, Mengxue Qu, Kunyang Han et al.
EPA: Boosting Event-based Video Frame Interpolation with Perceptually Aligned Learning
Yuhan Liu, LingHui Fu, Zhen Yang et al.
Satellites Reveal Mobility: A Commuting Origin-destination Flow Generator for Global Cities
Can Rong, Xin Zhang, Yanxin Xi et al.
Taccel: Scaling Up Vision-based Tactile Robotics via High-performance GPU Simulation
Yuyang Li, Wenxin Du, Chang Yu et al.
Scalable Feature Learning on Huge Knowledge Graphs for Downstream Machine Learning
Félix Lefebvre, Gael Varoquaux
How Far Are We from Optimal Reasoning Efficiency?
Jiaxuan Gao, Shu Yan, Qixin Tan et al.
Asymptotically exact variational flows via involutive MCMC kernels
Zuheng (David) Xu, Trevor Campbell
Simultaneous Statistical Inference for Off-Policy Evaluation in Reinforcement Learning
Tianpai Luo, Xinyuan Fan, Weichi Wu
Causal Discovery and Inference through Next-Token Prediction
Eivinas Butkus, Nikolaus Kriegeskorte
On Efficiency-Effectiveness Trade-off of Diffusion-based Recommenders
Wenyu Mao, Jiancan Wu, Guoqing Hu et al.
Covering Multiple Objectives with a Small Set of Solutions Using Bayesian Optimization
Natalie Maus, Kyurae Kim, Yimeng Zeng et al.
Can LLMs Correct Themselves? A Benchmark of Self-Correction in LLMs
Guiyao Tie, Zenghui Yuan, Zeli Zhao et al.
NTKMTL: Mitigating Task Imbalance in Multi-Task Learning from Neural Tangent Kernel Perspective
Xiaohan Qin, Xiaoxing Wang, Ning Liao et al.
UniMRSeg: Unified Modality-Relax Segmentation via Hierarchical Self-Supervised Compensation
Xiaoqi Zhao, Youwei Pang, Chenyang Yu et al.
Pragmatic Heterogeneous Collaborative Perception via Generative Communication Mechanism
Junfei Zhou, Penglin Dai, Quanmin Wei et al.
SALMONN-omni: A Standalone Speech LLM without Codec Injection for Full-duplex Conversation
Wenyi Yu, Siyin Wang, Xiaoyu Yang et al.
FRN: Fractal-Based Recursive Spectral Reconstruction Network
Ge Meng, Zhongnan Cai, Ruizhe Chen et al.
On the SAC-BL Algorithm for Anomaly Detection
Xinsong Ma, Jie Wu, Weiwei Liu
Analog Foundation Models
Julian Büchel, Iason Chalas, Giovanni Acampa et al.
L2RSI: Cross-view LiDAR-based Place Recognition for Large-scale Urban Scenes via Remote Sensing Imagery
Ziwei Shi, Xiaoran Zhang, Wenjing Xu et al.
Steering Information Utility in Key-Value Memory for Language Model Post-Training
Chunyuan Deng, Ruidi Chang, Hanjie Chen
Resounding Acoustic Fields with Reciprocity
Zitong Lan, Yiduo Hao, Mingmin Zhao
LithoSim: A Large, Holistic Lithography Simulation Benchmark for AI-Driven Semiconductor Manufacturing
Hongquan He, Zhen Wang, Jingya Wang et al.
Unleashing the Power of One-Step Diffusion based Image Super-Resolution via a Large-Scale Diffusion Discriminator
Jianze Li, Jiezhang Cao, Zichen Zou et al.
MMPB: It’s Time for Multi-Modal Personalization
Jaeik Kim, Woojin Kim, Woohyeon Park et al.
STAR-Bets: Sequential TArget-Recalculating Bets for Tighter Confidence Intervals
Vaclav Voracek, Francesco Orabona
D$^2$GS: Dense Depth Regularization for LiDAR-free Urban Scene Reconstruction
Kejing Xia, Jidong Jia, Ke Jin et al.
Flattening Hierarchies with Policy Bootstrapping
John Zhou, Jonathan Kao
Learning to Watermark: A Selective Watermarking Framework for Large Language Models via Multi-Objective Optimization
Chenrui Wang, Junyi Shu, Billy Chiu et al.
PC-Net: Weakly Supervised Compositional Moment Retrieval via Proposal-Centric Network
Mingyao Zhou, Hao Sun, Wei Xie et al.
Interaction-Centric Knowledge Infusion and Transfer for Open Vocabulary Scene Graph Generation
Lin Li, Chuhan ZHANG, Dong Zhang et al.
Local Learning for Covariate Selection in Nonparametric Causal Effect Estimation with Latent Variables
Zheng Li, Xichen Guo, Feng Xie et al.
MRO: Enhancing Reasoning in Diffusion Language Models via Multi-Reward Optimization
Chenglong Wang, Yang Gan, Hang Zhou et al.
End-to-End Vision Tokenizer Tuning
Wenxuan Wang, Fan Zhang, Yufeng Cui et al.
Gradient Descent as Loss Landscape Navigation: a Normative Framework for Deriving Learning Rules
John Vastola, Samuel J Gershman, Kanaka Rajan
Compress & Cache: Vision token compression for efficient generation and retrieval
Adrian Bulat, Yassine Ouali, Georgios Tzimiropoulos
A Regularized Newton Method for Nonconvex Optimization with Global and Local Complexity Guarantees
Yuhao Zhou, Jintao Xu, Bingrui Li et al.
Adam Reduces a Unique Form of Sharpness: Theoretical Insights Near the Minimizer Manifold
Xinghan Li, Haodong Wen, Kaifeng Lyu
Active Test-time Vision-Language Navigation
Heeju Ko, Sung June Kim, Gyeongrok Oh et al.
MoodAngels: A Retrieval-augmented Multi-agent Framework for Psychiatry Diagnosis
Mengxi Xiao, Ben Liu, He Li et al.
High Dynamic Range Imaging with Time-Encoding Spike Camera
Zhenkun Zhu, Ruiqin Xiong, Jiyu Xie et al.
OpenOmni: Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Alignment and Real-time Emotional Speech Synthesis
Run Luo, Ting-En Lin, Haonan Zhang et al.
Adaptive Fission: Post-training Encoding for Low-latency Spike Neural Networks
Yizhou Jiang, Feng Chen, Yihan Li et al.
PAID: Pairwise Angular-Invariant Decomposition for Continual Test-Time Adaptation
Kunyu Wang, Xueyang Fu, Yuanfei Bao et al.
HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation
Ling Yang, Xinchen Zhang, Ye Tian et al.
Lookahead Routing for Large Language Models
Canbin Huang, Tianyuan Shi, Yuhua Zhu et al.
Accelerating Block Coordinate Descent for LLM Finetuning via Landscape Expansion
Qijun Luo, Yifei Shen, Liangzu Peng et al.
When Semantics Mislead Vision: Mitigating Large Multimodal Models Hallucinations in Scene Text Spotting and Understanding
Yan Shu, Hangui Lin, Yexin Liu et al.
Point-MaDi: Masked Autoencoding with Diffusion for Point Cloud Pre-training
Xiaoyang Xiao, Runzhao Yao, Zhiqiang Tian et al.
Generation as Search Operator for Test-Time Scaling of Diffusion-based Combinatorial Optimization
Yang Li, Lvda Chen, Haonan Wang et al.
Feature-aware Modulation for Learning from Temporal Tabular Data
Haorun Cai, Han-Jia Ye
NavBench: Probing Multimodal Large Language Models for Embodied Navigation
Yanyuan Qiao, Haodong Hong, Wenqi Lyu et al.
STAIR: Addressing Stage Misalignment through Temporal-Aligned Preference Reinforcement Learning
Yao Luan, Ni Mu, Yiqin Yang et al.
Zero-Shot Detection of LLM-Generated Text via Implicit Reward Model
Runheng Liu, Heyan Huang, Xingchen Xiao et al.
MTRec: Learning to Align with User Preferences via Mental Reward Models
Mengchen Zhao, Yifan Gao, Yaqing Hou et al.
TEMPO: Temporal Multi-scale Autoregressive Generation of Protein Conformational Ensembles
Yaoyao Xu, Di Wang, Zihan Zhou et al.
Enhancing Contrastive Learning with Variable Similarity
Haowen Cui, Shuo Chen, Jun Li et al.
Unifying Reconstruction and Density Estimation via Invertible Contraction Mapping in One-Class Classification
Xiaolei Wang, Tianhong Dai, Huihui Bai et al.
Enhancing LLM Watermark Resilience Against Both Scrubbing and Spoofing Attacks
Huanming Shen, Baizhou Huang, Xiaojun Wan
Purity Law for Neural Routing Problem Solvers with Enhanced Generalizability
Wenzhao Liu, Haoran Li, Congying Han et al.
Reasoning is Periodicity? Improving Large Language Models Through Effective Periodicity Modeling
Yihong Dong, Ge Li, Xue Jiang et al.
Multi-Modal Interactive Agent Layer for Few-Shot Universal Cross-Domain Retrieval and Beyond
Kaixiang Chen, Pengfei Fang, hui xue
Price of Parsimony: Complexity of Fourier Sparsity Testing
Arijit Ghosh, Manmatha Roy
CrypticBio: A Large Multimodal Dataset for Visually Confusing Species
Georgiana Manolache, Gerard Schouten, Joaquin Vanschoren
BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks and Defenses on Large Language Models
Yige Li, Hanxun Huang, Yunhan Zhao et al.
SolidGeo: Measuring Multimodal Spatial Math Reasoning in Solid Geometry
Peijie Wang, Chao Yang, Zhong-Zhi Li et al.
InfoChartQA: A Benchmark for Multimodal Question Answering on Infographic Charts
Tianchi Xie, Minzhi Lin, Mengchen Liu et al.
Listening to the Brain: Multi-Band sEEG Auditory Reconstruction via Dynamic Spatio-Temporal Hypergraphs
Xueyi Zhang, Ruicong Wang, Jialu Sun et al.
Benchmarking End-To-End Performance of AI-Based Chip Placement Algorithms
Zhihai Wang, Zijie Geng, Zhaojie Tu et al.
GenSpace: Benchmarking Spatially-Aware Image Generation
Zehan Wang, Jiayang Xu, Ziang Zhang et al.
Rethinking Evaluation of Infrared Small Target Detection
Youwei Pang, Xiaoqi Zhao, Lihe Zhang et al.
OrthoLoC: UAV 6-DoF Localization and Calibration Using Orthographic Geodata
Oussema Dhaouadi, Riccardo Marin, Johannes Meier et al.
AnomalyCoT: A Multi-Scenario Chain-of-Thought Dataset for Multimodal Large Language Models
Jiaxi Cheng, Yuliang Xu, Shoupeng Wang et al.
MedChain: Bridging the Gap Between LLM Agents and Clinical Practice with Interactive Sequence
Jie Liu, Wenxuan Wang, Zizhan Ma et al.
Universal Image Restoration Pre-training via Degradation Classification
Jiakui Hu, Lujia Jin, Zhengjian Yao et al.
On the Benefits of Attribute-Driven Graph Domain Adaptation
Ruiyi Fang, Bingheng Li, zhao kang et al.
Port-Hamiltonian Architectural Bias for Long-Range Propagation in Deep Graph Networks
Simon Heilig, Alessio Gravina, Alessandro Trenta et al.
From an LLM Swarm to a PDDL-empowered Hive: Planning Self-executed Instructions in a Multi-modal Jungle
Kaustubh Vyas, Damien Graux, Yijun Yang et al.
Contextual Self-paced Learning for Weakly Supervised Spatio-Temporal Video Grounding
Akash Kumar, Zsolt Kira, Yogesh S Rawat
Repurposing in AI: A Distinct Approach or an Extension of Creative Problem Solving?
Aissatou Diallo, Antonis Bikakis, Luke Dickens et al.
Compute-Optimal LLMs Provably Generalize Better with Scale
Marc Finzi, Sanyam Kapoor, Diego Granziol et al.
CodeMMLU: A Multi-Task Benchmark for Assessing Code Understanding & Reasoning Capabilities of CodeLLMs
Dung Nguyen, Thang Phan, Nam Le Hai et al.
Decoupled Subgraph Federated Learning
Javad Aliakbari, Johan Östman, Alexandre Graell i Amat
Everything, Everywhere, All at Once: Is Mechanistic Interpretability Identifiable?
Maxime Méloux, Silviu Maniu, François Portet et al.
Diffusion Bridge Implicit Models
Kaiwen Zheng, Guande He, Jianfei Chen et al.
Beyond Worst-Case Dimensionality Reduction for Sparse Vectors
Sandeep Silwal, David Woodruff, Qiuyi (Richard) Zhang
Elucidating the Preconditioning in Consistency Distillation
Kaiwen Zheng, Guande He, Jianfei Chen et al.
Improving Data Efficiency via Curating LLM-Driven Rating Systems
Jinlong Pang, Jiaheng Wei, Ankit Parag Shah et al.
Achieving Dimension-Free Communication in Federated Learning via Zeroth-Order Optimization
Zhe Li, Bicheng Ying, Zidong Liu et al.
Chain-of-Thought Provably Enables Learning the (Otherwise) Unlearnable
Chenxiao Yang, Zhiyuan Li, David Wipf
Understanding Warmup-Stable-Decay Learning Rates: A River Valley Loss Landscape View
Kaiyue Wen, Zhiyuan Li, Jason Wang et al.
nGPT: Normalized Transformer with Representation Learning on the Hypersphere
Ilya Loshchilov, Cheng-Ping Hsieh, Simeng Sun et al.
A Coefficient Makes SVRG Effective
Yida Yin, Zhiqiu Xu, Zhiyuan Li et al.
Homomorphism Counts as Structural Encodings for Graph Learning
Linus Bao, Emily Jin, Michael Bronstein et al.
PhysPDE: Rethinking PDE Discovery and a Physical HYpothesis Selection Benchmark
Mingquan Feng, Yixin Huang, Yizhou Liu et al.
Adam Exploits $\ell_\infty$-geometry of Loss Landscape via Coordinate-wise Adaptivity
Shuo Xie, Mohamad Amin Mohamadi, Zhiyuan Li
Near-Optimal Policy Identification in Robust Constrained Markov Decision Processes via Epigraph Form
Toshinori Kitamura, Tadashi Kozuno, Wataru Kumagai et al.
Optimality and Adaptivity of Deep Neural Features for Instrumental Variable Regression
Juno Kim, Dimitri Meunier, Arthur Gretton et al.
TC-MoE: Augmenting Mixture of Experts with Ternary Expert Choice
Shen Yan, Xingyan Bin, Sijun Zhang et al.
Remove Symmetries to Control Model Expressivity and Improve Optimization
Liu Ziyin, Yizhou Xu, Isaac Chuang
JPEG Inspired Deep Learning
Ahmed Hussien Salamah, Kaixiang Zheng, Yiwen Liu et al.
ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities
Peng Xu, Wei Ping, Xianchao Wu et al.
gRNAde: Geometric Deep Learning for 3D RNA inverse design
Chaitanya Joshi, Arian Jamasb, Ramon Viñas et al.
Boltzmann priors for Implicit Transfer Operators
Juan Viguera Diez, Mathias Schreiner, Ola Engkvist et al.
Biologically Constrained Barrel Cortex Model Integrates Whisker Inputs and Replicates Key Brain Network Dynamics
Tianfang Zhu, Dongli Hu, Jiandong Zhou et al.
Pairwise Elimination with Instance-Dependent Guarantees for Bandits with Cost Subsidy
Ishank Juneja, Carlee Joe-Wong, Osman Yagan
In-Context Editing: Learning Knowledge from Self-Induced Distributions
Siyuan Qi, Bangcheng Yang, Kailin Jiang et al.
Towards Understanding the Universality of Transformers for Next-Token Prediction
Michael Sander, Gabriel Peyré
Learning Task Belief Similarity with Latent Dynamics for Meta-Reinforcement Learning
Menglong Zhang, Fuyuan Qian, Quanying Liu
CryoGEN: Generative Energy-based Models for Cryogenic Electron Tomography Reconstruction
Yunfei Teng, Yuxuan Ren, Kai Chen et al.
KAN: Kolmogorov–Arnold Networks
Ziming Liu, Yixuan Wang, Sachin Vaidya et al.
Online Clustering with Nearly Optimal Consistency
T-H. Hubert Chan, Shaofeng Jiang, Tianyi Wu et al.
Simplifying, Stabilizing and Scaling Continuous-time Consistency Models
Cheng Lu, Yang Song
The Geometry of Categorical and Hierarchical Concepts in Large Language Models
Kiho Park, Yo Joong Choe, Yibo Jiang et al.
Adversarial Training Can Provably Improve Robustness: Theoretical Analysis of Feature Learning Process Under Structured Data
Binghui Li, Yuanzhi Li
TRENDy: Temporal Regression of Effective Nonlinear Dynamics
Matthew Ricci, Guy Pelc, Zoe Piran et al.
Regularized Proportional Fairness Mechanism for Resource Allocation Without Money
Sujay Bhatt, Alec Koppel, Sumitra Ganesh et al.
Dynamic Neural Fortresses: An Adaptive Shield for Model Extraction Defense
Siyu Luan, Zhenyi Wang, Li Shen et al.
Protein Language Model Fitness is a Matter of Preference
Cade Gordon, Amy Lu, Pieter Abbeel
Watch Less, Do More: Implicit Skill Discovery for Video-Conditioned Policy
Wang, Zongqing Lu
Learning and aligning single-neuron invariance manifolds in visual cortex
Mohammad Bashiri, Luca Baroni, Ján Antolík et al.
From Pixels to Tokens: Byte-Pair Encoding on Quantized Visual Modalities
Wanpeng Zhang, Zilong Xie, Yicheng Feng et al.
Cross-Domain Offline Policy Adaptation with Optimal Transport and Dataset Constraint
Jiafei Lyu, Mengbei Yan, Zhongjian Qiao et al.
Robustness Inspired Graph Backdoor Defense
Zhiwei Zhang, Minhua Lin, Junjie Xu et al.
Do You Keep an Eye on What I Ask? Mitigating Multimodal Hallucination via Attention-Guided Ensemble Decoding
Yeongjae Cho, Keonwoo Kim, Taebaek Hwang et al.
Lost in Prediction: Why Social Media Narratives Don't Help Macroeconomic Forecasting?
Almog Gueta, Roi Reichart, Amir Feder et al.
Improving Probabilistic Diffusion Models With Optimal Diagonal Covariance Matching
Zijing Ou, Mingtian Zhang, Andi Zhang et al.