NeurIPS Papers
5,858 papers found • Page 9 of 118
A Snapshot of Influence: A Local Data Attribution Framework for Online Reinforcement Learning
Yuzheng Hu, Fan Wu, Haotian Ye et al.
A solvable model of learning generative diffusion: theory and insights
Hugo Cui, Cengiz Pehlevan, Yue Lu
Assessing the quality of denoising diffusion models in Wasserstein distance: noisy score and optimal bounds
Vahan Arsenyan, Elen Vardanyan, Arnak Dalalyan
Assignments for Congestion-Averse Agents: Seeking Competitive and Envy-Free Solutions
Jiehua Chen, Jiong Guo, Yinghui Wen
Association-Focused Path Aggregation for Graph Fraud Detection
Tian Qiu, Wenda Li, Zunlei Feng et al.
A Stable Whitening Optimizer for Efficient Neural Network Training
Kevin Frans, Sergey Levine, Pieter Abbeel
A Standardized Benchmark for Multilabel Antimicrobial Peptide Classification
Sebastian Ojeda, Rafael Velasquez, Nicolás Aparicio et al.
A Statistical Framework of Watermarks for Large Language Models: Pivot, Detection Efficiency and Optimal Rules
Xiang Li, Feng Ruan, Huiyuan Wang et al.
A Statistical Theory of Contrastive Learning via Approximate Sufficient Statistics
Licong Lin, Song Mei
AstroVisBench: A Code Benchmark for Scientific Computing and Visualization in Astronomy
Sebastian Joseph, Syed M. Husain, Stella Offner et al.
A Sustainable AI Economy Needs Data Deals That Work for Generators
Ruoxi Jia, Luis Oala, Wenjie Xiong et al.
Asymmetric Dual-Lens Video Deblurring
Zeyu Xiao, Xinchao Wang
Asymmetric Dual Self-Distillation for 3D Self-Supervised Representation Learning
Remco Leijenaar, Hamidreza Kasaei
Asymmetric Duos: Sidekicks Improve Uncertainty
Tim G. Zhou, Evan Shelhamer, Geoff Pleiss
Asymmetric REINFORCE for off-Policy Reinforcement Learning: Balancing positive and negative rewards
Charles Arnal, Gaëtan Narozniak, Vivien Cabannes et al.
Asymptotically exact variational flows via involutive MCMC kernels
Zuheng (David) Xu, Trevor Campbell
Asymptotically Stable Quaternion-valued Hopfield-structured Neural Network with Periodic Projection-based Supervised Learning Rules
Tianwei Wang, Xinhui Ma, Wei Pang
Asymptotics of SGD in Sequence-Single Index Models and Single-Layer Attention Networks
Luca Arnaboldi, Bruno Loureiro, Ludovic Stephan et al.
Asymptotic Theory of Geometric and Adaptive $k$-Means Clustering
Adam Quinn Jaffe
Asymptotic theory of SGD with a general learning-rate
Or Goldreich, Ziyang Wei, SOHAM BONNERJEE et al.
A Tale of Two Symmetries: Exploring the Loss Landscape of Equivariant Models
YuQing Xie, Tess Smidt
A Technical Report on “Erasing the Invisible”: The 2024 NeurIPS Competition on Stress Testing Image Watermarks
Mucong Ding, Bang An, Tahseen Rabbani et al.
A Temporal Difference Method for Stochastic Continuous Dynamics
Haruki Settai, Naoya Takeishi, Takehisa Yairi
A Theoretical Framework for Grokking: Interpolation followed by Riemannian Norm Minimisation
Etienne Boursier, Scott Pesme, Radu-Alexandru Dragomir
A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning
Zhi Zhou, Tan Yuhao, Zenan Li et al.
A Theory for Worst-Case vs. Average-Case Guarantees for LLMs
Noga Amit, Shafi Goldwasser, Orr Paradise et al.
A*-Thought: Efficient Reasoning via Bidirectional Compression for Low-Resource Settings
Xiaoang Xu, Shuo Wang, Xu Han et al.
ATLAS: Autoformalizing Theorems through Lifting, Augmentation, and Synthesis of Data
Xiaoyang Liu, Kangjie Bao, Jiashuo Zhang et al.
AtlasGS: Atlanta-world Guided Surface Reconstruction with Implicit Structured Gaussians
Xiyu Zhang, Chong Bao, YiPeng Chen et al.
AtmosSci-Bench: Evaluating the Recent Advance of Large Language Model for Atmospheric Science
Chenyue Li, Wen Deng, Mengqian Lu et al.
A Token is Worth over 1,000 Tokens: Efficient Knowledge Distillation through Low-Rank Clone
Jitai Hao, Qiang Huang, Hao Liu et al.
Atomic Diffusion Models for Small Molecule Structure Elucidation from NMR Spectra
Ziyu Xiong, Yichi Zhang, Foyez Alauddin et al.
Atomic Thinking of LLMs: Decoupling and Exploring Mathematical Reasoning Abilities
Jiayi Kuang, Haojing Huang, Yinghui Li et al.
Atom of Thoughts for Markov LLM Test-Time Scaling
Fengwei Teng, Quan Shi, Zhaoyang Yu et al.
A TRIANGLE Enables Multimodal Alignment Beyond Cosine Similarity
Giordano Cicchetti, Eleonora Grassucci, Danilo Comminiello
Attack by Yourself: Effective and Unnoticeable Multi-Category Graph Backdoor Attacks with Subgraph Triggers Pool
Jiangtong Li, Dongyi Liu, Kun Zhu et al.
Attack via Overfitting: 10-shot Benign Fine-tuning to Jailbreak LLMs
Zhixin Xie, Xurui Song, Jun Luo
Attention (as Discrete-Time Markov) Chains
Yotam Erel, Olaf Dünkel, Rishabh Dabral et al.
Attention-based clustering
Rodrigo Maulen Soto, Pierre Marion, Claire Boyer
Attention Mechanism, Max-Affine Partition, and Universal Approximation
Hude Liu, Jerry Yao-Chieh Hu, Zhao Song et al.
Attention on the Sphere
Boris Bonev, Max Rietmann, Andrea Paris et al.
AttentionPredictor: Temporal Patterns Matter for KV Cache Compression
Qingyue Yang, Jie Wang, Xing Li et al.
Attention Sinks: A 'Catch, Tag, Release' Mechanism for Embeddings
Stephen Zhang, Mustafa Khan, Vardan Papyan
Attention with Trained Embeddings Provably Selects Important Tokens
Diyuan Wu, Aleksandr Shevchenko, Samet Oymak et al.
Attention! Your Vision Language Model Could Be Maliciously Manipulated
Xiaosen Wang, Shaokang Wang, Zhijin Ge et al.
Attractive Metadata Attack: Inducing LLM Agents to Invoke Malicious Tools
Kanghua Mo, Li Hu, Yucheng Long et al.
Attribution-Driven Adaptive Token Pruning for Transformers
YAOYAO YAN, Hui Yu, Weizhi Xu
Audio Flamingo 3: Advancing Audio Intelligence with Fully Open Large Audio Language Models
Sreyan Ghosh, Arushi Goel, Jaehyeon Kim et al.
Audio Super-Resolution with Latent Bridge Models
Chang Li, Zehua Chen, Liyuan Wang et al.
Audio-Sync Video Generation with Multi-Stream Temporal Control
Shuchen Weng, Haojie Zheng, zheng chang et al.