Most Cited NEURIPS "distortion-aware features" Papers
5,858 papers found • Page 8 of 30
Conference
Temporal Logic-Based Multi-Vehicle Backdoor Attacks against Offline RL Agents in End-to-end Autonomous Driving
Xuan Chen, Shiwei Feng, Zikang Xiong et al.
Neural-Driven Image Editing
Pengfei Zhou, Jie Xia, Xiaopeng Peng et al.
Practical Bayes-Optimal Membership Inference Attacks
Marcus Lassila, Johan Oestman, Khac-Hoang Ngo et al.
PRING: Rethinking Protein-Protein Interaction Prediction from Pairs to Graphs
Xinzhe Zheng, Hao Du, Fanding Xu et al.
Orientation-anchored Hyper-Gaussian for 4D Reconstruction from Casual Videos
Junyi Wu, Jiachen Tao, Haoxuan Wang et al.
Tail-Optimized Caching for LLM Inference
Wenxin Zhang, Yueying Li, Ciamac C Moallemi et al.
SynBrain: Enhancing Visual-to-fMRI Synthesis via Probabilistic Representation Learning
Weijian Mai, Jiamin Wu, Yu Zhu et al.
Exact and Linear Convergence for Federated Learning under Arbitrary Client Participation is Attainable
Bicheng Ying, Zhe Li, Haibo Yang
AdaSTaR: Adaptive Data Sampling for Training Self-Taught Reasoners
Reiss Koh, Wonbeen Oh, Jaein Jang et al.
Few-Shot Learning from Gigapixel Images via Hierarchical Vision-Language Alignment and Modeling
Bryan Wong, Jongwoo Kim, Huazhu Fu et al.
NOBLE - Neural Operator with Biologically-informed Latent Embeddings to Capture Experimental Variability in Biological Neuron Models
Luca Ghafourpour, Valentin Duruisseaux, Bahareh Tolooshams et al.
Seg2Any: Open-set Segmentation-Mask-to-Image Generation with Precise Shape and Semantic Control
Danfeng Li, Hui Zhang, Sheng Wang et al.
Orthogonal Survival Learners for Estimating Heterogeneous Treatment Effects from Time-to-Event Data
Dennis Frauen, Maresa Schröder, Konstantin Hess et al.
ROGR: Relightable 3D Objects using Generative Relighting
Jiapeng Tang, Matthew Levine, Dor Verbin et al.
RADAR: Benchmarking Language Models on Imperfect Tabular Data
Ken Gu, Zhihan Zhang, Kate Lin et al.
Uni-LoRA: One Vector is All You Need
Kaiyang Li, Shaobo Han, Qing Su et al.
On the Optimal Construction of Unbiased Gradient Estimators for Zeroth-Order Optimization
Shaocong Ma, Heng Huang
CALM-PDE: Continuous and Adaptive Convolutions for Latent Space Modeling of Time-dependent PDEs
Jan Hagnberger, Daniel Musekamp, Mathias Niepert
Tackling Feature-Classifier Mismatch in Federated Learning via Prompt-Driven Feature Transformation
Xinghao Wu, Xuefeng Liu, Jianwei Niu et al.
Scaling Image Geo-Localization to Continent Level
Philipp Lindenberger, Paul-Edouard Sarlin, Jan Hosang et al.
VisDiff: SDF-Guided Polygon Generation for Visibility Reconstruction, Characterization and Recognition
Rahul Moorthy Mahesh, Jun-Jee Chao, Volkan Isler
Detoxifying Large Language Models via Autoregressive Reward Guided Representation Editing
Yisong Xiao, Aishan Liu, Siyuan Liang et al.
PhysGym: Benchmarking LLMs in Interactive Physics Discovery with Controlled Priors
Yimeng Chen, Piotr Piękos, Mateusz Ostaszewski et al.
A General-Purpose Theorem for High-Probability Bounds of Stochastic Approximation with Polyak Averaging
Sajad Khodadadian, Martin Zubeldia
What's Producible May Not Be Reachable: Measuring the Steerability of Generative Models
Keyon Vafa, Sarah Bentley, Jon Kleinberg et al.
Generative Graph Pattern Machine
Zehong Wang, Zheyuan Zhang, Tianyi Ma et al.
High-dimensional neuronal activity from low-dimensional latent dynamics: a solvable model
Valentin Schmutz, Ali Haydaroğlu, Shuqi Wang et al.
C-SEO Bench: Does Conversational SEO Work?
Haritz Puerto, Martin Gubri, Tommaso Green et al.
SVRPBench: A Realistic Benchmark for Stochastic Vehicle Routing Problem
Ahmed Heakl, Yahia Salaheldin Shaaban, Salem Lahlou et al.
MoE-Gyro: Self-Supervised Over-Range Reconstruction and Denoising for MEMS Gyroscopes
Feiyang Pan, Shenghe Zheng, Chunyan Yin et al.
GradMetaNet: An Equivariant Architecture for Learning on Gradients
Yoav Gelberg, Yam Eitan, Aviv Navon et al.
A Multi-Task Benchmark for Abusive Language Detection in Low-Resource Settings
Fitsum Gaim, Hoyun Song, Huije Lee et al.
BenchmarkCards: Standardized Documentation for Large Language Model Benchmarks
Anna Sokol, Elizabeth Daly, Michael Hind et al.
LabUtopia: High-Fidelity Simulation and Hierarchical Benchmark for Scientific Embodied Agents
Rui Li, Zixuan Hu, Wenxi Qu et al.
Stable Matching with Ties: Approximation Ratios and Learning
Shiyun Lin, Simon Mauras, Nadav Merlis et al.
AgMMU: A Comprehensive Agricultural Multimodal Understanding Benchmark
Aruna Gauba, Irene Pi, Yunze Man et al.
Common Task Framework For a Critical Evaluation of Scientific Machine Learning Algorithms
Philippe Wyder, Judah Goldfeder, Alexey Yermakov et al.
SkyLadder: Better and Faster Pretraining via Context Window Scheduling
Tongyao Zhu, Qian Liu, Haonan Wang et al.
Evaluating LLM-contaminated Crowdsourcing Data Without Ground Truth
Yichi Zhang, Jinlong Pang, Zhaowei Zhu et al.
FFN Fusion: Rethinking Sequential Computation in Large Language Models
Akhiad Bercovich, Mohammed Dabbah, Omri Puny et al.
scMRDR: A scalable and flexible framework for unpaired single-cell multi-omics data integration
Jianle Sun, Chaoqi Liang, Ran Wei et al.
When Models Know More Than They Can Explain: Quantifying Knowledge Transfer in Human-AI Collaboration
Quan Shi, Carlos Jimenez, Shunyu Yao et al.
Model-Based Policy Adaptation for Closed-Loop End-to-end Autonomous Driving
Haohong Lin, Yunzhi Zhang, Wenhao Ding et al.
Imagined Autocurricula
Ahmet Hamdi Güzel, Matthew T Jackson, Jarek Liesen et al.
CGS-GAN: 3D Consistent Gaussian Splatting GANs for High Resolution Human Head Synthesis
Florian Barthel, Wieland Morgenstern, Paul Hinzer et al.
Towards Predicting Any Human Trajectory In Context
Ryo Fujii, Hideo Saito, Ryo Hachiuma
Fair Deepfake Detectors Can Generalize
Harry Cheng, Ming-Hui Liu, Yangyang Guo et al.
SRHand: Super-Resolving Hand Images and 3D Shapes via View/Pose-aware Neural Image Representations and Explicit Meshes
Minje Kim, Tae-Kyun Kim
Demystifying Network Foundation Models
Roman Beltiukov, Satyandra Guthula, Wenbo Guo et al.
Advancing Compositional Awareness in CLIP with Efficient Fine-Tuning
Amit Peleg, Naman Deep Singh, Matthias Hein
Measuring and Guiding Monosemanticity
Ruben Härle, Felix Friedrich, Manuel Brack et al.
ACCO: Accumulate While You Communicate for Communication-Overlapped Sharded LLM Training
Adel Nabli, Louis Fournier, Pierre ERBACHER et al.
TCM-Ladder: A Benchmark for Multimodal Question Answering on Traditional Chinese Medicine
Jiacheng Xie, Yang Yu, Ziyang Zhang et al.
TRAP: Targeted Redirecting of Agentic Preferences
Hangoo Kang, Jehyeok Yeon, Gagandeep Singh
Compute-Optimal Scaling for Value-Based Deep RL
Preston Fu, Oleh Rybkin, Zhiyuan (Paul) Zhou et al.
A geometric framework for momentum-based optimizers for low-rank training
Steffen Schotthöfer, Timon Klein, Jonas Kusch
Offline RL by Reward-Weighted Fine-Tuning for Conversation Optimization
Subhojyoti Mukherjee, Viet Lai, Raghavendra Addanki et al.
Logical Expressiveness of Graph Neural Networks with Hierarchical Node Individualization
Arie Soeteman, Balder ten Cate
Improved Regret Bounds for Linear Bandits with Heavy-Tailed Rewards
Artin Tajdini, Jonathan Scarlett, Kevin Jamieson
Turbocharging Gaussian Process Inference with Approximate Sketch-and-Project
Pratik Rathore, Zachary Frangella, Sachin Garg et al.
Hankel Singular Value Regularization for Highly Compressible State Space Models
Paul Schwerdtner, Jules Berman, Benjamin Peherstorfer
Spark Transformer: Reactivating Sparsity in Transformer FFN and Attention
Chong You, Kan Wu, Zhipeng Jia et al.
From Linear to Nonlinear: Provable Weak-to-Strong Generalization through Feature Learning
Junsoo Oh, Jerry Song, Chulhee Yun
Solver-Free Decision-Focused Learning for Linear Optimization Problems
Senne Berden, Ali Mahmutoğulları, Dimos Tsouros et al.
Unlearned but Not Forgotten: Data Extraction after Exact Unlearning in LLM
Xiaoyu Wu, Yifei Pang, Terrance Liu et al.
Composition and Alignment of Diffusion Models using Constrained Learning
Shervin Khalafi, Ignacio Hounie, Dongsheng Ding et al.
Rao-Blackwell Gradient Estimators for Equivariant Denoising Diffusion
Vinh Tong, Trung-Dung Hoang, Anji Liu et al.
TRIDENT: Tri-Modal Molecular Representation Learning with Taxonomic Annotations and Local Correspondence
Feng Jiang, Mangal Prakash, Hehuan Ma et al.
MAP Estimation with Denoisers: Convergence Rates and Guarantees
Scott Pesme, Giacomo Meanti, Michael Arbel et al.
SAS: Simulated Attention Score
Chuanyang Zheng, Jiankai Sun, Yihang Gao et al.
Many LLMs Are More Utilitarian Than One
Anita Keshmirian, Razan Baltaji, Babak Hemmatian et al.
Distance Adaptive Beam Search for Provably Accurate Graph-Based Nearest Neighbor Search
Yousef Al-Jazzazi, Haya Diwan, Jinrui Gou et al.
MPCache: MPC-Friendly KV Cache Eviction for Efficient Private LLM Inference
Wenxuan Zeng, Ye Dong, Jinjin Zhou et al.
SHAP values via sparse Fourier representation
Ali Gorji, Andisheh Amrollahi, Andreas Krause
Second-Order Convergence in Private Stochastic Non-Convex Optimization
Youming Tao, Zuyuan Zhang, Dongxiao Yu et al.
Bridging Symmetry and Robustness: On the Role of Equivariance in Enhancing Adversarial Robustness
Longwei Wang, Ifrat Ikhtear Uddin, Prof. KC Santosh (PhD) et al.
Optimism Without Regularization: Constant Regret in Zero-Sum Games
John Lazarsfeld, Georgios Piliouras, Ryann Sim et al.
Better Training Data Attribution via Better Inverse Hessian-Vector Products
Andrew Wang, Elisa Nguyen, Runshi Yang et al.
Scaling Diffusion Transformers Efficiently via $\mu$P
Chenyu Zheng, Xinyu Zhang, Rongzhen Wang et al.
Learning to cluster neuronal function
Nina Nellen, Polina Turishcheva, Michaela Vystrčilová et al.
Mixture-of-Experts Meets In-Context Reinforcement Learning
Wenhao Wu, Fuhong Liu, Haoru Li et al.
Guard Me If You Know Me: Protecting Specific Face-Identity from Deepfakes
Kaiqing Lin, Zhiyuan Yan, Ke-Yue Zhang et al.
Equilibrium Policy Generalization: A Reinforcement Learning Framework for Cross-Graph Zero-Shot Generalization in Pursuit-Evasion Games
Runyu Lu, Peng Zhang, Ruochuan Shi et al.
Scalable In-context Ranking with Generative Models
Nilesh Gupta, Chong You, Srinadh Bhojanapalli et al.
Hallucination at a Glance: Controlled Visual Edits and Fine-Grained Multimodal Learning
Tianyi Bai, Yuxuan Fan, Qiu Jiantao et al.
Reward Reasoning Models
Jiaxin Guo, Zewen Chi, Li Dong et al.
AlphaFold Database Debiasing for Robust Inverse Folding
Cheng Tan, Zhenxiao Cao, Zhangyang Gao et al.
Convolution Goes Higher-Order: A Biologically Inspired Mechanism Empowers Image Classification
Simone Azeglio, Olivier Marre, Peter Neri et al.
AutoSciDACT: Automated Scientific Discovery through Contrastive Embedding and Hypothesis Testing
Sam Bright-Thonney, Christina Reissel, Gaia Grosso et al.
Inference-Time Reward Hacking in Large Language Models
Hadi Khalaf, Claudio Mayrink Verdun, Alex Oesterling et al.
Monitoring Risks in Test-Time Adaptation
Mona Schirmer, Metod Jazbec, Christian Andersson Naesseth et al.
Spectral Graph Neural Networks are Incomplete on Graphs with a Simple Spectrum
Snir Hordan, Maya Bechler-Speicher, Gur Lifshitz et al.
ORIGAMISPACE: Benchmarking Multimodal LLMs in Multi-Step Spatial Reasoning with Mathematical Constraints
Rui Xu, Dakuan Lu, Zicheng Zhao et al.
Representation Consistency for Accurate and Coherent LLM Answer Aggregation
Junqi Jiang, Tom Bewley, Salim I. Amoukou et al.
Riemannian Flow Matching for Brain Connectivity Matrices via Pullback Geometry
Antoine Collas, Ce Ju, Nicolas Salvy et al.
Grammars of Formal Uncertainty: When to Trust LLMs in Automated Reasoning Tasks
Debargha Ganguly, Vikash Singh, Sreehari Sankar et al.
Latent Mixture of Symmetries for Sample-Efficient Dynamic Learning
Haoran Li, CHENHAN XIAO, Muhao Guo et al.
V2V: Scaling Event-Based Vision through Efficient Video-to-Voxel Simulation
Hanyue Lou, Jinxiu Liang, Minggui Teng et al.
From Black-box to Causal-box: Towards Building More Interpretable Models
Inwoo Hwang, Yushu Pan, Elias Bareinboim
Brain Harmony: A Multimodal Foundation Model Unifying Morphology and Function into 1D Tokens
Zijian Dong, Ruilin Li, Joanna Chong et al.
On Feasible Rewards in Multi-Agent Inverse Reinforcement Learning
Till Freihaut, Giorgia Ramponi
Emergent Risk Awareness in Rational Agents under Resource Constraints
Daniel Jarne Ornia, Nicholas Bishop, Joel Dyer et al.
Generalized Contrastive Learning for Universal Multimodal Retrieval
Jungsoo Lee, Janghoon Cho, Hyojin Park et al.
Text-Aware Real-World Image Super-Resolution via Diffusion Model with Joint Segmentation Decoders
Qiming Hu, Linlong Fan, Yiyan Luo et al.
An Analysis of Concept Bottleneck Models: Measuring, Understanding, and Mitigating the Impact of Noisy Annotations
Seonghwan Park, Jueun Mun, Donghyun Oh et al.
Learning the Wrong Lessons: Syntactic-Domain Spurious Correlations in Language Models
Chantal Shaib, Vinith Suriyakumar, Byron Wallace et al.
Limitations of Normalization in Attention
Timur Mudarisov, Mikhail Burtsev, Tatiana Petrova et al.
Adaptive Frontier Exploration on Graphs with Applications to Network-Based Disease Testing
XianJun, Davin Choo, Yuqi Pan, Tonghan Wang et al.
Efficient Parametric SVD of Koopman Operator for Stochastic Dynamical Systems
Minchan Jeong, Jongha (Jon) Ryu, Se-Young Yun et al.
Data Fusion for Partial Identification of Causal Effects
Quinn Lanners, Cynthia Rudin, Alexander Volfovsky et al.
The Effect of Optimal Self-Distillation in Noisy Gaussian Mixture Model
Kaito Takanami, Takashi Takahashi, Ayaka Sakata
Optimal Rates in Continual Linear Regression via Increasing Regularization
Ran Levinstein, Amit Attia, Matan Schliserman et al.
UGoDIT: Unsupervised Group Deep Image Prior Via Transferable Weights
Shijun Liang, Ismail Alkhouri, Siddhant Gautam et al.
Hadamax Encoding: Elevating Performance in Model-Free Atari
Jacob Eeuwe Kooi, Zhao Yang, Vincent Francois-Lavet
AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders
Yuezhou Hu, Jiaxin Guo, Xinyu Feng et al.
Modeling the Economic Impacts of AI Openness Regulation
Tori Qiu, Benjamin Laufer, Jon Kleinberg et al.
High-order Equivariant Flow Matching for Density Functional Theory Hamiltonian Prediction
Seongsu Kim, Nayoung Kim, Dongwoo Kim et al.
HYPRL: Reinforcement Learning of Control Policies for Hyperproperties
Tzu-Han Hsu, Arshia Rafieioskouei, Borzoo Bonakdarpour
Streaming Attention Approximation via Discrepancy Theory
Ekaterina Kochetkova, Kshiteej Jitesh Sheth, Insu Han et al.
MedicalNarratives: Connecting Medical Vision and Language with Localized Narratives
Wisdom Ikezogwo, Kevin M. Zhang, Saygin Seyfioglu
Towards Evaluating Proactive Risk Awareness of Multimodal Language Models
Youliang Yuan, Wenxiang Jiao, Yuejin Xie et al.
Constrained Posterior Sampling: Time Series Generation with Hard Constraints
Sai Shankar Narasimhan, Shubhankar Agarwal, Litu Rout et al.
An Analytical Theory of Spectral Bias in the Learning Dynamics of Diffusion Models
Binxu Wang, Cengiz Pehlevan
FastLongSpeech: Enhancing Large Speech-Language Models for Efficient Long-Speech Processing
Shoutao Guo, Shaolei Zhang, Qingkai Fang et al.
Instance-Level Composed Image Retrieval
Bill Psomas, George Retsinas, Nikos Efthymiadis et al.
Adaptive Prediction-Powered AutoEval with Reliability and Efficiency Guarantees
Sangwoo Park, Matteo Zecchin, Osvaldo Simeone
Visual Instruction Bottleneck Tuning
Changdae Oh, Jiatong Li, Shawn Im et al.
On the creation of narrow AI: hierarchy and nonlocality of neural network skills
Eric Michaud, Asher Parker-Sartori, Max Tegmark
HoloScene: Simulation‑Ready Interactive 3D Worlds from a Single Video
Hongchi Xia, Chih-Hao Lin, Hao-Yu Hsu et al.
Self-alignment of Large Video Language Models with Refined Regularized Preference Optimization
Pritam Sarkar, Ali Etemad
Dynamic Bundling with Large Language Models for Zero-Shot Inference on Text-Attributed Graphs
Yusheng Zhao, Qixin Zhang, Xiao Luo et al.
Linear Attention for Efficient Bidirectional Sequence Modeling
Arshia Afzal, Elias Abad Rocamora, Leyla Candogan et al.
ClusterFusion: Expanding Operator Fusion Scope for LLM Inference via Cluster-Level Collective Primitive
Xinhao Luo, Zihan Liu, Yangjie Zhou et al.
Reinforced Context Order Recovery for Adaptive Reasoning and Planning
Long Ma, Fangwei Zhong, Yizhou Wang
OmniGaze: Reward-inspired Generalizable Gaze Estimation in the Wild
Hongyu Qu, Jianan Wei, Xiangbo Shu et al.
SCOUT: Teaching Pre-trained Language Models to Enhance Reasoning via Flow Chain-of-Thought
Guanghao Li, Wenhao Jiang, Mingfeng Chen et al.
Sparse Polyak: an adaptive step size rule for high-dimensional M-estimation
Tianqi Qiao, Marie Maros
The Generative Leap: Tight Sample Complexity for Efficiently Learning Gaussian Multi-Index Models
Alex Damian, Jason Lee, Joan Bruna
Vision‑Language‑Vision Auto‑Encoder: Scalable Knowledge Distillation from Diffusion Models
Tiezheng Zhang, Yitong Li, Yu-Cheng Chou et al.
Differentially Private Quantiles with Smaller Error
Jacob Imola, Fabrizio Boninsegna, Hannah Keller et al.
OWMM-Agent: Open World Mobile Manipulation With Multi-modal Agentic Data Synthesis
Junting Chen, Haotian Liang, Lingxiao Du et al.
NeedleInATable: Exploring Long-Context Capability of Large Language Models towards Long-Structured Tables
Lanrui Wang, Mingyu Zheng, Hongyin Tang et al.
Continuous Simplicial Neural Networks
Aref Einizade, Dorina Thanou, Fragkiskos Malliaros et al.
ALINE: Joint Amortization for Bayesian Inference and Active Data Acquisition
Daolang Huang, Xinyi Wen, Ayush Bharti et al.
OpenWorldSAM: Extending SAM2 for Universal Image Segmentation with Language Prompts
Shiting (Ginny) Xiao, Rishabh Kabra, Yuhang Li et al.
Universal Causal Inference in a Topos
Sridhar Mahadevan
FLOWING: Implicit Neural Flows for Structure-Preserving Morphing
Arthur Bizzi, Matias Grynberg Portnoy, Vitor Pereira Matias et al.
COOPERA: Continual Open-Ended Human-Robot Assistance
Chenyang Ma, Kai Lu, Ruta Desai et al.
Mixture-of-Experts Operator Transformer for Large-Scale PDE Pre-Training
Hong Wang, Haiyang Xin, Jie Wang et al.
Beyond Scores: Proximal Diffusion Models
Zhenghan Fang, Mateo Diaz, Sam Buchanan et al.
Alignment of Large Language Models with Constrained Learning
Botong Zhang, Shuo Li, Ignacio Hounie et al.
Hamiltonian Descent Algorithms for Optimization: Accelerated Rates via Randomized Integration Time
Qiang Fu, Andre Wibisono
Head Pursuit: Probing Attention Specialization in Multimodal Transformers
Lorenzo Basile, Valentino Maiorca, Diego Doimo et al.
Align-DA: Align Score-based Atmospheric Data Assimilation with Multiple Preferences
Jing-An Sun, Hang Fan, Junchao Gong et al.
Text-to-Decision Agent: Offline Meta-Reinforcement Learning from Natural Language Supervision
Shilin Zhang, Zican Hu, Wenhao Wu et al.
Deep Tree Tensor Networks
Chang Nie
GeoDynamics: A Geometric State‑Space Neural Network for Understanding Brain Dynamics on Riemannian Manifolds
Tingting Dan, Jiaqi Ding, Guorong Wu
DGS-LRM: Real-Time Deformable 3D Gaussian Reconstruction From Monocular Videos
Chieh Lin, Zhaoyang Lv, Songyin Wu et al.
A Circular Argument: Does RoPE need to be Equivariant for Vision?
Chase van de Geijn, Timo Lüddecke, Polina Turishcheva et al.
Language Modeling by Language Models
Junyan Cheng, Peter Clark, Kyle Richardson
PocketSR: The Super-Resolution Expert in Your Pocket Mobiles
Haoze Sun, Linfeng Jiang, Fan Li et al.
Measuring Scientific Capabilities of Language Models with a Systems Biology Dry Lab
Haonan Duan, Stephen Lu, Caitlin F Harrigan et al.
Factorio Learning Environment
Jack Hopkins, Mart Bakler, Akbir Khan
A Generalized Bisimulation Metric of State Similarity between Markov Decision Processes: From Theoretical Propositions to Applications
Zhenyu Tao, Wei Xu, Xiaohu You
Transformers for Mixed-type Event Sequences
Felix Draxler, Yang Meng, Kai Nelson et al.
Contrastive Representations for Temporal Reasoning
Alicja Ziarko, Michał Bortkiewicz, Michał Zawalski et al.
No-Regret Learning Under Adversarial Resource Constraints: A Spending Plan Is All You Need!
Francesco Emanuele Stradi, Matteo Castiglioni, Alberto Marchesi et al.
Feedback-Aware MCTS for Goal-Oriented Information Seeking
Harshita Chopra, Chirag Shah
Provably Efficient Online RLHF with One-Pass Reward Modeling
Long-Fei Li, Yu-Yang Qian, Peng Zhao et al.
Learning Dense Hand Contact Estimation from Imbalanced Data
Daniel Jung, Kyoung Mu Lee
Towards Reliable Code-as-Policies: A Neuro-Symbolic Framework for Embodied Task Planning
Sanghyun Ahn, Wonje Choi, Junyong Lee et al.
Set Smoothness Unlocks Clarke Hyper-stationarity in Bilevel Optimization
He Chen, Jiajin Li, Anthony Man-Cho So
A Reliable Cryptographic Framework for Empirical Machine Unlearning Evaluation
Yiwen Tu, Pingbang Hu, Jiaqi Ma
DuoGPT: Training-free Dual Sparsity through Activation-aware Pruning in LLMs
Ruokai Yin, Yuhang Li, Donghyun Lee et al.
Noise Matters: Optimizing Matching Noise for Diffusion Classifiers
Yanghao Wang, Long Chen
On-Policy Optimization with Group Equivalent Preference for Multi-Programming Language Understanding
Haoyuan Wu, Rui Ming, Jilong Gao et al.
Attention Sinks: A 'Catch, Tag, Release' Mechanism for Embeddings
Stephen Zhang, Mustafa Khan, Vardan Papyan
Robo2VLM: Improving Visual Question Answering using Large-Scale Robot Manipulation Data
Kaiyuan Eric Chen, Shuangyu Xie, Zehan Ma et al.
Exploring the Translation Mechanism of Large Language Models
Hongbin Zhang, Kehai Chen, Xuefeng Bai et al.
Image Editing As Programs with Diffusion Models
Yujia Hu, Songhua Liu, Zhenxiong Tan et al.
Elucidated Rolling Diffusion Models for Probabilistic Forecasting of Complex Dynamics
Salva Rühling Cachay, Miika Aittala, Karsten Kreis et al.
Scale-invariant attention
Ben Anson, Xi Wang, Laurence Aitchison
What are you sinking? A geometric approach on attention sink
Valeria Ruscio, Umberto Nanni, Fabrizio Silvestri
Rethinking Losses for Diffusion Bridge Samplers
Sebastian Sanokowski, Lukas Gruber, Christoph Bartmann et al.
Open-World Drone Active Tracking with Goal-Centered Rewards
Haowei Sun, Jinwu Hu, Zhirui Zhang et al.
Unified Scaling Laws for Compressed Representations
Andrei Panferov, Alexandra Volkova, Ionut-Vlad Modoranu et al.
The Implicit Bias of Structured State Space Models Can Be Poisoned With Clean Labels
Yonatan Slutzky, Yotam Alexander, Noam Razin et al.
Copresheaf Topological Neural Networks: A Generalized Deep Learning Framework
Mustafa Hajij, Lennart Bastian, Sarah Osentoski et al.
Delta Attention: Fast and Accurate Sparse Attention Inference by Delta Correction
Jeffrey Willette, Heejun Lee, Sung Ju Hwang
Rethinking Residual Distribution in Locate-then-Edit Model Editing
Xiaopeng Li, Shangwen Wang, Shasha Li et al.
A Clean Slate for Offline Reinforcement Learning
Matthew T Jackson, Uljad Berdica, Jarek Liesen et al.
Solving Continuous Mean Field Games: Deep Reinforcement Learning for Non-Stationary Dynamics
Lorenzo Magnino, Kai Shao, Zida Wu et al.
S'MoRE: Structural Mixture of Residual Experts for Parameter-Efficient LLM Fine-tuning
Hanqing Zeng, Yinglong Xia, Zhuokai Zhao et al.
LoRASuite: Efficient LoRA Adaptation Across Large Language Model Upgrades
Yanan Li, Fanxu Meng, Muhan Zhang et al.
GRIP: A Graph-Based Reasoning Instruction Producer
Jiankang Wang, Jianjun Xu, Xiaorui Wang et al.
Fixed-Point RNNs: Interpolating from Diagonal to Dense
Sajad Movahedi, Felix Sarnthein, Nicola Muca Cirone et al.
Doctor Approved: Generating Medically Accurate Skin Disease Images through AI-Expert Feedback
Janet Wang, Yunbei Zhang, Zhengming Ding et al.
On Transferring Transferability: Towards a Theory for Size Generalization
Eitan Levin, Yuxin Ma, Mateo Diaz et al.
Local-Global Associative Frames for Symmetry-Preserving Crystal Structure Modeling
haowei hua, Wanyu Lin
TAI3: Testing Agent Integrity in Interpreting User Intent
Shiwei Feng, Xiangzhe Xu, Xuan Chen et al.