Most Cited ICLR Poster Papers
6,124 papers found • Page 17 of 31
Conference
Inspection and Control of Self-Generated-Text Recognition Ability in Llama3-8b-Instruct
Christopher Ackerman, Nina Panickssery
Support is All You Need for Certified VAE Training
Changming Xu, Debangshu Banerjee, Deepak Vasisht et al.
Neuron based Personality Trait Induction in Large Language Models
Jia Deng, Tianyi Tang, Yanbin Yin et al.
The Alignment Problem from a Deep Learning Perspective
Richard Ngo, Lawrence Chan, Sören Mindermann
Rethinking Reward Model Evaluation: Are We Barking up the Wrong Tree?
xueru wen, Jie Lou, Yaojie Lu et al.
Pathformer: Multi-scale Transformers with Adaptive Pathways for Time Series Forecasting
Peng Chen, Yingying ZHANG, Yunyao Cheng et al.
Factual Context Validation and Simplification: A Scalable Method to Enhance GPT Trustworthiness and Efficiency
Tianyi Huang
GraphBridge: Towards Arbitrary Transfer Learning in GNNs
Li Ju, Xingyi Yang, Qi Li et al.
PFGuard: A Generative Framework with Privacy and Fairness Safeguards
Soyeon Kim, Yuji Roh, Geon Heo et al.
Score-based Self-supervised MRI Denoising
Jiachen Tu, Yaokun Shi, Fan Lam
Local Patterns Generalize Better for Novel Anomalies
Yalong Jiang
Offline RL in Regular Decision Processes: Sample Efficiency via Language Metrics
Ahana Deb, Roberto Cipollone, Anders Jonsson et al.
TempMe: Video Temporal Token Merging for Efficient Text-Video Retrieval
Leqi Shen, Tianxiang Hao, Tao He et al.
How much of my dataset did you use? Quantitative Data Usage Inference in Machine Learning
Yao Tong, Jiayuan Ye, Sajjad Zarifzadeh et al.
Causal Representation Learning from Multimodal Biomedical Observations
Yuewen Sun, Lingjing Kong, Guangyi Chen et al.
FIRING-Net: A filtered feature recycling network for speech enhancement
Xinmeng Xu, Yiqun Zhang, Jizhen Li et al.
Autoregressive Pretraining with Mamba in Vision
Sucheng Ren, Xianhang Li, Haoqin Tu et al.
ZooProbe: A Data Engine for Evaluating, Exploring, and Evolving Large-scale Training Data for Multimodal LLMs
Yi-Kai Zhang, Shiyin Lu, Qing-Guo Chen et al.
A Little Goes a Long Way: Efficient Long Context Training and Inference with Partial Contexts
Suyu Ge, Xihui Lin, Yunan Zhang et al.
Mechanistic Interpretability Meets Vision Language Models: Insights and Limitations
Yiming Liu, Yuhui Zhang, Serena Yeung
Statistical Tractability of Off-policy Evaluation of History-dependent Policies in POMDPs
Yuheng Zhang, Nan Jiang
GDrag:Towards General-Purpose Interactive Editing with Anti-ambiguity Point Diffusion
Xiaojian Lin, Hanhui Li, Yuhao Cheng et al.
SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding
Sihang Li, Jin Huang, Jiaxi Zhuang et al.
Offline Hierarchical Reinforcement Learning via Inverse Optimization
Carolin Schmidt, Daniele Gammelli, James Harrison et al.
GraphArena: Evaluating and Exploring Large Language Models on Graph Computation
Jianheng Tang, Qifan Zhang, Yuhan Li et al.
Training Nonlinear Transformers for Chain-of-Thought Inference: A Theoretical Generalization Analysis
Hongkang Li, Songtao Lu, Pin-Yu Chen et al.
Underdamped Diffusion Bridges with Applications to Sampling
Denis Blessing, Julius Berner, Lorenz Richter et al.
T2V2: A Unified Non-Autoregressive Model for Speech Recognition and Synthesis via Multitask Learning
Nabarun Goswami, Hanqin Wang, Tatsuya Harada
One Hundred Neural Networks and Brains Watching Videos: Lessons from Alignment
Christina Sartzetaki, Gemma Roig, Cees G Snoek et al.
TD-Paint: Faster Diffusion Inpainting Through Time-Aware Pixel Conditioning
Tsiry MAYET, Pourya Shamsolmoali, Simon Bernard et al.
XAIguiFormer: explainable artificial intelligence guided transformer for brain disorder identification
Hanning Guo, Farah Abdellatif, Yu Fu et al.
One for all and all for one: Efficient computation of partial Wasserstein distances on the line
Laetitia Chapel, Romain Tavenard
Learning Dynamics of Deep Matrix Factorization Beyond the Edge of Stability
Avrajit Ghosh, Soo Min Kwon, Rongrong Wang et al.
TraceVLA: Visual Trace Prompting Enhances Spatial-Temporal Awareness for Generalist Robotic Policies
Ruijie Zheng, Yongyuan Liang, Shuaiyi Huang et al.
Taming Overconfidence in LLMs: Reward Calibration in RLHF
Jixuan Leng, Chengsong Huang, Banghua Zhu et al.
Any-step Dynamics Model Improves Future Predictions for Online and Offline Reinforcement Learning
Haoxin Lin, Yu-Yan Xu, Yihao Sun et al.
Scaling FP8 training to trillion-token LLMs
Maxim Fishman, Brian Chmiel, Ron Banner et al.
Causal Identification for Complex Functional Longitudinal Studies
Andrew Ying
RESuM: A Rare Event Surrogate Model for Physics Detector Design
Ann-Kathrin Schuetz, Alan Poon, Aobo Li
On Stochastic Contextual Bandits with Knapsacks in Small Budget Regime
Hengquan Guo, Xin Liu
SIMPL: Scalable and hassle-free optimisation of neural representations from behaviour
Tom George, Pierre Glaser, Kimberly Stachenfeld et al.
For Better or For Worse? Learning Minimum Variance Features With Label Augmentation
Muthu Chidambaram, Rong Ge
Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models
Junyu Chen, Han Cai, Junsong Chen et al.
Residual-MPPI: Online Policy Customization for Continuous Control
Pengcheng Wang, Chenran Li, Catherine Weaver et al.
ImagineNav: Prompting Vision-Language Models as Embodied Navigator through Scene Imagination
Xinxin Zhao, Wenzhe Cai, Likun Tang et al.
Posterior-Mean Rectified Flow: Towards Minimum MSE Photo-Realistic Image Restoration
Guy Ohayon, Tomer Michaeli, Michael Elad
Restructuring Vector Quantization with the Rotation Trick
Christopher Fifty, Ronald Junkins, Dennis Duan et al.
Biologically Plausible Brain Graph Transformer
Ciyuan Peng, Yuelong Huang, Qichao Dong et al.
EasyTPP: Towards Open Benchmarking Temporal Point Processes
Siqiao Xue, Xiaoming Shi, Zhixuan Chu et al.
The Belief State Transformer
Edward Hu, Kwangjun Ahn, Qinghua Liu et al.
Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages
Xiang Yue, Yueqi Song, Akari Asai et al.
Generalization, Expressivity, and Universality of Graph Neural Networks on Attributed Graphs
Levi Rauchwerger, Stefanie Jegelka, Ron Levie
GeoLoRA: Geometric integration for parameter efficient fine-tuning
Steffen Schotthöfer, Emanuele Zangrando, Gianluca Ceruti et al.
Continual Slow-and-Fast Adaptation of Latent Neural Dynamics (CoSFan): Meta-Learning What-How & When to Adapt
Ryan Missel, Linwei Wang
Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF
Shicong Cen, Jincheng Mei, Katayoon Goshvadi et al.
Operator Deep Smoothing for Implied Volatility
Ruben Wiedemann, Antoine (Jack) Jacquier, Lukas Gonon
SAGEPhos: Sage Bio-Coupled and Augmented Fusion for Phosphorylation Site Detection
Jingjie Zhang, Hanqun Cao, Zijun Gao et al.
Select before Act: Spatially Decoupled Action Repetition for Continuous Control
Buqing Nie, Yangqing Fu, Yue Gao
EmbedLLM: Learning Compact Representations of Large Language Models
Richard Zhuang, Tianhao Wu, Zhaojin Wen et al.
Connectome Mapping: Shape-Memory Network via Interpretation of Contextual Semantic Information
Kyungsu Lee, Haeyun Lee, Jae Youn Hwang
TPO: Aligning Large Language Models with Multi-branch & Multi-step Preference Trees
Weibin Liao, Xu Chu, Yasha Wang
Round and Round We Go! What makes Rotary Positional Encodings useful?
Federico Barbero, Alex Vitvitskyi, Christos Perivolaropoulos et al.
GOLD: Graph Out-of-Distribution Detection via Implicit Adversarial Latent Generation
Danny Wang, Ruihong Qiu, Guangdong Bai et al.
Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy
Tong Wu, Shujian Zhang, Kaiqiang Song et al.
Global Identifiability of Overcomplete Dictionary Learning via L1 and Volume Minimization
Yuchen Sun, Kejun Huang
Data Pruning by Information Maximization
Haoru Tan, Sitong Wu, Wei Huang et al.
An Evolved Universal Transformer Memory
Edoardo Cetin, Qi Sun, Tianyu Zhao et al.
Multilevel Generative Samplers for Investigating Critical Phenomena
Ankur Singha, Elia Cellini, Kim A. Nicoli et al.
Memory Mosaics
Jianyu Zhang, Niklas Nolte, Ranajoy Sadhukhan et al.
Partially Observed Trajectory Inference using Optimal Transport and a Dynamics Prior
Anming Gu, Edward Chien, Kristjan Greenewald
Examining Alignment of Large Language Models through Representative Heuristics: the case of political stereotypes
Sullam Jeoung, Yubin Ge, Haohan Wang et al.
URLOST: Unsupervised Representation Learning without Stationarity or Topology
Zeyu Yun, Juexiao Zhang, Yann LeCun et al.
Enhancing End-to-End Autonomous Driving with Latent World Model
Yingyan Li, Lue Fan, Jiawei He et al.
CREAM: Consistency Regularized Self-Rewarding Language Models
Zhaoyang Wang, Weilei He, Zhiyuan Liang et al.
Estimation of single-cell and tissue perturbation effect in spatial transcriptomics via Spatial Causal Disentanglement
Stathis Megas, Daniel Chen, Krzysztof Polanski et al.
TLDR: Token-Level Detective Reward Model for Large Vision Language Models
Deqing Fu, Tong Xiao, Rui Wang et al.
SqueezeAttention: 2D Management of KV-Cache in LLM Inference via Layer-wise Optimal Budget
Zihao Wang, Bin CUI, Shaoduo Gan
Understanding and Enhancing Safety Mechanisms of LLMs via Safety-Specific Neuron
Yiran Zhao, Wenxuan Zhang, Yuxi Xie et al.
Towards Domain Adaptive Neural Contextual Bandits
Ziyan Wang, Xiaoming Huo, Hao Wang
Should VLMs be Pre-trained with Image Data?
Sedrick Keh, Jean Mercat, Samir Yitzhak Gadre et al.
SEMDICE: Off-policy State Entropy Maximization via Stationary Distribution Correction Estimation
Jongmin Lee, Meiqi Sun, Pieter Abbeel
Wayward Concepts In Multimodal Models
Brandon Trabucco, Max Gurinas, Kyle Doherty et al.
Learning Diagrams: A Graphical Language for Compositional Training Regimes
Mason Lary, Richard Samuelson, Alexander Wilentz et al.
Detecting Pretraining Data from Large Language Models
Weijia Shi, Anirudh Ajith, Mengzhou Xia et al.
Pacmann: Efficient Private Approximate Nearest Neighbor Search
Mingxun Zhou, Elaine Shi, Giulia Fanti
What's New in My Data? Novelty Exploration via Contrastive Generation
Masaru Isonuma, Ivan Titov
Risk-Sensitive Diffusion: Robustly Optimizing Diffusion Models with Noisy Samples
Yangming Li, Max Ruiz Luyten, Mihaela van der Schaar
Towards Automated Knowledge Integration From Human-Interpretable Representations
Katarzyna Kobalczyk, Mihaela van der Schaar
Adapting Multi-modal Large Language Model to Concept Drift From Pre-training Onwards
Xiaoyu Yang, Jie Lu, En Yu
LLaVA-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models
Feng Li, Renrui Zhang, Hao Zhang et al.
Root Cause Analysis of Anomalies in Multivariate Time Series through Granger Causal Discovery
Xiao Han, Saima Absar, Lu Zhang et al.
Bridging the Gap between Variational Inference and Stochastic Gradient MCMC in Function Space
Mengjing Wu, Junyu Xuan, Jie Lu
BatchPrompt: Accomplish more with less
Jianzhe Lin, Maurice Diesendruck, Liang Du et al.
Scrutinize What We Ignore: Reining In Task Representation Shift Of Context-Based Offline Meta Reinforcement Learning
Hai Zhang, Boyuan Zheng, Tianying Ji et al.
Forward Learning of Graph Neural Networks
Namyong Park, Xing Wang, Antoine Simoulin et al.
Leveraging Sub-Optimal Data for Human-in-the-Loop Reinforcement Learning
Calarina Muslimani, Matthew E Taylor
BIRD: A Trustworthy Bayesian Inference Framework for Large Language Models
Yu Feng, Ben Zhou, Weidong Lin et al.
Youku Dense Caption: A Large-scale Chinese Video Dense Caption Dataset and Benchmarks
Zixuan Xiong, Guangwei Xu, wenkai zhang et al.
Scaling Laws for Adversarial Attacks on Language Model Activations and Tokens
Stanislav Fort
Mentored Learning: Improving Generalization and Convergence of Student Learner
Xiaofeng Cao, Yaming Guo, Heng Tao Shen et al.
Large Language Models Often Say One Thing and Do Another
Ruoxi Xu, Hongyu Lin, Xianpei Han et al.
CLOVER: Cross-Layer Orthogonal Vectors Pruning and Fine-Tuning
Fanxu Meng, Muhan Zhang
Enhancing Vision-Language Model with Unmasked Token Alignment
Hongsheng Li, Jihao Liu, Boxiao Liu et al.
Making Transformer Decoders Better Differentiable Indexers
Wuchao Li, Kai Zheng, Defu Lian et al.
ChartMoE: Mixture of Diversely Aligned Expert Connector for Chart Understanding
Zhengzhuo Xu, Bowen Qu, Yiyan Qi et al.
The KoLMogorov Test: Compression by Code Generation
Ori Yoran, Kunhao Zheng, Fabian Gloeckle et al.
GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models
Iman Mirzadeh, Keivan Alizadeh-Vahid, Hooman Shahrokhi et al.
MaestroMotif: Skill Design from Artificial Intelligence Feedback
Martin Klissarov, Mikael Henaff, Roberta Raileanu et al.
Feedback Favors the Generalization of Neural ODEs
Jindou Jia, Zihan Yang, Meng Wang et al.
AutoUAD: Hyper-parameter Optimization for Unsupervised Anomaly Detection
Wei Dai, Jicong Fan
FlashMask: Efficient and Rich Mask Extension of FlashAttention
Guoxia Wang, Jinle Zeng, Xiyuan Xiao et al.
Task-Adaptive Pretrained Language Models via Clustered-Importance Sampling
David Grangier, Simin Fan, Skyler Seto et al.
Disentangling 3D Animal Pose Dynamics with Scrubbed Conditional Latent Variables
Joshua Wu, Hari Koneru, James Ravenel et al.
Do as I do (Safely): Mitigating Task-Specific Fine-tuning Risks in Large Language Models
Francisco Eiras, Aleksandar Petrov, Philip Torr et al.
FreCaS: Efficient Higher-Resolution Image Generation via Frequency-aware Cascaded Sampling
zhengqiang ZHANG, Ruihuang Li, Lei Zhang
Non-Equilibrium Dynamics of Hybrid Continuous-Discrete Ground-State Sampling
Timothee Leleu, Sam Reifenstein
Spatial-Mamba: Effective Visual State Space Models via Structure-Aware State Fusion
Chaodong Xiao, Minghan Li, zhengqiang ZHANG et al.
TeaserGen: Generating Teasers for Long Documentaries
Weihan Xu, Paul Pu Liang, Haven Kim et al.
A Theory of Initialisation's Impact on Specialisation
Devon Jarvis, Sebastian Lee, Clementine Domine et al.
Searching for Optimal Solutions with LLMs via Bayesian Optimization
Dhruv Agarwal, Manoj Ghuhan Arivazhagan, Rajarshi Das et al.
Ranking-aware adapter for text-driven image ordering with CLIP
Wei-Hsiang Yu, Yen-Yu Lin, Ming-Hsuan Yang et al.
Exploiting Hankel-Toeplitz Structures for Fast Computation of Kernel Precision Matrices
Frida Viset, Frederiek Wesel, Arno Solin et al.
On Calibration of LLM-based Guard Models for Reliable Content Moderation
Hongfu Liu, Hengguan Huang, Xiangming Gu et al.
Rethinking Artistic Copyright Infringements In the Era Of Text-to-Image Generative Models
Mazda Moayeri, Sriram Balasubramanian, Samyadeep Basu et al.
How many samples are needed to train a deep neural network?
Pegah Golestaneh, Mahsa Taheri, Johannes Lederer
PolyNet: Learning Diverse Solution Strategies for Neural Combinatorial Optimization
André Hottung, Mridul Mahajan, Kevin Tierney
LASER: A Neuro-Symbolic Framework for Learning Spatio-Temporal Scene Graphs with Weak Supervision
Jiani Huang, Ziyang Li, Mayur Naik et al.
Backdooring Vision-Language Models with Out-Of-Distribution Data
Weimin Lyu, Michael Yao, Saumya Gupta et al.
POTEC: Off-Policy Contextual Bandits for Large Action Spaces via Policy Decomposition
Yuta Saito, Jihan Yao, Thorsten Joachims
MathGAP: Out-of-Distribution Evaluation on Problems with Arbitrarily Complex Proofs
Andreas Opedal, Haruki Shirakami, Bernhard Schölkopf et al.
Adversarial Attacks on Data Attribution
Xinhe Wang, Pingbang Hu, Junwei Deng et al.
Conformalized Survival Analysis for General Right-Censored Data
Hen Davidov, Shai Feldman, Gil Shamai et al.
CellPLM: Pre-training of Cell Language Model Beyond Single Cells
Hongzhi Wen, Wenzhuo Tang, Xinnan Dai et al.
A Sanity Check for AI-generated Image Detection
Shilin Yan, Ouxiang Li, Jiayin Cai et al.
Grounding Multimodal Large Language Model in GUI World
Weixian Lei, Difei Gao, Mike Zheng Shou
Holistically Evaluating the Environmental Impact of Creating Language Models
Jacob Morrison, Clara Na, Jared Fernandez et al.
GraphEval: A Lightweight Graph-Based LLM Framework for Idea Evaluation
Tao Feng, Yihang Sun, Jiaxuan You
Steering LLMs' Behavior with Concept Activation Vectors
Ruixuan HUANG, Shuai Wang
Learning vector fields of differential equations on manifolds with geometrically constrained operator-valued kernels
Daning Huang, Hanyang He, John Harlim et al.
QPM: Discrete Optimization for Globally Interpretable Image Classification
Thomas Norrenbrock, Timo Kaiser, Sovan Biswas et al.
Test-Time Adaptation for Combating Missing Modalities in Egocentric Videos
Merey Ramazanova, Alejandro Pardo, Bernard Ghanem et al.
Mitigating Spurious Correlations in Zero-Shot Multimodal Models
Shenyu Lu, Junyi Chai, Xiaoqian Wang
MambaExtend: A Training-Free Approach to Improve Long Context Extension of Mamba
Seyedarmin Azizi, Souvik Kundu, Mohammad Sadeghi et al.
An Effective Manifold-based Optimization Method for Distributionally Robust Classification
Jiawei Huang, Hu Ding
Restating the Proof of Linear Convergence for Linear GNNs
Huayi Tang, Yuhe Guo, Yong Liu et al.
Adaptive Gradient Clipping for Robust Federated Learning
Youssef Allouah, Rachid Guerraoui, Nirupam Gupta et al.
Looking Backward: Retrospective Backward Synthesis for Goal-Conditioned GFlowNets
Haoran He, Can Chang, Huazhe Xu et al.
The Utility and Complexity of In- and Out-of-Distribution Machine Unlearning
Youssef Allouah, Joshua Kazdan, Rachid Guerraoui et al.
One-for-All Few-Shot Anomaly Detection via Instance-Induced Prompt Learning
Wenxi Lv, Qinliang Su, Wenchao Xu
Parameter Expanded Stochastic Gradient Markov Chain Monte Carlo
Hyunsu Kim, Giung Nam, Chulhee Yun et al.
TSC-Net: Prediction of Pedestrian Trajectories by Trajectory-Scene-Cell Classification
BO HU, Tat-Jen Cham
A Deep Generative Learning Approach for Two-stage Adaptive Robust Optimization
Aron Brenner, Rahman Khorramfar, Jennifer Sun et al.
UniDetox: Universal Detoxification of Large Language Models via Dataset Distillation
Huimin LU, Masaru Isonuma, Junichiro Mori et al.
MCNC: Manifold-Constrained Reparameterization for Neural Compression
Chayne Thrash, Reed Andreas, Ali Abbasi et al.
Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning
Linjiajie Fang, Ruoxue Liu, Jing Zhang et al.
TIGeR: Unifying Text-to-Image Generation and Retrieval with Large Multimodal Models
Leigang Qu, Haochuan Li, Tan Wang et al.
Neural Multi-Objective Combinatorial Optimization via Graph-Image Multimodal Fusion
Jinbiao Chen, Jiahai Wang, Zhiguang Cao et al.
Adaptive Deployment of Untrusted LLMs Reduces Distributed Threats
Jiaxin Wen, Vivek Hebbar, Caleb Larson et al.
Interpretable Causal Representation Learning for Biological Data in the Pathway Space
Jesus de la Fuente Cedeño, Robert Lehmann, Carlos Ruiz-Arenas et al.
Knowledge Entropy Decay during Language Model Pretraining Hinders New Knowledge Acquisition
Jiyeon Kim, Hyunji Lee, Hyowon Cho et al.
Is uniform expressivity too restrictive? Towards efficient expressivity of GNNs
Sammy Khalife, Josué Tonelli-Cueto
$\sigma$-zero: Gradient-based Optimization of $\ell_0$-norm Adversarial Examples
Antonio Emanuele Cinà, Francesco Villani, Maura Pintor et al.
Understanding Model Calibration - A gentle introduction and visual exploration of calibration and the expected calibration error (ECE)
Maja Pavlovic
CREIMBO: Cross-Regional Ensemble Interactions in Multi-view Brain Observations
Noga Mudrik, Ryan Ly, Oliver Ruebel et al.
DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agent
Taiyi Wang, Zhihao Wu, Jianheng Liu et al.
Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures
Yuchen Duan, Weiyun Wang, Zhe Chen et al.
Conditional Diffusion with Ordinal Regression: Longitudinal Data Generation for Neurodegenerative Disease Studies
Hyuna Cho, Ziquan Wei, Seungjoo Lee et al.
Aligning Language Models with Demonstrated Feedback
Omar Shaikh, Michelle Lam, Joey Hejna et al.
Contextual Document Embeddings
John X. Morris, Alexander Rush
Locality Sensitive Avatars From Video
Chunjin Song, Zhijie Wu, Shih-Yang Su et al.
R-Sparse: Rank-Aware Activation Sparsity for Efficient LLM Inference
Zhenyu Zhang, Zechun Liu, Yuandong Tian et al.
Controllable Satellite-to-Street-View Synthesis with Precise Pose Alignment and Zero-Shot Environmental Control
Xianghui Ze, Zhenbo Song, Qiwei Wang et al.
SaMer: A Scenario-aware Multi-dimensional Evaluator for Large Language Models
Kehua Feng, Keyan Ding, Jing Yu et al.
Hessian Free Efficient Single Loop Iterative Differentiation Methods for Bi-Level Optimization Problems
Peiran Yu, Junyi Li, Heng Huang
Bootstrapping Language Models with DPO Implicit Rewards
Changyu Chen, Zichen Liu, Chao Du et al.
Meta-VBO: Utilizing Prior Tasks in Optimizing Risk Measures with Gaussian Processes
Quoc Phong Nguyen, Bryan Kian Hsiang Low, Patrick Jaillet
Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning
Haque Ishfaq, Guangyuan Wang, Sami Islam et al.
Chemistry-Inspired Diffusion with Non-Differentiable Guidance
Yuchen Shen, Chenhao Zhang, Sijie Fu et al.
U-Nets as Belief Propagation: Efficient Classification, Denoising, and Diffusion in Generative Hierarchical Models
Song Mei
Linear Transformer Topological Masking with Graph Random Features
Isaac Reid, Kumar Dubey, Deepali Jain et al.
Temporal Generalization Estimation in Evolving Graphs
Bin Lu, Tingyan Ma, Xiaoying Gan et al.
Robustness of Quantum Algorithms for Nonconvex Optimization
Weiyuan Gong, Chenyi Zhang, Tongyang Li
Neural Dueling Bandits: Preference-Based Optimization with Human Feedback
Arun Verma, Zhongxiang Dai, Xiaoqiang Lin et al.
CAKE: Cascading and Adaptive KV Cache Eviction with Layer Preferences
Ziran Qin, Yuchen Cao, Mingbao Lin et al.
Conflict-Averse Gradient Aggregation for Constrained Multi-Objective Reinforcement Learning
Dohyeong Kim, Mineui Hong, Jeongho Park et al.
Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL
Ghada Sokar, Johan S Obando Ceron, Aaron Courville et al.
LLaRA: Supercharging Robot Learning Data for Vision-Language Policy
Xiang Li, Cristina Mata, Jongwoo Park et al.
Active Learning for Neural PDE Solvers
Daniel Musekamp, Marimuthu Kalimuthu, David Holzmüller et al.
Large Language Models are Interpretable Learners
Ruochen Wang, Si Si, Felix Yu et al.
PICASO: Permutation-Invariant Context Composition with State Space Models
Tian Yu Liu, Alessandro Achille, Matthew Trager et al.
Quality Measures for Dynamic Graph Generative Models
Ryien Hosseini, Filippo Simini, Venkatram Vishwanath et al.
Scalable Mechanistic Neural Networks
Jiale Chen, Dingling Yao, Adeel Pervez et al.
On Bits and Bandits: Quantifying the Regret-Information Trade-off
Itai Shufaro, Nadav Merlis, Nir Weinberger et al.
Multi-level Certified Defense Against Poisoning Attacks in Offline Reinforcement Learning
Shijie Liu, Andrew Cullen, Paul Montague et al.
A Differentiable Rank-Based Objective for Better Feature Learning
Krunoslav Lehman Pavasovic, Giulio Biroli, Levent Sagun
Tractable Multi-Agent Reinforcement Learning through Behavioral Economics
Eric Mazumdar, Kishan Panaganti, Laixi Shi
Single-agent Poisoning Attacks Suffice to Ruin Multi-Agent Learning
Fan Yao, Yuwei Cheng, Ermin Wei et al.
Look Before You Leap: Universal Emergent Mechanism for Retrieval in Language Models
Alexandre Variengien, Eric Winsor
EgoSim: Egocentric Exploration in Virtual Worlds with Multi-modal Conditioning
Wei Yu, Songheng Yin, Steve Easterbrook et al.
Recovering Manifold Structure Using Ollivier Ricci Curvature
Tristan L. Saidi, Abigail Hickok, Andrew J Blumberg