Most Cited ICLR "image-based steering" Papers
6,124 papers found • Page 29 of 31
Conference
Investigating the Benefits of Projection Head for Representation Learning
Yihao Xue, Eric Gan, Jiayi Ni et al.
LLM-CXR: Instruction-Finetuned LLM for CXR Image Understanding and Generation
Suhyeon Lee, Won Jun Kim, Jinho Chang et al.
Continual Momentum Filtering on Parameter Space for Online Test-time Adaptation
Jae-Hong Lee, Joon-Hyuk Chang
Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning
Zihan Ding, Chi Jin
FedP3: Federated Personalized and Privacy-friendly Network Pruning under Model Heterogeneity
Kai Yi, Nidham Gazagnadou, Peter Richtarik et al.
Revisit and Outstrip Entity Alignment: A Perspective of Generative Models
Lingbing Guo, Zhuo Chen, Jiaoyan Chen et al.
OMNI: Open-endedness via Models of human Notions of Interestingness
Jenny Zhang, Joel Lehman, Kenneth Stanley et al.
Unifying Feature and Cost Aggregation with Transformers for Semantic and Visual Correspondence
Sunghwan Hong, Seokju Cho, Seungryong Kim et al.
Causality-Inspired Spatial-Temporal Explanations for Dynamic Graph Neural Networks
Kesen Zhao, Liang Zhang
LLM Augmented LLMs: Expanding Capabilities through Composition
Rachit Bansal, Bidisha Samanta, Siddharth Dalmia et al.
Bayesian Optimization through Gaussian Cox Process Models for Spatio-temporal Data
Yongsheng Mei, Mahdi Imani, Tian Lan
TOSS: High-quality Text-guided Novel View Synthesis from a Single Image
Yukai Shi, Jianan Wang, He CAO et al.
Initializing Models with Larger Ones
Zhiqiu Xu, Yanjie Chen, Kirill Vishniakov et al.
Language-Interfaced Tabular Oversampling via Progressive Imputation and Self-Authentication
June Yong Yang, Geondo Park, Joowon Kim et al.
Counterfactual Density Estimation using Kernel Stein Discrepancies
Diego Martinez-Taboada, Edward Kennedy
Adaptive Regularization of Representation Rank as an Implicit Constraint of Bellman Equation
Qiang HE, Tianyi Zhou, Meng Fang et al.
PerceptionCLIP: Visual Classification by Inferring and Conditioning on Contexts
Bang An, Sicheng Zhu, Michael-Andrei Panaitescu-Liess et al.
Removing Biases from Molecular Representations via Information Maximization
Chenyu Wang, Sharut Gupta, Caroline Uhler et al.
Online Continual Learning for Interactive Instruction Following Agents
Byeonghwi Kim, Minhyuk Seo, Jonghyun Choi
A Framework for Inference Inspired by Human Memory Mechanisms
Xiangyu Zeng, Jie Lin, Piao Hu et al.
PBADet: A One-Stage Anchor-Free Approach for Part-Body Association
Zhongpai Gao, Huayi Zhou, Abhishek Sharma et al.
Entropy is not Enough for Test-Time Adaptation: From the Perspective of Disentangled Factors
Jonghyun Lee, Dahuin Jung, Saehyung Lee et al.
Accelerating Sinkhorn algorithm with sparse Newton iterations
Xun Tang, Michael Shavlovsky, Holakou Rahmanian et al.
Circuit Component Reuse Across Tasks in Transformer Language Models
Jack Merullo, Carsten Eickhoff, Ellie Pavlick
Learning to solve Class-Constrained Bin Packing Problems via Encoder-Decoder Model
Hanni Cheng, Ya Cong, Weihao Jiang et al.
EmerDiff: Emerging Pixel-level Semantic Knowledge in Diffusion Models
Koichi Namekata, Amirmojtaba Sabour, Sanja Fidler et al.
RetroBridge: Modeling Retrosynthesis with Markov Bridges
Ilia Igashov, Arne Schneuing, Marwin Segler et al.
Improved Analysis of Sparse Linear Regression in Local Differential Privacy Model
Liyang Zhu, Meng Ding, Vaneet Aggarwal et al.
Visual Data-Type Understanding does not emerge from scaling Vision-Language Models
Vishaal Udandarao, Max F. Burg, Samuel Albanie et al.
Enhancing Tail Performance in Extreme Classifiers by Label Variance Reduction
Anirudh Buvanesh, Rahul Chand, Jatin Prakash et al.
Generative Pre-training for Speech with Flow Matching
Alexander Liu, Matthew Le, Apoorv Vyas et al.
Enhancing Neural Training via a Correlated Dynamics Model
Jonathan Brokman, Roy Betser, Rotem Turjeman et al.
Structured Video-Language Modeling with Temporal Grouping and Spatial Grounding
Yuanhao Xiong, Long Zhao, Boqing Gong et al.
Graph Metanetworks for Processing Diverse Neural Architectures
Derek Lim, Haggai Maron, Marc T Law et al.
A differentiable brain simulator bridging brain simulation and brain-inspired computing
Chaoming Wang, Tianqiu Zhang, Sichao He et al.
Neural Field Classifiers via Target Encoding and Classification Loss
Xindi Yang, Zeke Xie, Xiong Zhou et al.
Numerical Accounting in the Shuffle Model of Differential Privacy
Antti Koskela, Antti Honkela, Mikko Heikkilä
Language Model Decoding as Direct Metrics Optimization
Haozhe Ji, Pei Ke, Hongning Wang et al.
On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes
Rishabh Agarwal, Nino Vieillard, Yongchao Zhou et al.
Generative Learning for Solving Non-Convex Problem with Multi-Valued Input-Solution Mapping
Enming Liang, Minghua Chen
Overthinking the Truth: Understanding how Language Models Process False Demonstrations
Danny Halawi, Jean-Stanislas Denain, Jacob Steinhardt
The mechanistic basis of data dependence and abrupt learning in an in-context classification task
Gautam Reddy Nallamala
Universal Backdoor Attacks
Benjamin Schneider, Nils Lukas, Florian Kerschbaum
Inversion by Direct Iteration: An Alternative to Denoising Diffusion for Image Restoration
Peyman Milanfar, Mauricio Delbracio
INSIDE: LLMs' Internal States Retain the Power of Hallucination Detection
Chao Chen, Kai Liu, Ze Chen et al.
GOAt: Explaining Graph Neural Networks via Graph Output Attribution
Shengyao Lu, Keith G Mills, Jiao He et al.
OpenNeRF: Open Set 3D Neural Scene Segmentation with Pixel-Wise Features and Rendered Novel Views
Francis Engelmann, Fabian Manhardt, Michael Niemeyer et al.
Efficiently Computing Similarities to Private Datasets
Arturs Backurs, Zinan Lin, Sepideh Mahabadi et al.
Towards Few-Shot Adaptation of Foundation Models via Multitask Finetuning
Zhuoyan Xu, Zhenmei Shi, Junyi Wei et al.
LEAP: Liberate Sparse-View 3D Modeling from Camera Poses
Hanwen Jiang, Zhenyu Jiang, Yue Zhao et al.
Skill or Luck? Return Decomposition via Advantage Functions
Hsiao-Ru Pan, Bernhard Schoelkopf
Unsupervised Order Learning
Seon-Ho Lee, Nyeong-Ho Shin, Chang-Su Kim
KITAB: Evaluating LLMs on Constraint Satisfaction for Information Retrieval
Marah I Abdin, Suriya Gunasekar, Varun Chandrasekaran et al.
Compressed Context Memory for Online Language Model Interaction
Jang-Hyun Kim, Junyoung Yeom, Sangdoo Yun et al.
MINDE: Mutual Information Neural Diffusion Estimation
Giulio Franzese, Mustapha BOUNOUA, Pietro Michiardi
Provably Efficient CVaR RL in Low-rank MDPs
Yulai Zhao, Wenhao Zhan, Xiaoyan Hu et al.
Rethinking Channel Dependence for Multivariate Time Series Forecasting: Learning from Leading Indicators
Lifan Zhao, Yanyan Shen
MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use
Yue Huang, Jiawen Shi, Yuan Li et al.
Towards Category Unification of 3D Single Object Tracking on Point Clouds
Jiahao Nie, Zhiwei He, Xudong Lv et al.
Self-Consuming Generative Models Go MAD
Sina Alemohammad, Josue Casco-Rodriguez, Lorenzo Luzi et al.
Out-of-Distribution Detection with Negative Prompts
Jun Nie, Yonggang Zhang, Zhen Fang et al.
STARC: A General Framework For Quantifying Differences Between Reward Functions
Joar Skalse, Lucy Farnik, Sumeet Motwani et al.
Attacking Perceptual Similarity Metrics
Abhijay Ghildyal, Feng Liu
A ROBUST DIFFERENTIAL NEURAL ODE OPTIMIZER
Panagiotis Theodoropoulos, Guan-Horng Liu, Tianrong Chen et al.
StructComp: Substituting propagation with Structural Compression in Training Graph Contrastive Learning
Shengzhong Zhang, Wenjie Yang, Xinyuan Cao et al.
TopoMLP: A Simple yet Strong Pipeline for Driving Topology Reasoning
Dongming Wu, Jiahao Chang, Fan Jia et al.
$\infty$-Diff: Infinite Resolution Diffusion with Subsampled Mollified States
Sam Bond-Taylor, Chris G Willcocks
Clifford Group Equivariant Simplicial Message Passing Networks
Cong Liu, David Ruhe, Floor Eijkelboom et al.
ED-NeRF: Efficient Text-Guided Editing of 3D Scene With Latent Space NeRF
Jangho Park, Gihyun Kwon, Jong Chul YE
AgentBench: Evaluating LLMs as Agents
Xiao Liu, Hao Yu, Hanchen Zhang et al.
Towards Aligned Layout Generation via Diffusion Model with Aesthetic Constraints
Jian Chen, Ruiyi Zhang, Yufan Zhou et al.
VQ-TR: Vector Quantized Attention for Time Series Forecasting
Kashif Rasul, Andrew Bennett, Pablo Vicente et al.
Aligning Relational Learning with Lipschitz Fairness
Yaning Jia, Chunhui Zhang, Soroush Vosoughi
DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation
Jiaxiang Tang, Jiawei Ren, Hang Zhou et al.
Separate and Diffuse: Using a Pretrained Diffusion Model for Better Source Separation
Shahar Lutati, Eliya Nachmani, Lior Wolf
A Precise Characterization of SGD Stability Using Loss Surface Geometry
Gregory Dexter, Borja Ocejo, Sathiya Keerthi et al.
The Expressive Power of Transformers with Chain of Thought
William Merrill, Ashish Sabharwal
Posterior Sampling Based on Gradient Flows of the MMD with Negative Distance Kernel
Paul Hagemann, Johannes Hertrich, Fabian Altekrüger et al.
Confidence-aware Reward Optimization for Fine-tuning Text-to-Image Models
Kyuyoung Kim, Jongheon Jeong, Minyong An et al.
Large Language Models as Automated Aligners for benchmarking Vision-Language Models
Yuanfeng Ji, Chongjian GE, Weikai Kong et al.
Hypergraph Dynamic System
Jielong Yan, Yifan Feng, Shihui Ying et al.
SPDER: Semiperiodic Damping-Enabled Object Representation
Kathan Shah, Chawin Sitawarin
MagicDrive: Street View Generation with Diverse 3D Geometry Control
Ruiyuan Gao, Kai Chen, Enze Xie et al.
GeoDiffusion: Text-Prompted Geometric Control for Object Detection Data Generation
Kai Chen, Enze Xie, Zhe Chen et al.
DATS: Difficulty-Aware Task Sampler for Meta-Learning Physics-Informed Neural Networks
Maryam Toloubidokhti, Yubo Ye, Ryan Missel et al.
Fast Ensembling with Diffusion Schrödinger Bridge
Hyunsu Kim, Jongmin Yoon, Juho Lee
Neurosymbolic Grounding for Compositional World Models
Atharva Sehgal, Arya Grayeli, Jennifer Sun et al.
Making Pre-trained Language Models Great on Tabular Prediction
Jiahuan Yan, Bo Zheng, Hongxia Xu et al.
Feature-aligned N-BEATS with Sinkhorn divergence
Joonhun Lee, Myeongho Jeon, Myungjoo Kang et al.
Exploiting Causal Graph Priors with Posterior Sampling for Reinforcement Learning
Mirco Mutti, Riccardo De Santi, Marcello Restelli et al.
More is Better: when Infinite Overparameterization is Optimal and Overfitting is Obligatory
James Simon, Dhruva Karkada, Nikhil Ghosh et al.
End-to-End (Instance)-Image Goal Navigation through Correspondence as an Emergent Phenomenon
Guillaume Bono, Leonid Antsfeld, Boris Chidlovskii et al.
Don't Judge by the Look: Towards Motion Coherent Video Representation
Yitian Zhang, Yue Bai, Huan Wang et al.
Submodular Reinforcement Learning
Manish Prajapat, Mojmir Mutny, Melanie Zeilinger et al.
Inherently Interpretable Time Series Classification via Multiple Instance Learning
Joseph Early, Gavin Cheung, Kurt Cutajar et al.
Towards Robust Multi-Modal Reasoning via Model Selection
Xiangyan Liu, Rongxue LI, Wei Ji et al.
The optimality of kernel classifiers in Sobolev space
Jianfa Lai, zhifan Li, Dongming Huang et al.
SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression
Tim Dettmers, Ruslan Svirschevski, Vage Egiazarian et al.
A Hard-to-Beat Baseline for Training-free CLIP-based Adaptation
Zhengbo Wang, Jian Liang, Lijun Sheng et al.
Piecewise Linear Parametrization of Policies: Towards Interpretable Deep Reinforcement Learning
Maxime Wabartha, Joelle Pineau
PandaLM: An Automatic Evaluation Benchmark for LLM Instruction Tuning Optimization
Yidong Wang, Zhuohao Yu, Wenjin Yao et al.
Designing Skill-Compatible AI: Methodologies and Frameworks in Chess
KARIM HAMADE, Reid McIlroy-Young, Siddhartha Sen et al.
Knowledge Distillation Based on Transformed Teacher Matching
Kaixiang Zheng, EN-HUI YANG
Learning Polynomial Problems with $SL(2, \mathbb{R})$-Equivariance
Hannah Lawrence, Mitchell Harris
PINNsFormer: A Transformer-Based Framework For Physics-Informed Neural Networks
Zhiyuan Zhao, Xueying Ding, B. Aditya Prakash
NEFTune: Noisy Embeddings Improve Instruction Finetuning
Neel Jain, Ping-yeh Chiang, Yuxin Wen et al.
Entropy Coding of Unordered Data Structures
Julius Kunze, Daniel Severo, giulio zani et al.
A Semantic Invariant Robust Watermark for Large Language Models
Aiwei Liu, Leyi Pan, Xuming Hu et al.
QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models
Yuhui Xu, Lingxi Xie, Xiaotao Gu et al.
Hybrid Internal Model: Learning Agile Legged Locomotion with Simulated Robot Response
Junfeng Long, ZiRui Wang, Quanyi Li et al.
An Analytical Solution to Gauss-Newton Loss for Direct Image Alignment
Sergei Solonets, Daniil Sinitsyn, Lukas Von Stumberg et al.
De novo Protein Design Using Geometric Vector Field Networks
weian mao, Muzhi Zhu, Zheng Sun et al.
Weakly-supervised Audio Separation via Bi-modal Semantic Similarity
Tanvir Mahmud, Saeed Amizadeh, Kazuhito Koishida et al.
DecompOpt: Controllable and Decomposed Diffusion Models for Structure-based Molecular Optimization
Xiangxin Zhou, Xiwei Cheng, Yuwei Yang et al.
Toward Student-oriented Teacher Network Training for Knowledge Distillation
Chengyu Dong, Liyuan Liu, Jingbo Shang
Robust Adversarial Reinforcement Learning via Bounded Rationality Curricula
Aryaman Reddi, Maximilian Tölle, Jan Peters et al.
BioBridge: Bridging Biomedical Foundation Models via Knowledge Graphs
Zifeng Wang, Zichen Wang, Balasubramaniam Srinivasan et al.
Doubly Robust Instance-Reweighted Adversarial Training
Daouda Sow, Sen Lin, Zhangyang Wang et al.
An Extensible Framework for Open Heterogeneous Collaborative Perception
Yifan Lu, Yue Hu, Yiqi Zhong et al.
Accurate and Scalable Estimation of Epistemic Uncertainty for Graph Neural Networks
Puja Trivedi, Mark Heimann, Rushil Anirudh et al.
Replay across Experiments: A Natural Extension of Off-Policy RL
Dhruva Tirumala, Thomas Lampe, Jose Enrique Chen et al.
Conditional Variational Diffusion Models
Gabriel della Maggiora, Luis A. Croquevielle, Nikita Deshpande et al.
Can LLM-Generated Misinformation Be Detected?
Canyu Chen, Kai Shu
Less is More: One-shot Subgraph Reasoning on Large-scale Knowledge Graphs
Zhanke Zhou, Yongqi Zhang, Jiangchao Yao et al.
On the generalization capacity of neural networks during generic multimodal reasoning
Takuya Ito, Soham Dan, Mattia Rigotti et al.
Multi-Scale Representations by Varying Window Attention for Semantic Segmentation
Haotian Yan, Ming Wu, Chuang Zhang
3D Feature Prediction for Masked-AutoEncoder-Based Point Cloud Pretraining
Siming Yan, Yuqi Yang, Yu-Xiao Guo et al.
A Graph is Worth 1-bit Spikes: When Graph Contrastive Learning Meets Spiking Neural Networks
Jintang Li, Huizhe Zhang, Ruofan Wu et al.
dEBORA: Efficient Bilevel Optimization-based low-Rank Adaptation
Emanuele Zangrando, Sara Venturini, Francesco Rinaldi et al.
In Search of the Engram in LLMs: A Neuroscience Perspective on the Memory Functions in AI Models
Minsung Kim, Jea Kwon, Dong-Kyum Kim et al.
Brain-inspired $L_p$-Convolution benefits large kernels and aligns better with visual cortex
Jea Kwon, Sungjun Lim, Kyungwoo Song et al.
Multi-session, multi-task neural decoding from distinct cell-types and brain regions
Mehdi Azabou, Krystal Pan, Vinam Arora et al.
Rethinking Shapley Value for Negative Interactions in Non-convex Games
Wonjoon Chang, Myeongjin Lee, Jaesik Choi
Offline Model-Based Optimization by Learning to Rank
Rong-Xi Tan, Ke Xue, Shen-Huan Lyu et al.
RecDreamer: Consistent Text-to-3D Generation via Uniform Score Distillation
Chenxi Zheng, Yihong Lin, Bangzhen Liu et al.
CoRNStack: High-Quality Contrastive Data for Better Code Retrieval and Reranking
Tarun Suresh, Revanth Gangi Reddy, Yifei Xu et al.
When do GFlowNets learn the right distribution?
Tiago Silva, Rodrigo Alves, Eliezer de Souza da Silva et al.
Comparing Targeting Strategies for Maximizing Social Welfare with Limited Resources
Vibhhu Sharma, Bryan Wilder
Fine-Tuning Attention Modules Only: Enhancing Weight Disentanglement in Task Arithmetic
Ruochen Jin, Bojian Hou, Jiancong Xiao et al.
Systematic Relational Reasoning With Epistemic Graph Neural Networks
Irtaza Khalid, Steven Schockaert
Model-Agnostic Knowledge Guided Correction for Improved Neural Surrogate Rollout
Bharat Srikishan, Daniel O'Malley, Mohamed Mehana et al.
LLM-wrapper: Black-Box Semantic-Aware Adaptation of Vision-Language Models for Referring Expression Comprehension
Amaia Cardiel, Eloi Zablocki, Elias Ramzi et al.
pMoE: Prompting Diverse Experts Together Wins More in Visual Adaptation
Shentong Mo, Xufang Luo, Dongsheng Li
Comparing noisy neural population dynamics using optimal transport distances
Amin Nejatbakhsh, Victor Geadah, Alex Williams et al.
Dobi-SVD: Differentiable SVD for LLM Compression and Some New Perspectives
Qinsi Wang, Jinghan Ke, Masayoshi Tomizuka et al.
ParFam -- (Neural Guided) Symbolic Regression via Continuous Global Optimization
Philipp Scholl, Katharina Bieker, Hillary Hauger et al.
Linear Representations of Political Perspective Emerge in Large Language Models
Junsol Kim, James Evans, Aaron Schein
Fugatto 1: Foundational Generative Audio Transformer Opus 1
Rafael Valle, Rohan Badlani, Zhifeng Kong et al.
GRAIN: Exact Graph Reconstruction from Gradients
Maria Drencheva, Ivo Petrov, Maximilian Baader et al.
Diff-2-in-1: Bridging Generation and Dense Perception with Diffusion Models
Shuhong Zheng, Zhipeng Bao, Ruoyu Zhao et al.
Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data
Sreyan Ghosh, Sonal Kumar, Zhifeng Kong et al.
PeriodWave: Multi-Period Flow Matching for High-Fidelity Waveform Generation
Sang-Hoon Lee, Ha-Yeong Choi, Seong-Whan Lee
On the Inherent Privacy Properties of Discrete Denoising Diffusion Models
Eli Chien, Pan Li, Vamsi Potluru et al.
Object centric architectures enable efficient causal representation learning
Amin Mansouri, Jason Hartford, Yan Zhang et al.
TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning
Xiangyu Zeng, Kunchang Li, Chenting Wang et al.
Finding and Only Finding Differential Nash Equilibria by Both Pretending to be a Follower
Guodong Zhang, Xuchan Bao
Improving the Sparse Structure Learning of Spiking Neural Networks from the View of Compression Efficiency
Jiangrong Shen, Qi Xu, Gang Pan et al.
A Theory for Token-Level Harmonization in Retrieval-Augmented Generation
Shicheng Xu, Liang Pang, Huawei Shen et al.
Enhanced Diffusion Sampling via Extrapolation with Multiple ODE Solutions
Jinyoung Choi, Junoh Kang, Bohyung Han
Peeking Behind Closed Doors: Risks of LLM Evaluation by Private Data Curators
Pratyush Maini, Hritik Bansal
SoftCVI: Contrastive variational inference with self-generated soft labels
Daniel Ward, Mark Beaumont, Matteo Fasiolo
Concept Pinpoint Eraser for Text-to-image Diffusion Models via Residual Attention Gate
Byung Hyun Lee, Sungjin Lim, Seunggyu Lee et al.
Reconstruction-Guided Policy: Enhancing Decision-Making through Agent-Wise State Consistency
Qifan Liang, Yixiang Shan, Haipeng Liu et al.
Efficiently Learning at Test-Time: Active Fine-Tuning of LLMs
Jonas Hübotter, Sascha Bongni, Ido Hakimi et al.
Reasoning Elicitation in Language Models via Counterfactual Feedback
Alihan Hüyük, Xinnuo Xu, Jacqueline Maasch et al.
TSVD: Bridging Theory and Practice in Continual Learning with Pre-trained Models
Liangzu Peng, Juan Elenter, Joshua Agterberg et al.
Diffusion Attribution Score: Evaluating Training Data Influence in Diffusion Models
Jinxu Lin, Linwei Tao, Minjing Dong et al.
Fundamental Limitations on Subquadratic Alternatives to Transformers
Josh Alman, Hantao Yu
Three-in-One: Fast and Accurate Transducer for Hybrid-Autoregressive ASR
Hainan Xu, Travis Bartley, Vladimir Bataev et al.
Improving Instruction-Following in Language Models through Activation Steering
Alessandro Stolfo, Vidhisha Balachandran, Safoora Yousefi et al.
Unearthing Skill-level Insights for Understanding Trade-offs of Foundation Models
Mazda Moayeri, Vidhisha Balachandran, Varun Chandrasekaran et al.
Intelligence at the Edge of Chaos
Shiyang Zhang, Aakash Patel, Syed Rizvi et al.
Multimodal Situational Safety
Kaiwen Zhou, Chengzhi Liu, Xuandong Zhao et al.
Analysing The Spectral Biases in Generative Models
Amitoj Miglani, Shweta Singh, Vidit Aggarwal
InvestESG: A multi-agent reinforcement learning benchmark for studying climate investment as a social dilemma
Xiaoxuan Hou, Jiayi Yuan, Joel Z Leibo et al.
Provence: efficient and robust context pruning for retrieval-augmented generation
Nadezhda Chirkova, Thibault Formal, Vassilina Nikoulina et al.
ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance
Jiannan Huang, Jun Hao Liew, Hanshu Yan et al.
SOREL: A Stochastic Algorithm for Spectral Risks Minimization
Yuze Ge, Rujun Jiang
Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask Learning
Moritz Reuss, Jyothish Pari, Pulkit Agrawal et al.
Conformal Structured Prediction
Botong Zhang, Shuo Li, Osbert Bastani
Enhancing Language Model Agents using Diversity of Thoughts
Vijay Chandra Lingam, Behrooz Tehrani, sujay sanghavi et al.
ThinK: Thinner Key Cache by Query-Driven Pruning
Yuhui Xu, Zhanming Jie, Hanze Dong et al.
Multiplicative Logit Adjustment Approximates Neural-Collapse-Aware Decision Boundary Adjustment
Naoya Hasegawa, Issei Sato
Adaptive Camera Sensor for Vision Models
Eunsu Baek, Sung-hwan Han, Taesik Gong et al.
Topological data analysis on noisy quantum computers
Ismail Akhalwaya, Shashanka Ubaru, Kenneth Clarkson et al.
VCR-Graphormer: A Mini-batch Graph Transformer via Virtual Connections
Dongqi Fu, Zhigang Hua, Yan Xie et al.
A new framework for evaluating model out-of-distribution generalisation for the biochemical domain
Raul Fernandez-Diaz, Hoang Thanh Lam, Vanessa Lopez et al.
Size-Generalizable RNA Structure Evaluation by Exploring Hierarchical Geometries
Zongzhao Li, Jiacheng Cen, Wenbing Huang et al.
Semialgebraic Neural Networks: From roots to representations
S David Mis, Matti Lassas, Maarten V de Hoop
Learning General-purpose Biomedical Volume Representations using Randomized Synthesis
Neel Dey, Benjamin Billot, Hallee Wong et al.
Nova: Generative Language Models for Assembly Code with Hierarchical Attention and Contrastive Learning
Nan Jiang, Chengxiao Wang, Kevin Liu et al.
Multi-Field Adaptive Retrieval
Millicent Li, Tongfei Chen, Ben Van Durme et al.
PIORF: Physics-Informed Ollivier-Ricci Flow for Long–Range Interactions in Mesh Graph Neural Networks
Youn-Yeol Yu, Jeongwhan Choi, Jaehyeon Park et al.
ReAttention: Training-Free Infinite Context with Finite Attention Scope
Xiaoran Liu, Ruixiao Li, Zhigeng Liu et al.
KLay: Accelerating Arithmetic Circuits for Neurosymbolic AI
Jaron Maene, Vincent Derkinderen, Pedro Zuidberg Dos Martires
Strategist: Self-improvement of LLM Decision Making via Bi-Level Tree Search
Jonathan Light, Min Cai, Weiqin Chen et al.
DexTrack: Towards Generalizable Neural Tracking Control for Dexterous Manipulation from Human References
Xueyi Liu, Jianibieke Adalibieke, Qianwei Han et al.
ContextGNN: Beyond Two-Tower Recommendation Systems
Yiwen Yuan, Zecheng Zhang, Xinwei He et al.
Private Mechanism Design via Quantile Estimation
Yuanyuan Yang, Tao Xiao, Bhuvesh Kumar et al.
Revealing and Mitigating Over-Attention in Knowledge Editing
Pinzheng Wang, Zecheng Tang, Keyan Zhou et al.