Most Cited ICLR "autoregressive inference" Papers

6,124 papers found • Page 11 of 31

#2001

Air Quality Prediction with Physics-Guided Dual Neural ODEs in Open Systems

jindong tian, Yuxuan Liang, Ronghui Xu et al.

ICLR 2025oralarXiv:2410.19892
18
citations
#2002

Minimum width for universal approximation using ReLU networks on compact domain

Namjun Kim, Chanho Min, Sejun Park

ICLR 2024arXiv:2309.10402
18
citations
#2003

Locality Sensitive Sparse Encoding for Learning World Models Online

Zichen Liu, Chao Du, Wee Sun Lee et al.

ICLR 2024arXiv:2401.13034
18
citations
#2004

Adversarial AutoMixup

Huafeng Qin, Xin Jin, Yun Jiang et al.

ICLR 2024spotlightarXiv:2312.11954
18
citations
#2005

Benchmarking Algorithms for Federated Domain Generalization

Ruqi Bai, Saurabh Bagchi, David Inouye

ICLR 2024spotlightarXiv:2307.04942
18
citations
#2006

Learning Energy-Based Models by Cooperative Diffusion Recovery Likelihood

yaxuan zhu, Jianwen Xie, Yingnian Wu et al.

ICLR 2024spotlightarXiv:2309.05153
18
citations
#2007

SemiReward: A General Reward Model for Semi-supervised Learning

Siyuan Li, Weiyang Jin, Zedong Wang et al.

ICLR 2024arXiv:2310.03013
18
citations
#2008

DarkBench: Benchmarking Dark Patterns in Large Language Models

Esben Kran, Hieu Minh Nguyen, Akash Kundu et al.

ICLR 2025arXiv:2503.10728
18
citations
#2009

Variance-aware Regret Bounds for Stochastic Contextual Dueling Bandits

Qiwei Di, Tao Jin, Yue Wu et al.

ICLR 2024arXiv:2310.00968
18
citations
#2010

SLMRec: Distilling Large Language Models into Small for Sequential Recommendation

Wujiang Xu, Qitian Wu, Zujie Liang et al.

ICLR 2025oralarXiv:2405.17890
18
citations
#2011

Generalization through variance: how noise shapes inductive biases in diffusion models

John Vastola

ICLR 2025arXiv:2504.12532
18
citations
#2012

Active Learning for Neural PDE Solvers

Daniel Musekamp, Marimuthu Kalimuthu, David Holzmüller et al.

ICLR 2025arXiv:2408.01536
18
citations
#2013

Aioli: A Unified Optimization Framework for Language Model Data Mixing

Mayee Chen, Michael Hu, Nicholas Lourie et al.

ICLR 2025arXiv:2411.05735
18
citations
#2014

Conformal Prediction via Regression-as-Classification

Etash Guha, Shlok Natarajan, Thomas Möllenhoff et al.

ICLR 2024arXiv:2404.08168
18
citations
#2015

Adaptive Sharpness-Aware Pruning for Robust Sparse Networks

Anna Bair, Hongxu Yin, Maying Shen et al.

ICLR 2024arXiv:2306.14306
18
citations
#2016

GS-CPR: Efficient Camera Pose Refinement via 3D Gaussian Splatting

Changkun Liu, Shuai Chen, Yash Bhalgat et al.

ICLR 2025arXiv:2408.11085
18
citations
#2017

Palu: KV-Cache Compression with Low-Rank Projection

Chi-Chih Chang, Wei-Cheng Lin, Chien-Yu Lin et al.

ICLR 2025
18
citations
#2018

Contextual Document Embeddings

John X. Morris, Alexander Rush

ICLR 2025arXiv:2410.02525
18
citations
#2019

De novo Protein Design Using Geometric Vector Field Networks

weian mao, Muzhi Zhu, Zheng Sun et al.

ICLR 2024spotlightarXiv:2310.11802
18
citations
#2020

Bio-xLSTM: Generative modeling, representation and in-context learning of biological and chemical sequences

Niklas Schmidinger, Lisa Schneckenreiter, Philipp Seidl et al.

ICLR 2025arXiv:2411.04165
18
citations
#2021

Towards Faithful XAI Evaluation via Generalization-Limited Backdoor Watermark

Mengxi Ya, Yiming Li, Tao Dai et al.

ICLR 2024
18
citations
#2022

Determine-Then-Ensemble: Necessity of Top-k Union for Large Language Model Ensembling

Yuxuan YAO, Han Wu, Mingyang LIU et al.

ICLR 2025arXiv:2410.03777
18
citations
#2023

Understanding and Enhancing the Transferability of Jailbreaking Attacks

Runqi Lin, Bo Han, Fengwang Li et al.

ICLR 2025arXiv:2502.03052
18
citations
#2024

VL-ICL Bench: The Devil in the Details of Multimodal In-Context Learning

Yongshuo Zong, Ondrej Bohdal, Timothy Hospedales

ICLR 2025arXiv:2403.13164
18
citations
#2025

Limits of Deep Learning: Sequence Modeling through the Lens of Complexity Theory

Nikola Zubic, Federico Soldà, Aurelio Sulser et al.

ICLR 2025arXiv:2405.16674
18
citations
#2026

MiniPLM: Knowledge Distillation for Pre-training Language Models

Yuxian Gu, Hao Zhou, Fandong Meng et al.

ICLR 2025arXiv:2410.17215
18
citations
#2027

Improving Reasoning Performance in Large Language Models via Representation Engineering

Bertram Højer, Oliver Jarvis, Stefan Heinrich

ICLR 2025arXiv:2504.19483
18
citations
#2028

Fundamental Limits of Prompt Tuning Transformers: Universality, Capacity and Efficiency

Jerry Yao-Chieh Hu, Wei-Po Wang, Ammar Gilani et al.

ICLR 2025arXiv:2411.16525
18
citations
#2029

Scalable Influence and Fact Tracing for Large Language Model Pretraining

Tyler Chang, Dheeraj Rajagopal, Tolga Bolukbasi et al.

ICLR 2025arXiv:2410.17413
18
citations
#2030

Label-Agnostic Forgetting: A Supervision-Free Unlearning in Deep Models

Shaofei Shen, Chenhao Zhang, Yawen Zhao et al.

ICLR 2024arXiv:2404.00506
18
citations
#2031

Efficient Multi-agent Reinforcement Learning by Planning

Qihan Liu, Jianing Ye, Xiaoteng Ma et al.

ICLR 2024arXiv:2405.11778
18
citations
#2032

Perm: A Parametric Representation for Multi-Style 3D Hair Modeling

Chengan He, Xin Sun, Zhixin Shu et al.

ICLR 2025arXiv:2407.19451
18
citations
#2033

CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding

Junyan Li, Delin Chen, Yining Hong et al.

ICLR 2024arXiv:2311.03354
18
citations
#2034

Discretization-invariance? On the Discretization Mismatch Errors in Neural Operators

Wenhan Gao, Ruichen Xu, Yuefan Deng et al.

ICLR 2025
18
citations
#2035

Understanding Augmentation-based Self-Supervised Representation Learning via RKHS Approximation and Regression

Runtian Zhai, Bingbin Liu, Andrej Risteski et al.

ICLR 2024spotlightarXiv:2306.00788
18
citations
#2036

MMDisCo: Multi-Modal Discriminator-Guided Cooperative Diffusion for Joint Audio and Video Generation

Akio Hayakawa, Masato Ishii, Takashi Shibuya et al.

ICLR 2025arXiv:2405.17842
18
citations
#2037

Boosting Neural Combinatorial Optimization for Large-Scale Vehicle Routing Problems

Fu Luo, Xi Lin, Yaoxin Wu et al.

ICLR 2025
18
citations
#2038

Bridging Context Gaps: Leveraging Coreference Resolution for Long Contextual Understanding

Yanming Liu, Xinyue Peng, Jiannan Cao et al.

ICLR 2025arXiv:2410.01671
18
citations
#2039

Tracing Representation Progression: Analyzing and Enhancing Layer-Wise Similarity

Jiachen Jiang, Jinxin Zhou, Zhihui Zhu

ICLR 2025arXiv:2406.14479
18
citations
#2040

End-to-End (Instance)-Image Goal Navigation through Correspondence as an Emergent Phenomenon

Guillaume Bono, Leonid Antsfeld, Boris Chidlovskii et al.

ICLR 2024arXiv:2309.16634
18
citations
#2041

Energy-based Automated Model Evaluation

Ru Peng, Heming Zou, Haobo Wang et al.

ICLR 2024arXiv:2401.12689
18
citations
#2042

Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning

Haozhe Ma, Zhengding Luo, Thanh Vinh Vo et al.

ICLR 2025arXiv:2408.03029
18
citations
#2043

No Preference Left Behind: Group Distributional Preference Optimization

Binwei Yao, Zefan Cai, Yun-Shiuan Chuang et al.

ICLR 2025arXiv:2412.20299
18
citations
#2044

MMAD: A Comprehensive Benchmark for Multimodal Large Language Models in Industrial Anomaly Detection

Xi Jiang, Jian Li, Hanqiu Deng et al.

ICLR 2025arXiv:2410.09453
18
citations
#2045

COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RL

Xiyao Wang, Ruijie Zheng, Yanchao Sun et al.

ICLR 2024arXiv:2310.07220
18
citations
#2046

VideoShield: Regulating Diffusion-based Video Generation Models via Watermarking

Runyi Hu, Jie Zhang, Yiming Li et al.

ICLR 2025oralarXiv:2501.14195
18
citations
#2047

From Sparse Dependence to Sparse Attention: Unveiling How Chain-of-Thought Enhances Transformer Sample Efficiency

Kaiyue Wen, Huaqing Zhang, Hongzhou Lin et al.

ICLR 2025arXiv:2410.05459
17
citations
#2048

GEVRM: Goal-Expressive Video Generation Model For Robust Visual Manipulation

Hongyin Zhang, Pengxiang Ding, Shangke Lyu et al.

ICLR 2025arXiv:2502.09268
17
citations
#2049

Learning Graph Quantized Tokenizers

Limei Wang, Kaveh Hassani, Si Zhang et al.

ICLR 2025arXiv:2410.13798
17
citations
#2050

CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion

Shoubin Yu, Jaehong Yoon, Mohit Bansal

ICLR 2025arXiv:2402.05889
17
citations
#2051

Spiking Vision Transformer with Saccadic Attention

Shuai Wang, Malu Zhang, Dehao Zhang et al.

ICLR 2025oralarXiv:2502.12677
17
citations
#2052

An Engorgio Prompt Makes Large Language Model Babble on

Jianshuo Dong, Ziyuan Zhang, Qingjie Zhang et al.

ICLR 2025arXiv:2412.19394
17
citations
#2053

Defining and extracting generalizable interaction primitives from DNNs

Lu Chen, Siyu Lou, Benhao Huang et al.

ICLR 2024arXiv:2401.16318
17
citations
#2054

A Multi-Power Law for Loss Curve Prediction Across Learning Rate Schedules

Kairong Luo, Haodong Wen, Shengding Hu et al.

ICLR 2025arXiv:2503.12811
17
citations
#2055

Towards Universality: Studying Mechanistic Similarity Across Language Model Architectures

Junxuan Wang, Xuyang Ge, Wentao Shu et al.

ICLR 2025arXiv:2410.06672
17
citations
#2056

OmniKV: Dynamic Context Selection for Efficient Long-Context LLMs

Jitai Hao, Yuke Zhu, Tian Wang et al.

ICLR 2025
17
citations
#2057

Fantastic Gains and Where to Find Them: On the Existence and Prospect of General Knowledge Transfer between Any Pretrained Model

Karsten Roth, Lukas Thede, A. Sophia Koepke et al.

ICLR 2024spotlightarXiv:2310.17653
17
citations
#2058

Locality-aware Gaussian Compression for Fast and High-quality Rendering

Seungjoo Shin, Jaesik Park, Sunghyun Cho

ICLR 2025arXiv:2501.05757
17
citations
#2059

Unleashing the Potential of Fractional Calculus in Graph Neural Networks with FROND

Qiyu Kang, Kai Zhao, Qinxu Ding et al.

ICLR 2024spotlightarXiv:2404.17099
17
citations
#2060

CBGBench: Fill in the Blank of Protein-Molecule Complex Binding Graph

Haitao Lin, Guojiang Zhao, Odin Zhang et al.

ICLR 2025arXiv:2406.10840
17
citations
#2061

SPADE: Semi-supervised Anomaly Detection under Distribution Mismatch

Chun-Liang Li, Tomas Pfister, Kihyuk Sohn et al.

ICLR 2024arXiv:2212.00173
17
citations
#2062

Graph Sparsification via Mixture of Graphs

Guibin Zhang, Xiangguo SUN, Yanwei Yue et al.

ICLR 2025arXiv:2405.14260
17
citations
#2063

What Makes a Good Prune? Maximal Unstructured Pruning for Maximal Cosine Similarity

Gabryel Mason-Williams, Fredrik Dahlqvist

ICLR 2024
17
citations
#2064

Non-Exchangeable Conformal Risk Control

António Farinhas, Chrysoula Zerva, Dennis Ulmer et al.

ICLR 2024arXiv:2310.01262
17
citations
#2065

DiffusionNAG: Predictor-guided Neural Architecture Generation with Diffusion Models

Sohyun An, Hayeon Lee, Jaehyeong Jo et al.

ICLR 2024arXiv:2305.16943
17
citations
#2066

Towards Enhancing Time Series Contrastive Learning: A Dynamic Bad Pair Mining Approach

Xiang Lan, Hanshu Yan, Shenda Hong et al.

ICLR 2024arXiv:2302.03357
17
citations
#2067

MetaMetrics: Calibrating Metrics for Generation Tasks Using Human Preferences

Genta Winata, David Anugraha, Lucky Susanto et al.

ICLR 2025arXiv:2410.02381
17
citations
#2068

Intent3D: 3D Object Detection in RGB-D Scans Based on Human Intention

Weitai Kang, Mengxue Qu, Jyoti Kini et al.

ICLR 2025arXiv:2405.18295
17
citations
#2069

Towards Green AI in Fine-tuning Large Language Models via Adaptive Backpropagation

Kai Huang, Hanyun Yin, Heng Huang et al.

ICLR 2024arXiv:2309.13192
17
citations
#2070

Simulating Human-like Daily Activities with Desire-driven Autonomy

Yiding Wang, Yuxuan Chen, Fangwei Zhong et al.

ICLR 2025oralarXiv:2412.06435
17
citations
#2071

Scalable Discrete Diffusion Samplers: Combinatorial Optimization and Statistical Physics

Sebastian Sanokowski, Wilhelm Berghammer, Haoyu Wang et al.

ICLR 2025arXiv:2502.08696
17
citations
#2072

Decentralized Riemannian Conjugate Gradient Method on the Stiefel Manifold

Jun Chen, Haishan Ye, Mengmeng Wang et al.

ICLR 2024arXiv:2308.10547
17
citations
#2073

RefactorBench: Evaluating Stateful Reasoning in Language Agents Through Code

Dhruv Gautam, Spandan Garg, Jinu Jang et al.

ICLR 2025arXiv:2503.07832
17
citations
#2074

Can We Talk Models Into Seeing the World Differently?

Paul Gavrikov, Jovita Lukasik, Steffen Jung et al.

ICLR 2025arXiv:2403.09193
17
citations
#2075

Sliced Denoising: A Physics-Informed Molecular Pre-Training Method

yuyan ni, Shikun Feng, Wei-Ying Ma et al.

ICLR 2024arXiv:2311.02124
17
citations
#2076

Unifying Unsupervised Graph-Level Anomaly Detection and Out-of-Distribution Detection: A Benchmark

Yili Wang, Yixin Liu, Xu Shen et al.

ICLR 2025arXiv:2406.15523
17
citations
#2077

Efficient Learning with Sine-Activated Low-Rank Matrices

Yiping Ji, Hemanth Saratchandran, Cameron Gordon et al.

ICLR 2025arXiv:2403.19243
17
citations
#2078

Blending Imitation and Reinforcement Learning for Robust Policy Improvement

Xuefeng Liu, Takuma Yoneda, Rick Stevens et al.

ICLR 2024spotlightarXiv:2310.01737
17
citations
#2079

FedCompass: Efficient Cross-Silo Federated Learning on Heterogeneous Client Devices Using a Computing Power-Aware Scheduler

Zilinghan Li, Pranshu Chaturvedi, Shilan He et al.

ICLR 2024arXiv:2309.14675
17
citations
#2080

Self-Supervised Contrastive Learning for Long-term Forecasting

Junwoo Park, Daehoon Gwak, Jaegul Choo et al.

ICLR 2024arXiv:2402.02023
17
citations
#2081

Learnable Expansion of Graph Operators for Multi-Modal Feature Fusion

Dexuan Ding, Lei Wang, Liyun Zhu et al.

ICLR 2025arXiv:2410.01506
17
citations
#2082

u-$\mu$P: The Unit-Scaled Maximal Update Parametrization

Charles Blake, Constantin Eichenberg, Josef Dean et al.

ICLR 2025
17
citations
#2083

Cross-Entropy Is All You Need To Invert the Data Generating Process

Patrik Reizinger, Alice Bizeul, Attila Juhos et al.

ICLR 2025arXiv:2410.21869
17
citations
#2084

Sparse Spiking Neural Network: Exploiting Heterogeneity in Timescales for Pruning Recurrent SNN

Biswadeep Chakraborty, Beomseok Kang, Harshit Kumar et al.

ICLR 2024arXiv:2403.03409
17
citations
#2085

PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection

Botao Ren, Xue Yang, Yi Yu et al.

ICLR 2025arXiv:2410.08210
17
citations
#2086

Prompting Fairness: Integrating Causality to Debias Large Language Models

Jingling Li, Zeyu Tang, Xiaoyu Liu et al.

ICLR 2025arXiv:2403.08743
17
citations
#2087

Predictive, scalable and interpretable knowledge tracing on structured domains

Hanqi Zhou, Robert Bamler, Charley Wu et al.

ICLR 2024spotlightarXiv:2403.13179
17
citations
#2088

Leave-one-out Distinguishability in Machine Learning

Jiayuan Ye, Anastasia Borovykh, Soufiane Hayou et al.

ICLR 2024arXiv:2309.17310
17
citations
#2089

Permute-and-Flip: An optimally stable and watermarkable decoder for LLMs

Xuandong Zhao, Lei Li, Yu-Xiang Wang

ICLR 2025arXiv:2402.05864
17
citations
#2090

LMUFormer: Low Complexity Yet Powerful Spiking Model With Legendre Memory Units

Zeyu Liu, Gourav Datta, Anni Li et al.

ICLR 2024arXiv:2402.04882
17
citations
#2091

Painting with Words: Elevating Detailed Image Captioning with Benchmark and Alignment Learning

Qinghao Ye, Xianhan Zeng, Fu Li et al.

ICLR 2025arXiv:2503.07906
17
citations
#2092

Tighter Privacy Auditing of DP-SGD in the Hidden State Threat Model

Tudor Cebere, Aurélien Bellet, Nicolas Papernot

ICLR 2025arXiv:2405.14457
17
citations
#2093

Optimal Transport for Time Series Imputation

Hao Wang, zhengnan li, Haoxuan Li et al.

ICLR 2025oral
17
citations
#2094

Towards Understanding Factual Knowledge of Large Language Models

Xuming Hu, Junzhe Chen, Xiaochuan Li et al.

ICLR 2024oral
17
citations
#2095

Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMs

Sreyan Ghosh, Chandra Kiran Evuru, Sonal Kumar et al.

ICLR 2025arXiv:2405.15683
17
citations
#2096

DeLLMa: Decision Making Under Uncertainty with Large Language Models

Ollie Liu, Deqing Fu, Dani Yogatama et al.

ICLR 2025arXiv:2402.02392
17
citations
#2097

U-Nets as Belief Propagation: Efficient Classification, Denoising, and Diffusion in Generative Hierarchical Models

Song Mei

ICLR 2025arXiv:2404.18444
17
citations
#2098

Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping

Yue Yang, Shuibo Zhang, Kaipeng Zhang et al.

ICLR 2025arXiv:2410.08695
17
citations
#2099

NeuroBack: Improving CDCL SAT Solving using Graph Neural Networks

Wenxi Wang, Yang Hu, Mohit Tiwari et al.

ICLR 2024arXiv:2110.14053
17
citations
#2100

Adaptive Methods through the Lens of SDEs: Theoretical Insights on the Role of Noise

Enea Monzio Compagnoni, Tianlin Liu, Rustem Islamov et al.

ICLR 2025arXiv:2411.15958
17
citations
#2101

Mask-DPO: Generalizable Fine-grained Factuality Alignment of LLMs

Yuzhe Gu, Wenwei Zhang, Chengqi Lyu et al.

ICLR 2025arXiv:2503.02846
17
citations
#2102

Learning Interpretable Hierarchical Dynamical Systems Models from Time Series Data

Manuel Brenner, Elias Weber, Georgia Koppe et al.

ICLR 2025arXiv:2410.04814
17
citations
#2103

What Matters in Learning from Large-Scale Datasets for Robot Manipulation

Vaibhav Saxena, Matthew Bronars, Nadun Ranawaka Arachchige et al.

ICLR 2025arXiv:2506.13536
17
citations
#2104

TULIP: Token-length Upgraded CLIP

Ivona Najdenkoska, Mohammad Mahdi Derakhshani, Yuki Asano et al.

ICLR 2025arXiv:2410.10034
17
citations
#2105

AutoBencher: Towards Declarative Benchmark Construction

XIANG LI, Farzaan Kaiyom, Evan Liu et al.

ICLR 2025arXiv:2407.08351
17
citations
#2106

Quamba: A Post-Training Quantization Recipe for Selective State Space Models

Hung-Yueh Chiang, Chi-Chih Chang, Natalia Frumkin et al.

ICLR 2025arXiv:2410.13229
17
citations
#2107

ThinkBot: Embodied Instruction Following with Thought Chain Reasoning

Guanxing Lu, Ziwei Wang, Changliu Liu et al.

ICLR 2025arXiv:2312.07062
17
citations
#2108

From Risk to Uncertainty: Generating Predictive Uncertainty Measures via Bayesian Estimation

Nikita Kotelevskii, Vladimir Kondratyev, Martin Takáč et al.

ICLR 2025arXiv:2402.10727
17
citations
#2109

A Probabilistic Perspective on Unlearning and Alignment for Large Language Models

Yan Scholten, Stephan Günnemann, Leo Schwinn

ICLR 2025arXiv:2410.03523
17
citations
#2110

EControl: Fast Distributed Optimization with Compression and Error Control

Yuan Gao, Rustem Islamov, Sebastian Stich

ICLR 2024arXiv:2311.05645
17
citations
#2111

Closed-Form Merging of Parameter-Efficient Modules for Federated Continual Learning

Riccardo Salami, Pietro Buzzega, Matteo Mosconi et al.

ICLR 2025arXiv:2410.17961
17
citations
#2112

SparseFormer: Sparse Visual Recognition via Limited Latent Tokens

Ziteng Gao, Zhan Tong, Limin Wang et al.

ICLR 2024arXiv:2304.03768
17
citations
#2113

Data Distillation Can Be Like Vodka: Distilling More Times For Better Quality

Xuxi Chen, Yu Yang, Zhangyang Wang et al.

ICLR 2024arXiv:2310.06982
17
citations
#2114

CIFAR-10-Warehouse: Broad and More Realistic Testbeds in Model Generalization Analysis

Xiaoxiao Sun, Xingjian Leng, Zijian Wang et al.

ICLR 2024arXiv:2310.04414
17
citations
#2115

Image Inpainting via Iteratively Decoupled Probabilistic Modeling

Wenbo Li, Xin Yu, Kun Zhou et al.

ICLR 2024spotlightarXiv:2212.02963
17
citations
#2116

Controllable Context Sensitivity and the Knob Behind It

Julian Minder, Kevin Du, Niklas Stoehr et al.

ICLR 2025arXiv:2411.07404
17
citations
#2117

Training Neural Networks as Recognizers of Formal Languages

Alexandra Butoi, Ghazal Khalighinejad, Anej Svete et al.

ICLR 2025arXiv:2411.07107
17
citations
#2118

Learning Efficient Positional Encodings with Graph Neural Networks

Charilaos Kanatsoulis, Evelyn Choi, Stefanie Jegelka et al.

ICLR 2025arXiv:2502.01122
17
citations
#2119

CAMIL: Context-Aware Multiple Instance Learning for Cancer Detection and Subtyping in Whole Slide Images

olga fourkioti, Matt De Vries, Chris Bakal

ICLR 2024spotlightarXiv:2305.05314
17
citations
#2120

Learning Clustering-based Prototypes for Compositional Zero-Shot Learning

Hongyu Qu, Jianan Wei, Xiangbo Shu et al.

ICLR 2025arXiv:2502.06501
17
citations
#2121

A CLIP-Powered Framework for Robust and Generalizable Data Selection

Suorong Yang, Peng Ye, Wanli Ouyang et al.

ICLR 2025arXiv:2410.11215
17
citations
#2122

R-MAE: Regions Meet Masked Autoencoders

Duy-Kien Nguyen, Yanghao Li, Vaibhav Aggarwal et al.

ICLR 2024arXiv:2306.05411
17
citations
#2123

Adam Exploits $\ell_\infty$-geometry of Loss Landscape via Coordinate-wise Adaptivity

Shuo Xie, Mohamad Amin Mohamadi, Zhiyuan Li

ICLR 2025arXiv:2410.08198
17
citations
#2124

Swift4D: Adaptive divide-and-conquer Gaussian Splatting for compact and efficient reconstruction of dynamic scene

Jiahao Wu, Rui Peng, Zhiyan Wang et al.

ICLR 2025
17
citations
#2125

Identifying Representations for Intervention Extrapolation

Sorawit (James) Saengkyongam, Elan Rosenfeld, Pradeep K Ravikumar et al.

ICLR 2024arXiv:2310.04295
17
citations
#2126

Generative Flows on Synthetic Pathway for Drug Design

Seonghwan Seo, Minsu Kim, Tony Shen et al.

ICLR 2025arXiv:2410.04542
17
citations
#2127

Crystalformer: Infinitely Connected Attention for Periodic Structure Encoding

Tatsunori Taniai, Ryo Igarashi, Yuta Suzuki et al.

ICLR 2024arXiv:2403.11686
17
citations
#2128

Improving Domain Generalization with Domain Relations

Huaxiu Yao, Xinyu Yang, Xinyi Pan et al.

ICLR 2024spotlightarXiv:2302.02609
17
citations
#2129

When should we prefer Decision Transformers for Offline Reinforcement Learning?

Prajjwal Bhargava, Rohan Chitnis, Alborz Geramifard et al.

ICLR 2024arXiv:2305.14550
17
citations
#2130

Generating Images with 3D Annotations Using Diffusion Models

Wufei Ma, Qihao Liu, Jiahao Wang et al.

ICLR 2024spotlightarXiv:2306.08103
17
citations
#2131

CyberHost: A One-stage Diffusion Framework for Audio-driven Talking Body Generation

Gaojie Lin, Jianwen Jiang, Chao Liang et al.

ICLR 2025
17
citations
#2132

Block-Attention for Efficient Prefilling

Dongyang Ma, Yan Wang, Tian Lan

ICLR 2025arXiv:2409.15355
17
citations
#2133

Self-supervised Representation Learning from Random Data Projectors

Yi Sui, Tongzi Wu, Jesse Cresswell et al.

ICLR 2024arXiv:2310.07756
17
citations
#2134

CORN: Contact-based Object Representation for Nonprehensile Manipulation of General Unseen Objects

Yoonyoung Cho, Junhyek Han, Yoontae Cho et al.

ICLR 2024arXiv:2403.10760
16
citations
#2135

DiffEnc: Variational Diffusion with a Learned Encoder

Beatrix M. G. Nielsen, Anders Christensen, Andrea Dittadi et al.

ICLR 2024arXiv:2310.19789
16
citations
#2136

Endless Jailbreaks with Bijection Learning

Brian R.Y. Huang, Max Li, Leonard Tang

ICLR 2025arXiv:2410.01294
16
citations
#2137

Multimarginal Generative Modeling with Stochastic Interpolants

Michael Albergo, Nicholas Boffi, Michael Lindsey et al.

ICLR 2024arXiv:2310.03695
16
citations
#2138

DRoC: Elevating Large Language Models for Complex Vehicle Routing via Decomposed Retrieval of Constraints

Xia Jiang, Yaoxin Wu, Chenhao Zhang et al.

ICLR 2025
16
citations
#2139

Mirage: Model-agnostic Graph Distillation for Graph Classification

Mridul Gupta, Sahil Manchanda, HARIPRASAD KODAMANA et al.

ICLR 2024arXiv:2310.09486
16
citations
#2140

Jointly-Learned Exit and Inference for a Dynamic Neural Network

Florence Regol, Joud Chataoui, Mark Coates

ICLR 2024arXiv:2310.09163
16
citations
#2141

COME: Test-time Adaption by Conservatively Minimizing Entropy

Qingyang Zhang, Yatao Bian, Xinke Kong et al.

ICLR 2025arXiv:2410.10894
16
citations
#2142

MACPO: Weak-to-Strong Alignment via Multi-Agent Contrastive Preference Optimization

Yougang Lyu, Lingyong Yan, Zihan Wang et al.

ICLR 2025oralarXiv:2410.07672
16
citations
#2143

Is attention required for ICL? Exploring the Relationship Between Model Architecture and In-Context Learning Ability

Ivan Lee, Nan Jiang, Taylor Berg-Kirkpatrick

ICLR 2024arXiv:2310.08049
16
citations
#2144

Mixture of Experts Made Personalized: Federated Prompt Learning for Vision-Language Models

Jun Luo, Chen Chen, Shandong Wu

ICLR 2025arXiv:2410.10114
16
citations
#2145

Masked Completion via Structured Diffusion with White-Box Transformers

Druv Pai, Sam Buchanan, Ziyang Wu et al.

ICLR 2024arXiv:2404.02446
16
citations
#2146

InstructDET: Diversifying Referring Object Detection with Generalized Instructions

Ronghao Dang, Jiangyan Feng, Haodong Zhang et al.

ICLR 2024arXiv:2310.05136
16
citations
#2147

On Calibration of LLM-based Guard Models for Reliable Content Moderation

Hongfu Liu, Hengguan Huang, Xiangming Gu et al.

ICLR 2025arXiv:2410.10414
16
citations
#2148

MetaOOD: Automatic Selection of OOD Detection Models

Yuehan Qin, Yichi Zhang, Yi Nian et al.

ICLR 2025arXiv:2410.03074
16
citations
#2149

OATS: Outlier-Aware Pruning Through Sparse and Low Rank Decomposition

Stephen Zhang, Vardan Papyan

ICLR 2025arXiv:2409.13652
16
citations
#2150

Learning Stackable and Skippable LEGO Bricks for Efficient, Reconfigurable, and Variable-Resolution Diffusion Modeling

Huangjie Zheng, Zhendong Wang, Jianbo Yuan et al.

ICLR 2024arXiv:2310.06389
16
citations
#2151

Do LLMs estimate uncertainty well in instruction-following?

Juyeon Heo, Miao Xiong, Christina Heinze-Deml et al.

ICLR 2025arXiv:2410.14582
16
citations
#2152

What Matters to You? Towards Visual Representation Alignment for Robot Learning

Thomas Tian, Chenfeng Xu, Masayoshi Tomizuka et al.

ICLR 2024oralarXiv:2310.07932
16
citations
#2153

Learning Flexible Body Collision Dynamics with Hierarchical Contact Mesh Transformer

Youn-Yeol Yu, Jeongwhan Choi, Woojin Cho et al.

ICLR 2024arXiv:2312.12467
16
citations
#2154

S$2$AC: Energy-Based Reinforcement Learning with Stein Soft Actor Critic

Safa Messaoud, Billel Mokeddem, Zhenghai Xue et al.

ICLR 2024arXiv:2405.00987
16
citations
#2155

Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval

Sheryl Hsu, Omar Khattab, Chelsea Finn et al.

ICLR 2025arXiv:2410.23214
16
citations
#2156

Is Factuality Enhancement a Free Lunch For LLMs? Better Factuality Can Lead to Worse Context-Faithfulness

Baolong Bi, Shenghua Liu, Yiwei Wang et al.

ICLR 2025arXiv:2404.00216
16
citations
#2157

SiReRAG: Indexing Similar and Related Information for Multihop Reasoning

Nan Zhang, Prafulla Kumar Choubey, Alexander Fabbri et al.

ICLR 2025arXiv:2412.06206
16
citations
#2158

Robustifying State-space Models for Long Sequences via Approximate Diagonalization

Annan Yu, Arnur Nigmetov, Dmitriy Morozov et al.

ICLR 2024spotlightarXiv:2310.01698
16
citations
#2159

EgoExo-Gen: Ego-centric Video Prediction by Watching Exo-centric Videos

Jilan Xu, Yifei Huang, Baoqi Pei et al.

ICLR 2025oralarXiv:2504.11732
16
citations
#2160

MrT5: Dynamic Token Merging for Efficient Byte-level Language Models

Julie Kallini, Shikhar Murty, Christopher Manning et al.

ICLR 2025arXiv:2410.20771
16
citations
#2161

CoT3DRef: Chain-of-Thoughts Data-Efficient 3D Visual Grounding

eslam Abdelrahman, Mohamed Ayman Mohamed, Mahmoud Ahmed et al.

ICLR 2024arXiv:2310.06214
16
citations
#2162

Label-Noise Robust Diffusion Models

Byeonghu Na, Yeongmin Kim, HeeSun Bae et al.

ICLR 2024arXiv:2402.17517
16
citations
#2163

Dynamic Gaussians Mesh: Consistent Mesh Reconstruction from Dynamic Scenes

Isabella Liu, Hao Su, Xiaolong Wang

ICLR 2025oralarXiv:2404.12379
16
citations
#2164

Can In-context Learning Really Generalize to Out-of-distribution Tasks?

Qixun Wang, Yifei Wang, Xianghua Ying et al.

ICLR 2025arXiv:2410.09695
16
citations
#2165

LeanQuant: Accurate and Scalable Large Language Model Quantization with Loss-error-aware Grid

Tianyi Zhang, Anshumali Shrivastava

ICLR 2025arXiv:2407.10032
16
citations
#2166

Track-On: Transformer-based Online Point Tracking with Memory

Görkay Aydemir, Xiongyi Cai, Weidi Xie et al.

ICLR 2025oralarXiv:2501.18487
16
citations
#2167

3D-AffordanceLLM: Harnessing Large Language Models for Open-Vocabulary Affordance Detection in 3D Worlds

Hengshuo Chu, Xiang Deng, Qi Lv et al.

ICLR 2025arXiv:2502.20041
16
citations
#2168

Signature Kernel Conditional Independence Tests in Causal Discovery for Stochastic Processes

Georg Manten, Cecilia Casolo, Emilio Ferrucci et al.

ICLR 2025arXiv:2402.18477
16
citations
#2169

Knowledge Entropy Decay during Language Model Pretraining Hinders New Knowledge Acquisition

Jiyeon Kim, Hyunji Lee, Hyowon Cho et al.

ICLR 2025arXiv:2410.01380
16
citations
#2170

PolyPythias: Stability and Outliers across Fifty Language Model Pre-Training Runs

Oskar van der Wal, Pietro Lesci, Max Müller-Eberstein et al.

ICLR 2025arXiv:2503.09543
16
citations
#2171

BIRD: A Trustworthy Bayesian Inference Framework for Large Language Models

Yu Feng, Ben Zhou, Weidong Lin et al.

ICLR 2025arXiv:2404.12494
16
citations
#2172

Black-Box Detection of Language Model Watermarks

Thibaud Gloaguen, Nikola Jovanović, Robin Staab et al.

ICLR 2025arXiv:2405.20777
16
citations
#2173

Refine Knowledge of Large Language Models via Adaptive Contrastive Learning

Yinghui Li, Haojing Huang, Jiayi Kuang et al.

ICLR 2025arXiv:2502.07184
16
citations
#2174

Concept Bottleneck Language Models For Protein Design

Aya Ismail, Tuomas Oikarinen, Amy Wang et al.

ICLR 2025arXiv:2411.06090
16
citations
#2175

ADDP: Learning General Representations for Image Recognition and Generation with Alternating Denoising Diffusion Process

Changyao Tian, Chenxin Tao, Jifeng Dai et al.

ICLR 2024arXiv:2306.05423
16
citations
#2176

LoRA Done RITE: Robust Invariant Transformation Equilibration for LoRA Optimization

Jui-Nan Yen, Si Si, Zhao Meng et al.

ICLR 2025arXiv:2410.20625
16
citations
#2177

Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning

Haque Ishfaq, Guangyuan Wang, Sami Islam et al.

ICLR 2025arXiv:2501.17827
16
citations
#2178

Adapting Multi-modal Large Language Model to Concept Drift From Pre-training Onwards

Xiaoyu Yang, Jie Lu, En Yu

ICLR 2025arXiv:2405.13459
16
citations
#2179

FlexCAD: Unified and Versatile Controllable CAD Generation with Fine-tuned Large Language Models

Zhanwei Zhang, Shizhao Sun, Wenxiao Wang et al.

ICLR 2025arXiv:2411.05823
16
citations
#2180

Probabilistic Language-Image Pre-Training

Sanghyuk Chun, Wonjae Kim, Song Park et al.

ICLR 2025arXiv:2410.18857
16
citations
#2181

Overcoming the Pitfalls of Vision-Language Model Finetuning for OOD Generalization

Yuhang Zang, Hanlin Goh, Joshua Susskind et al.

ICLR 2024arXiv:2401.15914
16
citations
#2182

Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion

Marco Mistretta, Alberto Baldrati, Lorenzo Agnolucci et al.

ICLR 2025arXiv:2502.04263
16
citations
#2183

EMOS: Embodiment-aware Heterogeneous Multi-robot Operating System with LLM Agents

Junting Chen, Checheng Yu, Xunzhe Zhou et al.

ICLR 2025arXiv:2410.22662
16
citations
#2184

Re-Thinking Inverse Graphics With Large Language Models

Haiwen Feng, Michael J Black, Weiyang Liu et al.

ICLR 2025arXiv:2404.15228
16
citations
#2185

Revisiting Deep Audio-Text Retrieval Through the Lens of Transportation

Tien Manh Luong, Khai Nguyen, Nhat Ho et al.

ICLR 2024arXiv:2405.10084
16
citations
#2186

Provably Accurate Shapley Value Estimation via Leverage Score Sampling

Christopher Musco, R. Teal Witter

ICLR 2025arXiv:2410.01917
16
citations
#2187

Quadratic models for understanding catapult dynamics of neural networks

Libin Zhu, Chaoyue Liu, Adityanarayanan Radhakrishnan et al.

ICLR 2024arXiv:2205.11787
16
citations
#2188

GOPlan: Goal-conditioned Offline Reinforcement Learning by Planning with Learned Models

Mianchu Wang, Rui Yang, Xi Chen et al.

ICLR 2025arXiv:2310.20025
16
citations
#2189

APE: Faster and Longer Context-Augmented Generation via Adaptive Parallel Encoding

Xinyu Yang, Tianqi Chen, Beidi Chen

ICLR 2025arXiv:2502.05431
16
citations
#2190

FlowDec: A flow-based full-band general audio codec with high perceptual quality

Simon Welker, Matthew Le, Ricky T. Q. Chen et al.

ICLR 2025arXiv:2503.01485
16
citations
#2191

To Trust or Not to Trust? Enhancing Large Language Models' Situated Faithfulness to External Contexts

Yukun Huang, Sanxing Chen, Hongyi Cai et al.

ICLR 2025arXiv:2410.14675
16
citations
#2192

DaWin: Training-free Dynamic Weight Interpolation for Robust Adaptation

Changdae Oh, Yixuan Li, Kyungwoo Song et al.

ICLR 2025arXiv:2410.03782
16
citations
#2193

RobustTSF: Towards Theory and Design of Robust Time Series Forecasting with Anomalies

Hao Cheng, Qingsong Wen, Yang Liu et al.

ICLR 2024arXiv:2402.02032
16
citations
#2194

TidalDecode: Fast and Accurate LLM Decoding with Position Persistent Sparse Attention

Lijie Yang, Zhihao Zhang, Zhuofu Chen et al.

ICLR 2025arXiv:2410.05076
16
citations
#2195

Presto! Distilling Steps and Layers for Accelerating Music Generation

Zachary Novack, Ge Zhu, Jonah Casebeer et al.

ICLR 2025arXiv:2410.05167
16
citations
#2196

Functional Interpolation for Relative Positions improves Long Context Transformers

Shanda Li, Chong You, Guru Guruganesh et al.

ICLR 2024arXiv:2310.04418
16
citations
#2197

Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective

Ming Zhong, Chenxin An, Weizhu Chen et al.

ICLR 2024arXiv:2310.11451
16
citations
#2198

OBI-Bench: Can LMMs Aid in Study of Ancient Script on Oracle Bones?

Zijian Chen, tingzhu chen, Wenjun Zhang et al.

ICLR 2025arXiv:2412.01175
16
citations
#2199

BayesDiff: Estimating Pixel-wise Uncertainty in Diffusion via Bayesian Inference

Siqi Kou, Lei Gan, Dequan Wang et al.

ICLR 2024arXiv:2310.11142
16
citations
#2200

RevisEval: Improving LLM-as-a-Judge via Response-Adapted References

Qiyuan Zhang, Yufei Wang, Tiezheng YU et al.

ICLR 2025arXiv:2410.05193
16
citations