ICLR Papers

6,124 papers found • Page 8 of 123

Better autoregressive regression with LLMs via regression-aware fine-tuning

Michal Lukasik, Zhao Meng, Harikrishna Narasimhan et al.

ICLR 2025poster
7
citations

Better Instruction-Following Through Minimum Bayes Risk

Ian Wu, Patrick Fernandes, Amanda Bertsch et al.

ICLR 2025posterarXiv:2410.02902

Better than Your Teacher: LLM Agents that learn from Privileged AI Feedback

Sanjiban Choudhury, Paloma Sodhi

ICLR 2025posterarXiv:2410.05434

Beware of Calibration Data for Pruning Large Language Models

Yixin Ji, Yang Xiang, Juntao Li et al.

ICLR 2025posterarXiv:2410.17711
8
citations

Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning

Jiacheng Ye, Jiahui Gao, Shansan Gong et al.

ICLR 2025posterarXiv:2410.14157
75
citations

Beyond Autoregression: Fast LLMs via Self-Distillation Through Time

Justin Deschenaux, Caglar Gulcehre

ICLR 2025posterarXiv:2410.21035
25
citations

Beyond Canonicalization: How Tensorial Messages Improve Equivariant Message Passing

Peter Lippmann, Gerrit Gerhartz, Roman Remme et al.

ICLR 2025posterarXiv:2405.15389
14
citations

Beyond Circuit Connections: A Non-Message Passing Graph Transformer Approach for Quantum Error Mitigation

Tianyi Bao, Xinyu Ye, Hang Ruan et al.

ICLR 2025poster
2
citations

Beyond Content Relevance: Evaluating Instruction Following in Retrieval Models

Jianqun Zhou, Yuanlei Zheng, Wei Chen et al.

ICLR 2025posterarXiv:2410.23841
6
citations

Beyond correlation: The impact of human uncertainty in measuring the effectiveness of automatic evaluation and LLM-as-a-judge

Aparna Elangovan, Lei Xu, Jongwoo Ko et al.

ICLR 2025posterarXiv:2410.03775
21
citations

Beyond-Expert Performance with Limited Demonstrations: Efficient Imitation Learning with Double Exploration

Heyang Zhao, Xingrui Yu, David Bossens et al.

ICLR 2025posterarXiv:2506.20307
2
citations

Beyond FVD: An Enhanced Evaluation Metrics for Video Generation Distribution Quality

Ge Ya Luo, Gian M Favero, Zhi Hao Luo et al.

ICLR 2025oral
5
citations

Beyond Graphs: Can Large Language Models Comprehend Hypergraphs?

Yifan Feng, Chengwu Yang, Xingliang Hou et al.

ICLR 2025posterarXiv:2410.10083
10
citations

Beyond Interpretability: The Gains of Feature Monosemanticity on Model Robustness

Qi Zhang, Yifei Wang, Jingyi Cui et al.

ICLR 2025posterarXiv:2410.21331
4
citations

Beyond Linear Approximations: A Novel Pruning Approach for Attention Matrix

Yingyu Liang, Jiangxuan Long, Zhenmei Shi et al.

ICLR 2025posterarXiv:2410.11261

Beyond Mere Token Analysis: A Hypergraph Metric Space Framework for Defending Against Socially Engineered LLM Attacks

Manohar Kaul, Aditya Saibewar, Sadbhavana Babar

ICLR 2025poster

Beyond Model Collapse: Scaling Up with Synthesized Data Requires Verification

Yunzhen Feng, Elvis Dohmatob, Pu Yang et al.

ICLR 2025posterarXiv:2406.07515

Beyond Next Token Prediction: Patch-Level Training for Large Language Models

Chenze Shao, Fandong Meng, Jie Zhou

ICLR 2025posterarXiv:2407.12665
2
citations

Beyond Random Augmentations: Pretraining with Hard Views

Fabio Ferreira, Ivo Rapant, Jörg Franke et al.

ICLR 2025posterarXiv:2310.03940
1
citations

Beyond Random Masking: When Dropout meets Graph Convolutional Networks

Yuankai Luo, Xiao-Ming Wu, Hao Zhu

ICLR 2025poster
5
citations

Beyond Sequence: Impact of Geometric Context for RNA Property Prediction

Junjie Xu, Artem Moskalev, Tommaso Mansi et al.

ICLR 2025posterarXiv:2410.11933
8
citations

Beyond Single Concept Vector: Modeling Concept Subspace in LLMs with Gaussian Distribution

Haiyan Zhao, Heng Zhao, Bo Shen et al.

ICLR 2025posterarXiv:2410.00153
16
citations

Beyond single neurons: population response geometry in digital twins of mouse visual cortex

Dario Liscai, Emanuele Luconi, Alessandro Marin Vargas et al.

ICLR 2025poster
1
citations

Beyond Squared Error: Exploring Loss Design for Enhanced Training of Generative Flow Networks

Rui Hu, Yifan Zhang, Zhuoran Li et al.

ICLR 2025posterarXiv:2410.02596

Beyond Surface Structure: A Causal Assessment of LLMs' Comprehension ability

Yujin Han, Lei Xu, Sirui Chen et al.

ICLR 2025posterarXiv:2411.19456
2
citations

Beyond the convexity assumption: Realistic tabular data generation under quantifier-free real linear constraints

Mihaela Stoian, Eleonora Giunchiglia

ICLR 2025posterarXiv:2502.18237
9
citations

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Clemencia Siro, Guy Gur-Ari, Gaurav Mishra et al.

ICLR 2025oralarXiv:2206.04615
2192
citations

Beyond Worst-Case Dimensionality Reduction for Sparse Vectors

Sandeep Silwal, David Woodruff, Qiuyi (Richard) Zhang

ICLR 2025posterarXiv:2502.19865

Bias Mitigation in Graph Diffusion Models

Meng Yu, Kun Zhan

ICLR 2025poster

Bidirectional Decoding: Improving Action Chunking via Guided Test-Time Sampling

Yuejiang Liu, Jubayer Hamid, Annie Xie et al.

ICLR 2025oralarXiv:2408.17355
6
citations

Bi-Factorial Preference Optimization: Balancing Safety-Helpfulness in Language Models

Wenxuan Zhang, Philip Torr, Mohamed Elhoseiny et al.

ICLR 2025posterarXiv:2408.15313
23
citations

BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions

Terry Yue Zhuo, Minh Chien Vu, Jenny Chim et al.

ICLR 2025posterarXiv:2406.15877
397
citations

BigDocs: An Open Dataset for Training Multimodal Models on Document and Code Tasks

Juan A. Rodriguez, Xiangru Jian, Siba Smarak Panigrahi et al.

ICLR 2025posterarXiv:2412.04626
5
citations

BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities

Shaozhe Hao, Xuantong LIU, Xianbiao Qi et al.

ICLR 2025posterarXiv:2410.14672
4
citations

Bilinear MLPs enable weight-based mechanistic interpretability

Michael Pearce, Thomas Dooms, Alice Rigg et al.

ICLR 2025posterarXiv:2410.08417

BinaryDM: Accurate Weight Binarization for Efficient Diffusion Models

Xingyu Zheng, Xianglong Liu, Haotong Qin et al.

ICLR 2025posterarXiv:2404.05662
7
citations

Binary Losses for Density Ratio Estimation

Werner Zellinger

ICLR 2025posterarXiv:2407.01371
1
citations

BingoGuard: LLM Content Moderation Tools with Risk Levels

Fan Yin, Philippe Laban, XIANGYU PENG et al.

ICLR 2025posterarXiv:2503.06550
14
citations

BioDiscoveryAgent: An AI Agent for Designing Genetic Perturbation Experiments

Yusuf Roohani, Andrew Lee, Qian Huang et al.

ICLR 2025posterarXiv:2405.17631
48
citations

Biologically Constrained Barrel Cortex Model Integrates Whisker Inputs and Replicates Key Brain Network Dynamics

Tianfang Zhu, Dongli Hu, Jiandong Zhou et al.

ICLR 2025oral

Biologically Plausible Brain Graph Transformer

Ciyuan Peng, Yuelong Huang, Qichao Dong et al.

ICLR 2025posterarXiv:2502.08958

Bio-xLSTM: Generative modeling, representation and in-context learning of biological and chemical sequences

Niklas Schmidinger, Lisa Schneckenreiter, Philipp Seidl et al.

ICLR 2025posterarXiv:2411.04165

BIRD: A Trustworthy Bayesian Inference Framework for Large Language Models

Yu Feng, Ben Zhou, Weidong Lin et al.

ICLR 2025posterarXiv:2404.12494

BirdSet: A Large-Scale Dataset for Audio Classification in Avian Bioacoustics

Lukas Rauch, Raphael Schwinger, Moritz Wirth et al.

ICLR 2025posterarXiv:2403.10380
18
citations

Bisimulation Metric for Model Predictive Control

Yutaka Shimizu, Masayoshi Tomizuka

ICLR 2025posterarXiv:2410.04553
2
citations

BitStack: Any-Size Compression of Large Language Models in Variable Memory Environments

Xinghao Wang, Pengyu Wang, Bo Wang et al.

ICLR 2025posterarXiv:2410.23918
5
citations

Black-Box Detection of Language Model Watermarks

Thibaud Gloaguen, Nikola Jovanović, Robin Staab et al.

ICLR 2025posterarXiv:2405.20777
15
citations

Black Sheep in the Herd: Playing with Spuriously Correlated Attributes for Vision-Language Recognition

Xinyu Tian, Shu Zou, Zhaoyuan Yang et al.

ICLR 2025posterarXiv:2502.15809
5
citations

BLEND: Behavior-guided Neural Population Dynamics Modeling via Privileged Knowledge Distillation

Zhengrui Guo, Fangxu Zhou, Wei Wu et al.

ICLR 2025oralarXiv:2410.13872
3
citations

BlendRL: A Framework for Merging Symbolic and Neural Policy Learning

Hikaru Shindo, Quentin Delfosse, Devendra Singh Dhami et al.

ICLR 2025posterarXiv:2410.11689