Most Cited ICLR "full information feedback" Papers

6,124 papers found • Page 7 of 31

#1201

Jointly Training Large Autoregressive Multimodal Models

Emanuele Aiello, Lili Yu, Yixin Nie et al.

ICLR 2024arXiv:2309.15564
34
citations
#1202

Beyond Accuracy: Evaluating Self-Consistency of Code Large Language Models with IdentityChain

Marcus J. Min, Yangruibo Ding, Luca Buratti et al.

ICLR 2024arXiv:2310.14053
34
citations
#1203

O(d/T) Convergence Theory for Diffusion Probabilistic Models under Minimal Assumptions

Gen Li, Yuling Yan

ICLR 2025arXiv:2409.18959
34
citations
#1204

Learning to Clarify: Multi-turn Conversations with Action-Based Contrastive Self-Training

Maximillian Chen, Ruoxi Sun, Tomas Pfister et al.

ICLR 2025arXiv:2406.00222
34
citations
#1205

Computational Limits of Low-Rank Adaptation (LoRA) Fine-Tuning for Transformer Models

Jerry Yao-Chieh Hu, Maojiang Su, En-Jui Kuo et al.

ICLR 2025arXiv:2406.03136
34
citations
#1206

SD-LoRA: Scalable Decoupled Low-Rank Adaptation for Class Incremental Learning

Yichen Wu, Hongming Piao, Long-Kai Huang et al.

ICLR 2025arXiv:2501.13198
34
citations
#1207

Low Rank Matrix Completion via Robust Alternating Minimization in Nearly Linear Time

Yuzhou Gu, Zhao Song, Junze Yin et al.

ICLR 2024arXiv:2302.11068
34
citations
#1208

Planning Anything with Rigor: General-Purpose Zero-Shot Planning with LLM-based Formalized Programming

Yilun Hao, Yang Zhang, Chuchu Fan

ICLR 2025arXiv:2410.12112
34
citations
#1209

Iterative Nash Policy Optimization: Aligning LLMs with General Preferences via No-Regret Learning

Yuheng Zhang, Dian Yu, Baolin Peng et al.

ICLR 2025arXiv:2407.00617
34
citations
#1210

ForecastBench: A Dynamic Benchmark of AI Forecasting Capabilities

Ezra Karger, Houtan Bastani, Chen Yueh-Han et al.

ICLR 2025arXiv:2409.19839
34
citations
#1211

Text4Seg: Reimagining Image Segmentation as Text Generation

Mengcheng Lan, Chaofeng Chen, Yue Zhou et al.

ICLR 2025arXiv:2410.09855
34
citations
#1212

RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards

Xinze Li, Sen Mei, Zhenghao Liu et al.

ICLR 2025arXiv:2410.13509
34
citations
#1213

Judge Decoding: Faster Speculative Sampling Requires Going Beyond Model Alignment

Gregor Bachmann, Sotiris Anagnostidis, Albert Pumarola et al.

ICLR 2025arXiv:2501.19309
34
citations
#1214

Scalable Language Model with Generalized Continual Learning

Bohao PENG, Zhuotao Tian, Shu Liu et al.

ICLR 2024arXiv:2404.07470
34
citations
#1215

Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning

Ruizhe Shi, Yuyao Liu, Yanjie Ze et al.

ICLR 2024arXiv:2310.20587
34
citations
#1216

Early Stopping Against Label Noise Without Validation Data

Suqin Yuan, Lei Feng, Tongliang Liu

ICLR 2024arXiv:2502.07551
34
citations
#1217

Let Models Speak Ciphers: Multiagent Debate through Embeddings

Chau Pham, Boyi Liu, Yingxiang Yang et al.

ICLR 2024arXiv:2310.06272
34
citations
#1218

How Does Unlabeled Data Provably Help Out-of-Distribution Detection?

Xuefeng Du, Zhen Fang, Ilias Diakonikolas et al.

ICLR 2024arXiv:2402.03502
34
citations
#1219

What to align in multimodal contrastive learning?

Benoit Dufumier, Javiera Castillo Navarro, Devis Tuia et al.

ICLR 2025arXiv:2409.07402
34
citations
#1220

Neural Monge Map estimation and its applications

Shaojun Ma, Yongxin Chen, Hao-Min Zhou et al.

ICLR 2024arXiv:2106.03812
33
citations
#1221

Revisiting text-to-image evaluation with Gecko: on metrics, prompts, and human rating

Olivia Wiles, Chuhan Zhang, Isabela Albuquerque et al.

ICLR 2025arXiv:2404.16820
33
citations
#1222

On the Relation between Trainability and Dequantization of Variational Quantum Learning Models

Elies Gil-Fuster, Casper Gyurik, Adrian Perez-Salinas et al.

ICLR 2025arXiv:2406.07072
33
citations
#1223

Random Feature Amplification: Feature Learning and Generalization in Neural Networks

Spencer Frei, Niladri Chatterji, Peter L. Bartlett

ICLR 2024arXiv:2202.07626
33
citations
#1224

AI as Humanity’s Salieri: Quantifying Linguistic Creativity of Language Models via Systematic Attribution of Machine Text against Web Text

Ximing Lu, Melanie Sclar, Skyler Hallinan et al.

ICLR 2025arXiv:2410.04265
33
citations
#1225

Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo

Haque Ishfaq, Qingfeng Lan, Pan Xu et al.

ICLR 2024arXiv:2305.18246
33
citations
#1226

LiDAR-PTQ: Post-Training Quantization for Point Cloud 3D Object Detection

Sifan Zhou, Liang Li, Xinyu Zhang et al.

ICLR 2024arXiv:2401.15865
33
citations
#1227

Scaling Wearable Foundation Models

Girish Narayanswamy, Xin Liu, Kumar Ayush et al.

ICLR 2025arXiv:2410.13638
33
citations
#1228

Poisoned Forgery Face: Towards Backdoor Attacks on Face Forgery Detection

Jiawei Liang, Siyuan Liang, Aishan Liu et al.

ICLR 2024spotlightarXiv:2402.11473
33
citations
#1229

Interpreting the Second-Order Effects of Neurons in CLIP

Yossi Gandelsman, Alexei Efros, Jacob Steinhardt

ICLR 2025arXiv:2406.04341
33
citations
#1230

gRNAde: Geometric Deep Learning for 3D RNA inverse design

Chaitanya Joshi, Arian Jamasb, Ramon Viñas et al.

ICLR 2025arXiv:2305.14749
33
citations
#1231

Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling

Wenda Xu, Rujun Han, Zifeng Wang et al.

ICLR 2025arXiv:2410.11325
33
citations
#1232

Gramian Multimodal Representation Learning and Alignment

Giordano Cicchetti, Eleonora Grassucci, Luigi Sigillo et al.

ICLR 2025arXiv:2412.11959
33
citations
#1233

Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization

Junkang Wu, Yuexiang Xie, Zhengyi Yang et al.

ICLR 2025arXiv:2407.07880
33
citations
#1234

The Hidden Language of Diffusion Models

Hila Chefer, Oran Lang, Mor Geva et al.

ICLR 2024arXiv:2306.00966
33
citations
#1235

Training Unbiased Diffusion Models From Biased Dataset

Yeongmin Kim, Byeonghu Na, Minsang Park et al.

ICLR 2024arXiv:2403.01189
33
citations
#1236

Logical Languages Accepted by Transformer Encoders with Hard Attention

Pablo Barcelo, Alexander Kozachinskiy, Anthony W. Lin et al.

ICLR 2024arXiv:2310.03817
33
citations
#1237

CABINET: Content Relevance-based Noise Reduction for Table Question Answering

Sohan Patnaik, Heril Changwal, Milan Aggarwal et al.

ICLR 2024spotlightarXiv:2402.01155
33
citations
#1238

Memorization Capacity of Multi-Head Attention in Transformers

Sadegh Mahdavi, Renjie Liao, Christos Thrampoulidis

ICLR 2024spotlightarXiv:2306.02010
33
citations
#1239

Don't Play Favorites: Minority Guidance for Diffusion Models

Soobin Um, Suhyeon Lee, Jong Chul YE

ICLR 2024arXiv:2301.12334
33
citations
#1240

SNIP: Bridging Mathematical Symbolic and Numeric Realms with Unified Pre-training

Kazem Meidani, Parshin Shojaee, Chandan Reddy et al.

ICLR 2024spotlightarXiv:2310.02227
33
citations
#1241

VideoGrain: Modulating Space-Time Attention for Multi-Grained Video Editing

Xiangpeng Yang, Linchao Zhu, Hehe Fan et al.

ICLR 2025arXiv:2502.17258
33
citations
#1242

Hyper-Connections

Defa Zhu, Hongzhi Huang, Zihao Huang et al.

ICLR 2025arXiv:2409.19606
33
citations
#1243

AgentRefine: Enhancing Agent Generalization through Refinement Tuning

Dayuan Fu, Keqing He, Yejie Wang et al.

ICLR 2025arXiv:2501.01702
33
citations
#1244

It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition

CHEN CHEN, Ruizhe Li, Yuchen Hu et al.

ICLR 2024arXiv:2402.05457
33
citations
#1245

MoDeGPT: Modular Decomposition for Large Language Model Compression

Chi-Heng Lin, Shangqian Gao, James Smith et al.

ICLR 2025arXiv:2408.09632
32
citations
#1246

GTA: A Geometry-Aware Attention Mechanism for Multi-View Transformers

Takeru Miyato, Bernhard Jaeger, Max Welling et al.

ICLR 2024arXiv:2310.10375
32
citations
#1247

Self-Boosting Large Language Models with Synthetic Preference Data

Qingxiu Dong, Li Dong, Xingxing Zhang et al.

ICLR 2025arXiv:2410.06961
32
citations
#1248

Get more for less: Principled Data Selection for Warming Up Fine-Tuning in LLMs

Feiyang Kang, Hoang Anh Just, Yifan Sun et al.

ICLR 2024arXiv:2405.02774
32
citations
#1249

CPPO: Continual Learning for Reinforcement Learning with Human Feedback

Han Zhang, Yu Lei, Lin Gui et al.

ICLR 2024
32
citations
#1250

InverseBench: Benchmarking Plug-and-Play Diffusion Priors for Inverse Problems in Physical Sciences

Hongkai Zheng, Wenda Chu, Bingliang Zhang et al.

ICLR 2025arXiv:2503.11043
32
citations
#1251

Walk the Talk? Measuring the Faithfulness of Large Language Model Explanations

Katie Matton, Robert Ness, John Guttag et al.

ICLR 2025arXiv:2504.14150
32
citations
#1252

Context-Alignment: Activating and Enhancing LLMs Capabilities in Time Series

Yuxiao Hu, Qian Li, Dongxiao Zhang et al.

ICLR 2025arXiv:2501.03747
32
citations
#1253

Harnessing Diversity for Important Data Selection in Pretraining Large Language Models

Chi Zhang, Huaping Zhong, Kuan Zhang et al.

ICLR 2025arXiv:2409.16986
32
citations
#1254

OmniPhysGS: 3D Constitutive Gaussians for General Physics-Based Dynamics Generation

Yuchen Lin, Chenguo Lin, Jianjin Xu et al.

ICLR 2025arXiv:2501.18982
32
citations
#1255

Interpretable Diffusion via Information Decomposition

Xianghao Kong, Ollie Liu, Han Li et al.

ICLR 2024arXiv:2310.07972
32
citations
#1256

CAKE: Cascading and Adaptive KV Cache Eviction with Layer Preferences

Ziran Qin, Yuchen Cao, Mingbao Lin et al.

ICLR 2025oralarXiv:2503.12491
32
citations
#1257

CityGaussianV2: Efficient and Geometrically Accurate Reconstruction for Large-Scale Scenes

Yang Liu, Chuanchen Luo, Zhongkai Mao et al.

ICLR 2025arXiv:2411.00771
32
citations
#1258

ICLR: In-Context Learning of Representations

Core Francisco Park, Andrew Lee, Ekdeep Singh Lubana et al.

ICLR 2025arXiv:2501.00070
32
citations
#1259

Number Cookbook: Number Understanding of Language Models and How to Improve It

Haotong Yang, Yi Hu, Shijia Kang et al.

ICLR 2025arXiv:2411.03766
32
citations
#1260

Enhancing One-Shot Federated Learning Through Data and Ensemble Co-Boosting

Rong Dai, Yonggang Zhang, Ang Li et al.

ICLR 2024arXiv:2402.15070
32
citations
#1261

CausalLM is not optimal for in-context learning

Nan Ding, Tomer Levinboim, Jialin Wu et al.

ICLR 2024arXiv:2308.06912
32
citations
#1262

Policy Decorator: Model-Agnostic Online Refinement for Large Policy Model

Xiu Yuan, Tongzhou Mu, Stone Tao et al.

ICLR 2025arXiv:2412.13630
32
citations
#1263

TIS-DPO: Token-level Importance Sampling for Direct Preference Optimization With Estimated Weights

Aiwei Liu, Haoping Bai, Zhiyun Lu et al.

ICLR 2025arXiv:2410.04350
32
citations
#1264

Beyond Linear Approximations: A Novel Pruning Approach for Attention Matrix

Yingyu Liang, Jiangxuan Long, Zhenmei Shi et al.

ICLR 2025arXiv:2410.11261
32
citations
#1265

Sin3DM: Learning a Diffusion Model from a Single 3D Textured Shape

Rundi Wu, Ruoshi Liu, Carl Vondrick et al.

ICLR 2024arXiv:2305.15399
32
citations
#1266

Ambient Diffusion Posterior Sampling: Solving Inverse Problems with Diffusion Models Trained on Corrupted Data

Asad Aali, Giannis Daras, Brett Levac et al.

ICLR 2025arXiv:2403.08728
32
citations
#1267

STBLLM: Breaking the 1-Bit Barrier with Structured Binary LLMs

Peijie Dong, Lujun Li, Yuedong Zhong et al.

ICLR 2025arXiv:2408.01803
32
citations
#1268

PeFLL: Personalized Federated Learning by Learning to Learn

Jonathan Scott, Hossein Zakerinia, Christoph Lampert

ICLR 2024arXiv:2306.05515
32
citations
#1269

Exploring Diffusion Time-steps for Unsupervised Representation Learning

Zhongqi Yue, Zhongqi Yue, Jiankun Wang et al.

ICLR 2024arXiv:2401.11430
32
citations
#1270

Language Model Alignment in Multilingual Trolley Problems

Zhijing Jin, Max Kleiman-Weiner, Giorgio Piatti et al.

ICLR 2025oralarXiv:2407.02273
31
citations
#1271

LILO: Learning Interpretable Libraries by Compressing and Documenting Code

Gabriel Grand, Lio Wong, Maddy Bowers et al.

ICLR 2024arXiv:2310.19791
31
citations
#1272

Exploring Prosocial Irrationality for LLM Agents: A Social Cognition View

Xuan Liu, Jie ZHANG, HaoYang Shang et al.

ICLR 2025arXiv:2405.14744
31
citations
#1273

MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models

Peng Xia, Siwei Han, Shi Qiu et al.

ICLR 2025arXiv:2410.10139
31
citations
#1274

From Tokens to Words: On the Inner Lexicon of LLMs

Guy Kaplan, Matanel Oren, Yuval Reif et al.

ICLR 2025arXiv:2410.05864
31
citations
#1275

Initializing Models with Larger Ones

Zhiqiu Xu, Yanjie Chen, Kirill Vishniakov et al.

ICLR 2024spotlightarXiv:2311.18823
31
citations
#1276

Image Inpainting via Tractable Steering of Diffusion Models

Anji Liu, Mathias Niepert, Guy Van den Broeck

ICLR 2024arXiv:2401.03349
31
citations
#1277

InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists

Yulu Gan, Sung Woo Park, Alexander Schubert et al.

ICLR 2024arXiv:2310.00390
31
citations
#1278

Spatio-Temporal Few-Shot Learning via Diffusive Neural Network Generation

Yuan Yuan, Chenyang Shao, Jingtao Ding et al.

ICLR 2024oralarXiv:2402.11922
31
citations
#1279

SEGNO: Generalizing Equivariant Graph Neural Networks with Physical Inductive Biases

Yang Liu, Jiashun Cheng, Haihong Zhao et al.

ICLR 2024spotlightarXiv:2308.13212
31
citations
#1280

GPUDrive: Data-driven, multi-agent driving simulation at 1 million FPS

Saman Kazemkhani, Aarav Pandya, Daphne Cornelisse et al.

ICLR 2025arXiv:2408.01584
31
citations
#1281

Forgetting Transformer: Softmax Attention with a Forget Gate

Zhixuan Lin, Evgenii Nikishin, Xu He et al.

ICLR 2025arXiv:2503.02130
31
citations
#1282

Herald: A Natural Language Annotated Lean 4 Dataset

Guoxiong Gao, Yutong Wang, Jiedong Jiang et al.

ICLR 2025arXiv:2410.10878
31
citations
#1283

TempMe: Video Temporal Token Merging for Efficient Text-Video Retrieval

Leqi Shen, Tianxiang Hao, Tao He et al.

ICLR 2025oralarXiv:2409.01156
31
citations
#1284

Domain-Agnostic Molecular Generation with Chemical Feedback

Yin Fang, Ningyu Zhang, Zhuo Chen et al.

ICLR 2024arXiv:2301.11259
31
citations
#1285

CLIP the Bias: How Useful is Balancing Data in Multimodal Learning?

Ibrahim Alabdulmohsin, Xiao Wang, Andreas Steiner et al.

ICLR 2024arXiv:2403.04547
31
citations
#1286

CViT: Continuous Vision Transformer for Operator Learning

Sifan Wang, Jacob Seidman, Shyam Sankaran et al.

ICLR 2025oralarXiv:2405.13998
31
citations
#1287

CBQ: Cross-Block Quantization for Large Language Models

Xin Ding, Xiaoyu Liu, Zhijun Tu et al.

ICLR 2025arXiv:2312.07950
31
citations
#1288

STAMP: Scalable Task- And Model-agnostic Collaborative Perception

Xiangbo Gao, Runsheng Xu, Jiachen Li et al.

ICLR 2025arXiv:2501.18616
31
citations
#1289

CivRealm: A Learning and Reasoning Odyssey in Civilization for Decision-Making Agents

Siyuan Qi, Shuo Chen, Yexin Li et al.

ICLR 2024spotlightarXiv:2401.10568
31
citations
#1290

FairerCLIP: Debiasing CLIP's Zero-Shot Predictions using Functions in RKHSs

Sepehr Dehdashtian, Lan Wang, Vishnu Boddeti

ICLR 2024arXiv:2403.15593
31
citations
#1291

System 1.x: Learning to Balance Fast and Slow Planning with Language Models

Swarnadeep Saha, Archiki Prasad, Justin Chen et al.

ICLR 2025arXiv:2407.14414
31
citations
#1292

SparseDFF: Sparse-View Feature Distillation for One-Shot Dexterous Manipulation

Qianxu Wang, Haotong Zhang, Congyue Deng et al.

ICLR 2024arXiv:2310.16838
31
citations
#1293

Sparse autoencoders reveal selective remapping of visual concepts during adaptation

Hyesu Lim, Jinho Choi, Jaegul Choo et al.

ICLR 2025arXiv:2412.05276
31
citations
#1294

Longhorn: State Space Models are Amortized Online Learners

Bo Liu, Rui Wang, Lemeng Wu et al.

ICLR 2025arXiv:2407.14207
31
citations
#1295

GeneOH Diffusion: Towards Generalizable Hand-Object Interaction Denoising via Denoising Diffusion

Xueyi Liu, Li Yi

ICLR 2024arXiv:2402.14810
31
citations
#1296

Transformer Fusion with Optimal Transport

Moritz Imfeld, Jacopo Graldi, Marco Giordano et al.

ICLR 2024arXiv:2310.05719
31
citations
#1297

PivotMesh: Generic 3D Mesh Generation via Pivot Vertices Guidance

Haohan Weng, Yikai Wang, Tong Zhang et al.

ICLR 2025arXiv:2405.16890
31
citations
#1298

OptiBench Meets ReSocratic: Measure and Improve LLMs for Optimization Modeling

Zhicheng YANG, Yiwei Wang, Yinya Huang et al.

ICLR 2025arXiv:2407.09887
31
citations
#1299

LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior

Hanyu Wang, Saksham Suri, Yixuan Ren et al.

ICLR 2025arXiv:2410.21264
31
citations
#1300

NECO: NEural Collapse Based Out-of-distribution detection

Mouïn Ben Ammar, Nacim Belkhir, Sebastian Popescu et al.

ICLR 2024arXiv:2310.06823
31
citations
#1301

ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing

Ziteng Wang, Jun Zhu, Jianfei Chen

ICLR 2025arXiv:2412.14711
31
citations
#1302

Methods for Convex $(L_0,L_1)$-Smooth Optimization: Clipping, Acceleration, and Adaptivity

Eduard Gorbunov, Nazarii Tupitsa, Sayantan Choudhury et al.

ICLR 2025arXiv:2409.14989
31
citations
#1303

McEval: Massively Multilingual Code Evaluation

Linzheng Chai, Shukai Liu, Jian Yang et al.

ICLR 2025arXiv:2406.07436
31
citations
#1304

Finite-Time Analysis of On-Policy Heterogeneous Federated Reinforcement Learning

Chenyu Zhang, Han Wang, Aritra Mitra et al.

ICLR 2024arXiv:2401.15273
31
citations
#1305

Fair and Efficient Contribution Valuation for Vertical Federated Learning

Zhenan Fan, Huang Fang, Xinglu Wang et al.

ICLR 2024arXiv:2201.02658
31
citations
#1306

From Isolated Conversations to Hierarchical Schemas: Dynamic Tree Memory Representation for LLMs

Alireza Rezazadeh, Zichao Li, Wei Wei et al.

ICLR 2025arXiv:2410.14052
31
citations
#1307

REEF: Representation Encoding Fingerprints for Large Language Models

Jie Zhang, Dongrui Liu, Chen Qian et al.

ICLR 2025arXiv:2410.14273
31
citations
#1308

Spurious Forgetting in Continual Learning of Language Models

Junhao Zheng, Xidi Cai, Shengjie Qiu et al.

ICLR 2025arXiv:2501.13453
30
citations
#1309

Transformer-VQ: Linear-Time Transformers via Vector Quantization

Lucas D. Lingle

ICLR 2024arXiv:2309.16354
30
citations
#1310

GOFA: A Generative One-For-All Model for Joint Graph Language Modeling

Lecheng Kong, Jiarui Feng, Hao Liu et al.

ICLR 2025arXiv:2407.09709
30
citations
#1311

Closing the Curious Case of Neural Text Degeneration

Matthew Finlayson, John Hewitt, Alexander Koller et al.

ICLR 2024arXiv:2310.01693
30
citations
#1312

Feature emergence via margin maximization: case studies in algebraic tasks

Depen Morwani, Benjamin Edelman, Costin-Andrei Oncescu et al.

ICLR 2024spotlightarXiv:2311.07568
30
citations
#1313

LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models

Junyan Ye, Baichuan Zhou, Zilong Huang et al.

ICLR 2025arXiv:2410.09732
30
citations
#1314

LANTERN: Accelerating Visual Autoregressive Models with Relaxed Speculative Decoding

Doohyuk Jang, Sihwan Park, June Yong Yang et al.

ICLR 2025arXiv:2410.03355
30
citations
#1315

SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding

Sihang Li, Jin Huang, Jiaxi Zhuang et al.

ICLR 2025arXiv:2408.15545
30
citations
#1316

Image Watermarks are Removable using Controllable Regeneration from Clean Noise

Yepeng Liu, Yiren Song, Hai Ci et al.

ICLR 2025arXiv:2410.05470
30
citations
#1317

Motion-Agent: A Conversational Framework for Human Motion Generation with LLMs

Qi Wu, Yubo Zhao, Yifan Wang et al.

ICLR 2025arXiv:2405.17013
30
citations
#1318

Long-Short-Range Message-Passing: A Physics-Informed Framework to Capture Non-Local Interaction for Scalable Molecular Dynamics Simulation

Yunyang Li, Yusong Wang, Lin Huang et al.

ICLR 2024arXiv:2304.13542
30
citations
#1319

Second-Order Fine-Tuning without Pain for LLMs: A Hessian Informed Zeroth-Order Optimizer

Yanjun Zhao, Sizhe Dang, Haishan Ye et al.

ICLR 2025
30
citations
#1320

Achieving Human Parity in Content-Grounded Datasets Generation

Asaf Yehudai, Boaz Carmeli, Yosi Mass et al.

ICLR 2024arXiv:2401.14367
30
citations
#1321

Generative Inbetweening: Adapting Image-to-Video Models for Keyframe Interpolation

Xiaojuan Wang, Boyang Zhou, Brian Curless et al.

ICLR 2025arXiv:2408.15239
30
citations
#1322

Early Neuron Alignment in Two-layer ReLU Networks with Small Initialization

Hancheng Min, Enrique Mallada, Rene Vidal

ICLR 2024arXiv:2307.12851
30
citations
#1323

MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks

Jiacheng Chen, Tianhao Liang, Sherman Siu et al.

ICLR 2025arXiv:2410.10563
30
citations
#1324

Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark

Tsung-Han Wu, Giscard Biamby, Jerome Quenum et al.

ICLR 2025arXiv:2407.13766
30
citations
#1325

DIFFTACTILE: A Physics-based Differentiable Tactile Simulator for Contact-rich Robotic Manipulation

Zilin Si, Gu Zhang, Qingwei Ben et al.

ICLR 2024arXiv:2403.08716
30
citations
#1326

Backdoor Federated Learning by Poisoning Backdoor-Critical Layers

Haomin Zhuang, Mingxian Yu, Hao Wang et al.

ICLR 2024arXiv:2308.04466
30
citations
#1327

Learning to Reject with a Fixed Predictor: Application to Decontextualization

Christopher Mohri, Daniel Andor, Eunsol Choi et al.

ICLR 2024arXiv:2301.09044
30
citations
#1328

MRAG-Bench: Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models

Wenbo Hu, Jia-Chen Gu, Zi-Yi Dou et al.

ICLR 2025arXiv:2410.08182
30
citations
#1329

Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge

Haomiao Xiong, Zongxin Yang, Jiazuo Yu et al.

ICLR 2025arXiv:2501.13468
30
citations
#1330

Understanding In-Context Learning from Repetitions

Jianhao (Elliott) Yan, Jin Xu, Chiyu Song et al.

ICLR 2024arXiv:2310.00297
30
citations
#1331

Multi-Scale Representations by Varying Window Attention for Semantic Segmentation

Haotian Yan, Ming Wu, Chuang Zhang

ICLR 2024arXiv:2404.16573
30
citations
#1332

OmniBind: Large-scale Omni Multimodal Representation via Binding Spaces

zehan wang, Ziang Zhang, Minjie Hong et al.

ICLR 2025arXiv:2407.11895
30
citations
#1333

Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making

Jeonghye Kim, Su Young Lee, Woojun Kim et al.

ICLR 2024spotlightarXiv:2310.03022
30
citations
#1334

On Large Language Model Continual Unlearning

Chongyang Gao, Lixu Wang, Kaize Ding et al.

ICLR 2025arXiv:2407.10223
30
citations
#1335

InstantSwap: Fast Customized Concept Swapping across Sharp Shape Differences

Chenyang Zhu, Kai Li, Yue Ma et al.

ICLR 2025arXiv:2412.01197
30
citations
#1336

Efficient Online Reinforcement Learning Fine-Tuning Need Not Retain Offline Data

Zhiyuan Zhou, Andy Peng, Qiyang Li et al.

ICLR 2025arXiv:2412.07762
30
citations
#1337

Efficient Dictionary Learning with Switch Sparse Autoencoders

Anish Mudide, Josh Engels, Eric Michaud et al.

ICLR 2025arXiv:2410.08201
30
citations
#1338

RetroBridge: Modeling Retrosynthesis with Markov Bridges

Ilia Igashov, Arne Schneuing, Marwin Segler et al.

ICLR 2024spotlightarXiv:2308.16212
30
citations
#1339

GenSE: Generative Speech Enhancement via Language Models using Hierarchical Modeling

Jixun Yao, Hexin Liu, CHEN CHEN et al.

ICLR 2025arXiv:2502.02942
30
citations
#1340

Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask Learning

Moritz Reuss, Jyothish Pari, Pulkit Agrawal et al.

ICLR 2025arXiv:2412.12953
30
citations
#1341

Understanding Addition in Transformers

Philip Quirke, Fazl Barez

ICLR 2024arXiv:2310.13121
30
citations
#1342

Learning the greatest common divisor: explaining transformer predictions

François Charton

ICLR 2024spotlightarXiv:2308.15594
29
citations
#1343

Can Knowledge Editing Really Correct Hallucinations?

Baixiang Huang, Canyu Chen, Xiongxiao Xu et al.

ICLR 2025arXiv:2410.16251
29
citations
#1344

ChartMoE: Mixture of Diversely Aligned Expert Connector for Chart Understanding

Zhengzhuo Xu, Bowen Qu, Yiyan Qi et al.

ICLR 2025arXiv:2409.03277
29
citations
#1345

Zigzag Diffusion Sampling: Diffusion Models Can Self-Improve via Self-Reflection

Lichen Bai, Shitong Shao, zikai zhou et al.

ICLR 2025arXiv:2412.10891
29
citations
#1346

Multimodal Situational Safety

Kaiwen Zhou, Chengzhi Liu, Xuandong Zhao et al.

ICLR 2025arXiv:2410.06172
29
citations
#1347

Object-Aware Inversion and Reassembly for Image Editing

Zhen Yang, Ganggui Ding, Wen Wang et al.

ICLR 2024arXiv:2310.12149
29
citations
#1348

Parallelizing non-linear sequential models over the sequence length

Yi Heng Lim, Qi Zhu, Joshua Selfridge et al.

ICLR 2024arXiv:2309.12252
29
citations
#1349

Semantics-Adaptive Activation Intervention for LLMs via Dynamic Steering Vectors

Weixuan Wang, JINGYUAN YANG, Wei Peng

ICLR 2025arXiv:2410.12299
29
citations
#1350

Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Refuse

Maojia Song, Shang Hong Sim, Rishabh Bhardwaj et al.

ICLR 2025arXiv:2409.11242
29
citations
#1351

Ghost on the Shell: An Expressive Representation of General 3D Shapes

Zhen Liu, Yao Feng, Yuliang Xiu et al.

ICLR 2024arXiv:2310.15168
29
citations
#1352

Large-scale and Fine-grained Vision-language Pre-training for Enhanced CT Image Understanding

Zhongyi Shui, Jianpeng Zhang, Weiwei Cao et al.

ICLR 2025arXiv:2501.14548
29
citations
#1353

Holistically Evaluating the Environmental Impact of Creating Language Models

Jacob Morrison, Clara Na, Jared Fernandez et al.

ICLR 2025arXiv:2503.05804
29
citations
#1354

Learning Energy Decompositions for Partial Inference in GFlowNets

Hyosoon Jang, Minsu Kim, Sungsoo Ahn

ICLR 2024arXiv:2310.03301
29
citations
#1355

How Discrete and Continuous Diffusion Meet: Comprehensive Analysis of Discrete Diffusion Models via a Stochastic Integral Framework

Yinuo Ren, Haoxuan Chen, Grant Rotskoff et al.

ICLR 2025arXiv:2410.03601
29
citations
#1356

Energy-Based Concept Bottleneck Models: Unifying Prediction, Concept Intervention, and Probabilistic Interpretations

Xinyue Xu, Yi Qin, Lu Mi et al.

ICLR 2024arXiv:2401.14142
29
citations
#1357

ASID: Active Exploration for System Identification in Robotic Manipulation

Marius Memmel, Andrew Wagenmaker, Chuning Zhu et al.

ICLR 2024arXiv:2404.12308
29
citations
#1358

A Formal Framework for Understanding Length Generalization in Transformers

Xinting Huang, Andy Yang, Satwik Bhattamishra et al.

ICLR 2025arXiv:2410.02140
29
citations
#1359

Chain-of-Action: Faithful and Multimodal Question Answering through Large Language Models

Zhenyu Pan, Haozheng Luo, Manling Li et al.

ICLR 2025arXiv:2403.17359
29
citations
#1360

Biased Temporal Convolution Graph Network for Time Series Forecasting with Missing Values

Xiaodan Chen, Xiucheng Li, Bo Liu et al.

ICLR 2024oral
29
citations
#1361

DailyDilemmas: Revealing Value Preferences of LLMs with Quandaries of Daily Life

Yu Ying Chiu, Liwei Jiang, Yejin Choi

ICLR 2025oralarXiv:2410.02683
29
citations
#1362

PolyVoice: Language Models for Speech to Speech Translation

Qianqian Dong, Zhiying Huang, Qiao Tian et al.

ICLR 2024arXiv:2306.02982
29
citations
#1363

CAT-3DGS: A Context-Adaptive Triplane Approach to Rate-Distortion-Optimized 3DGS Compression

Yu-Ting Zhan, Cheng-Yuan Ho, He-Bi Yang et al.

ICLR 2025arXiv:2503.00357
29
citations
#1364

Multimodal Patient Representation Learning with Missing Modalities and Labels

Zhenbang Wu, Anant Dadu, Nicholas Tustison et al.

ICLR 2024
29
citations
#1365

Batched Low-Rank Adaptation of Foundation Models

Yeming Wen, Swarat Chaudhuri

ICLR 2024arXiv:2312.05677
29
citations
#1366

On the Stability of Expressive Positional Encodings for Graphs

Yinan Huang, William Lu, Joshua Robinson et al.

ICLR 2024arXiv:2310.02579
29
citations
#1367

Machine Unlearning Fails to Remove Data Poisoning Attacks

Martin Pawelczyk, Jimmy Di, Yiwei Lu et al.

ICLR 2025arXiv:2406.17216
29
citations
#1368

Posterior Sampling Based on Gradient Flows of the MMD with Negative Distance Kernel

Paul Hagemann, Johannes Hertrich, Fabian Altekrüger et al.

ICLR 2024arXiv:2310.03054
29
citations
#1369

Beyond Autoregression: Fast LLMs via Self-Distillation Through Time

Justin Deschenaux, Caglar Gulcehre

ICLR 2025arXiv:2410.21035
29
citations
#1370

Understanding and Enhancing Safety Mechanisms of LLMs via Safety-Specific Neuron

Yiran Zhao, Wenxuan Zhang, Yuxi Xie et al.

ICLR 2025
29
citations
#1371

Copula Conformal prediction for multi-step time series prediction

Sophia Sun, Rose Yu

ICLR 2024oral
29
citations
#1372

GraphArena: Evaluating and Exploring Large Language Models on Graph Computation

Jianheng Tang, Qifan Zhang, Yuhan Li et al.

ICLR 2025arXiv:2407.00379
29
citations
#1373

Can Large Language Models Understand Symbolic Graphics Programs?

Zeju Qiu, Weiyang Liu, Haiwen Feng et al.

ICLR 2025arXiv:2408.08313
29
citations
#1374

UniTabE: A Universal Pretraining Protocol for Tabular Foundation Model in Data Science

Yazheng Yang, Yuqi Wang, Guang Liu et al.

ICLR 2024arXiv:2307.09249
29
citations
#1375

Scaling physics-informed hard constraints with mixture-of-experts

Nithin Chalapathi, Yiheng Du, Aditi Krishnapriyan

ICLR 2024oralarXiv:2402.13412
29
citations
#1376

Energy-Weighted Flow Matching for Offline Reinforcement Learning

Shiyuan Zhang, Weitong Zhang, Quanquan Gu

ICLR 2025arXiv:2503.04975
29
citations
#1377

PhyloGFN: Phylogenetic inference with generative flow networks

MING YANG ZHOU, Zichao Yan, Elliot Layne et al.

ICLR 2024arXiv:2310.08774
29
citations
#1378

Contrastive Learning is Spectral Clustering on Similarity Graph

Zhiquan Tan, Yifan Zhang, Jingqin Yang et al.

ICLR 2024arXiv:2303.15103
29
citations
#1379

CURIE: Evaluating LLMs on Multitask Scientific Long-Context Understanding and Reasoning

Hao Cui, Zahra Shamsi, Gowoon Cheon et al.

ICLR 2025arXiv:2503.13517
29
citations
#1380

Reducing Hallucinations in Large Vision-Language Models via Latent Space Steering

Sheng Liu, Haotian Ye, James Y Zou

ICLR 2025
29
citations
#1381

PerturboLLaVA: Reducing Multimodal Hallucinations with Perturbative Visual Training

Cong Chen, Mingyu Liu, Chenchen Jing et al.

ICLR 2025arXiv:2503.06486
29
citations
#1382

Your Mixture-of-Experts LLM Is Secretly an Embedding Model for Free

Ziyue Li, Tianyi Zhou

ICLR 2025arXiv:2410.10814
29
citations
#1383

The OMG dataset: An Open MetaGenomic corpus for mixed-modality genomic language modeling

Andre Cornman, Jacob West-Roberts, Antonio Camargo et al.

ICLR 2025
29
citations
#1384

NOLA: Compressing LoRA using Linear Combination of Random Basis

Soroush Abbasi Koohpayegani, K L Navaneet, Parsa Nooralinejad et al.

ICLR 2024oralarXiv:2310.02556
28
citations
#1385

Competing Large Language Models in Multi-Agent Gaming Environments

Jen-Tse Huang, Eric John Li, Man Ho LAM et al.

ICLR 2025
28
citations
#1386

MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code

Zimu Lu, Aojun Zhou, Ke Wang et al.

ICLR 2025arXiv:2410.08196
28
citations
#1387

Improving Uncertainty Estimation through Semantically Diverse Language Generation

Lukas Aichberger, Kajetan Schweighofer, Mykyta Ielanskyi et al.

ICLR 2025arXiv:2406.04306
28
citations
#1388

Nonconvex Stochastic Optimization under Heavy-Tailed Noises: Optimal Convergence without Gradient Clipping

Zijian Liu, Zhengyuan Zhou

ICLR 2025arXiv:2412.19529
28
citations
#1389

GUI-World: A Video Benchmark and Dataset for Multimodal GUI-oriented Understanding

Dongping Chen, Yue Huang, Siyuan Wu et al.

ICLR 2025oralarXiv:2406.10819
28
citations
#1390

Unified Generative Modeling of 3D Molecules with Bayesian Flow Networks

Yuxuan Song, Jingjing Gong, Hao Zhou et al.

ICLR 2024arXiv:2403.15441
28
citations
#1391

DREAM: Dual Structured Exploration with Mixup for Open-set Graph Domain Adaption

Nan Yin, Mengzhu Wang, Mengzhu Wang et al.

ICLR 2024
28
citations
#1392

Denoising Autoregressive Transformers for Scalable Text-to-Image Generation

Jiatao Gu, Yuyang Wang, Yizhe Zhang et al.

ICLR 2025arXiv:2410.08159
28
citations
#1393

The LLM Surgeon

Tycho van der Ouderaa, Markus Nagel, Mart van Baalen et al.

ICLR 2024arXiv:2312.17244
28
citations
#1394

Inherently Interpretable Time Series Classification via Multiple Instance Learning

Joseph Early, Gavin Cheung, Kurt Cutajar et al.

ICLR 2024spotlightarXiv:2311.10049
28
citations
#1395

The Superposition of Diffusion Models Using the Itô Density Estimator

Marta Skreta, Lazar Atanackovic, Joey Bose et al.

ICLR 2025arXiv:2412.17762
28
citations
#1396

Compressed Context Memory for Online Language Model Interaction

Jang-Hyun Kim, Junyoung Yeom, Sangdoo Yun et al.

ICLR 2024arXiv:2312.03414
28
citations
#1397

A Geometric Framework for Understanding Memorization in Generative Models

Brendan Ross, Hamidreza Kamkari, Tongzi Wu et al.

ICLR 2025arXiv:2411.00113
28
citations
#1398

CREAM: Consistency Regularized Self-Rewarding Language Models

Zhaoyang Wang, Weilei He, Zhiyuan Liang et al.

ICLR 2025arXiv:2410.12735
28
citations
#1399

Diffusion-based Neural Network Weights Generation

Bedionita Soro, Bruno Andreis, Hayeon Lee et al.

ICLR 2025arXiv:2402.18153
28
citations
#1400

From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions

Changle Qu, Sunhao Dai, Xiaochi Wei et al.

ICLR 2025arXiv:2410.08197
28
citations