Most Cited ICLR "linear recurrent networks" Papers

6,124 papers found • Page 7 of 31

Filters:Most Cited ICLR linear recurrent networks Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

#1201

Jointly Training Large Autoregressive Multimodal Models

Emanuele Aiello, Lili Yu, Yixin Nie et al.

ICLR 2024arXiv:2309.15564

citations

#1202

Beyond Accuracy: Evaluating Self-Consistency of Code Large Language Models with IdentityChain

Marcus J. Min, Yangruibo Ding, Luca Buratti et al.

ICLR 2024arXiv:2310.14053

citations

#1203

O(d/T) Convergence Theory for Diffusion Probabilistic Models under Minimal Assumptions

Gen Li, Yuling Yan

ICLR 2025arXiv:2409.18959

citations

#1204

Learning to Clarify: Multi-turn Conversations with Action-Based Contrastive Self-Training

Maximillian Chen, Ruoxi Sun, Tomas Pfister et al.

ICLR 2025arXiv:2406.00222

citations

#1205

Computational Limits of Low-Rank Adaptation (LoRA) Fine-Tuning for Transformer Models

Jerry Yao-Chieh Hu, Maojiang Su, En-Jui Kuo et al.

ICLR 2025arXiv:2406.03136

citations

#1206

SD-LoRA: Scalable Decoupled Low-Rank Adaptation for Class Incremental Learning

Yichen Wu, Hongming Piao, Long-Kai Huang et al.

ICLR 2025arXiv:2501.13198

citations

#1207

Low Rank Matrix Completion via Robust Alternating Minimization in Nearly Linear Time

Yuzhou Gu, Zhao Song, Junze Yin et al.

ICLR 2024arXiv:2302.11068

citations

#1208

Planning Anything with Rigor: General-Purpose Zero-Shot Planning with LLM-based Formalized Programming

Yilun Hao, Yang Zhang, Chuchu Fan

ICLR 2025arXiv:2410.12112

citations

#1209

Iterative Nash Policy Optimization: Aligning LLMs with General Preferences via No-Regret Learning

Yuheng Zhang, Dian Yu, Baolin Peng et al.

ICLR 2025arXiv:2407.00617

citations

#1210

ForecastBench: A Dynamic Benchmark of AI Forecasting Capabilities

Ezra Karger, Houtan Bastani, Chen Yueh-Han et al.

ICLR 2025arXiv:2409.19839

citations

#1211

Text4Seg: Reimagining Image Segmentation as Text Generation

Mengcheng Lan, Chaofeng Chen, Yue Zhou et al.

ICLR 2025arXiv:2410.09855

citations

#1212

RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards

Xinze Li, Sen Mei, Zhenghao Liu et al.

ICLR 2025arXiv:2410.13509

citations

#1213

Judge Decoding: Faster Speculative Sampling Requires Going Beyond Model Alignment

Gregor Bachmann, Sotiris Anagnostidis, Albert Pumarola et al.

ICLR 2025arXiv:2501.19309

citations

#1214

Scalable Language Model with Generalized Continual Learning

Bohao PENG, Zhuotao Tian, Shu Liu et al.

ICLR 2024arXiv:2404.07470

citations

#1215

Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning

Ruizhe Shi, Yuyao Liu, Yanjie Ze et al.

ICLR 2024arXiv:2310.20587

citations

#1216

Early Stopping Against Label Noise Without Validation Data

Suqin Yuan, Lei Feng, Tongliang Liu

ICLR 2024arXiv:2502.07551

citations

#1217

Let Models Speak Ciphers: Multiagent Debate through Embeddings

Chau Pham, Boyi Liu, Yingxiang Yang et al.

ICLR 2024arXiv:2310.06272

citations

#1218

How Does Unlabeled Data Provably Help Out-of-Distribution Detection?

Xuefeng Du, Zhen Fang, Ilias Diakonikolas et al.

ICLR 2024arXiv:2402.03502

citations

#1219

What to align in multimodal contrastive learning?

Benoit Dufumier, Javiera Castillo Navarro, Devis Tuia et al.

ICLR 2025arXiv:2409.07402

citations

#1220

Neural Monge Map estimation and its applications

Shaojun Ma, Yongxin Chen, Hao-Min Zhou et al.

ICLR 2024arXiv:2106.03812

citations

#1221

Revisiting text-to-image evaluation with Gecko: on metrics, prompts, and human rating

Olivia Wiles, Chuhan Zhang, Isabela Albuquerque et al.

ICLR 2025arXiv:2404.16820

citations

#1222

On the Relation between Trainability and Dequantization of Variational Quantum Learning Models

Elies Gil-Fuster, Casper Gyurik, Adrian Perez-Salinas et al.

ICLR 2025arXiv:2406.07072

citations

#1223

Random Feature Amplification: Feature Learning and Generalization in Neural Networks

Spencer Frei, Niladri Chatterji, Peter L. Bartlett

ICLR 2024arXiv:2202.07626

citations

#1224

AI as Humanity’s Salieri: Quantifying Linguistic Creativity of Language Models via Systematic Attribution of Machine Text against Web Text

Ximing Lu, Melanie Sclar, Skyler Hallinan et al.

ICLR 2025arXiv:2410.04265

citations

#1225

Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo

Haque Ishfaq, Qingfeng Lan, Pan Xu et al.

ICLR 2024arXiv:2305.18246

citations

#1226

LiDAR-PTQ: Post-Training Quantization for Point Cloud 3D Object Detection

Sifan Zhou, Liang Li, Xinyu Zhang et al.

ICLR 2024arXiv:2401.15865

citations

#1227

Scaling Wearable Foundation Models

Girish Narayanswamy, Xin Liu, Kumar Ayush et al.

ICLR 2025arXiv:2410.13638

citations

#1228

Poisoned Forgery Face: Towards Backdoor Attacks on Face Forgery Detection

Jiawei Liang, Siyuan Liang, Aishan Liu et al.

ICLR 2024spotlightarXiv:2402.11473

citations

#1229

Interpreting the Second-Order Effects of Neurons in CLIP

Yossi Gandelsman, Alexei Efros, Jacob Steinhardt

ICLR 2025arXiv:2406.04341

citations

#1230

gRNAde: Geometric Deep Learning for 3D RNA inverse design

Chaitanya Joshi, Arian Jamasb, Ramon Viñas et al.

ICLR 2025arXiv:2305.14749

citations

#1231

Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling

Wenda Xu, Rujun Han, Zifeng Wang et al.

ICLR 2025arXiv:2410.11325

citations

#1232

Gramian Multimodal Representation Learning and Alignment

Giordano Cicchetti, Eleonora Grassucci, Luigi Sigillo et al.

ICLR 2025arXiv:2412.11959

citations

#1233

Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization

Junkang Wu, Yuexiang Xie, Zhengyi Yang et al.

ICLR 2025arXiv:2407.07880

citations

#1234

The Hidden Language of Diffusion Models

Hila Chefer, Oran Lang, Mor Geva et al.

ICLR 2024arXiv:2306.00966

citations

#1235

Training Unbiased Diffusion Models From Biased Dataset

Yeongmin Kim, Byeonghu Na, Minsang Park et al.

ICLR 2024arXiv:2403.01189

citations

#1236

Logical Languages Accepted by Transformer Encoders with Hard Attention

Pablo Barcelo, Alexander Kozachinskiy, Anthony W. Lin et al.

ICLR 2024arXiv:2310.03817

citations

#1237

CABINET: Content Relevance-based Noise Reduction for Table Question Answering

Sohan Patnaik, Heril Changwal, Milan Aggarwal et al.

ICLR 2024spotlightarXiv:2402.01155

citations

#1238

Memorization Capacity of Multi-Head Attention in Transformers

Sadegh Mahdavi, Renjie Liao, Christos Thrampoulidis

ICLR 2024spotlightarXiv:2306.02010

citations

#1239

Don't Play Favorites: Minority Guidance for Diffusion Models

Soobin Um, Suhyeon Lee, Jong Chul YE

ICLR 2024arXiv:2301.12334

citations

#1240

SNIP: Bridging Mathematical Symbolic and Numeric Realms with Unified Pre-training

Kazem Meidani, Parshin Shojaee, Chandan Reddy et al.

ICLR 2024spotlightarXiv:2310.02227

citations

#1241

VideoGrain: Modulating Space-Time Attention for Multi-Grained Video Editing

Xiangpeng Yang, Linchao Zhu, Hehe Fan et al.

ICLR 2025arXiv:2502.17258

citations

#1242

Hyper-Connections

Defa Zhu, Hongzhi Huang, Zihao Huang et al.

ICLR 2025arXiv:2409.19606

citations

#1243

AgentRefine: Enhancing Agent Generalization through Refinement Tuning

Dayuan Fu, Keqing He, Yejie Wang et al.

ICLR 2025arXiv:2501.01702

citations

#1244

It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition

CHEN CHEN, Ruizhe Li, Yuchen Hu et al.

ICLR 2024arXiv:2402.05457

citations

#1245

MoDeGPT: Modular Decomposition for Large Language Model Compression

Chi-Heng Lin, Shangqian Gao, James Smith et al.

ICLR 2025arXiv:2408.09632

citations

#1246

GTA: A Geometry-Aware Attention Mechanism for Multi-View Transformers

Takeru Miyato, Bernhard Jaeger, Max Welling et al.

ICLR 2024arXiv:2310.10375

citations

#1247

Self-Boosting Large Language Models with Synthetic Preference Data

Qingxiu Dong, Li Dong, Xingxing Zhang et al.

ICLR 2025arXiv:2410.06961

citations

#1248

Get more for less: Principled Data Selection for Warming Up Fine-Tuning in LLMs

Feiyang Kang, Hoang Anh Just, Yifan Sun et al.

ICLR 2024arXiv:2405.02774

citations

#1249

CPPO: Continual Learning for Reinforcement Learning with Human Feedback

Han Zhang, Yu Lei, Lin Gui et al.

ICLR 2024

citations

#1250

InverseBench: Benchmarking Plug-and-Play Diffusion Priors for Inverse Problems in Physical Sciences

Hongkai Zheng, Wenda Chu, Bingliang Zhang et al.

ICLR 2025arXiv:2503.11043

citations

#1251

Walk the Talk? Measuring the Faithfulness of Large Language Model Explanations

Katie Matton, Robert Ness, John Guttag et al.

ICLR 2025arXiv:2504.14150

citations

#1252

Context-Alignment: Activating and Enhancing LLMs Capabilities in Time Series

Yuxiao Hu, Qian Li, Dongxiao Zhang et al.

ICLR 2025arXiv:2501.03747

citations

#1253

Harnessing Diversity for Important Data Selection in Pretraining Large Language Models

Chi Zhang, Huaping Zhong, Kuan Zhang et al.

ICLR 2025arXiv:2409.16986

citations

#1254

OmniPhysGS: 3D Constitutive Gaussians for General Physics-Based Dynamics Generation

Yuchen Lin, Chenguo Lin, Jianjin Xu et al.

ICLR 2025arXiv:2501.18982

citations

#1255

Interpretable Diffusion via Information Decomposition

Xianghao Kong, Ollie Liu, Han Li et al.

ICLR 2024arXiv:2310.07972

citations

#1256

CAKE: Cascading and Adaptive KV Cache Eviction with Layer Preferences

Ziran Qin, Yuchen Cao, Mingbao Lin et al.

ICLR 2025oralarXiv:2503.12491

citations

#1257

CityGaussianV2: Efficient and Geometrically Accurate Reconstruction for Large-Scale Scenes

Yang Liu, Chuanchen Luo, Zhongkai Mao et al.

ICLR 2025arXiv:2411.00771

citations

#1258

ICLR: In-Context Learning of Representations

Core Francisco Park, Andrew Lee, Ekdeep Singh Lubana et al.

ICLR 2025arXiv:2501.00070

citations

#1259

Number Cookbook: Number Understanding of Language Models and How to Improve It

Haotong Yang, Yi Hu, Shijia Kang et al.

ICLR 2025arXiv:2411.03766

citations

#1260

Enhancing One-Shot Federated Learning Through Data and Ensemble Co-Boosting

Rong Dai, Yonggang Zhang, Ang Li et al.

ICLR 2024arXiv:2402.15070

citations

#1261

CausalLM is not optimal for in-context learning

Nan Ding, Tomer Levinboim, Jialin Wu et al.

ICLR 2024arXiv:2308.06912

citations

#1262

Policy Decorator: Model-Agnostic Online Refinement for Large Policy Model

Xiu Yuan, Tongzhou Mu, Stone Tao et al.

ICLR 2025arXiv:2412.13630

citations

#1263

TIS-DPO: Token-level Importance Sampling for Direct Preference Optimization With Estimated Weights

Aiwei Liu, Haoping Bai, Zhiyun Lu et al.

ICLR 2025arXiv:2410.04350

citations

#1264

Beyond Linear Approximations: A Novel Pruning Approach for Attention Matrix

Yingyu Liang, Jiangxuan Long, Zhenmei Shi et al.

ICLR 2025arXiv:2410.11261

citations

#1265

Sin3DM: Learning a Diffusion Model from a Single 3D Textured Shape

Rundi Wu, Ruoshi Liu, Carl Vondrick et al.

ICLR 2024arXiv:2305.15399

citations

#1266

Ambient Diffusion Posterior Sampling: Solving Inverse Problems with Diffusion Models Trained on Corrupted Data

Asad Aali, Giannis Daras, Brett Levac et al.

ICLR 2025arXiv:2403.08728

citations

#1267

STBLLM: Breaking the 1-Bit Barrier with Structured Binary LLMs

Peijie Dong, Lujun Li, Yuedong Zhong et al.

ICLR 2025arXiv:2408.01803

citations

#1268

PeFLL: Personalized Federated Learning by Learning to Learn

Jonathan Scott, Hossein Zakerinia, Christoph Lampert

ICLR 2024arXiv:2306.05515

citations

#1269

Exploring Diffusion Time-steps for Unsupervised Representation Learning

Zhongqi Yue, Zhongqi Yue, Jiankun Wang et al.

ICLR 2024arXiv:2401.11430

citations

#1270

Language Model Alignment in Multilingual Trolley Problems

Zhijing Jin, Max Kleiman-Weiner, Giorgio Piatti et al.

ICLR 2025oralarXiv:2407.02273

citations

#1271

LILO: Learning Interpretable Libraries by Compressing and Documenting Code

Gabriel Grand, Lio Wong, Maddy Bowers et al.

ICLR 2024arXiv:2310.19791

citations

#1272

Exploring Prosocial Irrationality for LLM Agents: A Social Cognition View

Xuan Liu, Jie ZHANG, HaoYang Shang et al.

ICLR 2025arXiv:2405.14744

citations

#1273

MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models

Peng Xia, Siwei Han, Shi Qiu et al.

ICLR 2025arXiv:2410.10139

citations

#1274

From Tokens to Words: On the Inner Lexicon of LLMs

Guy Kaplan, Matanel Oren, Yuval Reif et al.

ICLR 2025arXiv:2410.05864

citations

#1275

Initializing Models with Larger Ones

Zhiqiu Xu, Yanjie Chen, Kirill Vishniakov et al.

ICLR 2024spotlightarXiv:2311.18823

citations

#1276

Image Inpainting via Tractable Steering of Diffusion Models

Anji Liu, Mathias Niepert, Guy Van den Broeck

ICLR 2024arXiv:2401.03349

citations

#1277

InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists

Yulu Gan, Sung Woo Park, Alexander Schubert et al.

ICLR 2024arXiv:2310.00390

citations

#1278

Spatio-Temporal Few-Shot Learning via Diffusive Neural Network Generation

Yuan Yuan, Chenyang Shao, Jingtao Ding et al.

ICLR 2024oralarXiv:2402.11922

citations

#1279

SEGNO: Generalizing Equivariant Graph Neural Networks with Physical Inductive Biases

Yang Liu, Jiashun Cheng, Haihong Zhao et al.

ICLR 2024spotlightarXiv:2308.13212

citations

#1280

GPUDrive: Data-driven, multi-agent driving simulation at 1 million FPS

Saman Kazemkhani, Aarav Pandya, Daphne Cornelisse et al.

ICLR 2025arXiv:2408.01584

citations

#1281

Forgetting Transformer: Softmax Attention with a Forget Gate

Zhixuan Lin, Evgenii Nikishin, Xu He et al.

ICLR 2025arXiv:2503.02130

citations

#1282

Herald: A Natural Language Annotated Lean 4 Dataset

Guoxiong Gao, Yutong Wang, Jiedong Jiang et al.

ICLR 2025arXiv:2410.10878

citations

#1283

TempMe: Video Temporal Token Merging for Efficient Text-Video Retrieval

Leqi Shen, Tianxiang Hao, Tao He et al.

ICLR 2025oralarXiv:2409.01156

citations

#1284

Domain-Agnostic Molecular Generation with Chemical Feedback

Yin Fang, Ningyu Zhang, Zhuo Chen et al.

ICLR 2024arXiv:2301.11259

citations

#1285

CLIP the Bias: How Useful is Balancing Data in Multimodal Learning?

Ibrahim Alabdulmohsin, Xiao Wang, Andreas Steiner et al.

ICLR 2024arXiv:2403.04547

citations

#1286

CViT: Continuous Vision Transformer for Operator Learning

Sifan Wang, Jacob Seidman, Shyam Sankaran et al.

ICLR 2025oralarXiv:2405.13998

citations

#1287

CBQ: Cross-Block Quantization for Large Language Models

Xin Ding, Xiaoyu Liu, Zhijun Tu et al.

ICLR 2025arXiv:2312.07950

citations

#1288

STAMP: Scalable Task- And Model-agnostic Collaborative Perception

Xiangbo Gao, Runsheng Xu, Jiachen Li et al.

ICLR 2025arXiv:2501.18616

citations

#1289

CivRealm: A Learning and Reasoning Odyssey in Civilization for Decision-Making Agents

Siyuan Qi, Shuo Chen, Yexin Li et al.

ICLR 2024spotlightarXiv:2401.10568

citations

#1290

FairerCLIP: Debiasing CLIP's Zero-Shot Predictions using Functions in RKHSs

Sepehr Dehdashtian, Lan Wang, Vishnu Boddeti

ICLR 2024arXiv:2403.15593

citations

#1291

System 1.x: Learning to Balance Fast and Slow Planning with Language Models

Swarnadeep Saha, Archiki Prasad, Justin Chen et al.

ICLR 2025arXiv:2407.14414

citations

#1292

SparseDFF: Sparse-View Feature Distillation for One-Shot Dexterous Manipulation

Qianxu Wang, Haotong Zhang, Congyue Deng et al.

ICLR 2024arXiv:2310.16838

citations

#1293

Sparse autoencoders reveal selective remapping of visual concepts during adaptation

Hyesu Lim, Jinho Choi, Jaegul Choo et al.

ICLR 2025arXiv:2412.05276

citations

#1294

Longhorn: State Space Models are Amortized Online Learners

Bo Liu, Rui Wang, Lemeng Wu et al.

ICLR 2025arXiv:2407.14207

citations

#1295

GeneOH Diffusion: Towards Generalizable Hand-Object Interaction Denoising via Denoising Diffusion

Xueyi Liu, Li Yi

ICLR 2024arXiv:2402.14810

citations

#1296

Transformer Fusion with Optimal Transport

Moritz Imfeld, Jacopo Graldi, Marco Giordano et al.

ICLR 2024arXiv:2310.05719

citations

#1297

PivotMesh: Generic 3D Mesh Generation via Pivot Vertices Guidance

Haohan Weng, Yikai Wang, Tong Zhang et al.

ICLR 2025arXiv:2405.16890

citations

#1298

OptiBench Meets ReSocratic: Measure and Improve LLMs for Optimization Modeling

Zhicheng YANG, Yiwei Wang, Yinya Huang et al.

ICLR 2025arXiv:2407.09887

citations

#1299

LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior

Hanyu Wang, Saksham Suri, Yixuan Ren et al.

ICLR 2025arXiv:2410.21264

citations

#1300

NECO: NEural Collapse Based Out-of-distribution detection

Mouïn Ben Ammar, Nacim Belkhir, Sebastian Popescu et al.

ICLR 2024arXiv:2310.06823

citations

#1301

ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing

Ziteng Wang, Jun Zhu, Jianfei Chen

ICLR 2025arXiv:2412.14711

citations

#1302

Methods for Convex $(L_0,L_1)$-Smooth Optimization: Clipping, Acceleration, and Adaptivity

Eduard Gorbunov, Nazarii Tupitsa, Sayantan Choudhury et al.

ICLR 2025arXiv:2409.14989

citations

#1303

McEval: Massively Multilingual Code Evaluation

Linzheng Chai, Shukai Liu, Jian Yang et al.

ICLR 2025arXiv:2406.07436

citations

#1304

Finite-Time Analysis of On-Policy Heterogeneous Federated Reinforcement Learning

Chenyu Zhang, Han Wang, Aritra Mitra et al.

ICLR 2024arXiv:2401.15273

citations

#1305

Fair and Efficient Contribution Valuation for Vertical Federated Learning

Zhenan Fan, Huang Fang, Xinglu Wang et al.

ICLR 2024arXiv:2201.02658

citations

#1306

From Isolated Conversations to Hierarchical Schemas: Dynamic Tree Memory Representation for LLMs

Alireza Rezazadeh, Zichao Li, Wei Wei et al.

ICLR 2025arXiv:2410.14052

citations

#1307

REEF: Representation Encoding Fingerprints for Large Language Models

Jie Zhang, Dongrui Liu, Chen Qian et al.

ICLR 2025arXiv:2410.14273

citations

#1308

Spurious Forgetting in Continual Learning of Language Models

Junhao Zheng, Xidi Cai, Shengjie Qiu et al.

ICLR 2025arXiv:2501.13453

citations

#1309

Transformer-VQ: Linear-Time Transformers via Vector Quantization

Lucas D. Lingle

ICLR 2024arXiv:2309.16354

citations

#1310

GOFA: A Generative One-For-All Model for Joint Graph Language Modeling

Lecheng Kong, Jiarui Feng, Hao Liu et al.

ICLR 2025arXiv:2407.09709

citations

#1311

Closing the Curious Case of Neural Text Degeneration

Matthew Finlayson, John Hewitt, Alexander Koller et al.

ICLR 2024arXiv:2310.01693

citations

#1312

Feature emergence via margin maximization: case studies in algebraic tasks

Depen Morwani, Benjamin Edelman, Costin-Andrei Oncescu et al.

ICLR 2024spotlightarXiv:2311.07568

citations

#1313

LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models

Junyan Ye, Baichuan Zhou, Zilong Huang et al.

ICLR 2025arXiv:2410.09732

citations

#1314

LANTERN: Accelerating Visual Autoregressive Models with Relaxed Speculative Decoding

Doohyuk Jang, Sihwan Park, June Yong Yang et al.

ICLR 2025arXiv:2410.03355

citations

#1315

SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding

Sihang Li, Jin Huang, Jiaxi Zhuang et al.

ICLR 2025arXiv:2408.15545

citations

#1316

Image Watermarks are Removable using Controllable Regeneration from Clean Noise

Yepeng Liu, Yiren Song, Hai Ci et al.

ICLR 2025arXiv:2410.05470

citations

#1317

Motion-Agent: A Conversational Framework for Human Motion Generation with LLMs

Qi Wu, Yubo Zhao, Yifan Wang et al.

ICLR 2025arXiv:2405.17013

citations

#1318

Long-Short-Range Message-Passing: A Physics-Informed Framework to Capture Non-Local Interaction for Scalable Molecular Dynamics Simulation

Yunyang Li, Yusong Wang, Lin Huang et al.

ICLR 2024arXiv:2304.13542

citations

#1319

Second-Order Fine-Tuning without Pain for LLMs: A Hessian Informed Zeroth-Order Optimizer

Yanjun Zhao, Sizhe Dang, Haishan Ye et al.

ICLR 2025

citations

#1320

Achieving Human Parity in Content-Grounded Datasets Generation

Asaf Yehudai, Boaz Carmeli, Yosi Mass et al.

ICLR 2024arXiv:2401.14367

citations

#1321

Generative Inbetweening: Adapting Image-to-Video Models for Keyframe Interpolation

Xiaojuan Wang, Boyang Zhou, Brian Curless et al.

ICLR 2025arXiv:2408.15239

citations

#1322

Early Neuron Alignment in Two-layer ReLU Networks with Small Initialization

Hancheng Min, Enrique Mallada, Rene Vidal

ICLR 2024arXiv:2307.12851

citations

#1323

MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks

Jiacheng Chen, Tianhao Liang, Sherman Siu et al.

ICLR 2025arXiv:2410.10563

citations

#1324

Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark

Tsung-Han Wu, Giscard Biamby, Jerome Quenum et al.

ICLR 2025arXiv:2407.13766

citations

#1325

DIFFTACTILE: A Physics-based Differentiable Tactile Simulator for Contact-rich Robotic Manipulation

Zilin Si, Gu Zhang, Qingwei Ben et al.

ICLR 2024arXiv:2403.08716

citations

#1326

Backdoor Federated Learning by Poisoning Backdoor-Critical Layers

Haomin Zhuang, Mingxian Yu, Hao Wang et al.

ICLR 2024arXiv:2308.04466

citations

#1327

Learning to Reject with a Fixed Predictor: Application to Decontextualization

Christopher Mohri, Daniel Andor, Eunsol Choi et al.

ICLR 2024arXiv:2301.09044

citations

#1328

MRAG-Bench: Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models

Wenbo Hu, Jia-Chen Gu, Zi-Yi Dou et al.

ICLR 2025arXiv:2410.08182

citations

#1329

Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge

Haomiao Xiong, Zongxin Yang, Jiazuo Yu et al.

ICLR 2025arXiv:2501.13468

citations

#1330

Understanding In-Context Learning from Repetitions

Jianhao (Elliott) Yan, Jin Xu, Chiyu Song et al.

ICLR 2024arXiv:2310.00297

citations

#1331

Multi-Scale Representations by Varying Window Attention for Semantic Segmentation

Haotian Yan, Ming Wu, Chuang Zhang

ICLR 2024arXiv:2404.16573

citations

#1332

OmniBind: Large-scale Omni Multimodal Representation via Binding Spaces

zehan wang, Ziang Zhang, Minjie Hong et al.

ICLR 2025arXiv:2407.11895

citations

#1333

Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making

Jeonghye Kim, Su Young Lee, Woojun Kim et al.

ICLR 2024spotlightarXiv:2310.03022

citations

#1334

On Large Language Model Continual Unlearning

Chongyang Gao, Lixu Wang, Kaize Ding et al.

ICLR 2025arXiv:2407.10223

citations

#1335

InstantSwap: Fast Customized Concept Swapping across Sharp Shape Differences

Chenyang Zhu, Kai Li, Yue Ma et al.

ICLR 2025arXiv:2412.01197

citations

#1336

Efficient Online Reinforcement Learning Fine-Tuning Need Not Retain Offline Data

Zhiyuan Zhou, Andy Peng, Qiyang Li et al.

ICLR 2025arXiv:2412.07762

citations

#1337

Efficient Dictionary Learning with Switch Sparse Autoencoders

Anish Mudide, Josh Engels, Eric Michaud et al.

ICLR 2025arXiv:2410.08201

citations

#1338

RetroBridge: Modeling Retrosynthesis with Markov Bridges

Ilia Igashov, Arne Schneuing, Marwin Segler et al.

ICLR 2024spotlightarXiv:2308.16212

citations

#1339

GenSE: Generative Speech Enhancement via Language Models using Hierarchical Modeling

Jixun Yao, Hexin Liu, CHEN CHEN et al.

ICLR 2025arXiv:2502.02942

citations

#1340

Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask Learning

Moritz Reuss, Jyothish Pari, Pulkit Agrawal et al.

ICLR 2025arXiv:2412.12953

citations

#1341

Understanding Addition in Transformers

Philip Quirke, Fazl Barez

ICLR 2024arXiv:2310.13121

citations

#1342

Learning the greatest common divisor: explaining transformer predictions

François Charton

ICLR 2024spotlightarXiv:2308.15594

citations

#1343

Can Knowledge Editing Really Correct Hallucinations?

Baixiang Huang, Canyu Chen, Xiongxiao Xu et al.

ICLR 2025arXiv:2410.16251

citations

#1344

ChartMoE: Mixture of Diversely Aligned Expert Connector for Chart Understanding

Zhengzhuo Xu, Bowen Qu, Yiyan Qi et al.

ICLR 2025arXiv:2409.03277

citations

#1345

Zigzag Diffusion Sampling: Diffusion Models Can Self-Improve via Self-Reflection

Lichen Bai, Shitong Shao, zikai zhou et al.

ICLR 2025arXiv:2412.10891

citations

#1346

Multimodal Situational Safety

Kaiwen Zhou, Chengzhi Liu, Xuandong Zhao et al.

ICLR 2025arXiv:2410.06172

citations

#1347

Object-Aware Inversion and Reassembly for Image Editing

Zhen Yang, Ganggui Ding, Wen Wang et al.

ICLR 2024arXiv:2310.12149

citations

#1348

Parallelizing non-linear sequential models over the sequence length

Yi Heng Lim, Qi Zhu, Joshua Selfridge et al.

ICLR 2024arXiv:2309.12252

citations

#1349

Semantics-Adaptive Activation Intervention for LLMs via Dynamic Steering Vectors

Weixuan Wang, JINGYUAN YANG, Wei Peng

ICLR 2025arXiv:2410.12299

citations

#1350

Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Refuse

Maojia Song, Shang Hong Sim, Rishabh Bhardwaj et al.

ICLR 2025arXiv:2409.11242

citations

#1351

Ghost on the Shell: An Expressive Representation of General 3D Shapes

Zhen Liu, Yao Feng, Yuliang Xiu et al.

ICLR 2024arXiv:2310.15168

citations

#1352

Large-scale and Fine-grained Vision-language Pre-training for Enhanced CT Image Understanding

Zhongyi Shui, Jianpeng Zhang, Weiwei Cao et al.

ICLR 2025arXiv:2501.14548

citations

#1353

Holistically Evaluating the Environmental Impact of Creating Language Models

Jacob Morrison, Clara Na, Jared Fernandez et al.

ICLR 2025arXiv:2503.05804

citations

#1354

Learning Energy Decompositions for Partial Inference in GFlowNets

Hyosoon Jang, Minsu Kim, Sungsoo Ahn

ICLR 2024arXiv:2310.03301

citations

#1355

How Discrete and Continuous Diffusion Meet: Comprehensive Analysis of Discrete Diffusion Models via a Stochastic Integral Framework

Yinuo Ren, Haoxuan Chen, Grant Rotskoff et al.

ICLR 2025arXiv:2410.03601

citations

#1356

Energy-Based Concept Bottleneck Models: Unifying Prediction, Concept Intervention, and Probabilistic Interpretations

Xinyue Xu, Yi Qin, Lu Mi et al.

ICLR 2024arXiv:2401.14142

citations

#1357

ASID: Active Exploration for System Identification in Robotic Manipulation

Marius Memmel, Andrew Wagenmaker, Chuning Zhu et al.

ICLR 2024arXiv:2404.12308

citations

#1358

A Formal Framework for Understanding Length Generalization in Transformers

Xinting Huang, Andy Yang, Satwik Bhattamishra et al.

ICLR 2025arXiv:2410.02140

citations

#1359

Chain-of-Action: Faithful and Multimodal Question Answering through Large Language Models

Zhenyu Pan, Haozheng Luo, Manling Li et al.

ICLR 2025arXiv:2403.17359

citations

#1360

Biased Temporal Convolution Graph Network for Time Series Forecasting with Missing Values

Xiaodan Chen, Xiucheng Li, Bo Liu et al.

ICLR 2024oral

citations

#1361

DailyDilemmas: Revealing Value Preferences of LLMs with Quandaries of Daily Life

Yu Ying Chiu, Liwei Jiang, Yejin Choi

ICLR 2025oralarXiv:2410.02683

citations

#1362

PolyVoice: Language Models for Speech to Speech Translation

Qianqian Dong, Zhiying Huang, Qiao Tian et al.

ICLR 2024arXiv:2306.02982

citations

#1363

CAT-3DGS: A Context-Adaptive Triplane Approach to Rate-Distortion-Optimized 3DGS Compression

Yu-Ting Zhan, Cheng-Yuan Ho, He-Bi Yang et al.

ICLR 2025arXiv:2503.00357

citations

#1364

Multimodal Patient Representation Learning with Missing Modalities and Labels

Zhenbang Wu, Anant Dadu, Nicholas Tustison et al.

ICLR 2024

citations

#1365

Batched Low-Rank Adaptation of Foundation Models

Yeming Wen, Swarat Chaudhuri

ICLR 2024arXiv:2312.05677

citations

#1366

On the Stability of Expressive Positional Encodings for Graphs

Yinan Huang, William Lu, Joshua Robinson et al.

ICLR 2024arXiv:2310.02579

citations

#1367

Machine Unlearning Fails to Remove Data Poisoning Attacks

Martin Pawelczyk, Jimmy Di, Yiwei Lu et al.

ICLR 2025arXiv:2406.17216

citations

#1368

Posterior Sampling Based on Gradient Flows of the MMD with Negative Distance Kernel

Paul Hagemann, Johannes Hertrich, Fabian Altekrüger et al.

ICLR 2024arXiv:2310.03054

citations

#1369

Beyond Autoregression: Fast LLMs via Self-Distillation Through Time

Justin Deschenaux, Caglar Gulcehre

ICLR 2025arXiv:2410.21035

citations

#1370

Understanding and Enhancing Safety Mechanisms of LLMs via Safety-Specific Neuron

Yiran Zhao, Wenxuan Zhang, Yuxi Xie et al.

ICLR 2025

citations

#1371

Copula Conformal prediction for multi-step time series prediction

Sophia Sun, Rose Yu

ICLR 2024oral

citations

#1372

GraphArena: Evaluating and Exploring Large Language Models on Graph Computation

Jianheng Tang, Qifan Zhang, Yuhan Li et al.

ICLR 2025arXiv:2407.00379

citations

#1373

Can Large Language Models Understand Symbolic Graphics Programs?

Zeju Qiu, Weiyang Liu, Haiwen Feng et al.

ICLR 2025arXiv:2408.08313

citations

#1374

UniTabE: A Universal Pretraining Protocol for Tabular Foundation Model in Data Science

Yazheng Yang, Yuqi Wang, Guang Liu et al.

ICLR 2024arXiv:2307.09249

citations

#1375

Scaling physics-informed hard constraints with mixture-of-experts

Nithin Chalapathi, Yiheng Du, Aditi Krishnapriyan

ICLR 2024oralarXiv:2402.13412

citations

#1376

Energy-Weighted Flow Matching for Offline Reinforcement Learning

Shiyuan Zhang, Weitong Zhang, Quanquan Gu

ICLR 2025arXiv:2503.04975

citations

#1377

PhyloGFN: Phylogenetic inference with generative flow networks

MING YANG ZHOU, Zichao Yan, Elliot Layne et al.

ICLR 2024arXiv:2310.08774

citations

#1378

Contrastive Learning is Spectral Clustering on Similarity Graph

Zhiquan Tan, Yifan Zhang, Jingqin Yang et al.

ICLR 2024arXiv:2303.15103

citations

#1379

CURIE: Evaluating LLMs on Multitask Scientific Long-Context Understanding and Reasoning

Hao Cui, Zahra Shamsi, Gowoon Cheon et al.

ICLR 2025arXiv:2503.13517

citations

#1380

Reducing Hallucinations in Large Vision-Language Models via Latent Space Steering

Sheng Liu, Haotian Ye, James Y Zou

ICLR 2025

citations

#1381

PerturboLLaVA: Reducing Multimodal Hallucinations with Perturbative Visual Training

Cong Chen, Mingyu Liu, Chenchen Jing et al.

ICLR 2025arXiv:2503.06486

citations

#1382

Your Mixture-of-Experts LLM Is Secretly an Embedding Model for Free

Ziyue Li, Tianyi Zhou

ICLR 2025arXiv:2410.10814

citations

#1383

The OMG dataset: An Open MetaGenomic corpus for mixed-modality genomic language modeling

Andre Cornman, Jacob West-Roberts, Antonio Camargo et al.

ICLR 2025

citations

#1384

NOLA: Compressing LoRA using Linear Combination of Random Basis

Soroush Abbasi Koohpayegani, K L Navaneet, Parsa Nooralinejad et al.

ICLR 2024oralarXiv:2310.02556

citations

#1385

Competing Large Language Models in Multi-Agent Gaming Environments

Jen-Tse Huang, Eric John Li, Man Ho LAM et al.

ICLR 2025

citations

#1386

MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code

Zimu Lu, Aojun Zhou, Ke Wang et al.

ICLR 2025arXiv:2410.08196

citations

#1387

Improving Uncertainty Estimation through Semantically Diverse Language Generation

Lukas Aichberger, Kajetan Schweighofer, Mykyta Ielanskyi et al.

ICLR 2025arXiv:2406.04306

citations

#1388

Nonconvex Stochastic Optimization under Heavy-Tailed Noises: Optimal Convergence without Gradient Clipping

Zijian Liu, Zhengyuan Zhou

ICLR 2025arXiv:2412.19529

citations

#1389

GUI-World: A Video Benchmark and Dataset for Multimodal GUI-oriented Understanding

Dongping Chen, Yue Huang, Siyuan Wu et al.

ICLR 2025oralarXiv:2406.10819

citations

#1390

Unified Generative Modeling of 3D Molecules with Bayesian Flow Networks

Yuxuan Song, Jingjing Gong, Hao Zhou et al.

ICLR 2024arXiv:2403.15441

citations

#1391

DREAM: Dual Structured Exploration with Mixup for Open-set Graph Domain Adaption

Nan Yin, Mengzhu Wang, Mengzhu Wang et al.

ICLR 2024

citations

#1392

Denoising Autoregressive Transformers for Scalable Text-to-Image Generation

Jiatao Gu, Yuyang Wang, Yizhe Zhang et al.

ICLR 2025arXiv:2410.08159

citations

#1393

The LLM Surgeon

Tycho van der Ouderaa, Markus Nagel, Mart van Baalen et al.

ICLR 2024arXiv:2312.17244

citations

#1394

Inherently Interpretable Time Series Classification via Multiple Instance Learning

Joseph Early, Gavin Cheung, Kurt Cutajar et al.

ICLR 2024spotlightarXiv:2311.10049

citations

#1395

The Superposition of Diffusion Models Using the Itô Density Estimator

Marta Skreta, Lazar Atanackovic, Joey Bose et al.

ICLR 2025arXiv:2412.17762

citations

#1396

Compressed Context Memory for Online Language Model Interaction

Jang-Hyun Kim, Junyoung Yeom, Sangdoo Yun et al.

ICLR 2024arXiv:2312.03414

citations

#1397

A Geometric Framework for Understanding Memorization in Generative Models

Brendan Ross, Hamidreza Kamkari, Tongzi Wu et al.

ICLR 2025arXiv:2411.00113

citations

#1398

CREAM: Consistency Regularized Self-Rewarding Language Models

Zhaoyang Wang, Weilei He, Zhiyuan Liang et al.

ICLR 2025arXiv:2410.12735

citations

#1399

Diffusion-based Neural Network Weights Generation

Bedionita Soro, Bruno Andreis, Hayeon Lee et al.

ICLR 2025arXiv:2402.18153

citations

#1400

From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions

Changle Qu, Sunhao Dai, Xiaochi Wei et al.

ICLR 2025arXiv:2410.08197

citations

← Previous

1...5 6 7 8 9...31