Most Cited ICLR "repobench-c-8k evaluation" Papers

6,124 papers found • Page 6 of 31

Filters:Most Cited ICLR repobench-c-8k evaluation Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

#1001

Particle Guidance: non-I.I.D. Diverse Sampling with Diffusion Models

Gabriele Corso, Yilun Xu, Valentin De Bortoli et al.

ICLR 2024arXiv:2310.13102

citations

#1002

Few-Shot Detection of Machine-Generated Text using Style Representations

Rafael Rivera Soto, Kailin Koch, Aleem Khan et al.

ICLR 2024arXiv:2401.06712

citations

#1003

Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models

Shuai Zhao, Xiaohan Wang, Linchao Zhu et al.

ICLR 2024arXiv:2305.18010

citations

#1004

On the expressiveness and spectral bias of KANs

Yixuan Wang, Jonathan Siegel, Ziming Liu et al.

ICLR 2025arXiv:2410.01803

citations

#1005

Does CLIP’s generalization performance mainly stem from high train-test similarity?

Prasanna Mayilvahanan, Thaddäus Wiedemer, Evgenia Rusak et al.

ICLR 2024arXiv:2310.09562

citations

#1006

DELTA: DENSE EFFICIENT LONG-RANGE 3D TRACKING FOR ANY VIDEO

Tuan Ngo, Peiye Zhuang, Evangelos Kalogerakis et al.

ICLR 2025arXiv:2410.24211

citations

#1007

Vevo: Controllable Zero-Shot Voice Imitation with Self-Supervised Disentanglement

Xueyao Zhang, Xiaohui Zhang, Kainan Peng et al.

ICLR 2025arXiv:2502.07243

citations

#1008

Multi-Task Reinforcement Learning with Mixture of Orthogonal Experts

Ahmed Hendawy, Jan Peters, Carlo D'Eramo

ICLR 2024arXiv:2311.11385

citations

#1009

Benchmarking and Improving Generator-Validator Consistency of Language Models

XIANG LI, Vaishnavi Shrivastava, Siyan Li et al.

ICLR 2024arXiv:2310.01846

citations

#1010

Beyond Imitation: Leveraging Fine-grained Quality Signals for Alignment

Geyang Guo, Ranchi Zhao, Tianyi Tang et al.

ICLR 2024arXiv:2311.04072

citations

#1011

SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration

Heming Xia, Yongqi Li, Jun Zhang et al.

ICLR 2025arXiv:2410.06916

citations

#1012

Quality-Diversity through AI Feedback

Herbie Bradley, Andrew Dai, Hannah Teufel et al.

ICLR 2024arXiv:2310.13032

citations

#1013

THOUGHT PROPAGATION: AN ANALOGICAL APPROACH TO COMPLEX REASONING WITH LARGE LANGUAGE MODELS

Junchi Yu, Ran He, Rex Ying

ICLR 2024arXiv:2310.03965

citations

#1014

Leveraging Optimization for Adaptive Attacks on Image Watermarks

Nils Lukas, Abdelrahman Ahmed, Lucas Fenaux et al.

ICLR 2024arXiv:2309.16952

citations

#1015

Do Vision-Language Models Represent Space and How? Evaluating Spatial Frame of Reference under Ambiguities

Zheyuan Zhang, Fengyuan Hu, Jayjun Lee et al.

ICLR 2025arXiv:2410.17385

citations

#1016

PaPaGei: Open Foundation Models for Optical Physiological Signals

Arvind Pillai, Dimitris Spathis, Fahim Kawsar et al.

ICLR 2025arXiv:2410.20542

citations

#1017

Scaling Speech-Text Pre-training with Synthetic Interleaved Data

Aohan Zeng, Zhengxiao Du, Mingdao Liu et al.

ICLR 2025arXiv:2411.17607

citations

#1018

Sparse Autoencoders Do Not Find Canonical Units of Analysis

Patrick Leask, Bart Bussmann, Michael Pearce et al.

ICLR 2025arXiv:2502.04878

citations

#1019

Facing the Elephant in the Room: Visual Prompt Tuning or Full finetuning?

Cheng Han, Qifan Wang, Yiming Cui et al.

ICLR 2024arXiv:2401.12902

citations

#1020

SymmCD: Symmetry-Preserving Crystal Generation with Diffusion Models

Daniel Levy, Siba Smarak Panigrahi, Sékou-Oumar Kaba et al.

ICLR 2025arXiv:2502.03638

citations

#1021

Fusing Models with Complementary Expertise

Hongyi Wang, Felipe Polo, Yuekai Sun et al.

ICLR 2024arXiv:2310.01542

citations

#1022

SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation

Koichi Namekata, Sherwin Bahmani, Ziyi Wu et al.

ICLR 2025arXiv:2411.04989

citations

#1023

Dual RL: Unification and New Methods for Reinforcement and Imitation Learning

Harshit Sikchi, Qinqing Zheng, Amy Zhang et al.

ICLR 2024spotlightarXiv:2302.08560

citations

#1024

Neural Optimal Transport with General Cost Functionals

Arip Asadulaev, Alexander Korotin, Vage Egiazarian et al.

ICLR 2024arXiv:2205.15403

citations

#1025

VCR-Graphormer: A Mini-batch Graph Transformer via Virtual Connections

Dongqi Fu, Zhigang Hua, Yan Xie et al.

ICLR 2024arXiv:2403.16030

citations

#1026

DePT: Decomposed Prompt Tuning for Parameter-Efficient Fine-tuning

Zhengxiang Shi, Aldo Lipani

ICLR 2024arXiv:2309.05173

citations

#1027

DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation

Chenguo Lin, Panwang Pan, Bangbang Yang et al.

ICLR 2025arXiv:2501.16764

citations

#1028

Human-inspired Episodic Memory for Infinite Context LLMs

Zafeirios Fountas, Martin A Benfeghoul, Adnan Oomerjee et al.

ICLR 2025oralarXiv:2407.09450

citations

#1029

Yet Another ICU Benchmark: A Flexible Multi-Center Framework for Clinical ML

Robin van de Water, Hendrik Schmidt, Paul Elbers et al.

ICLR 2024oralarXiv:2306.05109

citations

#1030

Addressing Loss of Plasticity and Catastrophic Forgetting in Continual Learning

Mohamed Elsayed, A. Rupam Mahmood

ICLR 2024arXiv:2404.00781

citations

#1031

A Hard-to-Beat Baseline for Training-free CLIP-based Adaptation

Zhengbo Wang, Jian Liang, Lijun Sheng et al.

ICLR 2024arXiv:2402.04087

citations

#1032

Intriguing Properties of Data Attribution on Diffusion Models

Xiaosen Zheng, Tianyu Pang, Chao Du et al.

ICLR 2024arXiv:2311.00500

citations

#1033

Looped Transformers for Length Generalization

Ying Fan, Yilun Du, Kannan Ramchandran et al.

ICLR 2025arXiv:2409.15647

citations

#1034

3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation

Xiao Fu, Xian Liu, Xintao WANG et al.

ICLR 2025arXiv:2412.07759

citations

#1035

Teaching Language Models to Hallucinate Less with Synthetic Tasks

Erik Jones, Hamid Palangi, Clarisse Ribeiro et al.

ICLR 2024arXiv:2310.06827

citations

#1036

Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance

Yaxi Lu, Shenzhi Yang, Cheng Qian et al.

ICLR 2025arXiv:2410.12361

citations

#1037

Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents

Kexun Zhang, Weiran Yao, Zuxin Liu et al.

ICLR 2025arXiv:2408.07060

citations

#1038

On Evaluating the Durability of Safeguards for Open-Weight LLMs

Xiangyu Qi, Boyi Wei, Nicholas Carlini et al.

ICLR 2025arXiv:2412.07097

citations

#1039

How Feature Learning Can Improve Neural Scaling Laws

Blake Bordelon, Alexander Atanasov, Cengiz Pehlevan

ICLR 2025arXiv:2409.17858

citations

#1040

Mastering Memory Tasks with World Models

Mohammad Reza Samsami, Artem Zholus, Janarthanan Rajendran et al.

ICLR 2024oralarXiv:2403.04253

citations

#1041

WOODS: Benchmarks for Out-of-Distribution Generalization in Time Series

Irina Rish, Kartik Ahuja, Mohammad Javad Darvishi Bayazi et al.

ICLR 2024

citations

#1042

Reverse Diffusion Monte Carlo

Xunpeng Huang, Hanze Dong, Yifan HAO et al.

ICLR 2024arXiv:2307.02037

citations

#1043

Trajectory attention for fine-grained video motion control

Zeqi Xiao, Wenqi Ouyang, Yifan Zhou et al.

ICLR 2025oralarXiv:2411.19324

citations

#1044

FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores

Dan Fu, Hermann Kumbong, Eric Nguyen et al.

ICLR 2024arXiv:2311.05908

citations

#1045

A Unified and General Framework for Continual Learning

Zhenyi Wang, Yan Li, Li Shen et al.

ICLR 2024arXiv:2403.13249

citations

#1046

EG4D: Explicit Generation of 4D Object without Score Distillation

Qi Sun, Zhiyang Guo, Ziyu Wan et al.

ICLR 2025oralarXiv:2405.18132

citations

#1047

Stable Neural Stochastic Differential Equations in Analyzing Irregular Time Series Data

YongKyung Oh, Dongyoung Lim, Sungil Kim

ICLR 2024spotlightarXiv:2402.14989

citations

#1048

Combining Induction and Transduction for Abstract Reasoning

Wen-Ding Li, Keya Hu, Carter Larsen et al.

ICLR 2025arXiv:2411.02272

citations

#1049

LLMs Meet VLMs: Boost Open Vocabulary Object Detection with Fine-grained Descriptors

Sheng JIn, Xueying Jiang, Jiaxing Huang et al.

ICLR 2024arXiv:2402.04630

citations

#1050

SLiMe: Segment Like Me

Aliasghar Khani, Saeid Asgari, Aditya Sanghi et al.

ICLR 2024arXiv:2309.03179

citations

#1051

Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators

Daniel Geng, Andrew Owens

ICLR 2024arXiv:2401.18085

citations

#1052

Unlearning or Obfuscating? Jogging the Memory of Unlearned LLMs via Benign Relearning

Shengyuan Hu, Yiwei Fu, Steven Wu et al.

ICLR 2025arXiv:2406.13356

citations

#1053

A-Bench: Are LMMs Masters at Evaluating AI-generated Images?

Zicheng Zhang, Haoning Wu, Chunyi Li et al.

ICLR 2025arXiv:2406.03070

citations

#1054

DiscoveryBench: Towards Data-Driven Discovery with Large Language Models

Bodhisattwa Prasad Majumder, Harshit Surana, Dhruv Agarwal et al.

ICLR 2025arXiv:2407.01725

citations

#1055

Fourier Transporter: Bi-Equivariant Robotic Manipulation in 3D

Haojie Huang, Owen Howell, Dian Wang et al.

ICLR 2024arXiv:2401.12046

citations

#1056

DartControl: A Diffusion-Based Autoregressive Motion Model for Real-Time Text-Driven Motion Control

Kaifeng Zhao, Gen Li, Siyu Tang

ICLR 2025arXiv:2410.05260

citations

#1057

Retrieval-Enhanced Contrastive Vision-Text Models

Ahmet Iscen, Mathilde Caron, Alireza Fathi et al.

ICLR 2024arXiv:2306.07196

citations

#1058

SANA: Efficient High-Resolution Text-to-Image Synthesis with Linear Diffusion Transformers

Enze Xie, Junsong Chen, Junyu Chen et al.

ICLR 2025

citations

#1059

Think while You Generate: Discrete Diffusion with Planned Denoising

Sulin Liu, Juno Nam, Andrew Campbell et al.

ICLR 2025arXiv:2410.06264

citations

#1060

V-DETR: DETR with Vertex Relative Position Encoding for 3D Object Detection

Yichao Shen, Zigang Geng, YUHUI YUAN et al.

ICLR 2024arXiv:2308.04409

citations

#1061

Rethinking Channel Dependence for Multivariate Time Series Forecasting: Learning from Leading Indicators

Lifan Zhao, Yanyan Shen

ICLR 2024arXiv:2401.17548

citations

#1062

Uni-Sign: Toward Unified Sign Language Understanding at Scale

Zecheng Li, Wengang Zhou, Weichao Zhao et al.

ICLR 2025arXiv:2501.15187

citations

#1063

PanoDiffusion: 360-degree Panorama Outpainting via Diffusion

Tianhao Wu, Chuanxia Zheng, Tat-Jen Cham

ICLR 2024arXiv:2307.03177

citations

#1064

MgNO: Efficient Parameterization of Linear Operators via Multigrid

Juncai He, Xinliang Liu, Jinchao Xu

ICLR 2024arXiv:2310.19809

citations

#1065

Benign Overfitting and Grokking in ReLU Networks for XOR Cluster Data

Zhiwei Xu, Yutong Wang, Spencer Frei et al.

ICLR 2024arXiv:2310.02541

citations

#1066

EmerDiff: Emerging Pixel-level Semantic Knowledge in Diffusion Models

Koichi Namekata, Amirmojtaba Sabour, Sanja Fidler et al.

ICLR 2024arXiv:2401.11739

citations

#1067

Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation

Yiming Wang, Pei Zhang, Baosong Yang et al.

ICLR 2025arXiv:2410.13640

citations

#1068

TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained Models

Zuxin Liu, Jesse Zhang, Kavosh Asadi et al.

ICLR 2024arXiv:2310.05905

citations

#1069

Attention with Markov: A Curious Case of Single-layer Transformers

Ashok Makkuva, Marco Bondaschi, Adway Girish et al.

ICLR 2025arXiv:2402.04161

citations

#1070

Strong Model Collapse

Elvis Dohmatob, Yunzhen Feng, Arjun Subramonian et al.

ICLR 2025arXiv:2410.04840

citations

#1071

Theory, Analysis, and Best Practices for Sigmoid Self-Attention

Jason Ramapuram, Federico Danieli, Eeshan Gunesh Dhekane et al.

ICLR 2025arXiv:2409.04431

citations

#1072

Training-Free Activation Sparsity in Large Language Models

James Liu, Pragaash Ponnusamy, Tianle Cai et al.

ICLR 2025arXiv:2408.14690

citations

#1073

INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge

Angelika Romanou, Negar Foroutan, Anna Sotnikova et al.

ICLR 2025arXiv:2411.19799

citations

#1074

Is This the Subspace You Are Looking for? An Interpretability Illusion for Subspace Activation Patching

Aleksandar Makelov, Georg Lange, Atticus Geiger et al.

ICLR 2024arXiv:2311.17030

citations

#1075

Watermark Anything With Localized Messages

Tom Sander, Pierre Fernandez, Alain Oliviero Durmus et al.

ICLR 2025arXiv:2411.07231

citations

#1076

When Do Prompting and Prefix-Tuning Work? A Theory of Capabilities and Limitations

Aleksandar Petrov, Philip Torr, Adel Bibi

ICLR 2024arXiv:2310.19698

citations

#1077

On Scaling Up 3D Gaussian Splatting Training

Hexu Zhao, Haoyang Weng, Daohan Lu et al.

ICLR 2025arXiv:2406.18533

citations

#1078

Improving Pretraining Data Using Perplexity Correlations

Tristan Thrush, Christopher Potts, Tatsunori Hashimoto

ICLR 2025arXiv:2409.05816

citations

#1079

PolyGCL: GRAPH CONTRASTIVE LEARNING via Learnable Spectral Polynomial Filters

Jingyu Chen, Runlin Lei, Zhewei Wei

ICLR 2024spotlight

citations

#1080

Synthetic continued pretraining

Zitong Yang, Neil Band, Shuangping Li et al.

ICLR 2025arXiv:2409.07431

citations

#1081

Enabling Efficient Equivariant Operations in the Fourier Basis via Gaunt Tensor Products

Shengjie Luo, Tianlang Chen, Aditi Krishnapriyan

ICLR 2024spotlightarXiv:2401.10216

citations

#1082

AirPhyNet: Harnessing Physics-Guided Neural Networks for Air Quality Prediction

Kethmi Hirushini Hettige, Jiahao Ji, Shili Xiang et al.

ICLR 2024oralarXiv:2402.03784

citations

#1083

PINNACLE: PINN Adaptive ColLocation and Experimental points selection

Gregory Kang Ruey Lau, Apivich Hemachandra, See-Kiong Ng et al.

ICLR 2024spotlightarXiv:2404.07662

citations

#1084

Elastic Feature Consolidation For Cold Start Exemplar-Free Incremental Learning

Simone Magistri, Tomaso Trinci, Albin Soutif--Cormerais et al.

ICLR 2024arXiv:2402.03917

citations

#1085

Complete and Efficient Graph Transformers for Crystal Material Property Prediction

Keqiang Yan, Cong Fu, Xiaofeng Qian et al.

ICLR 2024arXiv:2403.11857

citations

#1086

What is Wrong with Perplexity for Long-context Language Modeling?

Lizhe Fang, Yifei Wang, Zhaoyang Liu et al.

ICLR 2025arXiv:2410.23771

citations

#1087

STanHop: Sparse Tandem Hopfield Model for Memory-Enhanced Time Series Prediction

Yu-Hsuan Wu, Jerry Hu, Weijian Li et al.

ICLR 2024oralarXiv:2312.17346

citations

#1088

Building Interactable Replicas of Complex Articulated Objects via Gaussian Splatting

Yu Liu, Baoxiong Jia, Ruijie Lu et al.

ICLR 2025arXiv:2502.19459

citations

#1089

SCBench: A KV Cache-Centric Analysis of Long-Context Methods

Yucheng Li, Huiqiang Jiang, Qianhui Wu et al.

ICLR 2025arXiv:2412.10319

citations

#1090

TESTAM: A Time-Enhanced Spatio-Temporal Attention Model with Mixture of Experts

Hyunwook Lee, Sungahn Ko

ICLR 2024oralarXiv:2403.02600

citations

#1091

Persistent Pre-training Poisoning of LLMs

Yiming Zhang, Javier Rando, Ivan Evtimov et al.

ICLR 2025arXiv:2410.13722

citations

#1092

No Training, No Problem: Rethinking Classifier-Free Guidance for Diffusion Models

Seyedmorteza Sadat, Manuel Kansy, Otmar Hilliges et al.

ICLR 2025arXiv:2407.02687

citations

#1093

Towards Foundational Models for Molecular Learning on Large-Scale Multi-Task Datasets

Dominique Beaini, Shenyang(Andy) Huang, Joao Cunha et al.

ICLR 2024arXiv:2310.04292

citations

#1094

LLaVA-MoD: Making LLaVA Tiny via MoE-Knowledge Distillation

Fangxun Shu, Yue Liao, Lei Zhang et al.

ICLR 2025arXiv:2408.15881

citations

#1095

CLEX: Continuous Length Extrapolation for Large Language Models

Guanzheng Chen, Xin Li, Zaiqiao Meng et al.

ICLR 2024arXiv:2310.16450

citations

#1096

Multi-granularity Correspondence Learning from Long-term Noisy Videos

Yijie Lin, Jie Zhang, Zhenyu Huang et al.

ICLR 2024oralarXiv:2401.16702

citations

#1097

GROOT: Learning to Follow Instructions by Watching Gameplay Videos

Shaofei Cai, Bowei Zhang, Zihao Wang et al.

ICLR 2024spotlightarXiv:2310.08235

citations

#1098

Teach LLMs to Phish: Stealing Private Information from Language Models

Ashwinee Panda, Christopher Choquette-Choo, Zhengming Zhang et al.

ICLR 2024arXiv:2403.00871

citations

#1099

Sequential Controlled Langevin Diffusions

Junhua Chen, Lorenz Richter, Julius Berner et al.

ICLR 2025arXiv:2412.07081

citations

#1100

Grokking as a First Order Phase Transition in Two Layer Networks

Noa Rubin, Inbar Seroussi, Zohar Ringel

ICLR 2024arXiv:2310.03789

citations

#1101

Multi-modal Agent Tuning: Building a VLM-Driven Agent for Efficient Tool Usage

Zhi Gao, Bofei Zhang, Pengxiang Li et al.

ICLR 2025arXiv:2412.15606

citations

#1102

Variational Best-of-N Alignment

Afra Amini, Tim Vieira, Elliott Ash et al.

ICLR 2025arXiv:2407.06057

citations

#1103

PARL: A Unified Framework for Policy Alignment in Reinforcement Learning from Human Feedback

Souradip Chakraborty, Amrit Bedi, Alec Koppel et al.

ICLR 2024arXiv:2308.02585

citations

#1104

Restructuring Vector Quantization with the Rotation Trick

Christopher Fifty, Ronald Junkins, Dennis Duan et al.

ICLR 2025arXiv:2410.06424

citations

#1105

TopoMLP: A Simple yet Strong Pipeline for Driving Topology Reasoning

Dongming Wu, Jiahao Chang, Fan Jia et al.

ICLR 2024arXiv:2310.06753

citations

#1106

Dynamic Diffusion Transformer

Wangbo Zhao, Yizeng Han, Jiasheng Tang et al.

ICLR 2025arXiv:2410.03456

citations

#1107

ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron Pruning

Ruchika Chavhan, Da Li, Timothy Hospedales

ICLR 2025arXiv:2405.19237

citations

#1108

SafeDreamer: Safe Reinforcement Learning with World Models

Weidong Huang, Jiaming Ji, Chunhe Xia et al.

ICLR 2024arXiv:2307.07176

citations

#1109

ExACT: Teaching AI Agents to Explore with Reflective-MCTS and Exploratory Learning

Xiao Yu, Baolin Peng, Vineeth Vajipey et al.

ICLR 2025arXiv:2410.02052

citations

#1110

Modeling Future Conversation Turns to Teach LLMs to Ask Clarifying Questions

Michael Zhang, W. Bradley Knox, Eunsol Choi

ICLR 2025arXiv:2410.13788

citations

#1111

SaLoRA: Safety-Alignment Preserved Low-Rank Adaptation

Mingjie Li, Wai Man Si, Michael Backes et al.

ICLR 2025arXiv:2501.01765

citations

#1112

LaMP: Language-Motion Pretraining for Motion Generation, Retrieval, and Captioning

Zhe Li, Weihao Yuan, Yisheng He et al.

ICLR 2025arXiv:2410.07093

citations

#1113

Skip-Attention: Improving Vision Transformers by Paying Less Attention

Shashank Venkataramanan, Amir Ghodrati, Yuki Asano et al.

ICLR 2024arXiv:2301.02240

citations

#1114

Compositional Entailment Learning for Hyperbolic Vision-Language Models

Avik Pal, Max van Spengler, Guido D'Amely di Melendugno et al.

ICLR 2025arXiv:2410.06912

citations

#1115

Efficient Evolutionary Search Over Chemical Space with Large Language Models

Haorui Wang, Marta Skreta, Cher-Tian Ser et al.

ICLR 2025arXiv:2406.16976

citations

#1116

Partitioning Message Passing for Graph Fraud Detection

Wei Zhuo, Zemin Liu, Bryan Hooi et al.

ICLR 2024arXiv:2412.00020

citations

#1117

ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities

Peng Xu, Wei Ping, Xianchao Wu et al.

ICLR 2025arXiv:2407.14482

citations

#1118

Fully Hyperbolic Convolutional Neural Networks for Computer Vision

Ahmad Bdeir, Kristian Schwethelm, Niels Landwehr

ICLR 2024arXiv:2303.15919

citations

#1119

MBR and QE Finetuning: Training-time Distillation of the Best and Most Expensive Decoding Methods

Mara Finkelstein, Markus Freitag

ICLR 2024arXiv:2309.10966

citations

#1120

Deconstructing What Makes a Good Optimizer for Autoregressive Language Models

Rosie Zhao, Depen Morwani, David Brandfonbrener et al.

ICLR 2025

citations

#1121

Flow Matching with General Discrete Paths: A Kinetic-Optimal Perspective

Neta Shaul, Itai Gat, Marton Havasi et al.

ICLR 2025arXiv:2412.03487

citations

#1122

TOMATO: Assessing Visual Temporal Reasoning Capabilities in Multimodal Foundation Models

Ziyao Shangguan, Chuhan Li, Yuxuan Ding et al.

ICLR 2025oralarXiv:2410.23266

citations

#1123

Making RL with Preference-based Feedback Efficient via Randomization

Runzhe Wu, Wen Sun

ICLR 2024arXiv:2310.14554

citations

#1124

Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy

Tong Wu, Shujian Zhang, Kaiqiang Song et al.

ICLR 2025arXiv:2410.09102

citations

#1125

FairMT-Bench: Benchmarking Fairness for Multi-turn Dialogue in Conversational LLMs

Zhiting Fan, Ruizhe Chen, Tianxiang Hu et al.

ICLR 2025arXiv:2410.19317

citations

#1126

Transformers Provably Solve Parity Efficiently with Chain of Thought

Juno Kim, Taiji Suzuki

ICLR 2025arXiv:2410.08633

citations

#1127

Is ImageNet worth 1 video? Learning strong image encoders from 1 long unlabelled video

Shashank Venkataramanan, Mamshad Nayeem Rizve, Joao Carreira et al.

ICLR 2024arXiv:2310.08584

citations

#1128

DeciMamba: Exploring the Length Extrapolation Potential of Mamba

Assaf Ben-Kish, Itamar Zimerman, Shady Abu-Hussein et al.

ICLR 2025arXiv:2406.14528

citations

#1129

ControlAR: Controllable Image Generation with Autoregressive Models

Zongming Li, Tianheng Cheng, Shoufa Chen et al.

ICLR 2025arXiv:2410.02705

citations

#1130

COMBO: Compositional World Models for Embodied Multi-Agent Cooperation

Hongxin Zhang, Zeyuan Wang, Qiushi Lyu et al.

ICLR 2025arXiv:2404.10775

citations

#1131

Generative Sliced MMD Flows with Riesz Kernels

Johannes Hertrich, Christian Wald, Fabian Altekrüger et al.

ICLR 2024arXiv:2305.11463

citations

#1132

GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-Time Alignment

Yuancheng Xu, Udari Sehwag, Alec Koppel et al.

ICLR 2025arXiv:2410.08193

citations

#1133

Preserving Diversity in Supervised Fine-Tuning of Large Language Models

Ziniu Li, Congliang Chen, Tian Xu et al.

ICLR 2025arXiv:2408.16673

citations

#1134

Do LLM Agents Have Regret? A Case Study in Online Learning and Games

Chanwoo Park, Xiangyu Liu, Asuman Ozdaglar et al.

ICLR 2025arXiv:2403.16843

citations

#1135

Rethinking LLM Unlearning Objectives: A Gradient Perspective and Go Beyond

Qizhou Wang, Jin Zhou, (Andrew) Zhanke Zhou et al.

ICLR 2025arXiv:2502.19301

citations

#1136

How to Fine-Tune Vision Models with SGD

Ananya Kumar, Ruoqi Shen, Sebastien Bubeck et al.

ICLR 2024arXiv:2211.09359

citations

#1137

LitCab: Lightweight Language Model Calibration over Short- and Long-form Responses

Xin Liu, Muhammad Khalifa, Lu Wang

ICLR 2024arXiv:2310.19208

citations

#1138

Large Language Models are Efficient Learners of Noise-Robust Speech Recognition

Yuchen Hu, CHEN CHEN, Chao-Han Huck Yang et al.

ICLR 2024spotlightarXiv:2401.10446

citations

#1139

Momentum Benefits Non-iid Federated Learning Simply and Provably

Ziheng Cheng, Xinmeng Huang, Pengfei Wu et al.

ICLR 2024arXiv:2306.16504

citations

#1140

Multi-resolution HuBERT: Multi-resolution Speech Self-Supervised Learning with Masked Unit Prediction

Jiatong Shi, Hirofumi Inaguma, Xutai Ma et al.

ICLR 2024spotlightarXiv:2310.02720

citations

#1141

AgentStudio: A Toolkit for Building General Virtual Agents

Longtao Zheng, Zhiyuan Huang, Zhenghai Xue et al.

ICLR 2025arXiv:2403.17918

citations

#1142

Diffusion Bridge Implicit Models

Kaiwen Zheng, Guande He, Jianfei Chen et al.

ICLR 2025arXiv:2405.15885

citations

#1143

Never Train from Scratch: Fair Comparison of Long-Sequence Models Requires Data-Driven Priors

Ido Amos, Jonathan Berant, Ankit Gupta

ICLR 2024arXiv:2310.02980

citations

#1144

MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos

Xuehai He, Weixi Feng, Kaizhi Zheng et al.

ICLR 2025arXiv:2406.08407

citations

#1145

FreeVS: Generative View Synthesis on Free Driving Trajectory

Qitai Wang, Lue Fan, Yuqi Wang et al.

ICLR 2025arXiv:2410.18079

citations

#1146

Provable Compositional Generalization for Object-Centric Learning

Thaddäus Wiedemer, Jack Brady, Alexander Panfilov et al.

ICLR 2024arXiv:2310.05327

citations

#1147

PAD: Personalized Alignment of LLMs at Decoding-time

Ruizhe Chen, Xiaotian Zhang, Meng Luo et al.

ICLR 2025arXiv:2410.04070

citations

#1148

Reasoning with Latent Diffusion in Offline Reinforcement Learning

Siddarth Venkatraman, Shivesh Khaitan, Ravi Tej Akella et al.

ICLR 2024oralarXiv:2309.06599

citations

#1149

CogCoM: A Visual Language Model with Chain-of-Manipulations Reasoning

Ji Qi, Ming Ding, Weihan Wang et al.

ICLR 2025arXiv:2402.04236

citations

#1150

Federated Recommendation with Additive Personalization

Zhiwei Li, Guodong Long, Tianyi Zhou

ICLR 2024arXiv:2301.09109

citations

#1151

Spatial-Mamba: Effective Visual State Space Models via Structure-Aware State Fusion

Chaodong Xiao, Minghan Li, zhengqiang ZHANG et al.

ICLR 2025arXiv:2410.15091

citations

#1152

SINGAPO: Single Image Controlled Generation of Articulated Parts in Objects

Jiayi Liu, Denys Iliash, Angel Chang et al.

ICLR 2025arXiv:2410.16499

citations

#1153

Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models

Yongxin Guo, Zhenglin Cheng, Xiaoying Tang et al.

ICLR 2025arXiv:2405.14297

citations

#1154

OpenTab: Advancing Large Language Models as Open-domain Table Reasoners

Kezhi Kong, Jiani Zhang, Zhengyuan Shen et al.

ICLR 2024arXiv:2402.14361

citations

#1155

Adaptive Data Optimization: Dynamic Sample Selection with Scaling Laws

Yiding Jiang, Allan Zhou, Zhili Feng et al.

ICLR 2025arXiv:2410.11820

citations

#1156

Knowledge Distillation Based on Transformed Teacher Matching

Kaixiang Zheng, EN-HUI YANG

ICLR 2024arXiv:2402.11148

citations

#1157

MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts

Peng Jin, Bo Zhu, Yuan Li et al.

ICLR 2025arXiv:2410.07348

citations

#1158

VQGraph: Rethinking Graph Representation Space for Bridging GNNs and MLPs

Ling Yang, Ye Tian, Minkai Xu et al.

ICLR 2024arXiv:2308.02117

citations

#1159

Dynamic Sparse Training with Structured Sparsity

Mike Lasby, Anna Golubeva, Utku Evci et al.

ICLR 2024arXiv:2305.02299

citations

#1160

Bespoke Solvers for Generative Flow Models

Neta Shaul, Juan Perez, Ricky T. Q. Chen et al.

ICLR 2024spotlightarXiv:2310.19075

citations

#1161

Towards Energy Efficient Spiking Neural Networks: An Unstructured Pruning Framework

Xinyu Shi, Jianhao Ding, Zecheng Hao et al.

ICLR 2024spotlight

citations

#1162

Social-Transmotion: Promptable Human Trajectory Prediction

Saeed Saadatnejad, Yang Gao, Kaouther Messaoud et al.

ICLR 2024oralarXiv:2312.16168

citations

#1163

Reconstructive Visual Instruction Tuning

Haochen Wang, Anlin Zheng, Yucheng Zhao et al.

ICLR 2025arXiv:2410.09575

citations

#1164

Subtractive Mixture Models via Squaring: Representation and Learning

Lorenzo Loconte, Aleksanteri Sladek, Stefan Mengel et al.

ICLR 2024spotlightarXiv:2310.00724

citations

#1165

TabDiff: a Mixed-type Diffusion Model for Tabular Data Generation

Juntong Shi, Minkai Xu, Harper Hua et al.

ICLR 2025arXiv:2410.20626

citations

#1166

Preference Optimization for Reasoning with Pseudo Feedback

Fangkai Jiao, Geyang Guo, Xingxing Zhang et al.

ICLR 2025arXiv:2411.16345

citations

#1167

Revisiting Link Prediction: a data perspective

Haitao Mao, Juanhui Li, Harry Shomer et al.

ICLR 2024arXiv:2310.00793

citations

#1168

Sufficient Context: A New Lens on Retrieval Augmented Generation Systems

Hailey Joren, Jianyi Zhang, Chun-Sung Ferng et al.

ICLR 2025arXiv:2411.06037

citations

#1169

Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models

Yoad Tewel, Rinon Gal, Dvir Samuel et al.

ICLR 2025arXiv:2411.07232

citations

#1170

A Closer Look at Machine Unlearning for Large Language Models

Xiaojian Yuan, Tianyu Pang, Chao Du et al.

ICLR 2025arXiv:2410.08109

citations

#1171

One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt

Tao Liu, Kai Wang, Senmao Li et al.

ICLR 2025arXiv:2501.13554

citations

#1172

LongGenBench: Benchmarking Long-Form Generation in Long Context LLMs

Yuhao Wu, Ming Shan Hee, Zhiqiang Hu et al.

ICLR 2025arXiv:2409.02076

citations

#1173

PTaRL: Prototype-based Tabular Representation Learning via Space Calibration

Hangting Ye, Wei Fan, Xiaozhuang Song et al.

ICLR 2024spotlightarXiv:2407.05364

citations

#1174

Can LLMs Understand Time Series Anomalies?

Zihao Zhou, Rose Yu

ICLR 2025arXiv:2410.05440

citations

#1175

A New Perspective on Shampoo's Preconditioner

Depen Morwani, Itai Shapira, Nikhil Vyas et al.

ICLR 2025arXiv:2406.17748

citations

#1176

ReMasker: Imputing Tabular Data with Masked Autoencoding

Tianyu Du, Luca Melis, Ting Wang

ICLR 2024arXiv:2309.13793

citations

#1177

Do Generated Data Always Help Contrastive Learning?

Yifei Wang, Jizhe Zhang, Yisen Wang

ICLR 2024arXiv:2403.12448

citations

#1178

How I Warped Your Noise: a Temporally-Correlated Noise Prior for Diffusion Models

Pascal Chang, Jingwei Tang, Markus Gross et al.

ICLR 2024oralarXiv:2504.03072

citations

#1179

EmbodiedSAM: Online Segment Any 3D Thing in Real Time

Xiuwei Xu, Huangxing Chen, Linqing Zhao et al.

ICLR 2025arXiv:2408.11811

citations

#1180

HiSplat: Hierarchical 3D Gaussian Splatting for Generalizable Sparse-View Reconstruction

Shengji Tang, Weicai Ye, Peng Ye et al.

ICLR 2025arXiv:2410.06245

citations

#1181

Adversarial Perturbations Cannot Reliably Protect Artists From Generative AI

Robert Hönig, Javier Rando, Nicholas Carlini et al.

ICLR 2025arXiv:2406.12027

citations

#1182

Merging LoRAs like Playing LEGO: Pushing the Modularity of LoRA to Extremes Through Rank-Wise Clustering

Ziyu Zhao, tao shen, Didi Zhu et al.

ICLR 2025arXiv:2409.16167

citations

#1183

Towards Few-Shot Adaptation of Foundation Models via Multitask Finetuning

Zhuoyan Xu, Zhenmei Shi, Junyi Wei et al.

ICLR 2024arXiv:2402.15017

citations

#1184

nGPT: Normalized Transformer with Representation Learning on the Hypersphere

Ilya Loshchilov, Cheng-Ping Hsieh, Simeng Sun et al.

ICLR 2025arXiv:2410.01131

citations

#1185

The Consensus Game: Language Model Generation via Equilibrium Search

Athul Jacob, Yikang Shen, Gabriele Farina et al.

ICLR 2024spotlightarXiv:2310.09139

citations

#1186

ToolGen: Unified Tool Retrieval and Calling via Generation

Renxi Wang, Xudong Han, Lei Ji et al.

ICLR 2025arXiv:2410.03439

citations

#1187

Learning to Clarify: Multi-turn Conversations with Action-Based Contrastive Self-Training

Maximillian Chen, Ruoxi Sun, Tomas Pfister et al.

ICLR 2025arXiv:2406.00222

citations

#1188

Adversarial Score identity Distillation: Rapidly Surpassing the Teacher in One Step

Mingyuan Zhou, Huangjie Zheng, Yi Gu et al.

ICLR 2025arXiv:2410.14919

citations

#1189

X-ALMA: Plug & Play Modules and Adaptive Rejection for Quality Translation at Scale

Haoran Xu, Kenton Murray, Philipp Koehn et al.

ICLR 2025arXiv:2410.03115

citations

#1190

$R^2$-Guard: Robust Reasoning Enabled LLM Guardrail via Knowledge-Enhanced Logical Reasoning

Mintong Kang, Bo Li

ICLR 2025arXiv:2407.05557

citations

#1191

DEEM: Diffusion models serve as the eyes of large language models for image perception

Run Luo, Yunshui Li, Longze Chen et al.

ICLR 2025arXiv:2405.15232

citations

#1192

Planning Anything with Rigor: General-Purpose Zero-Shot Planning with LLM-based Formalized Programming

Yilun Hao, Yang Zhang, Chuchu Fan

ICLR 2025arXiv:2410.12112

citations

#1193

Efficiently Learning at Test-Time: Active Fine-Tuning of LLMs

Jonas Hübotter, Sascha Bongni, Ido Hakimi et al.

ICLR 2025arXiv:2410.08020

citations

#1194

Beyond Accuracy: Evaluating Self-Consistency of Code Large Language Models with IdentityChain

Marcus J. Min, Yangruibo Ding, Luca Buratti et al.

ICLR 2024arXiv:2310.14053

citations

#1195

Hierarchical Context Merging: Better Long Context Understanding for Pre-trained LLMs

Woomin Song, Seunghyuk Oh, Sangwoo Mo et al.

ICLR 2024arXiv:2404.10308

citations

#1196

Jointly Training Large Autoregressive Multimodal Models

Emanuele Aiello, Lili Yu, Yixin Nie et al.

ICLR 2024arXiv:2309.15564

citations

#1197

Low Rank Matrix Completion via Robust Alternating Minimization in Nearly Linear Time

Yuzhou Gu, Zhao Song, Junze Yin et al.

ICLR 2024arXiv:2302.11068

citations

#1198

Spurious Feature Diversification Improves Out-of-distribution Generalization

LIN Yong, Lu Tan, Yifan HAO et al.

ICLR 2024arXiv:2309.17230

citations

#1199

VisualPredicator: Learning Abstract World Models with Neuro-Symbolic Predicates for Robot Planning

Yichao Liang, Nishanth Kumar, Hao Tang et al.

ICLR 2025arXiv:2410.23156

citations

#1200

Scalable Language Model with Generalized Continual Learning

Bohao PENG, Zhuotao Tian, Shu Liu et al.

ICLR 2024arXiv:2404.07470

citations

← Previous

1...4 5 6 7 8...31