Most Cited ICLR "language model applications" Papers

6,124 papers found • Page 20 of 31

Filters:Most Cited ICLR language model applications Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

#3801

Self-Improving Robust Preference Optimization

Eugene Choi, Arash Ahmadian, Matthieu Geist et al.

ICLR 2025arXiv:2406.01660

#3802

Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data

Sreyan Ghosh, Sonal Kumar, Zhifeng Kong et al.

ICLR 2025arXiv:2410.02056

#3803

Fugatto 1: Foundational Generative Audio Transformer Opus 1

Rafael Valle, Rohan Badlani, Zhifeng Kong et al.

ICLR 2025

#3804

Linear Representations of Political Perspective Emerge in Large Language Models

Junsol Kim, James Evans, Aaron Schein

ICLR 2025arXiv:2503.02080

#3805

Looking Backward: Streaming Video-to-Video Translation with Feature Banks

Feng Liang, Akio Kodaira, Chenfeng Xu et al.

ICLR 2025oralarXiv:2405.15757

#3806

Why Does the Effective Context Length of LLMs Fall Short?

Chenxin An, Jun Zhang, Ming Zhong et al.

ICLR 2025arXiv:2410.18745

#3807

Dobi-SVD: Differentiable SVD for LLM Compression and Some New Perspectives

Qinsi Wang, Jinghan Ke, Masayoshi Tomizuka et al.

ICLR 2025arXiv:2502.02723

#3808

Think-on-Graph 2.0: Deep and Faithful Large Language Model Reasoning with Knowledge-guided Retrieval Augmented Generation

Shengjie Ma, Chengjin Xu, Xuhui Jiang et al.

ICLR 2025arXiv:2407.10805

#3809

Model-Agnostic Knowledge Guided Correction for Improved Neural Surrogate Rollout

Bharat Srikishan, Daniel O'Malley, Mohamed Mehana et al.

ICLR 2025arXiv:2503.10048

#3810

Singular Subspace Perturbation Bounds via Rectangular Random Matrix Diffusions

Peiyao Lai, Oren Mangoubi

ICLR 2025arXiv:2406.02502

#3811

Leveraging Variable Sparsity to Refine Pareto Stationarity in Multi-Objective Optimization

Zeou Hu, Yaoliang Yu

ICLR 2025

#3812

On the Convergence of No-Regret Dynamics in Information Retrieval Games with Proportional Ranking Functions

Omer Madmon, Idan Pipano, Itamar Jacob Reinman et al.

ICLR 2025arXiv:2405.11517

#3813

Fine-Tuning Attention Modules Only: Enhancing Weight Disentanglement in Task Arithmetic

Ruochen Jin, Bojian Hou, Jiancong Xiao et al.

ICLR 2025arXiv:2407.07089

#3814

ToolGen: Unified Tool Retrieval and Calling via Generation

Renxi Wang, Xudong Han, Lei Ji et al.

ICLR 2025arXiv:2410.03439

#3815

When do GFlowNets learn the right distribution?

Tiago Silva, Rodrigo Alves, Eliezer de Souza da Silva et al.

ICLR 2025

#3816

CoRNStack: High-Quality Contrastive Data for Better Code Retrieval and Reranking

Tarun Suresh, Revanth Gangi Reddy, Yifei Xu et al.

ICLR 2025arXiv:2412.01007

#3817

Equivariant Denoisers Cannot Copy Graphs: Align Your Graph Diffusion Models

Najwa Laabid, Severi Rissanen, Markus Heinonen et al.

ICLR 2025arXiv:2405.17656

#3818

Towards a Complete Logical Framework for GNN Expressiveness

Tuo Xu

ICLR 2025

#3819

Perplexed by Perplexity: Perplexity-Based Data Pruning With Small Reference Models

Zachary Ankner, Cody Blakeney, Kartik Sreenivasan et al.

ICLR 2025arXiv:2405.20541

#3820

Efficient and Context-Aware Label Propagation for Zero-/Few-Shot Training-Free Adaptation of Vision-Language Model

Yushu Li, Yongyi Su, Adam Goodge et al.

ICLR 2025arXiv:2412.18303

#3821

Rethinking Shapley Value for Negative Interactions in Non-convex Games

Wonjoon Chang, Myeongjin Lee, Jaesik Choi

ICLR 2025

#3822

Matérn Kernels for Tunable Implicit Surface Reconstruction

Maximilian Weiherer, Bernhard Egger

ICLR 2025arXiv:2409.15466

#3823

In Search of the Engram in LLMs: A Neuroscience Perspective on the Memory Functions in AI Models

Minsung Kim, Jea Kwon, Dong-Kyum Kim et al.

ICLR 2025

#3824

MAESTRO: Masked Encoding Set Transformer with Self-Distillation

Matthew Lee, Jaesik Kim, Matei Ionita et al.

ICLR 2025

#3825

dEBORA: Efficient Bilevel Optimization-based low-Rank Adaptation

Emanuele Zangrando, Sara Venturini, Francesco Rinaldi et al.

ICLR 2025

#3826

Multi-session, multi-task neural decoding from distinct cell-types and brain regions

Mehdi Azabou, Krystal Pan, Vinam Arora et al.

ICLR 2025

#3827

Offline Model-Based Optimization by Learning to Rank

Rong-Xi Tan, Ke Xue, Shen-Huan Lyu et al.

ICLR 2025arXiv:2410.11502

#3828

Open-CK: A Large Multi-Physics Fields Coupling benchmarks in Combustion Kinetics

Zaige Fei, Fan Xu, Junyuan Mao et al.

ICLR 2025oral

#3829

RecDreamer: Consistent Text-to-3D Generation via Uniform Score Distillation

Chenxi Zheng, Yihong Lin, Bangzhen Liu et al.

ICLR 2025arXiv:2502.12640

#3830

CameraCtrl: Enabling Camera Control for Video Diffusion Models

Hao He, Yinghao Xu, Yuwei Guo et al.

ICLR 2025

#3831

No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images

Botao Ye, Sifei Liu, Haofei Xu et al.

ICLR 2025arXiv:2410.24207

#3832

Comparing Targeting Strategies for Maximizing Social Welfare with Limited Resources

Vibhhu Sharma, Bryan Wilder

ICLR 2025arXiv:2411.07414

#3833

MotionClone: Training-Free Motion Cloning for Controllable Video Generation

Pengyang Ling, Jiazi Bu, Pan Zhang et al.

ICLR 2025oralarXiv:2406.05338

#3834

Systematic Relational Reasoning With Epistemic Graph Neural Networks

Irtaza Khalid, Steven Schockaert

ICLR 2025arXiv:2407.17396

#3835

LLM-wrapper: Black-Box Semantic-Aware Adaptation of Vision-Language Models for Referring Expression Comprehension

Amaia Cardiel, Eloi Zablocki, Elias Ramzi et al.

ICLR 2025arXiv:2409.11919

#3836

Continuity-Preserving Convolutional Autoencoders for Learning Continuous Latent Dynamical Models from Images

Aiqing Zhu, Yuting Pan, Qianxiao Li

ICLR 2025arXiv:2502.00754

#3837

pMoE: Prompting Diverse Experts Together Wins More in Visual Adaptation

Shentong Mo, Xufang Luo, Dongsheng Li

ICLR 2025

#3838

Comparing noisy neural population dynamics using optimal transport distances

Amin Nejatbakhsh, Victor Geadah, Alex Williams et al.

ICLR 2025arXiv:2412.14421

#3839

ParFam -- (Neural Guided) Symbolic Regression via Continuous Global Optimization

Philipp Scholl, Katharina Bieker, Hillary Hauger et al.

ICLR 2025

#3840

ACES: Automatic Cohort Extraction System for Event-Stream Datasets

Justin Xu, Jack Gallifant, ALISTAIR JOHNSON et al.

ICLR 2025arXiv:2406.19653

#3841

GRAIN: Exact Graph Reconstruction from Gradients

Maria Drencheva, Ivo Petrov, Maximilian Baader et al.

ICLR 2025arXiv:2503.01838

#3842

Diff-2-in-1: Bridging Generation and Dense Perception with Diffusion Models

Shuhong Zheng, Zhipeng Bao, Ruoyu Zhao et al.

ICLR 2025arXiv:2411.05005

#3843

Exposing and Addressing Cross-Task Inconsistency in Unified Vision-Language Models

Aniruddha Kembhavi, Mohit Bansal, Amita Kamath et al.

ICLR 2025

#3844

PeriodWave: Multi-Period Flow Matching for High-Fidelity Waveform Generation

Sang-Hoon Lee, Ha-Yeong Choi, Seong-Whan Lee

ICLR 2025arXiv:2408.07547

#3845

See It from My Perspective: How Language Affects Cultural Bias in Image Understanding

Amith Ananthram, Elias Stengel-Eskin, Mohit Bansal et al.

ICLR 2025arXiv:2406.11665

#3846

Streaming Algorithms For $\ell_p$ Flows and $\ell_p$ Regression

Amit Chakrabarti, Jeffrey Jiang, David Woodruff et al.

ICLR 2025

#3847

Improving the Sparse Structure Learning of Spiking Neural Networks from the View of Compression Efficiency

Jiangrong Shen, Qi Xu, Gang Pan et al.

ICLR 2025arXiv:2502.13572

#3848

A Theory for Token-Level Harmonization in Retrieval-Augmented Generation

Shicheng Xu, Liang Pang, Huawei Shen et al.

ICLR 2025arXiv:2406.00944

#3849

Peeking Behind Closed Doors: Risks of LLM Evaluation by Private Data Curators

Pratyush Maini, Hritik Bansal

ICLR 2025arXiv:2503.04756

#3850

Reassessing EMNLP 2024’s Best Paper: Does Divergence-Based Calibration for MIAs Hold Up?

Pratyush Maini, Anshuman Suri

ICLR 2025oral

#3851

SoftCVI: Contrastive variational inference with self-generated soft labels

Daniel Ward, Mark Beaumont, Matteo Fasiolo

ICLR 2025arXiv:2407.15687

#3852

ContraDiff: Planning Towards High Return States via Contrastive Learning

Yixiang Shan, Zhengbang Zhu, Ting Long et al.

ICLR 2025

#3853

Class Distribution-induced Attention Map for Open-vocabulary Semantic Segmentations

Dong Un Kang, Hayeon Kim, Se Young Chun

ICLR 2025

#3854

Concept Pinpoint Eraser for Text-to-image Diffusion Models via Residual Attention Gate

Byung Hyun Lee, Sungjin Lim, Seunggyu Lee et al.

ICLR 2025arXiv:2506.22806

#3855

Reconstruction-Guided Policy: Enhancing Decision-Making through Agent-Wise State Consistency

Qifan Liang, Yixiang Shan, Haipeng Liu et al.

ICLR 2025

#3856

BOFormer: Learning to Solve Multi-Objective Bayesian Optimization via Non-Markovian RL

Yu Heng Hung, Kai-Jie Lin, Yu-Heng Lin et al.

ICLR 2025arXiv:2505.21974

#3857

Reasoning Elicitation in Language Models via Counterfactual Feedback

Alihan Hüyük, Xinnuo Xu, Jacqueline Maasch et al.

ICLR 2025arXiv:2410.03767

#3858

TSVD: Bridging Theory and Practice in Continual Learning with Pre-trained Models

Liangzu Peng, Juan Elenter, Joshua Agterberg et al.

ICLR 2025

#3859

Coreset Spectral Clustering

Ben Jourdan, Gregory Schwartzman, Peter Macgregor et al.

ICLR 2025arXiv:2503.07227

#3860

Diffusion Attribution Score: Evaluating Training Data Influence in Diffusion Models

Jinxu Lin, Linwei Tao, Minjing Dong et al.

ICLR 2025arXiv:2410.18639

#3861

Fundamental Limitations on Subquadratic Alternatives to Transformers

Josh Alman, Hantao Yu

ICLR 2025arXiv:2410.04271

#3862

Start Smart: Leveraging Gradients For Enhancing Mask-based XAI Methods

Buelent Uendes, Shujian Yu, Mark Hoogendoorn

ICLR 2025

#3863

Improving Instruction-Following in Language Models through Activation Steering

Alessandro Stolfo, Vidhisha Balachandran, Safoora Yousefi et al.

ICLR 2025arXiv:2410.12877

#3864

Unearthing Skill-level Insights for Understanding Trade-offs of Foundation Models

Mazda Moayeri, Vidhisha Balachandran, Varun Chandrasekaran et al.

ICLR 2025arXiv:2410.13826

#3865

Intelligence at the Edge of Chaos

Shiyang Zhang, Aakash Patel, Syed Rizvi et al.

ICLR 2025arXiv:2410.02536

#3866

Multimodal Situational Safety

Kaiwen Zhou, Chengzhi Liu, Xuandong Zhao et al.

ICLR 2025arXiv:2410.06172

#3867

Analysing The Spectral Biases in Generative Models

Amitoj Miglani, Shweta Singh, Vidit Aggarwal

ICLR 2025

#3868

Learning Continually by Spectral Regularization

Alex Lewandowski, Michał Bortkiewicz, Saurabh Kumar et al.

ICLR 2025arXiv:2406.06811

#3869

MGDA Converges under Generalized Smoothness, Provably

Qi Zhang, Peiyao Xiao, Shaofeng Zou et al.

ICLR 2025arXiv:2405.19440

#3870

Boundary constrained Gaussian processes for robust physics-informed machine learning of linear partial differential equations

David Dalton, Alan Lazarus, Hao Gao et al.

ICLR 2025

#3871

Bayesian Regularization of Latent Representation

Chukwudi Paul Obite, Zhi Chang, Keyan Wu et al.

ICLR 2025

#3872

Boosting Ray Search Procedure of Hard-label Attacks with Transfer-based Priors

Chen Ma, Xinjie Xu, Shuyu Cheng et al.

ICLR 2025arXiv:2507.17577

#3873

Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask Learning

Moritz Reuss, Jyothish Pari, Pulkit Agrawal et al.

ICLR 2025arXiv:2412.12953

#3874

DEPfold: RNA Secondary Structure Prediction as Dependency Parsing.

Ke Wang, Shay B Cohen

ICLR 2025

#3875

Open-Source vs Close-Source: The Context Utilization Challenge

Litu Ou

ICLR 2025

#3876

Aria-MIDI: A Dataset of Piano MIDI Files for Symbolic Music Modeling

Louis Bradshaw, Simon Colton

ICLR 2025arXiv:2504.15071

#3877

Variational Bayesian Pseudo-Coreset

Hyungi Lee, Seungyoo Lee, Juho Lee

ICLR 2025arXiv:2502.21143

#3878

ThinK: Thinner Key Cache by Query-Driven Pruning

Yuhui Xu, Zhanming Jie, Hanze Dong et al.

ICLR 2025arXiv:2407.21018

#3879

Scaling up the Banded Matrix Factorization Mechanism for Large Scale Differentially Private ML

Ryan McKenna

ICLR 2025

#3880

BoneMet: An Open Large-Scale Multi-Modal Murine Dataset for Breast Cancer Bone Metastasis Diagnosis and Prognosis

Tiankuo Chu, Fudong Lin, Shubo Wang et al.

ICLR 2025

#3881

Adaptive Camera Sensor for Vision Models

Eunsu Baek, Sung-hwan Han, Taesik Gong et al.

ICLR 2025arXiv:2503.02170

#3882

Size-Generalizable RNA Structure Evaluation by Exploring Hierarchical Geometries

Zongzhao Li, Jiacheng Cen, Wenbing Huang et al.

ICLR 2025

#3883

Semialgebraic Neural Networks: From roots to representations

S David Mis, Matti Lassas, Maarten V de Hoop

ICLR 2025arXiv:2501.01564

#3884

Nova: Generative Language Models for Assembly Code with Hierarchical Attention and Contrastive Learning

Nan Jiang, Chengxiao Wang, Kevin Liu et al.

ICLR 2025arXiv:2311.13721

#3885

ADAPT: Attentive Self-Distillation and Dual-Decoder Prediction Fusion for Continual Panoptic Segmentation

Ze Yang, Shichao Dong, Ruibo Li et al.

ICLR 2025

#3886

Flow With What You Know

Scott Hawley

ICLR 2025

#3887

KLay: Accelerating Arithmetic Circuits for Neurosymbolic AI

Jaron Maene, Vincent Derkinderen, Pedro Zuidberg Dos Martires

ICLR 2025arXiv:2410.11415

#3888

Difference-of-submodular Bregman Divergence

Masanari Kimura, Takahiro Kawashima, Tasuku Soma et al.

ICLR 2025

#3889

Transformers are Universal In-context Learners

Takashi Furuya, Maarten V de Hoop, Gabriel Peyré

ICLR 2025arXiv:2408.01367

#3890

Differential learning kinetics govern the transition from memorization to generalization during in-context learning

Alex Nguyen, Gautam Reddy Nallamala

ICLR 2025arXiv:2412.00104

#3891

GameGen-X: Interactive Open-world Game Video Generation

Haoxuan Che, Xuanhua He, Quande Liu et al.

ICLR 2025arXiv:2411.00769

#3892

Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers

Shijie Chen, Bernal Jimenez Gutierrez, Yu Su

ICLR 2025arXiv:2410.02642

#3893

DexTrack: Towards Generalizable Neural Tracking Control for Dexterous Manipulation from Human References

Xueyi Liu, Jianibieke Adalibieke, Qianwei Han et al.

ICLR 2025arXiv:2502.09614

#3894

SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement Learning

Hojoon Lee, Dongyoon Hwang, Donghu Kim et al.

ICLR 2025arXiv:2410.09754

#3895

Revealing and Mitigating Over-Attention in Knowledge Editing

Pinzheng Wang, Zecheng Tang, Keyan Zhou et al.

ICLR 2025arXiv:2502.14838

#3896

MotherNet: Fast Training and Inference via Hyper-Network Transformers

Andreas Mueller, Carlo Curino, Raghu Ramakrishnan

ICLR 2025

#3897

Optimistic Games for Combinatorial Bayesian Optimization with Application to Protein Design

Melis Ilayda Bal, Pier Giuseppe Sessa, Mojmir Mutny et al.

ICLR 2025arXiv:2409.18582

#3898

POGEMA: A Benchmark Platform for Cooperative Multi-Agent Pathfinding

Alexey Skrynnik, Anton Andreychuk, Anatolii Borzilov et al.

ICLR 2025arXiv:2407.14931

#3899

Probabilistic Conformal Prediction with Approximate Conditional Validity

Vincent Plassier, Alexander Fishkov, Mohsen Guizani et al.

ICLR 2025arXiv:2407.01794

#3900

Provable Convergence Bounds for Hybrid Dynamical Sampling and Optimization

Matthew Burns, Qingyuan Hou, Michael Huang

ICLR 2025

#3901

Overcoming Slow Decision Frequencies in Continuous Control: Model-Based Sequence Reinforcement Learning for Model-Free Control

Devdhar Patel, Hava Siegelmann

ICLR 2025oralarXiv:2410.08979

#3902

Large Scale Knowledge Washing

Yu Wang, Ruihan Wu, Zexue He et al.

ICLR 2025arXiv:2405.16720

#3903

MorphoDiff: Cellular Morphology Painting with Diffusion Models

Zeinab Navidi, Jun Ma, Esteban Miglietta et al.

ICLR 2025

#3904

PN-GAIL: Leveraging Non-optimal Information from Imperfect Demonstrations

Qiang Liu, Huiqiao Fu, Kaiqiang Tang et al.

ICLR 2025

#3905

Rethinking Graph Neural Networks From A Geometric Perspective Of Node Features

Feng Ji, Yanan Zhao, KAI ZHAO et al.

ICLR 2025

#3906

InfoGS: Efficient Structure-Aware 3D Gaussians via Lightweight Information Shaping

Yunchao Zhang, Guandao Yang, Leonidas Guibas et al.

ICLR 2025

#3907

IV-mixed Sampler: Leveraging Image Diffusion Models for Enhanced Video Synthesis

Shitong Shao, zikai zhou, Lichen Bai et al.

ICLR 2025oralarXiv:2410.04171

#3908

Animate Your Thoughts: Reconstruction of Dynamic Natural Vision from Human Brain Activity

Yizhuo Lu, Changde Du, Chong Wang et al.

ICLR 2025oral

#3909

Breaking Mental Set to Improve Reasoning through Diverse Multi-Agent Debate

Yexiang Liu, Jie Cao, Zekun Li et al.

ICLR 2025

#3910

Multi-objective Differentiable Neural Architecture Search

Rhea Sukthanker, Arber Zela, Benedikt Staffler et al.

ICLR 2025arXiv:2402.18213

#3911

Unlocking State-Tracking in Linear RNNs Through Negative Eigenvalues

Riccardo Grazzi, Julien Siems, Arber Zela et al.

ICLR 2025arXiv:2411.12537

#3912

RAPID: Retrieval Augmented Training of Differentially Private Diffusion Models

Tanqiu Jiang, Changjiang Li, Fenglong Ma et al.

ICLR 2025arXiv:2502.12794

#3913

CAX: Cellular Automata Accelerated in JAX

Maxence Faldor, Antoine Cully

ICLR 2025arXiv:2410.02651

#3914

Modeling dynamic social vision highlights gaps between deep learning and humans

Kathy Garcia, Emalie McMahon, Colin Conwell et al.

ICLR 2025

#3915

MoLEx: Mixture of Layer Experts for Fine-tuning with Sparse Upcycling

Rachel Teo, Tan Nguyen

ICLR 2025

#3916

Transformer Meets Twicing: Harnessing Unattended Residual Information

Laziz Abdullaev, Tan Nguyen

ICLR 2025arXiv:2503.00687

#3917

RAG-SR: Retrieval-Augmented Generation for Neural Symbolic Regression

Hengzhe Zhang, Qi Chen, Bing XUE et al.

ICLR 2025

#3918

INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge

Angelika Romanou, Negar Foroutan, Anna Sotnikova et al.

ICLR 2025arXiv:2411.19799

#3919

Joint Reward and Policy Learning with Demonstrations and Human Feedback Improves Alignment

Chenliang Li, Siliang Zeng, Zeyi Liao et al.

ICLR 2025

#3920

Token-Supervised Value Models for Enhancing Mathematical Problem-Solving Capabilities of Large Language Models

Jung Hyun Lee, June Yong Yang, Byeongho Heo et al.

ICLR 2025arXiv:2407.12863

#3921

Inverse Attention Agents for Multi-Agent Systems

Qian Long, Ruoyan Li, Minglu Zhao et al.

ICLR 2025arXiv:2410.21794

#3922

SelectFormer in Data Markets: Privacy-Preserving and Efficient Data Selection for Transformers with Multi-Party Computation

Xu Ouyang, Felix Xiaozhu Lin, Yangfeng Ji

ICLR 2025

#3923

Generalizing Reasoning Problems to Longer Lengths

Changnan Xiao, Bing Liu

ICLR 2025

#3924

Cybench: A Framework for Evaluating Cybersecurity Capabilities and Risks of Language Models

Andy K Zhang, Neil Perry, Riya Dulepet et al.

ICLR 2025arXiv:2408.08926

#3925

Learn-by-interact: A Data-Centric Framework For Self-Adaptive Agents in Realistic Environments

Hongjin SU, Ruoxi Sun, Jinsung Yoon et al.

ICLR 2025arXiv:2501.10893

#3926

Verifying Properties of Binary Neural Networks Using Sparse Polynomial Optimization

Jianting Yang, Srecko Durasinovic, Jean Bernard Lasserre et al.

ICLR 2025arXiv:2405.17049

#3927

GPUDrive: Data-driven, multi-agent driving simulation at 1 million FPS

Saman Kazemkhani, Aarav Pandya, Daphne Cornelisse et al.

ICLR 2025arXiv:2408.01584

#3928

EdgeRunner: Auto-regressive Auto-encoder for Artistic Mesh Generation

Jiaxiang Tang, Max Li, Zekun Hao et al.

ICLR 2025arXiv:2409.18114

#3929

Action Sequence Augmentation for Action Anticipation

Yihui Qiu, Deepu Rajan

ICLR 2025oral

#3930

IterGen: Iterative Semantic-aware Structured LLM Generation with Backtracking

Shubham Dipak Ugare, Rohan Gumaste, Tarun Suresh et al.

ICLR 2025arXiv:2410.07295

#3931

AnalogGenie: A Generative Engine for Automatic Discovery of Analog Circuit Topologies

Jian Gao, Weidong Cao, Junyi Yang et al.

ICLR 2025arXiv:2503.00205

#3932

The Last Iterate Advantage: Empirical Auditing and Principled Heuristic Analysis of Differentially Private SGD

Milad Nasr, Thomas Steinke, Borja Balle et al.

ICLR 2025arXiv:2410.06186

#3933

Near-Exact Privacy Amplification for Matrix Mechanisms

Christopher Choquette-Choo, Arun Ganesh, Saminul Haque et al.

ICLR 2025

#3934

Learning to Select Nodes in Branch and Bound with Sufficient Tree Representation

Sijia Zhang, Shuli Zeng, Shaoang Li et al.

ICLR 2025

#3935

Looking into User’s Long-term Interests through the Lens of Conservative Evidential Learning

Dingrong Wang, Krishna Neupane, Ervine Zheng et al.

ICLR 2025

#3936

Computational Explorations of Total Variation Distance

Arnab Bhattacharyya, Sutanu Gayen, Kuldeep S. Meel et al.

ICLR 2025arXiv:2412.10370

#3937

TopoLM: brain-like spatio-functional organization in a topographic language model

Neil Rathi, Johannes Mehrer, Badr AlKhamissi et al.

ICLR 2025arXiv:2410.11516

#3938

Do LLMs Recognize Your Preferences? Evaluating Personalized Preference Following in LLMs

Siyan Zhao, Mingyi Hong, Yang Liu et al.

ICLR 2025arXiv:2502.09597

#3939

Convex Formulations for Training Two-Layer ReLU Neural Networks

Karthik Prakhya, Tolga Birdal, Alp Yurtsever

ICLR 2025arXiv:2410.22311

#3940

Accelerating Task Generalisation with Multi-Level Skill Hierarchies

Thomas Cannon, Özgür Şimşek

ICLR 2025oralarXiv:2411.02998

#3941

SSOLE: Rethinking Orthogonal Low-rank Embedding for Self-Supervised Learning

Lun Huang, Qiang Qiu, Guillermo Sapiro

ICLR 2025

#3942

Large Convolutional Model Tuning via Filter Subspace

Wei Chen, Zichen Miao, Qiang Qiu

ICLR 2025arXiv:2403.00269

#3943

Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization

Yuxin Jiang, Bo Huang, Yufei Wang et al.

ICLR 2025arXiv:2408.07471

#3944

DPaI: Differentiable Pruning at Initialization with Node-Path Balance Principle

Lichuan Xiang, Quan Nguyen-Tri, Lan-Cuong Nguyen et al.

ICLR 2025

#3945

Balancing Act: Diversity and Consistency in Large Language Model Ensembles

Ahmed Abdulaal, Chen Jin, Nina Montaña-Brown et al.

ICLR 2025

#3946

DELTA: DENSE EFFICIENT LONG-RANGE 3D TRACKING FOR ANY VIDEO

Tuan Ngo, Peiye Zhuang, Evangelos Kalogerakis et al.

ICLR 2025arXiv:2410.24211

#3947

Tailoring Mixup to Data for Calibration

Quentin Bouniot, Pavlo Mozharovskyi, Florence d'Alché-Buc

ICLR 2025arXiv:2311.01434

#3948

BlendRL: A Framework for Merging Symbolic and Neural Policy Learning

Hikaru Shindo, Quentin Delfosse, Devendra Singh Dhami et al.

ICLR 2025arXiv:2410.11689

#3949

MGMapNet: Multi-Granularity Representation Learning for End-to-End Vectorized HD Map Construction

Jing Yang, Minyue Jiang, Sen Yang et al.

ICLR 2025arXiv:2410.07733

#3950

On Evaluating the Durability of Safeguards for Open-Weight LLMs

Xiangyu Qi, Boyi Wei, Nicholas Carlini et al.

ICLR 2025arXiv:2412.07097

#3951

LeanVec: Searching vectors faster by making them fit

Ishwar Bhati, Cecilia Aguerrebere, Mark Hildebrand et al.

ICLR 2025arXiv:2312.16335

#3952

Efficient Dictionary Learning with Switch Sparse Autoencoders

Anish Mudide, Josh Engels, Eric Michaud et al.

ICLR 2025arXiv:2410.08201

#3953

Curriculum-aware Training for Discriminating Molecular Property Prediction Models

Hansi Yang, Quanming Yao, James Kwok

ICLR 2025

#3954

Rationalizing and Augmenting Dynamic Graph Neural Networks

Guibin Zhang, Yiyan Qi, Ziyang Cheng et al.

ICLR 2025oral

#3955

Maintaining Structural Integrity in Parameter Spaces for Parameter Efficient Fine-tuning

Chongjie Si, Xuehui Wang, Xue Yang et al.

ICLR 2025arXiv:2405.14739

#3956

On the Adversarial Risk of Test Time Adaptation: An Investigation into Realistic Test-Time Data Poisoning

Yongyi Su, Yushu Li, Nanqing Liu et al.

ICLR 2025arXiv:2410.04682

#3957

Evidential Learning-based Certainty Estimation for Robust Dense Feature Matching

Lile Cai, Chuan Sheng Foo, Xun Xu et al.

ICLR 2025

#3958

Policy Design in Long-run Welfare Dynamics

Jiduan Wu, Rediet Abebe, Moritz Hardt et al.

ICLR 2025arXiv:2503.00632

#3959

KAA: Kolmogorov-Arnold Attention for Enhancing Attentive Graph Neural Networks

Taoran Fang, Tianhong Gao, Chunping Wang et al.

ICLR 2025arXiv:2501.13456

#3960

DECO: Unleashing the Potential of ConvNets for Query-based Detection and Segmentation

Xinghao Chen, Siwei Li, Yijing Yang et al.

ICLR 2025arXiv:2312.13735

#3961

A Theoretical Framework for Partially-Observed Reward States in RLHF

Chinmaya Kausik, Mirco Mutti, Aldo Pacchiano et al.

ICLR 2025

#3962

SPA-BENCH: A COMPREHENSIVE BENCHMARK FOR SMARTPHONE AGENT EVALUATION

Jingxuan Chen, Derek Yuen, Bin Xie et al.

ICLR 2025arXiv:2410.15164

#3963

DeeperForward: Enhanced Forward-Forward Training for Deeper and Better Performance

Liang Sun, Yang Zhang, Weizhao He et al.

ICLR 2025

#3964

SVBench: A Benchmark with Temporal Multi-Turn Dialogues for Streaming Video Understanding

Zhenyu Yang, Yuhang Hu, Zemin Du et al.

ICLR 2025oralarXiv:2502.10810

#3965

Spherical Tree-Sliced Wasserstein Distance

Viet-Hoang Tran, Thanh Chu, Minh-Khoi Nguyen-Nhat et al.

ICLR 2025arXiv:2503.11249

#3966

MAI: A Multi-turn Aggregation-Iteration Model for Composed Image Retrieval

Yanzhe Chen, Zhiwen Yang, Jinglin Xu et al.

ICLR 2025

#3967

Disentangled Representation Learning with the Gromov-Monge Gap

Théo Uscidda, Luca Eyring, Karsten Roth et al.

ICLR 2025arXiv:2407.07829

#3968

kNN Attention Demystified: A Theoretical Exploration for Scalable Transformers

Themistoklis Haris

ICLR 2025

#3969

Towards Unified Human Motion-Language Understanding via Sparse Interpretable Characterization

guangtao lyu, Chenghao Xu, Jiexi Yan et al.

ICLR 2025oral

#3970

Efficient Low-Bit Quantization with Adaptive Scales for Multi-Task Co-Training

Boyu Liu, Haoyu Huang, Linlin Yang et al.

ICLR 2025

#3971

Regularizing Energy among Training Samples for Out-of-Distribution Generalization

Yiting Chen, Qitian Wu, Junchi Yan

ICLR 2025

#3972

Rethinking and Improving Autoformalization: Towards a Faithful Metric and a Dependency Retrieval-based Approach

Qi Liu, Xinhao Zheng, Xudong Lu et al.

ICLR 2025

#3973

Learning Structured Universe Graph with Outlier OOD Detection for Partial Matching

Zetian Jiang, Jiaxin Lu, Haizhao Fan et al.

ICLR 2025

#3974

What Secrets Do Your Manifolds Hold? Understanding the Local Geometry of Generative Models

Ahmed Imtiaz Humayun, Ibtihel Amara, Cristina Nader Vasconcelos et al.

ICLR 2025arXiv:2408.08307

#3975

To Clip or not to Clip: the Dynamics of SGD with Gradient Clipping in High-Dimensions

Noah Marshall, Ke Liang Xiao, Atish Agarwala et al.

ICLR 2025arXiv:2406.11733

#3976

UniCO: On Unified Combinatorial Optimization via Problem Reduction to Matrix-Encoded General TSP

Wenzheng Pan, Hao Xiong, Jiale Ma et al.

ICLR 2025

#3977

Forget the Data and Fine-Tuning! Just Fold the Network to Compress

Dong Wang, Haris Šikić, Lothar Thiele et al.

ICLR 2025arXiv:2502.10216

#3978

Statistical Advantages of Perturbing Cosine Router in Mixture of Experts

Huy Nguyen, Pedram Akbarian Saravi, Trang Pham et al.

ICLR 2025arXiv:2405.14131

#3979

Learning Geometric Reasoning Networks For Robot Task And Motion Planning

Smail Ait Bouhsain, Rachid Alami, Thierry Simeon

ICLR 2025

#3980

Prompting Fairness: Integrating Causality to Debias Large Language Models

Jingling Li, Zeyu Tang, Xiaoyu Liu et al.

ICLR 2025arXiv:2403.08743

#3981

Dynamic Negative Guidance of Diffusion Models

Felix Koulischer, Johannes Deleu, Gabriel Raya et al.

ICLR 2025arXiv:2410.14398

#3982

Bilinear MLPs enable weight-based mechanistic interpretability

Michael Pearce, Thomas Dooms, Alice Rigg et al.

ICLR 2025arXiv:2410.08417

#3983

Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization

Wenkai Yang, Shiqi Shen, Guangyao Shen et al.

ICLR 2025arXiv:2406.11431

#3984

DocMIA: Document-Level Membership Inference Attacks against DocVQA Models

Khanh Nguyen, Raouf Kerkouche, Mario Fritz et al.

ICLR 2025arXiv:2502.03692

#3985

Fine-Tuning Token-Based Large Multimodal Models: What Works, What Doesn’t and What's Next

Zhulin Hu, Yan Ma, Jiadi Su et al.

ICLR 2025

#3986

Training-Free Diffusion Model Alignment with Sampling Demons

Po-Hung Yeh, Kuang-Huei Lee, Jun-Cheng Chen

ICLR 2025arXiv:2410.05760

#3987

Uncertainty-Aware Decoding with Minimum Bayes Risk

Nico Daheim, Clara Meister, Thomas Möllenhoff et al.

ICLR 2025arXiv:2503.05318

#3988

LDAdam: Adaptive Optimization from Low-Dimensional Gradient Statistics

Thomas Robert, Mher Safaryan, Ionut-Vlad Modoranu et al.

ICLR 2025arXiv:2410.16103

#3989

Tracking objects that change in appearance with phase synchrony

Sabine Muzellec, Drew Linsley, Alekh Ashok et al.

ICLR 2025arXiv:2410.02094

#3990

Descent with Misaligned Gradients and Applications to Hidden Convexity

Aditya Bhaskara, Ashok Cutkosky, Ravi Kumar et al.

ICLR 2025

#3991

Diffusion State-Guided Projected Gradient for Inverse Problems

Rayhan Zirvi, Bahareh Tolooshams, anima anandkumar

ICLR 2025arXiv:2410.03463

#3992

Learning from weak labelers as constraints

Vishwajeet Agrawal, Rattana Pukdee, Nina Balcan et al.

ICLR 2025

#3993

A Distributional Approach to Uncertainty-Aware Preference Alignment Using Offline Demonstrations

Sheng Xu, Bo Yue, Hongyuan Zha et al.

ICLR 2025

#3994

Estimating the Probabilities of Rare Outputs in Language Models

Gabriel Wu, Jacob Hilton

ICLR 2025arXiv:2410.13211

#3995

Self-Normalized Resets for Plasticity in Continual Learning

Vivek Farias, Adam Jozefiak

ICLR 2025arXiv:2410.20098

#3996

Training on the Test Task Confounds Evaluation and Emergence

Ricardo Dominguez-Olmedo, Florian Eddie Dorner, Moritz Hardt

ICLR 2025arXiv:2407.07890

#3997

COME: Test-time Adaption by Conservatively Minimizing Entropy

Qingyang Zhang, Yatao Bian, Xinke Kong et al.

ICLR 2025arXiv:2410.10894

#3998

Oracle efficient truncated statistics

Konstantinos Karatapanis, Vasilis Kontonis, Christos Tzamos

ICLR 2025

#3999

Training Free Guided Flow-Matching with Optimal Control

Luran Wang, Chaoran Cheng, Yizhen Liao et al.

ICLR 2025

#4000

SaLoRA: Safety-Alignment Preserved Low-Rank Adaptation

Mingjie Li, Wai Man Si, Michael Backes et al.

ICLR 2025arXiv:2501.01765

← Previous

1...18 19 20 21 22...31