Most Cited ICLR "sparse fine-tuning" Papers

6,124 papers found • Page 20 of 31

#3801

Self-Improving Robust Preference Optimization

Eugene Choi, Arash Ahmadian, Matthieu Geist et al.

ICLR 2025posterarXiv:2406.01660
#3802

Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data

Sreyan Ghosh, Sonal Kumar, Zhifeng Kong et al.

ICLR 2025posterarXiv:2410.02056
#3803

Fugatto 1: Foundational Generative Audio Transformer Opus 1

Rafael Valle, Rohan Badlani, Zhifeng Kong et al.

ICLR 2025poster
#3804

Linear Representations of Political Perspective Emerge in Large Language Models

Junsol Kim, James Evans, Aaron Schein

ICLR 2025posterarXiv:2503.02080
#3805

Looking Backward: Streaming Video-to-Video Translation with Feature Banks

Feng Liang, Akio Kodaira, Chenfeng Xu et al.

ICLR 2025oralarXiv:2405.15757
#3806

Why Does the Effective Context Length of LLMs Fall Short?

Chenxin An, Jun Zhang, Ming Zhong et al.

ICLR 2025posterarXiv:2410.18745
#3807

Dobi-SVD: Differentiable SVD for LLM Compression and Some New Perspectives

Qinsi Wang, Jinghan Ke, Masayoshi Tomizuka et al.

ICLR 2025posterarXiv:2502.02723
#3808

Think-on-Graph 2.0: Deep and Faithful Large Language Model Reasoning with Knowledge-guided Retrieval Augmented Generation

Shengjie Ma, Chengjin Xu, Xuhui Jiang et al.

ICLR 2025posterarXiv:2407.10805
#3809

Model-Agnostic Knowledge Guided Correction for Improved Neural Surrogate Rollout

Bharat Srikishan, Daniel O'Malley, Mohamed Mehana et al.

ICLR 2025posterarXiv:2503.10048
#3810

Singular Subspace Perturbation Bounds via Rectangular Random Matrix Diffusions

Peiyao Lai, Oren Mangoubi

ICLR 2025posterarXiv:2406.02502
#3811

Leveraging Variable Sparsity to Refine Pareto Stationarity in Multi-Objective Optimization

Zeou Hu, Yaoliang Yu

ICLR 2025poster
#3812

On the Convergence of No-Regret Dynamics in Information Retrieval Games with Proportional Ranking Functions

Omer Madmon, Idan Pipano, Itamar Jacob Reinman et al.

ICLR 2025posterarXiv:2405.11517
#3813

Fine-Tuning Attention Modules Only: Enhancing Weight Disentanglement in Task Arithmetic

Ruochen Jin, Bojian Hou, Jiancong Xiao et al.

ICLR 2025posterarXiv:2407.07089
#3814

ToolGen: Unified Tool Retrieval and Calling via Generation

Renxi Wang, Xudong Han, Lei Ji et al.

ICLR 2025posterarXiv:2410.03439
#3815

When do GFlowNets learn the right distribution?

Tiago Silva, Rodrigo Alves, Eliezer de Souza da Silva et al.

ICLR 2025poster
#3816

CoRNStack: High-Quality Contrastive Data for Better Code Retrieval and Reranking

Tarun Suresh, Revanth Gangi Reddy, Yifei Xu et al.

ICLR 2025posterarXiv:2412.01007
#3817

Equivariant Denoisers Cannot Copy Graphs: Align Your Graph Diffusion Models

Najwa Laabid, Severi Rissanen, Markus Heinonen et al.

ICLR 2025posterarXiv:2405.17656
#3818

Towards a Complete Logical Framework for GNN Expressiveness

Tuo Xu

ICLR 2025poster
#3819

Perplexed by Perplexity: Perplexity-Based Data Pruning With Small Reference Models

Zachary Ankner, Cody Blakeney, Kartik Sreenivasan et al.

ICLR 2025posterarXiv:2405.20541
#3820

Efficient and Context-Aware Label Propagation for Zero-/Few-Shot Training-Free Adaptation of Vision-Language Model

Yushu Li, Yongyi Su, Adam Goodge et al.

ICLR 2025posterarXiv:2412.18303
#3821

Rethinking Shapley Value for Negative Interactions in Non-convex Games

Wonjoon Chang, Myeongjin Lee, Jaesik Choi

ICLR 2025poster
#3822

Matérn Kernels for Tunable Implicit Surface Reconstruction

Maximilian Weiherer, Bernhard Egger

ICLR 2025posterarXiv:2409.15466
#3823

In Search of the Engram in LLMs: A Neuroscience Perspective on the Memory Functions in AI Models

Minsung Kim, Jea Kwon, Dong-Kyum Kim et al.

ICLR 2025poster
#3824

MAESTRO: Masked Encoding Set Transformer with Self-Distillation

Matthew Lee, Jaesik Kim, Matei Ionita et al.

ICLR 2025poster
#3825

dEBORA: Efficient Bilevel Optimization-based low-Rank Adaptation

Emanuele Zangrando, Sara Venturini, Francesco Rinaldi et al.

ICLR 2025poster
#3826

Multi-session, multi-task neural decoding from distinct cell-types and brain regions

Mehdi Azabou, Krystal Pan, Vinam Arora et al.

ICLR 2025poster
#3827

Offline Model-Based Optimization by Learning to Rank

Rong-Xi Tan, Ke Xue, Shen-Huan Lyu et al.

ICLR 2025posterarXiv:2410.11502
#3828

Open-CK: A Large Multi-Physics Fields Coupling benchmarks in Combustion Kinetics

Zaige Fei, Fan Xu, Junyuan Mao et al.

ICLR 2025oral
#3829

RecDreamer: Consistent Text-to-3D Generation via Uniform Score Distillation

Chenxi Zheng, Yihong Lin, Bangzhen Liu et al.

ICLR 2025posterarXiv:2502.12640
#3830

CameraCtrl: Enabling Camera Control for Video Diffusion Models

Hao He, Yinghao Xu, Yuwei Guo et al.

ICLR 2025poster
#3831

No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images

Botao Ye, Sifei Liu, Haofei Xu et al.

ICLR 2025posterarXiv:2410.24207
#3832

Comparing Targeting Strategies for Maximizing Social Welfare with Limited Resources

Vibhhu Sharma, Bryan Wilder

ICLR 2025posterarXiv:2411.07414
#3833

MotionClone: Training-Free Motion Cloning for Controllable Video Generation

Pengyang Ling, Jiazi Bu, Pan Zhang et al.

ICLR 2025oralarXiv:2406.05338
#3834

Systematic Relational Reasoning With Epistemic Graph Neural Networks

Irtaza Khalid, Steven Schockaert

ICLR 2025posterarXiv:2407.17396
#3835

LLM-wrapper: Black-Box Semantic-Aware Adaptation of Vision-Language Models for Referring Expression Comprehension

Amaia Cardiel, Eloi Zablocki, Elias Ramzi et al.

ICLR 2025posterarXiv:2409.11919
#3836

Continuity-Preserving Convolutional Autoencoders for Learning Continuous Latent Dynamical Models from Images

Aiqing Zhu, Yuting Pan, Qianxiao Li

ICLR 2025posterarXiv:2502.00754
#3837

pMoE: Prompting Diverse Experts Together Wins More in Visual Adaptation

Shentong Mo, Xufang Luo, Dongsheng Li

ICLR 2025poster
#3838

Comparing noisy neural population dynamics using optimal transport distances

Amin Nejatbakhsh, Victor Geadah, Alex Williams et al.

ICLR 2025posterarXiv:2412.14421
#3839

ParFam -- (Neural Guided) Symbolic Regression via Continuous Global Optimization

Philipp Scholl, Katharina Bieker, Hillary Hauger et al.

ICLR 2025poster
#3840

ACES: Automatic Cohort Extraction System for Event-Stream Datasets

Justin Xu, Jack Gallifant, ALISTAIR JOHNSON et al.

ICLR 2025posterarXiv:2406.19653
#3841

GRAIN: Exact Graph Reconstruction from Gradients

Maria Drencheva, Ivo Petrov, Maximilian Baader et al.

ICLR 2025posterarXiv:2503.01838
#3842

Diff-2-in-1: Bridging Generation and Dense Perception with Diffusion Models

Shuhong Zheng, Zhipeng Bao, Ruoyu Zhao et al.

ICLR 2025posterarXiv:2411.05005
#3843

Exposing and Addressing Cross-Task Inconsistency in Unified Vision-Language Models

Aniruddha Kembhavi, Mohit Bansal, Amita Kamath et al.

ICLR 2025poster
#3844

PeriodWave: Multi-Period Flow Matching for High-Fidelity Waveform Generation

Sang-Hoon Lee, Ha-Yeong Choi, Seong-Whan Lee

ICLR 2025posterarXiv:2408.07547
#3845

See It from My Perspective: How Language Affects Cultural Bias in Image Understanding

Amith Ananthram, Elias Stengel-Eskin, Mohit Bansal et al.

ICLR 2025posterarXiv:2406.11665
#3846

Streaming Algorithms For $\ell_p$ Flows and $\ell_p$ Regression

Amit Chakrabarti, Jeffrey Jiang, David Woodruff et al.

ICLR 2025poster
#3847

Improving the Sparse Structure Learning of Spiking Neural Networks from the View of Compression Efficiency

Jiangrong Shen, Qi Xu, Gang Pan et al.

ICLR 2025posterarXiv:2502.13572
#3848

A Theory for Token-Level Harmonization in Retrieval-Augmented Generation

Shicheng Xu, Liang Pang, Huawei Shen et al.

ICLR 2025posterarXiv:2406.00944
#3849

Peeking Behind Closed Doors: Risks of LLM Evaluation by Private Data Curators

Pratyush Maini, Hritik Bansal

ICLR 2025posterarXiv:2503.04756
#3850

Reassessing EMNLP 2024’s Best Paper: Does Divergence-Based Calibration for MIAs Hold Up?

Pratyush Maini, Anshuman Suri

ICLR 2025oral
#3851

SoftCVI: Contrastive variational inference with self-generated soft labels

Daniel Ward, Mark Beaumont, Matteo Fasiolo

ICLR 2025posterarXiv:2407.15687
#3852

ContraDiff: Planning Towards High Return States via Contrastive Learning

Yixiang Shan, Zhengbang Zhu, Ting Long et al.

ICLR 2025poster
#3853

Class Distribution-induced Attention Map for Open-vocabulary Semantic Segmentations

Dong Un Kang, Hayeon Kim, Se Young Chun

ICLR 2025poster
#3854

Concept Pinpoint Eraser for Text-to-image Diffusion Models via Residual Attention Gate

Byung Hyun Lee, Sungjin Lim, Seunggyu Lee et al.

ICLR 2025posterarXiv:2506.22806
#3855

Reconstruction-Guided Policy: Enhancing Decision-Making through Agent-Wise State Consistency

Qifan Liang, Yixiang Shan, Haipeng Liu et al.

ICLR 2025poster
#3856

BOFormer: Learning to Solve Multi-Objective Bayesian Optimization via Non-Markovian RL

Yu Heng Hung, Kai-Jie Lin, Yu-Heng Lin et al.

ICLR 2025posterarXiv:2505.21974
#3857

Reasoning Elicitation in Language Models via Counterfactual Feedback

Alihan Hüyük, Xinnuo Xu, Jacqueline Maasch et al.

ICLR 2025posterarXiv:2410.03767
#3858

TSVD: Bridging Theory and Practice in Continual Learning with Pre-trained Models

Liangzu Peng, Juan Elenter, Joshua Agterberg et al.

ICLR 2025poster
#3859

Coreset Spectral Clustering

Ben Jourdan, Gregory Schwartzman, Peter Macgregor et al.

ICLR 2025posterarXiv:2503.07227
#3860

Diffusion Attribution Score: Evaluating Training Data Influence in Diffusion Models

Jinxu Lin, Linwei Tao, Minjing Dong et al.

ICLR 2025posterarXiv:2410.18639
#3861

Fundamental Limitations on Subquadratic Alternatives to Transformers

Josh Alman, Hantao Yu

ICLR 2025posterarXiv:2410.04271
#3862

Start Smart: Leveraging Gradients For Enhancing Mask-based XAI Methods

Buelent Uendes, Shujian Yu, Mark Hoogendoorn

ICLR 2025poster
#3863

Improving Instruction-Following in Language Models through Activation Steering

Alessandro Stolfo, Vidhisha Balachandran, Safoora Yousefi et al.

ICLR 2025posterarXiv:2410.12877
#3864

Unearthing Skill-level Insights for Understanding Trade-offs of Foundation Models

Mazda Moayeri, Vidhisha Balachandran, Varun Chandrasekaran et al.

ICLR 2025posterarXiv:2410.13826
#3865

Intelligence at the Edge of Chaos

Shiyang Zhang, Aakash Patel, Syed Rizvi et al.

ICLR 2025posterarXiv:2410.02536
#3866

Multimodal Situational Safety

Kaiwen Zhou, Chengzhi Liu, Xuandong Zhao et al.

ICLR 2025posterarXiv:2410.06172
#3867

Analysing The Spectral Biases in Generative Models

Amitoj Miglani, Shweta Singh, Vidit Aggarwal

ICLR 2025poster
#3868

Learning Continually by Spectral Regularization

Alex Lewandowski, Michał Bortkiewicz, Saurabh Kumar et al.

ICLR 2025posterarXiv:2406.06811
#3869

MGDA Converges under Generalized Smoothness, Provably

Qi Zhang, Peiyao Xiao, Shaofeng Zou et al.

ICLR 2025posterarXiv:2405.19440
#3870

Boundary constrained Gaussian processes for robust physics-informed machine learning of linear partial differential equations

David Dalton, Alan Lazarus, Hao Gao et al.

ICLR 2025poster
#3871

Bayesian Regularization of Latent Representation

Chukwudi Paul Obite, Zhi Chang, Keyan Wu et al.

ICLR 2025poster
#3872

Boosting Ray Search Procedure of Hard-label Attacks with Transfer-based Priors

Chen Ma, Xinjie Xu, Shuyu Cheng et al.

ICLR 2025posterarXiv:2507.17577
#3873

Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask Learning

Moritz Reuss, Jyothish Pari, Pulkit Agrawal et al.

ICLR 2025posterarXiv:2412.12953
#3874

DEPfold: RNA Secondary Structure Prediction as Dependency Parsing.

Ke Wang, Shay B Cohen

ICLR 2025poster
#3875

Open-Source vs Close-Source: The Context Utilization Challenge

Litu Ou

ICLR 2025poster
#3876

Aria-MIDI: A Dataset of Piano MIDI Files for Symbolic Music Modeling

Louis Bradshaw, Simon Colton

ICLR 2025posterarXiv:2504.15071
#3877

Variational Bayesian Pseudo-Coreset

Hyungi Lee, Seungyoo Lee, Juho Lee

ICLR 2025posterarXiv:2502.21143
#3878

ThinK: Thinner Key Cache by Query-Driven Pruning

Yuhui Xu, Zhanming Jie, Hanze Dong et al.

ICLR 2025posterarXiv:2407.21018
#3879

Scaling up the Banded Matrix Factorization Mechanism for Large Scale Differentially Private ML

Ryan McKenna

ICLR 2025poster
#3880

BoneMet: An Open Large-Scale Multi-Modal Murine Dataset for Breast Cancer Bone Metastasis Diagnosis and Prognosis

Tiankuo Chu, Fudong Lin, Shubo Wang et al.

ICLR 2025poster
#3881

Adaptive Camera Sensor for Vision Models

Eunsu Baek, Sung-hwan Han, Taesik Gong et al.

ICLR 2025posterarXiv:2503.02170
#3882

Size-Generalizable RNA Structure Evaluation by Exploring Hierarchical Geometries

Zongzhao Li, Jiacheng Cen, Wenbing Huang et al.

ICLR 2025poster
#3883

Semialgebraic Neural Networks: From roots to representations

S David Mis, Matti Lassas, Maarten V de Hoop

ICLR 2025posterarXiv:2501.01564
#3884

Nova: Generative Language Models for Assembly Code with Hierarchical Attention and Contrastive Learning

Nan Jiang, Chengxiao Wang, Kevin Liu et al.

ICLR 2025posterarXiv:2311.13721
#3885

ADAPT: Attentive Self-Distillation and Dual-Decoder Prediction Fusion for Continual Panoptic Segmentation

Ze Yang, Shichao Dong, Ruibo Li et al.

ICLR 2025poster
#3886

Flow With What You Know

Scott Hawley

ICLR 2025poster
#3887

KLay: Accelerating Arithmetic Circuits for Neurosymbolic AI

Jaron Maene, Vincent Derkinderen, Pedro Zuidberg Dos Martires

ICLR 2025posterarXiv:2410.11415
#3888

Difference-of-submodular Bregman Divergence

Masanari Kimura, Takahiro Kawashima, Tasuku Soma et al.

ICLR 2025poster
#3889

Transformers are Universal In-context Learners

Takashi Furuya, Maarten V de Hoop, Gabriel Peyré

ICLR 2025posterarXiv:2408.01367
#3890

Differential learning kinetics govern the transition from memorization to generalization during in-context learning

Alex Nguyen, Gautam Reddy Nallamala

ICLR 2025posterarXiv:2412.00104
#3891

GameGen-X: Interactive Open-world Game Video Generation

Haoxuan Che, Xuanhua He, Quande Liu et al.

ICLR 2025posterarXiv:2411.00769
#3892

Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers

Shijie Chen, Bernal Jimenez Gutierrez, Yu Su

ICLR 2025posterarXiv:2410.02642
#3893

DexTrack: Towards Generalizable Neural Tracking Control for Dexterous Manipulation from Human References

Xueyi Liu, Jianibieke Adalibieke, Qianwei Han et al.

ICLR 2025posterarXiv:2502.09614
#3894

SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement Learning

Hojoon Lee, Dongyoon Hwang, Donghu Kim et al.

ICLR 2025posterarXiv:2410.09754
#3895

Revealing and Mitigating Over-Attention in Knowledge Editing

Pinzheng Wang, Zecheng Tang, Keyan Zhou et al.

ICLR 2025posterarXiv:2502.14838
#3896

MotherNet: Fast Training and Inference via Hyper-Network Transformers

Andreas Mueller, Carlo Curino, Raghu Ramakrishnan

ICLR 2025poster
#3897

Optimistic Games for Combinatorial Bayesian Optimization with Application to Protein Design

Melis Ilayda Bal, Pier Giuseppe Sessa, Mojmir Mutny et al.

ICLR 2025posterarXiv:2409.18582
#3898

POGEMA: A Benchmark Platform for Cooperative Multi-Agent Pathfinding

Alexey Skrynnik, Anton Andreychuk, Anatolii Borzilov et al.

ICLR 2025posterarXiv:2407.14931
#3899

Probabilistic Conformal Prediction with Approximate Conditional Validity

Vincent Plassier, Alexander Fishkov, Mohsen Guizani et al.

ICLR 2025posterarXiv:2407.01794
#3900

Provable Convergence Bounds for Hybrid Dynamical Sampling and Optimization

Matthew Burns, Qingyuan Hou, Michael Huang

ICLR 2025poster
#3901

Overcoming Slow Decision Frequencies in Continuous Control: Model-Based Sequence Reinforcement Learning for Model-Free Control

Devdhar Patel, Hava Siegelmann

ICLR 2025oralarXiv:2410.08979
#3902

Large Scale Knowledge Washing

Yu Wang, Ruihan Wu, Zexue He et al.

ICLR 2025posterarXiv:2405.16720
#3903

MorphoDiff: Cellular Morphology Painting with Diffusion Models

Zeinab Navidi, Jun Ma, Esteban Miglietta et al.

ICLR 2025poster
#3904

PN-GAIL: Leveraging Non-optimal Information from Imperfect Demonstrations

Qiang Liu, Huiqiao Fu, Kaiqiang Tang et al.

ICLR 2025poster
#3905

Rethinking Graph Neural Networks From A Geometric Perspective Of Node Features

Feng Ji, Yanan Zhao, KAI ZHAO et al.

ICLR 2025poster
#3906

InfoGS: Efficient Structure-Aware 3D Gaussians via Lightweight Information Shaping

Yunchao Zhang, Guandao Yang, Leonidas Guibas et al.

ICLR 2025poster
#3907

IV-mixed Sampler: Leveraging Image Diffusion Models for Enhanced Video Synthesis

Shitong Shao, zikai zhou, Lichen Bai et al.

ICLR 2025oralarXiv:2410.04171
#3908

Animate Your Thoughts: Reconstruction of Dynamic Natural Vision from Human Brain Activity

Yizhuo Lu, Changde Du, Chong Wang et al.

ICLR 2025oral
#3909

Breaking Mental Set to Improve Reasoning through Diverse Multi-Agent Debate

Yexiang Liu, Jie Cao, Zekun Li et al.

ICLR 2025poster
#3910

Multi-objective Differentiable Neural Architecture Search

Rhea Sukthanker, Arber Zela, Benedikt Staffler et al.

ICLR 2025posterarXiv:2402.18213
#3911

Unlocking State-Tracking in Linear RNNs Through Negative Eigenvalues

Riccardo Grazzi, Julien Siems, Arber Zela et al.

ICLR 2025posterarXiv:2411.12537
#3912

RAPID: Retrieval Augmented Training of Differentially Private Diffusion Models

Tanqiu Jiang, Changjiang Li, Fenglong Ma et al.

ICLR 2025posterarXiv:2502.12794
#3913

CAX: Cellular Automata Accelerated in JAX

Maxence Faldor, Antoine Cully

ICLR 2025posterarXiv:2410.02651
#3914

Modeling dynamic social vision highlights gaps between deep learning and humans

Kathy Garcia, Emalie McMahon, Colin Conwell et al.

ICLR 2025poster
#3915

MoLEx: Mixture of Layer Experts for Fine-tuning with Sparse Upcycling

Rachel Teo, Tan Nguyen

ICLR 2025poster
#3916

Transformer Meets Twicing: Harnessing Unattended Residual Information

Laziz Abdullaev, Tan Nguyen

ICLR 2025posterarXiv:2503.00687
#3917

RAG-SR: Retrieval-Augmented Generation for Neural Symbolic Regression

Hengzhe Zhang, Qi Chen, Bing XUE et al.

ICLR 2025poster
#3918

INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge

Angelika Romanou, Negar Foroutan, Anna Sotnikova et al.

ICLR 2025posterarXiv:2411.19799
#3919

Joint Reward and Policy Learning with Demonstrations and Human Feedback Improves Alignment

Chenliang Li, Siliang Zeng, Zeyi Liao et al.

ICLR 2025poster
#3920

Token-Supervised Value Models for Enhancing Mathematical Problem-Solving Capabilities of Large Language Models

Jung Hyun Lee, June Yong Yang, Byeongho Heo et al.

ICLR 2025posterarXiv:2407.12863
#3921

Inverse Attention Agents for Multi-Agent Systems

Qian Long, Ruoyan Li, Minglu Zhao et al.

ICLR 2025posterarXiv:2410.21794
#3922

SelectFormer in Data Markets: Privacy-Preserving and Efficient Data Selection for Transformers with Multi-Party Computation

Xu Ouyang, Felix Xiaozhu Lin, Yangfeng Ji

ICLR 2025poster
#3923

Generalizing Reasoning Problems to Longer Lengths

Changnan Xiao, Bing Liu

ICLR 2025poster
#3924

Cybench: A Framework for Evaluating Cybersecurity Capabilities and Risks of Language Models

Andy K Zhang, Neil Perry, Riya Dulepet et al.

ICLR 2025posterarXiv:2408.08926
#3925

Learn-by-interact: A Data-Centric Framework For Self-Adaptive Agents in Realistic Environments

Hongjin SU, Ruoxi Sun, Jinsung Yoon et al.

ICLR 2025posterarXiv:2501.10893
#3926

Verifying Properties of Binary Neural Networks Using Sparse Polynomial Optimization

Jianting Yang, Srecko Durasinovic, Jean Bernard Lasserre et al.

ICLR 2025posterarXiv:2405.17049
#3927

GPUDrive: Data-driven, multi-agent driving simulation at 1 million FPS

Saman Kazemkhani, Aarav Pandya, Daphne Cornelisse et al.

ICLR 2025posterarXiv:2408.01584
#3928

EdgeRunner: Auto-regressive Auto-encoder for Artistic Mesh Generation

Jiaxiang Tang, Max Li, Zekun Hao et al.

ICLR 2025posterarXiv:2409.18114
#3929

Action Sequence Augmentation for Action Anticipation

Yihui Qiu, Deepu Rajan

ICLR 2025oral
#3930

IterGen: Iterative Semantic-aware Structured LLM Generation with Backtracking

Shubham Dipak Ugare, Rohan Gumaste, Tarun Suresh et al.

ICLR 2025posterarXiv:2410.07295
#3931

AnalogGenie: A Generative Engine for Automatic Discovery of Analog Circuit Topologies

Jian Gao, Weidong Cao, Junyi Yang et al.

ICLR 2025posterarXiv:2503.00205
#3932

The Last Iterate Advantage: Empirical Auditing and Principled Heuristic Analysis of Differentially Private SGD

Milad Nasr, Thomas Steinke, Borja Balle et al.

ICLR 2025posterarXiv:2410.06186
#3933

Near-Exact Privacy Amplification for Matrix Mechanisms

Christopher Choquette-Choo, Arun Ganesh, Saminul Haque et al.

ICLR 2025poster
#3934

Learning to Select Nodes in Branch and Bound with Sufficient Tree Representation

Sijia Zhang, Shuli Zeng, Shaoang Li et al.

ICLR 2025poster
#3935

Looking into User’s Long-term Interests through the Lens of Conservative Evidential Learning

Dingrong Wang, Krishna Neupane, Ervine Zheng et al.

ICLR 2025poster
#3936

Computational Explorations of Total Variation Distance

Arnab Bhattacharyya, Sutanu Gayen, Kuldeep S. Meel et al.

ICLR 2025posterarXiv:2412.10370
#3937

TopoLM: brain-like spatio-functional organization in a topographic language model

Neil Rathi, Johannes Mehrer, Badr AlKhamissi et al.

ICLR 2025posterarXiv:2410.11516
#3938

Do LLMs Recognize Your Preferences? Evaluating Personalized Preference Following in LLMs

Siyan Zhao, Mingyi Hong, Yang Liu et al.

ICLR 2025posterarXiv:2502.09597
#3939

Convex Formulations for Training Two-Layer ReLU Neural Networks

Karthik Prakhya, Tolga Birdal, Alp Yurtsever

ICLR 2025posterarXiv:2410.22311
#3940

Accelerating Task Generalisation with Multi-Level Skill Hierarchies

Thomas Cannon, Özgür Şimşek

ICLR 2025oralarXiv:2411.02998
#3941

SSOLE: Rethinking Orthogonal Low-rank Embedding for Self-Supervised Learning

Lun Huang, Qiang Qiu, Guillermo Sapiro

ICLR 2025poster
#3942

Large Convolutional Model Tuning via Filter Subspace

Wei Chen, Zichen Miao, Qiang Qiu

ICLR 2025posterarXiv:2403.00269
#3943

Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization

Yuxin Jiang, Bo Huang, Yufei Wang et al.

ICLR 2025posterarXiv:2408.07471
#3944

DPaI: Differentiable Pruning at Initialization with Node-Path Balance Principle

Lichuan Xiang, Quan Nguyen-Tri, Lan-Cuong Nguyen et al.

ICLR 2025poster
#3945

Balancing Act: Diversity and Consistency in Large Language Model Ensembles

Ahmed Abdulaal, Chen Jin, Nina Montaña-Brown et al.

ICLR 2025poster
#3946

DELTA: DENSE EFFICIENT LONG-RANGE 3D TRACKING FOR ANY VIDEO

Tuan Ngo, Peiye Zhuang, Evangelos Kalogerakis et al.

ICLR 2025posterarXiv:2410.24211
#3947

Tailoring Mixup to Data for Calibration

Quentin Bouniot, Pavlo Mozharovskyi, Florence d'Alché-Buc

ICLR 2025posterarXiv:2311.01434
#3948

BlendRL: A Framework for Merging Symbolic and Neural Policy Learning

Hikaru Shindo, Quentin Delfosse, Devendra Singh Dhami et al.

ICLR 2025posterarXiv:2410.11689
#3949

MGMapNet: Multi-Granularity Representation Learning for End-to-End Vectorized HD Map Construction

Jing Yang, Minyue Jiang, Sen Yang et al.

ICLR 2025posterarXiv:2410.07733
#3950

On Evaluating the Durability of Safeguards for Open-Weight LLMs

Xiangyu Qi, Boyi Wei, Nicholas Carlini et al.

ICLR 2025posterarXiv:2412.07097
#3951

LeanVec: Searching vectors faster by making them fit

Ishwar Bhati, Cecilia Aguerrebere, Mark Hildebrand et al.

ICLR 2025posterarXiv:2312.16335
#3952

Efficient Dictionary Learning with Switch Sparse Autoencoders

Anish Mudide, Josh Engels, Eric Michaud et al.

ICLR 2025posterarXiv:2410.08201
#3953

Curriculum-aware Training for Discriminating Molecular Property Prediction Models

Hansi Yang, Quanming Yao, James Kwok

ICLR 2025poster
#3954

Rationalizing and Augmenting Dynamic Graph Neural Networks

Guibin Zhang, Yiyan Qi, Ziyang Cheng et al.

ICLR 2025oral
#3955

Maintaining Structural Integrity in Parameter Spaces for Parameter Efficient Fine-tuning

Chongjie Si, Xuehui Wang, Xue Yang et al.

ICLR 2025posterarXiv:2405.14739
#3956

On the Adversarial Risk of Test Time Adaptation: An Investigation into Realistic Test-Time Data Poisoning

Yongyi Su, Yushu Li, Nanqing Liu et al.

ICLR 2025posterarXiv:2410.04682
#3957

Evidential Learning-based Certainty Estimation for Robust Dense Feature Matching

Lile Cai, Chuan Sheng Foo, Xun Xu et al.

ICLR 2025poster
#3958

Policy Design in Long-run Welfare Dynamics

Jiduan Wu, Rediet Abebe, Moritz Hardt et al.

ICLR 2025posterarXiv:2503.00632
#3959

KAA: Kolmogorov-Arnold Attention for Enhancing Attentive Graph Neural Networks

Taoran Fang, Tianhong Gao, Chunping Wang et al.

ICLR 2025posterarXiv:2501.13456
#3960

DECO: Unleashing the Potential of ConvNets for Query-based Detection and Segmentation

Xinghao Chen, Siwei Li, Yijing Yang et al.

ICLR 2025posterarXiv:2312.13735
#3961

A Theoretical Framework for Partially-Observed Reward States in RLHF

Chinmaya Kausik, Mirco Mutti, Aldo Pacchiano et al.

ICLR 2025poster
#3962

SPA-BENCH: A COMPREHENSIVE BENCHMARK FOR SMARTPHONE AGENT EVALUATION

Jingxuan Chen, Derek Yuen, Bin Xie et al.

ICLR 2025posterarXiv:2410.15164
#3963

DeeperForward: Enhanced Forward-Forward Training for Deeper and Better Performance

Liang Sun, Yang Zhang, Weizhao He et al.

ICLR 2025poster
#3964

SVBench: A Benchmark with Temporal Multi-Turn Dialogues for Streaming Video Understanding

Zhenyu Yang, Yuhang Hu, Zemin Du et al.

ICLR 2025oralarXiv:2502.10810
#3965

Spherical Tree-Sliced Wasserstein Distance

Viet-Hoang Tran, Thanh Chu, Minh-Khoi Nguyen-Nhat et al.

ICLR 2025posterarXiv:2503.11249
#3966

MAI: A Multi-turn Aggregation-Iteration Model for Composed Image Retrieval

Yanzhe Chen, Zhiwen Yang, Jinglin Xu et al.

ICLR 2025poster
#3967

Disentangled Representation Learning with the Gromov-Monge Gap

Théo Uscidda, Luca Eyring, Karsten Roth et al.

ICLR 2025posterarXiv:2407.07829
#3968

kNN Attention Demystified: A Theoretical Exploration for Scalable Transformers

Themistoklis Haris

ICLR 2025poster
#3969

Towards Unified Human Motion-Language Understanding via Sparse Interpretable Characterization

guangtao lyu, Chenghao Xu, Jiexi Yan et al.

ICLR 2025oral
#3970

Efficient Low-Bit Quantization with Adaptive Scales for Multi-Task Co-Training

Boyu Liu, Haoyu Huang, Linlin Yang et al.

ICLR 2025poster
#3971

Regularizing Energy among Training Samples for Out-of-Distribution Generalization

Yiting Chen, Qitian Wu, Junchi Yan

ICLR 2025poster
#3972

Rethinking and Improving Autoformalization: Towards a Faithful Metric and a Dependency Retrieval-based Approach

Qi Liu, Xinhao Zheng, Xudong Lu et al.

ICLR 2025poster
#3973

Learning Structured Universe Graph with Outlier OOD Detection for Partial Matching

Zetian Jiang, Jiaxin Lu, Haizhao Fan et al.

ICLR 2025poster
#3974

What Secrets Do Your Manifolds Hold? Understanding the Local Geometry of Generative Models

Ahmed Imtiaz Humayun, Ibtihel Amara, Cristina Nader Vasconcelos et al.

ICLR 2025posterarXiv:2408.08307
#3975

To Clip or not to Clip: the Dynamics of SGD with Gradient Clipping in High-Dimensions

Noah Marshall, Ke Liang Xiao, Atish Agarwala et al.

ICLR 2025posterarXiv:2406.11733
#3976

UniCO: On Unified Combinatorial Optimization via Problem Reduction to Matrix-Encoded General TSP

Wenzheng Pan, Hao Xiong, Jiale Ma et al.

ICLR 2025poster
#3977

Forget the Data and Fine-Tuning! Just Fold the Network to Compress

Dong Wang, Haris Šikić, Lothar Thiele et al.

ICLR 2025posterarXiv:2502.10216
#3978

Statistical Advantages of Perturbing Cosine Router in Mixture of Experts

Huy Nguyen, Pedram Akbarian Saravi, Trang Pham et al.

ICLR 2025posterarXiv:2405.14131
#3979

Learning Geometric Reasoning Networks For Robot Task And Motion Planning

Smail Ait Bouhsain, Rachid Alami, Thierry Simeon

ICLR 2025poster
#3980

Prompting Fairness: Integrating Causality to Debias Large Language Models

Jingling Li, Zeyu Tang, Xiaoyu Liu et al.

ICLR 2025posterarXiv:2403.08743
#3981

Dynamic Negative Guidance of Diffusion Models

Felix Koulischer, Johannes Deleu, Gabriel Raya et al.

ICLR 2025posterarXiv:2410.14398
#3982

Bilinear MLPs enable weight-based mechanistic interpretability

Michael Pearce, Thomas Dooms, Alice Rigg et al.

ICLR 2025posterarXiv:2410.08417
#3983

Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization

Wenkai Yang, Shiqi Shen, Guangyao Shen et al.

ICLR 2025posterarXiv:2406.11431
#3984

DocMIA: Document-Level Membership Inference Attacks against DocVQA Models

Khanh Nguyen, Raouf Kerkouche, Mario Fritz et al.

ICLR 2025posterarXiv:2502.03692
#3985

Fine-Tuning Token-Based Large Multimodal Models: What Works, What Doesn’t and What's Next

Zhulin Hu, Yan Ma, Jiadi Su et al.

ICLR 2025poster
#3986

Training-Free Diffusion Model Alignment with Sampling Demons

Po-Hung Yeh, Kuang-Huei Lee, Jun-Cheng Chen

ICLR 2025posterarXiv:2410.05760
#3987

Uncertainty-Aware Decoding with Minimum Bayes Risk

Nico Daheim, Clara Meister, Thomas Möllenhoff et al.

ICLR 2025posterarXiv:2503.05318
#3988

LDAdam: Adaptive Optimization from Low-Dimensional Gradient Statistics

Thomas Robert, Mher Safaryan, Ionut-Vlad Modoranu et al.

ICLR 2025posterarXiv:2410.16103
#3989

Tracking objects that change in appearance with phase synchrony

Sabine Muzellec, Drew Linsley, Alekh Ashok et al.

ICLR 2025posterarXiv:2410.02094
#3990

Descent with Misaligned Gradients and Applications to Hidden Convexity

Aditya Bhaskara, Ashok Cutkosky, Ravi Kumar et al.

ICLR 2025poster
#3991

Diffusion State-Guided Projected Gradient for Inverse Problems

Rayhan Zirvi, Bahareh Tolooshams, anima anandkumar

ICLR 2025posterarXiv:2410.03463
#3992

Learning from weak labelers as constraints

Vishwajeet Agrawal, Rattana Pukdee, Nina Balcan et al.

ICLR 2025poster
#3993

A Distributional Approach to Uncertainty-Aware Preference Alignment Using Offline Demonstrations

Sheng Xu, Bo Yue, Hongyuan Zha et al.

ICLR 2025poster
#3994

Estimating the Probabilities of Rare Outputs in Language Models

Gabriel Wu, Jacob Hilton

ICLR 2025posterarXiv:2410.13211
#3995

Self-Normalized Resets for Plasticity in Continual Learning

Vivek Farias, Adam Jozefiak

ICLR 2025posterarXiv:2410.20098
#3996

Training on the Test Task Confounds Evaluation and Emergence

Ricardo Dominguez-Olmedo, Florian Eddie Dorner, Moritz Hardt

ICLR 2025posterarXiv:2407.07890
#3997

COME: Test-time Adaption by Conservatively Minimizing Entropy

Qingyang Zhang, Yatao Bian, Xinke Kong et al.

ICLR 2025posterarXiv:2410.10894
#3998

Oracle efficient truncated statistics

Konstantinos Karatapanis, Vasilis Kontonis, Christos Tzamos

ICLR 2025poster
#3999

Training Free Guided Flow-Matching with Optimal Control

Luran Wang, Chaoran Cheng, Yizhen Liao et al.

ICLR 2025poster
#4000

SaLoRA: Safety-Alignment Preserved Low-Rank Adaptation

Mingjie Li, Wai Man Si, Michael Backes et al.

ICLR 2025posterarXiv:2501.01765