Most Cited ICLR "model safety amplification" Papers

6,124 papers found • Page 31 of 31

#6001

SocioDojo: Building Lifelong Analytical Agents with Real-world Text and Time Series

Junyan Cheng, Peter Chin

ICLR 2024spotlight
#6002

HyperPLR: Hypergraph Generation through Projection, Learning, and Reconstruction

Weihuang Wen, Tianshu Yu

ICLR 2025
#6003

Modeling Unseen Environments with Language-guided Composable Causal Components in Reinforcement Learning

Xinyue Wang, Biwei Huang

ICLR 2025arXiv:2505.08361
#6004

A unique M-pattern for micro-expression spotting in long videos

Jinxuan Wang, Shiting Xu, Tong Zhang

ICLR 2024
#6005

HiRA: Parameter-Efficient Hadamard High-Rank Adaptation for Large Language Models

Qiushi Huang, Tom Ko, Zhan ZHUANG et al.

ICLR 2025
#6006

Modeling state-dependent communication between brain regions with switching nonlinear dynamical systems

Orren Karniol-Tambour, David Zoltowski, E. Mika Diamanti et al.

ICLR 2024
#6007

Jump Your Steps: Optimizing Sampling Schedule of Discrete Diffusion Models

Yong-Hyun Park, Chieh-Hsin Lai, Satoshi Hayakawa et al.

ICLR 2025
#6008

Bad-PFL: Exploiting Backdoor Attacks against Personalized Federated Learning

Mingyuan Fan, Zhanyi Hu, Fuyi Wang et al.

ICLR 2025
#6009

PhysPDE: Rethinking PDE Discovery and a Physical HYpothesis Selection Benchmark

Mingquan Feng, Yixin Huang, Yizhou Liu et al.

ICLR 2025
#6010

K-HALU: Multiple Answer Korean Hallucination Benchmark for Large Language Models

Jaehyung Seo, Heuiseok Lim

ICLR 2025
#6011

Long-Short Decision Transformer: Bridging Global and Local Dependencies for Generalized Decision-Making

Jincheng Wang, Penny Karanasou, Pengyuan Wei et al.

ICLR 2025
#6012

Faithful Vision-Language Interpretation via Concept Bottleneck Models

Songning Lai, Lijie Hu, Junxiao Wang et al.

ICLR 2024
#6013

Multi-Resolution Decomposable Diffusion Model for Non-Stationary Time Series Anomaly Detection

Guojin Zhong, pan wang, Jin Yuan et al.

ICLR 2025oral
#6014

Stylized Offline Reinforcement Learning: Extracting Diverse High-Quality Behaviors from Heterogeneous Datasets

Yihuan Mao, Chengjie Wu, Xi Chen et al.

ICLR 2024oral
#6015

Threaten Spiking Neural Networks through Combining Rate and Temporal Information

Zecheng Hao, Tong Bu, Xinyu Shi et al.

ICLR 2024oral
#6016

Learning to Search from Demonstration Sequences

Dixant Mittal, Liwei Kang, Wee Sun Lee

ICLR 2025
#6017

A Newborn Embodied Turing Test for Comparing Object Segmentation Across Animals and Machines

Manju Garimella, Denizhan Pak, Justin Wood et al.

ICLR 2024
#6018

Joint Gradient Balancing for Data Ordering in Finite-Sum Multi-Objective Optimization

Hansi Yang, James Kwok

ICLR 2025
#6019

Efficient Top-m Data Values Identification for Data Selection

Xiaoqiang Lin, Xinyi Xu, See-Kiong Ng et al.

ICLR 2025
#6020

Manifold Constraint Reduces Exposure Bias in Accelerated Diffusion Sampling

ICLR 2025
#6021

Divergence of Neural Tangent Kernel in Classification Problems

Zixiong Yu, Songtao Tian, Guhan Chen

ICLR 2025
#6022

Extending Power of Nature from Binary to Real-Valued Graph Learning in Real World

Chunshu Wu, Ruibing Song, Chuan Liu et al.

ICLR 2024
#6023

W-PCA Based Gradient-Free Proxy for Efficient Search of Lightweight Language Models

Shang Wang

ICLR 2025arXiv:2504.15983
#6024

Structured Video-Language Modeling with Temporal Grouping and Spatial Grounding

Yuanhao Xiong, Long Zhao, Boqing Gong et al.

ICLR 2024oralarXiv:2303.16341
#6025

Cross-Domain Off-Policy Evaluation and Learning for Contextual Bandits

Yuta Natsubori, Masataka Ushiku, Yuta Saito

ICLR 2025
#6026

Certified Adversarial Robustness for Rate Encoded Spiking Neural Networks

Bhaskar Mukhoty, Hilal AlQuabeh, Giulia De Masi et al.

ICLR 2024
#6027

Improving Unsupervised Constituency Parsing via Maximizing Semantic Information

Junjie Chen, Xiangheng He, Yusuke Miyao et al.

ICLR 2025arXiv:2410.02558
#6028

Denoising Levy Probabilistic Models

Dario Shariatian, Umut Simsekli, Alain Oliviero Durmus

ICLR 2025
#6029

Flat Reward in Policy Parameter Space Implies Robust Reinforcement Learning

HyunKyu Lee, Sung Whan Yoon

ICLR 2025
#6030

Regularized Proportional Fairness Mechanism for Resource Allocation Without Money

Sujay Bhatt, Alec Koppel, Sumitra Ganesh et al.

ICLR 2025arXiv:2501.01111
#6031

Making Transformer Decoders Better Differentiable Indexers

Wuchao Li, Kai Zheng, Defu Lian et al.

ICLR 2025
#6032

Retri3D: 3D Neural Graphics Representation Retrieval

Yushi Guan, Daniel Kwan, Jean Dandurand et al.

ICLR 2025
#6033

RingAttention with Blockwise Transformers for Near-Infinite Context

Hao Liu, Matei Zaharia, Pieter Abbeel

ICLR 2024
#6034

Why In-Context Learning Models are Good Few-Shot Learners?

Shiguang Wu, Yaqing Wang, Quanming Yao

ICLR 2025
#6035

Co$^{\mathbf{3}}$Gesture: Towards Coherent Concurrent Co-speech 3D Gesture Generation with Interactive Diffusion

Xingqun Qi, Yatian Wang, Hengyuan Zhang et al.

ICLR 2025oral
#6036

Debiasing Attention Mechanism in Transformer without Demographics

Shenyu Lu, Yipei Wang, Xiaoqian Wang

ICLR 2024
#6037

Oracle efficient truncated statistics

Konstantinos Karatapanis, Vasilis Kontonis, Christos Tzamos

ICLR 2025
#6038

Graph Transformers on EHRs: Better Representation Improves Downstream Performance

Raphael Poulain, Rahmatollah Beheshti

ICLR 2024oral
#6039

An Illustrated Guide to Automatic Sparse Differentiation

Adrian Hill, Guillaume Dalle, Alexis Montoison

ICLR 2025
#6040

Enhancing Compositional Text-to-Image Generation with Reliable Random Seeds

Shuangqi Li, Hieu Le, Jingyi Xu et al.

ICLR 2025
#6041

Scalable Modular Network: A Framework for Adaptive Learning via Agreement Routing

Minyang Hu, Hong Chang, Bingpeng Ma et al.

ICLR 2024
#6042

Improved Regret Bounds for Non-Convex Online-Within-Online Meta Learning

Jiechao GUAN, Hui Xiong

ICLR 2024
#6043

Can One Modality Model Synergize Training of Other Modality Models?

Jae-Jun Lee, Sung Whan Yoon

ICLR 2025
#6044

Do WGANs succeed because they minimize the Wasserstein Distance? Lessons from Discrete Generators

Ariel Elnekave, Yair Weiss

ICLR 2025
#6045

Integral Performance Approximation for Continuous-Time Reinforcement Learning Control

Brent Wallace, Jennie Si

ICLR 2025
#6046

Asymmetric Factorized Bilinear Operation for Vision Transformer

Junjie Wu, Qilong Wang, Jiangtao Xie et al.

ICLR 2025
#6047

A Theoretically-Principled Sparse, Connected, and Rigid Graph Representation of Molecules

Shih-Hsin Wang, Yuhao Huang, Justin Baker et al.

ICLR 2025
#6048

Efficient Cross-Episode Meta-RL

Gresa Shala, André Biedenkapp, Pierre Krack et al.

ICLR 2025
#6049

Hypergraph Dynamic System

Jielong Yan, Yifan Feng, Shihui Ying et al.

ICLR 2024
#6050

Atomas: Hierarchical Adaptive Alignment on Molecule-Text for Unified Molecule Understanding and Generation

Yikun Zhang, Geyan Ye, Chaohao Yuan et al.

ICLR 2025
#6051

Logic-Logit: A Logic-Based Approach to Choice Modeling

Shuhan Zhang, Wendi Ren, Shuang Li

ICLR 2025
#6052

Differential Transformer

Tianzhu Ye, Li Dong, Yuqing Xia et al.

ICLR 2025arXiv:2410.05258
#6053

VideoGLUE: Video General Understanding Evaluation of Foundation Models

Boqing Gong, Yin Cui, Long Zhao et al.

ICLR 2025oral
#6054

Boosting the Adversarial Robustness of Graph Neural Networks: An OOD Perspective

Kuan Li, YiWen Chen, Yang Liu et al.

ICLR 2024
#6055

Discriminator-Guided Embodied Planning for LLM Agent

Haofu Qian, Chenjia Bai, Jiatao Zhang et al.

ICLR 2025
#6056

Training Free Guided Flow-Matching with Optimal Control

Luran Wang, Chaoran Cheng, Yizhen Liao et al.

ICLR 2025
#6057

RLCD: Reinforcement Learning from Contrastive Distillation for LM Alignment

Kevin Yang, Dan Klein, Asli Celikyilmaz et al.

ICLR 2024
#6058

Brain Bandit: A Biologically Grounded Neural Network for Efficient Control of Exploration

Chen Jiang, Jiahui An, Yating Liu et al.

ICLR 2025
#6059

You Only Query Once: An Efficient Label-Only Membership Inference Attack

Yutong Wu, Han Qiu, Shangwei Guo et al.

ICLR 2024
#6060

Mind the GAP: Glimpse-based Active Perception improves generalization and sample efficiency of visual reasoning

Oleh Kolner, Thomas Ortner, Stanisław Woźniak et al.

ICLR 2025arXiv:2409.20213
#6061

An Image Is Worth 1000 Lies: Transferability of Adversarial Images across Prompts on Vision-Language Models

Haochen Luo, Jindong Gu, Fengyuan Liu et al.

ICLR 2024spotlight
#6062

Unveiling Options with Neural Network Decomposition

Mahdi Alikhasi, Levi Lelis

ICLR 2024oral
#6063

Interpretable Sparse System Identification: Beyond Recent Deep Learning Techniques on Time-Series Prediction

Liu Xiaoyi, Duxin Chen, Wenjia Wei et al.

ICLR 2024
#6064

FedInverse: Evaluating Privacy Leakage in Federated Learning

DI WU, Jun Bai, Yiliao Song et al.

ICLR 2024
#6065

CircuitNet 2.0: An Advanced Dataset for Promoting Machine Learning Innovations in Realistic Chip Design Environment

Xun Jiang, zhuomin chai, Yuxiang Zhao et al.

ICLR 2024
#6066

Enhancing Vision-Language Model with Unmasked Token Alignment

Hongsheng Li, Jihao Liu, Boxiao Liu et al.

ICLR 2025arXiv:2405.19009
#6067

BTBS-LNS: Binarized-Tightening, Branch and Search on Learning LNS Policies for MIP

Hao Yuan, wenli ouyang, Changwen Zhang et al.

ICLR 2025
#6068

On the Scalability and Memory Efficiency of Semidefinite Programs for Lipschitz Constant Estimation of Neural Networks

Zi Wang, Bin Hu, Aaron Havens et al.

ICLR 2024
#6069

Optimal criterion for feature learning of two-layer linear neural network in high dimensional interpolation regime

Keita Suzuki, Taiji Suzuki

ICLR 2024
#6070

HGM³: Hierarchical Generative Masked Motion Modeling with Hard Token Mining

Minjae Jeong, Yechan Hwang, Jaejin Lee et al.

ICLR 2025
#6071

The Illustrated AlphaFold

Elana Simon, Jake Silberg

ICLR 2025
#6072

Robust Transfer of Safety-Constrained Reinforcement Learning Agents

Markel Zubia, Thiago Simão, Nils Jansen

ICLR 2025
#6073

Rethinking CNN’s Generalization to Backdoor Attack from Frequency Domain

Quanrui Rao, Lin Wang, Wuying Liu

ICLR 2024
#6074

Noise-conditioned Energy-based Annealed Rewards (NEAR): A Generative Framework for Imitation Learning from Observation

Anish Abhijit Diwan, Julen Urain, Jens Kober et al.

ICLR 2025arXiv:2501.14856
#6075

Flow to Better: Offline Preference-based Reinforcement Learning via Preferred Trajectory Generation

Zhilong Zhang, Yihao Sun, Junyin Ye et al.

ICLR 2024oral
#6076

Local convergence of simultaneous min-max algorithms to differential equilibrium on Riemannian manifold

Sixin Zhang

ICLR 2025arXiv:2405.13392
#6077

VBH-GNN: Variational Bayesian Heterogeneous Graph Neural Networks for Cross-subject Emotion Recognition

Chenyu Liu, XINLIANG ZHOU, Zhengri Zhu et al.

ICLR 2024oral
#6078

Neural Rate Control for Learned Video Compression

yiwei zhang, Guo Lu, Yunuo Chen et al.

ICLR 2024oral
#6079

PnP Inversion: Boosting Diffusion-based Editing with 3 Lines of Code

Xuan Ju, Ailing Zeng, Yuxuan Bian et al.

ICLR 2024
#6080

BroGNet: Momentum-Conserving Graph Neural Stochastic Differential Equation for Learning Brownian Dynamics

Suresh Suresh, Jayadeva Jayadeva, Sayan Ranu et al.

ICLR 2024
#6081

A deep inverse-mapping model for a flapping robotic wing

Hadar Sharvit, Raz Karl, Tsevi Beatus

ICLR 2025arXiv:2502.09378
#6082

Youku Dense Caption: A Large-scale Chinese Video Dense Caption Dataset and Benchmarks

Zixuan Xiong, Guangwei Xu, wenkai zhang et al.

ICLR 2025
#6083

Structural Fairness-aware Active Learning for Graph Neural Networks

Haoyu Han, Xiaorui Liu, Li Ma et al.

ICLR 2024
#6084

Improved Efficiency Based on Learned Saccade and Continuous Scene Reconstruction From Foveated Visual Sampling

Jiayang Liu, Yiming Bu, Daniel Tso et al.

ICLR 2024spotlight
#6085

FlickerFusion: Intra-trajectory Domain Generalizing Multi-agent Reinforcement Learning

Woosung Koh, Wonbeen Oh, Siyeol Kim et al.

ICLR 2025
#6086

Plug, Play, and Generalize: Length Extrapolation with Pointer-Augmented Neural Memory

Svetha Venkatesh, Kien Do, Hung Le et al.

ICLR 2025
#6087

Improved Convergence Rate for Diffusion Probabilistic Models

Gen Li, Yuchen Jiao

ICLR 2025
#6088

Rapid Selection and Ordering of In-Context Demonstrations via Prompt Embedding Clustering

Kha Pham, Hung Le, Man Ngo et al.

ICLR 2025
#6089

SymDiff: Equivariant Diffusion via Stochastic Symmetrisation

Leo Zhang, Kianoosh Ashouritaklimi, Yee Whye Teh et al.

ICLR 2025arXiv:2410.06262
#6090

Negatively Correlated Ensemble Reinforcement Learning for Online Diverse Game Level Generation

Ziqi Wang, Chengpeng Hu, Jialin Liu et al.

ICLR 2024
#6091

AttEXplore: Attribution for Explanation with model parameters eXploration

Zhiyu Zhu, Huaming Chen, Jiayu Zhang et al.

ICLR 2024
#6092

MA$^2$E: Addressing Partial Observability in Multi-Agent Reinforcement Learning with Masked Auto-Encoder

Sehyeok Kang, Yongsik Lee, Gahee Kim et al.

ICLR 2025
#6093

Local Composite Saddle Point Optimization

Site Bai, Brian Bullins

ICLR 2024
#6094

Prediction Risk and Estimation Risk of the Ridgeless Least Squares Estimator under General Assumptions on Regression Errors

Sungyoon Lee, Sokbae Lee

ICLR 2025arXiv:2305.12883
#6095

Transformer-Modulated Diffusion Models for Probabilistic Multivariate Time Series Forecasting

Yuxin Li, Wenchao Chen, Xinyue Hu et al.

ICLR 2024
#6096

sRGB Real Noise Modeling via Noise-Aware Sampling with Normalizing Flows

Dongjin Kim, Donggoo Jung, Sungyong Baik et al.

ICLR 2024
#6097

A Visual Dive into Conditional Flow Matching

Anne Gagneux, Ségolène Martin, Rémi Emonet et al.

ICLR 2025
#6098

Language Model Detectors Are Easily Optimized Against

Charlotte Nicks, Eric Mitchell, Rafael Rafailov et al.

ICLR 2024
#6099

Pre-training of Foundation Adapters for LLM Fine-tuning

Linh The Nguyen, Dat Quoc Nguyen

ICLR 2025
#6100

Relation-Aware Diffusion for Heterogeneous Graphs with Partially Observed Features

Daeho Um, Yoonji Lee, Jiwoong Park et al.

ICLR 2025
#6101

Login

ICLR 2024arXiv:1006.2411
#6102

Prompt Gradient Projection for Continual Learning

Jingyang Qiao, Zhizhong Zhang, Xin Tan et al.

ICLR 2024spotlight
#6103

Do vision models perceive objects like toddlers ?

Arthur Aubret, Jochen Triesch

ICLR 2025
#6104

Efficient Neuron Segmentation in Electron Microscopy by Affinity-Guided Queries

Hang Chen, Chufeng Tang, Xiao Li et al.

ICLR 2025
#6105

Effective post-training embedding compression via temperature control in contrastive training

georgiana dinu, Corey Barrett, Yi Xiang et al.

ICLR 2025
#6106

In vivo cell-type and brain region classification via multimodal contrastive learning

Han Yu, Hanrui Lyu, YiXun Xu et al.

ICLR 2025
#6107

VoxDialogue: Can Spoken Dialogue Systems Understand Information Beyond Words?

Xize Cheng, Ruofan Hu, Xiaoda Yang et al.

ICLR 2025
#6108

Neural Neighborhood Search for Multi-agent Path Finding

Zhongxia Yan, Cathy Wu

ICLR 2024oral
#6109

Manipulating dropout reveals an optimal balance of efficiency and robustness in biological and machine visual systems

Jacob Prince, Gabriel Fajardo, George Alvarez et al.

ICLR 2024oral
#6110

MMD-Regularized Unbalanced Optimal Transport

SakethaNath Jagarlapudi, Pratik Jawanpuria, Piyushi Manupriya

ICLR 2025
#6111

Hybrid Regularization Improves Diffusion-based Inverse Problem Solving

Hongkun Dou, Zeyu Li, Jinyang Du et al.

ICLR 2025
#6112

Improved Sampling Algorithms for Lévy-Itô Diffusion Models

Vadim Popov, Assel Yermekova, Tasnima Sadekova et al.

ICLR 2025
#6113

Neural Stochastic Differential Equations for Uncertainty-Aware Offline RL

Cevahir Koprulu, Franck Djeumou, ufuk topcu

ICLR 2025
#6114

Tackling the Data Heterogeneity in Asynchronous Federated Learning with Cached Update Calibration

Yujia Wang, Yuanpu Cao, Jingcheng Wu et al.

ICLR 2024
#6115

Bias Mitigation in Graph Diffusion Models

Meng Yu, Kun Zhan

ICLR 2025
#6116

Adaptive Stochastic Gradient Algorithm for Black-box Multi-Objective Learning

Feiyang YE, YUEMING LYU, Xuehao Wang et al.

ICLR 2024
#6117

DQ-LoRe: Dual Queries with Low Rank Approximation Re-ranking for In-Context Learning

Jing Xiong, Zixuan Li, Chuanyang Zheng et al.

ICLR 2024arXiv:2310.02954
#6118

A Stochastic Approach to the Subset Selection Problem via Mirror Descent

Dan Greenstein, Elazar Gershuni, Ilan Ben-Bassat et al.

ICLR 2025
#6119

Optimality of Matrix Mechanism on $\ell_p^p$-metric

Zongrui Zou, Jingcheng Liu, Jalaj Upadhyay

ICLR 2025arXiv:2406.02140
#6120

Gap Preserving Distillation by Building Bidirectional Mappings with A Dynamic Teacher

Yong Guo, Shulian Zhang, Haolin Pan et al.

ICLR 2025arXiv:2410.04140
#6121

Scalable Extraction of Training Data from Aligned, Production Language Models

Milad Nasr, Javier Rando, Nicholas Carlini et al.

ICLR 2025
#6122

Beyond Mere Token Analysis: A Hypergraph Metric Space Framework for Defending Against Socially Engineered LLM Attacks

Manohar Kaul, Aditya Saibewar, Sadbhavana Babar

ICLR 2025
#6123

Has the Deep Neural Network learned the Stochastic Process? An Evaluation Viewpoint

Harshit Kumar, Beomseok Kang, Biswadeep Chakraborty et al.

ICLR 2025arXiv:2402.15163
#6124

NeRM: Learning Neural Representations for High-Framerate Human Motion Synthesis

Dong Wei, Huaijiang Sun, Bin Li et al.

ICLR 2024oral