Most Cited 2025 "causal order" Papers

22,274 papers found • Page 95 of 112

#18801

LDAdam: Adaptive Optimization from Low-Dimensional Gradient Statistics

Thomas Robert, Mher Safaryan, Ionut-Vlad Modoranu et al.

ICLR 2025arXiv:2410.16103
#18802

Tracking objects that change in appearance with phase synchrony

Sabine Muzellec, Drew Linsley, Alekh Ashok et al.

ICLR 2025arXiv:2410.02094
#18803

Descent with Misaligned Gradients and Applications to Hidden Convexity

Aditya Bhaskara, Ashok Cutkosky, Ravi Kumar et al.

ICLR 2025
#18804

Diffusion State-Guided Projected Gradient for Inverse Problems

Rayhan Zirvi, Bahareh Tolooshams, anima anandkumar

ICLR 2025arXiv:2410.03463
#18805

Learning from weak labelers as constraints

Vishwajeet Agrawal, Rattana Pukdee, Nina Balcan et al.

ICLR 2025
#18806

A Distributional Approach to Uncertainty-Aware Preference Alignment Using Offline Demonstrations

Sheng Xu, Bo Yue, Hongyuan Zha et al.

ICLR 2025
#18807

Estimating the Probabilities of Rare Outputs in Language Models

Gabriel Wu, Jacob Hilton

ICLR 2025arXiv:2410.13211
#18808

Self-Normalized Resets for Plasticity in Continual Learning

Vivek Farias, Adam Jozefiak

ICLR 2025arXiv:2410.20098
#18809

Training on the Test Task Confounds Evaluation and Emergence

Ricardo Dominguez-Olmedo, Florian Eddie Dorner, Moritz Hardt

ICLR 2025arXiv:2407.07890
#18810

COME: Test-time Adaption by Conservatively Minimizing Entropy

Qingyang Zhang, Yatao Bian, Xinke Kong et al.

ICLR 2025arXiv:2410.10894
#18811

Oracle efficient truncated statistics

Konstantinos Karatapanis, Vasilis Kontonis, Christos Tzamos

ICLR 2025
#18812

Training Free Guided Flow-Matching with Optimal Control

Luran Wang, Chaoran Cheng, Yizhen Liao et al.

ICLR 2025
#18813

SaLoRA: Safety-Alignment Preserved Low-Rank Adaptation

Mingjie Li, Wai Man Si, Michael Backes et al.

ICLR 2025arXiv:2501.01765
#18814

BTBS-LNS: Binarized-Tightening, Branch and Search on Learning LNS Policies for MIP

Hao Yuan, wenli ouyang, Changwen Zhang et al.

ICLR 2025
#18815

Pre-training of Foundation Adapters for LLM Fine-tuning

Linh The Nguyen, Dat Quoc Nguyen

ICLR 2025
#18816

NExT-Mol: 3D Diffusion Meets 1D Language Modeling for 3D Molecule Generation

Zhiyuan Liu, Yanchen Luo, Han Huang et al.

ICLR 2025arXiv:2502.12638
#18817

NeuroLM: A Universal Multi-task Foundation Model for Bridging the Gap between Language and EEG Signals

Wei-Bang Jiang, Yansen Wang, Bao-liang Lu et al.

ICLR 2025oralarXiv:2409.00101
#18818

Conformalized Interactive Imitation Learning: Handling Expert Shift and Intermittent Feedback

Michelle Zhao, Henny Admoni, Reid Simmons et al.

ICLR 2025arXiv:2410.08852
#18819

A Computational Framework for Modeling Emergence of Color Vision in the Human Brain

Atsunobu Kotani, Yi-Ren Ng

ICLR 2025arXiv:2408.16916
#18820

From Sparse Dependence to Sparse Attention: Unveiling How Chain-of-Thought Enhances Transformer Sample Efficiency

Kaiyue Wen, Huaqing Zhang, Hongzhou Lin et al.

ICLR 2025arXiv:2410.05459
#18821

Unsupervised Multiple Kernel Learning for Graphs via Ordinality Preservation

Yan Sun, Stanley Kok

ICLR 2025
#18822

Collaborative Discrete-Continuous Black-Box Prompt Learning for Language Models

Hualin Zhang, Haozhen Zhang, Zhekai Liu et al.

ICLR 2025
#18823

Generalizable Human Gaussians from Single-View Image

Jinnan Chen, Chen Li, Jianfeng Zhang et al.

ICLR 2025arXiv:2406.06050
#18824

Leveraging Flatness to Improve Information-Theoretic Generalization Bounds for SGD

Ze Peng, Jian Zhang, Yisen Wang et al.

ICLR 2025arXiv:2601.01465
#18825

Efficient Interpolation between Extragradient and Proximal Methods for Weak MVIs

Thomas Pethick, Ioannis Mavrothalassitis, Volkan Cevher

ICLR 2025
#18826

Two Effects, One Trigger: On the Modality Gap, Object Bias, and Information Imbalance in Contrastive Vision-Language Models

Simon Schrodi, David T. Hoffmann, Max Argus et al.

ICLR 2025arXiv:2404.07983
#18827

VideoShield: Regulating Diffusion-based Video Generation Models via Watermarking

Runyi Hu, Jie Zhang, Yiming Li et al.

ICLR 2025oralarXiv:2501.14195
#18828

Flaws of ImageNet, Computer Vision's Favourite Dataset

Nikita Kisel, Illia Volkov, Kateřina Hanzelková et al.

ICLR 2025arXiv:2412.00076
#18829

Influence Functions for Scalable Data Attribution in Diffusion Models

Bruno Mlodozeniec, Runa Eschenhagen, Juhan Bae et al.

ICLR 2025arXiv:2410.13850
#18830

Lossy Compression with Pretrained Diffusion Models

jeremy vonderfecht, Feng Liu

ICLR 2025arXiv:2501.09815
#18831

Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning

Haozhe Ma, Zhengding Luo, Thanh Vinh Vo et al.

ICLR 2025arXiv:2408.03029
#18832

SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration

Jintao Zhang, Jia wei, Pengle Zhang et al.

ICLR 2025arXiv:2410.02367
#18833

GeoX: Geometric Problem Solving Through Unified Formalized Vision-Language Pre-training

Renqiu Xia, mingsheng li, Hancheng Ye et al.

ICLR 2025arXiv:2412.11863
#18834

Safety Layers in Aligned Large Language Models: The Key to LLM Security

Shen Li, Liuyi Yao, Lan Zhang et al.

ICLR 2025arXiv:2408.17003
#18835

PIN: Prolate Spheroidal Wave Function-based Implicit Neural Representations

Viraj Dhananjaya Bandara Jayasundara Jayasundara Mudiyanselage, Heng Zhao, Demetrio Labate et al.

ICLR 2025
#18836

Learning Harmonized Representations for Speculative Sampling

Lefan Zhang, Xiaodan Wang, Yanhua Huang et al.

ICLR 2025arXiv:2408.15766
#18837

Extendable and Iterative Structure Learning Strategy for Bayesian Networks

Hamid Kalantari, Russell Greiner, Pouria Ramazi

ICLR 2025
#18838

KinFormer: Generalizable Dynamical Symbolic Regression for Catalytic Organic Reaction Kinetics

Jindou Chen, Jidong Tian, Liang Wu et al.

ICLR 2025
#18839

Transformers Provably Solve Parity Efficiently with Chain of Thought

Juno Kim, Taiji Suzuki

ICLR 2025arXiv:2410.08633
#18840

CLIPure: Purification in Latent Space via CLIP for Adversarially Robust Zero-Shot Classification

Mingkun Zhang, Keping Bi, Wei Chen et al.

ICLR 2025arXiv:2502.18176
#18841

Models trained with unnormalized density functions: A need for a course correction

Rishal Aggarwal, Daniel Penaherrera, Justin Shao et al.

ICLR 2025
#18842

REMEDY: Recipe Merging Dynamics in Large Vision-Language Models

Didi Zhu, Yibing Song, tao shen et al.

ICLR 2025
#18843

Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality

Guanyu Zhou, Yibo Yan, Xin Zou et al.

ICLR 2025arXiv:2410.04780
#18844

Noise Separation guided Candidate Label Reconstruction for Noisy Partial Label Learning

Xiaorui Peng, Yuheng Jia, Fuchao Yang et al.

ICLR 2025
#18845

ILLUSION: Unveiling Truth with a Comprehensive Multi-Modal, Multi-Lingual Deepfake Dataset

Kartik Thakral, Rishabh Ranjan, Akanksha Singh et al.

ICLR 2025
#18846

MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation

Chenxi Wang, Xiang Chen, Ningyu Zhang et al.

ICLR 2025arXiv:2410.11779
#18847

Neural ODE Transformers: Analyzing Internal Dynamics and Adaptive Fine-tuning

Anh Tong, Thanh Nguyen-Tang, Dongeun Lee et al.

ICLR 2025arXiv:2503.01329
#18848

Projection Head is Secretly an Information Bottleneck

Zhuo Ouyang, Kaiwen Hu, Qi Zhang et al.

ICLR 2025arXiv:2503.00507
#18849

Boltzmann Semantic Score: A Semantic Metric for Evaluating Large Vision Models Using Large Language Models

Ali Khajegili Mirabadi, Katherine Rich, Hossein Farahani et al.

ICLR 2025
#18850

Unsupervised Zero-Shot Reinforcement Learning via Dual-Value Forward-Backward Representation

Jingbo Sun, Songjun Tu, Qichao Zhang et al.

ICLR 2025
#18851

RuAG: Learned-rule-augmented Generation for Large Language Models

Yudi Zhang, Pei Xiao, Lu Wang et al.

ICLR 2025arXiv:2411.03349
#18852

SOAP: Improving and Stabilizing Shampoo using Adam for Language Modeling

Nikhil Vyas, Depen Morwani, Rosie Zhao et al.

ICLR 2025
#18853

When is Task Vector Provably Effective for Model Editing? A Generalization Analysis of Nonlinear Transformers

Hongkang Li, Yihua Zhang, shuai ZHANG et al.

ICLR 2025arXiv:2504.10957
#18854

Improving Deep Regression with Tightness

Shihao Zhang, Yuguang Yan, Angela Yao

ICLR 2025arXiv:2502.09122
#18855

GSE: Group-wise Sparse and Explainable Adversarial Attacks

Shpresim Sadiku, Moritz Wagner, Sebastian Pokutta

ICLR 2025arXiv:2311.17434
#18856

Understanding Methods for Scalable MCTS

Will Knipe

ICLR 2025
#18857

The impact of allocation strategies in subset learning on the expressive power of neural networks

Ofir Schlisselberg, Ran Darshan

ICLR 2025arXiv:2502.06300
#18858

Wavelet Diffusion Neural Operator

Peiyan Hu, Rui Wang, Xiang Zheng et al.

ICLR 2025arXiv:2412.04833
#18859

VCR: A Task for Pixel-Level Complex Reasoning in Vision Language Models via Restoring Occluded Text

Tianyu Zhang, Suyuchen Wang, Lu Li et al.

ICLR 2025arXiv:2406.06462
#18860

MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation

Lu Li, Tianyu Zhang, Zhiqi Bu et al.

ICLR 2025arXiv:2406.07529
#18861

OccProphet: Pushing the Efficiency Frontier of Camera-Only 4D Occupancy Forecasting with an Observer-Forecaster-Refiner Framework

Junliang Chen, Huaiyuan Xu, Yi Wang et al.

ICLR 2025oral
#18862

Nonlinear Sequence Embedding by Monotone Variational Inequality

Jonathan Y. Zhou, Yao Xie

ICLR 2025
#18863

Agree to Disagree: Demystifying Homogeneous Deep Ensembles through Distributional Equivalence

Yipei Wang, Xiaoqian Wang

ICLR 2025
#18864

Quantum (Inspired) $D^2$-sampling with Applications

Poojan Shah, Ragesh Jaiswal

ICLR 2025arXiv:2405.13351
#18865

Inspection and Control of Self-Generated-Text Recognition Ability in Llama3-8b-Instruct

Christopher Ackerman, Nina Panickssery

ICLR 2025oralarXiv:2410.02064
#18866

Discovering Clone Negatives via Adaptive Contrastive Learning for Image-Text Matching

Renjie Pan, Jihao Dong, Hua Yang

ICLR 2025
#18867

Mitigating Reward Over-Optimization in RLHF via Behavior-Supported Regularization

Juntao Dai, Taiye Chen, Yaodong Yang et al.

ICLR 2025arXiv:2503.18130
#18868

IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation

Xinchen Zhang, Ling Yang, Guohao Li et al.

ICLR 2025arXiv:2410.07171
#18869

Resolution Attack: Exploiting Image Compression to Deceive Deep Neural Networks

Wangjia Yu, Xiaomeng Fu, Qiao Li et al.

ICLR 2025
#18870

Rethinking Reward Model Evaluation: Are We Barking up the Wrong Tree?

xueru wen, Jie Lou, Yaojie Lu et al.

ICLR 2025arXiv:2410.05584
#18871

Walk the Talk? Measuring the Faithfulness of Large Language Model Explanations

Katie Matton, Robert Ness, John Guttag et al.

ICLR 2025arXiv:2504.14150
#18872

ProtPainter: Draw or Drag Protein via Topology-guided Diffusion

Zhengxi Lu, Shizhuo Cheng, Yuru Jiang et al.

ICLR 2025arXiv:2504.14274
#18873

Redefining the task of Bioactivity Prediction

Yanwen Huang, Bowen Gao, Yinjun JIA et al.

ICLR 2025
#18874

CtD: Composition through Decomposition in Emergent Communication

Boaz Carmeli, Ron Meir, Yonatan Belinkov

ICLR 2025arXiv:2601.10169
#18875

Reframing Structure-Based Drug Design Model Evaluation via Metrics Correlated to Practical Needs

Bowen Gao, Haichuan Tan, Yanwen Huang et al.

ICLR 2025
#18876

Score-based Self-supervised MRI Denoising

Jiachen Tu, Yaokun Shi, Fan Lam

ICLR 2025arXiv:2505.05631
#18877

Revisit Micro-batch Clipping: Adaptive Data Pruning via Gradient Manipulation

Lun Wang

ICLR 2025arXiv:2408.16204
#18878

Local Patterns Generalize Better for Novel Anomalies

Yalong Jiang

ICLR 2025oral
#18879

Model Risk-sensitive Offline Reinforcement Learning

Gwangpyo Yoo, Honguk Woo

ICLR 2025
#18880

Beyond Squared Error: Exploring Loss Design for Enhanced Training of Generative Flow Networks

Rui Hu, Yifan Zhang, Zhuoran Li et al.

ICLR 2025arXiv:2410.02596
#18881

Offline RL in Regular Decision Processes: Sample Efficiency via Language Metrics

Ahana Deb, Roberto Cipollone, Anders Jonsson et al.

ICLR 2025
#18882

TempMe: Video Temporal Token Merging for Efficient Text-Video Retrieval

Leqi Shen, Tianxiang Hao, Tao He et al.

ICLR 2025oralarXiv:2409.01156
#18883

FIRING-Net: A filtered feature recycling network for speech enhancement

Xinmeng Xu, Yiqun Zhang, Jizhen Li et al.

ICLR 2025
#18884

ZooProbe: A Data Engine for Evaluating, Exploring, and Evolving Large-scale Training Data for Multimodal LLMs

Yi-Kai Zhang, Shiyin Lu, Qing-Guo Chen et al.

ICLR 2025
#18885

Simple yet Effective Incomplete Multi-view Clustering: Similarity-level Imputation and Intra-view Hybrid-group Prototype Construction

Shengju Yu, Zhibin Dong, Siwei Wang et al.

ICLR 2025
#18886

UniCBE: An Uniformity-driven Comparing Based Evaluation Framework with Unified Multi-Objective Optimization

Peiwen Yuan, Shaoxiong Feng, Yiwei Li et al.

ICLR 2025arXiv:2502.11454
#18887

Personality Alignment of Large Language Models

Minjun Zhu, Yixuan Weng, Linyi Yang et al.

ICLR 2025oralarXiv:2408.11779
#18888

SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding

Sihang Li, Jin Huang, Jiaxi Zhuang et al.

ICLR 2025arXiv:2408.15545
#18889

UniRestore3D: A Scalable Framework For General Shape Restoration

Yuang Wang, Yujian Zhang, Sida Peng et al.

ICLR 2025
#18890

Offline Hierarchical Reinforcement Learning via Inverse Optimization

Carolin Schmidt, Daniele Gammelli, James Harrison et al.

ICLR 2025arXiv:2410.07933
#18891

Training Nonlinear Transformers for Chain-of-Thought Inference: A Theoretical Generalization Analysis

Hongkang Li, Songtao Lu, Pin-Yu Chen et al.

ICLR 2025arXiv:2410.02167
#18892

Adversarially Robust Anomaly Detection through Spurious Negative Pair Mitigation

Hossein Mirzaei Sadeghlou, Mojtaba Nafez, Jafar Habibi et al.

ICLR 2025
#18893

T2V2: A Unified Non-Autoregressive Model for Speech Recognition and Synthesis via Multitask Learning

Nabarun Goswami, Hanqin Wang, Tatsuya Harada

ICLR 2025oral
#18894

One Hundred Neural Networks and Brains Watching Videos: Lessons from Alignment

Christina Sartzetaki, Gemma Roig, Cees G Snoek et al.

ICLR 2025oral
#18895

TD-Paint: Faster Diffusion Inpainting Through Time-Aware Pixel Conditioning

Tsiry MAYET, Pourya Shamsolmoali, Simon Bernard et al.

ICLR 2025arXiv:2410.09306
#18896

One for all and all for one: Efficient computation of partial Wasserstein distances on the line

Laetitia Chapel, Romain Tavenard

ICLR 2025
#18897

Learning Dynamics of Deep Matrix Factorization Beyond the Edge of Stability

Avrajit Ghosh, Soo Min Kwon, Rongrong Wang et al.

ICLR 2025
#18898

MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models

Peng Xia, Kangyu Zhu, Haoran Li et al.

ICLR 2025arXiv:2410.13085
#18899

Hotspot-Driven Peptide Design via Multi-Fragment Autoregressive Extension

Jiahan Li, Tong Chen, Shitong Luo et al.

ICLR 2025arXiv:2411.18463
#18900

Any-step Dynamics Model Improves Future Predictions for Online and Offline Reinforcement Learning

Haoxin Lin, Yu-Yan Xu, Yihao Sun et al.

ICLR 2025arXiv:2405.17031
#18901

Semantic Temporal Abstraction via Vision-Language Model Guidance for Efficient Reinforcement Learning

Tian-Shuo Liu, Xu-Hui Liu, Ruifeng Chen et al.

ICLR 2025oral
#18902

Scaling FP8 training to trillion-token LLMs

Maxim Fishman, Brian Chmiel, Ron Banner et al.

ICLR 2025arXiv:2409.12517
#18903

Enhancing Pre-trained Representation Classifiability can Boost its Interpretability

ICLR 2025arXiv:2510.24105
#18904

Reassessing How to Compare and Improve the Calibration of Machine Learning Models

Muthu Chidambaram, Rong Ge

ICLR 2025arXiv:2406.04068
#18905

On Stochastic Contextual Bandits with Knapsacks in Small Budget Regime

Hengquan Guo, Xin Liu

ICLR 2025
#18906

For Better or For Worse? Learning Minimum Variance Features With Label Augmentation

Muthu Chidambaram, Rong Ge

ICLR 2025arXiv:2402.06855
#18907

Residual-MPPI: Online Policy Customization for Continuous Control

Pengcheng Wang, Chenran Li, Catherine Weaver et al.

ICLR 2025arXiv:2407.00898
#18908

ImagineNav: Prompting Vision-Language Models as Embodied Navigator through Scene Imagination

Xinxin Zhao, Wenzhe Cai, Likun Tang et al.

ICLR 2025arXiv:2410.09874
#18909

Restructuring Vector Quantization with the Rotation Trick

Christopher Fifty, Ronald Junkins, Dennis Duan et al.

ICLR 2025arXiv:2410.06424
#18910

SAGEPhos: Sage Bio-Coupled and Augmented Fusion for Phosphorylation Site Detection

Jingjie Zhang, Hanqun Cao, Zijun Gao et al.

ICLR 2025arXiv:2502.07384
#18911

EmbedLLM: Learning Compact Representations of Large Language Models

Richard Zhuang, Tianhao Wu, Zhaojin Wen et al.

ICLR 2025arXiv:2410.02223
#18912

TPO: Aligning Large Language Models with Multi-branch & Multi-step Preference Trees

Weibin Liao, Xu Chu, Yasha Wang

ICLR 2025arXiv:2410.12854
#18913

Bridging the Gap Between f-divergences and Bayes Hilbert Spaces

Linus Lach, Alexander Fottner, Yarema Okhrin

ICLR 2025
#18914

DeepTAGE: Deep Temporal-Aligned Gradient Enhancement for Optimizing Spiking Neural Networks

Wei Liu, Li Yang, Mingxuan Zhao et al.

ICLR 2025oral
#18915

Round and Round We Go! What makes Rotary Positional Encodings useful?

Federico Barbero, Alex Vitvitskyi, Christos Perivolaropoulos et al.

ICLR 2025arXiv:2410.06205
#18916

SysCaps: Language Interfaces for Simulation Surrogates of Complex Systems

Patrick Emami, Zhaonan Li, Saumya Sinha et al.

ICLR 2025arXiv:2405.19653
#18917

Revisit the Open Nature of Open Vocabulary Semantic Segmentation

Qiming Huang, Han Hu, Jianbo Jiao

ICLR 2025
#18918

Multi-Scale Fusion for Object Representation

Rongzhen Zhao, Vivienne Huiling Wang, Juho Kannala et al.

ICLR 2025arXiv:2410.01539
#18919

Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data

Jiajie Li, Brian Quaranto, Chenhui Xu et al.

ICLR 2025arXiv:2501.15326
#18920

GOLD: Graph Out-of-Distribution Detection via Implicit Adversarial Latent Generation

Danny Wang, Ruihong Qiu, Guangdong Bai et al.

ICLR 2025arXiv:2502.05780
#18921

SafeWatch: An Efficient Safety-Policy Following Video Guardrail Model with Transparent Explanations

Zhaorun Chen, Francesco Pinto, Minzhou Pan et al.

ICLR 2025arXiv:2412.06878
#18922

Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy

Tong Wu, Shujian Zhang, Kaiqiang Song et al.

ICLR 2025arXiv:2410.09102
#18923

Data Pruning by Information Maximization

Haoru Tan, Sitong Wu, Wei Huang et al.

ICLR 2025arXiv:2506.01701
#18924

An Evolved Universal Transformer Memory

Edoardo Cetin, Qi Sun, Tianyu Zhao et al.

ICLR 2025arXiv:2410.13166
#18925

Memory Mosaics

Jianyu Zhang, Niklas Nolte, Ranajoy Sadhukhan et al.

ICLR 2025arXiv:2405.06394
#18926

Improving Long-Text Alignment for Text-to-Image Diffusion Models

Luping Liu, Chao Du, Tianyu Pang et al.

ICLR 2025arXiv:2410.11817
#18927

Enhancing End-to-End Autonomous Driving with Latent World Model

Yingyan Li, Lue Fan, Jiawei He et al.

ICLR 2025arXiv:2406.08481
#18928

On the Computation of the Fisher Information in Continual Learning

Gido van de Ven

ICLR 2025arXiv:2502.11756
#18929

CREAM: Consistency Regularized Self-Rewarding Language Models

Zhaoyang Wang, Weilei He, Zhiyuan Liang et al.

ICLR 2025arXiv:2410.12735
#18930

3DMolFormer: A Dual-channel Framework for Structure-based Drug Discovery

Xiuyuan Hu, Guoqing Liu, Can Chen et al.

ICLR 2025arXiv:2502.05107
#18931

A Geometric Framework for Understanding Memorization in Generative Models

Brendan Ross, Hamidreza Kamkari, Tongzi Wu et al.

ICLR 2025arXiv:2411.00113
#18932

TLDR: Token-Level Detective Reward Model for Large Vision Language Models

Deqing Fu, Tong Xiao, Rui Wang et al.

ICLR 2025arXiv:2410.04734
#18933

Glimpse: Enabling White-Box Methods to Use Proprietary Models for Zero-Shot LLM-Generated Text Detection

Guangsheng Bao, Yanbin Zhao, Juncai He et al.

ICLR 2025arXiv:2412.11506
#18934

SqueezeAttention: 2D Management of KV-Cache in LLM Inference via Layer-wise Optimal Budget

Zihao Wang, Bin CUI, Shaoduo Gan

ICLR 2025arXiv:2404.04793
#18935

Towards Domain Adaptive Neural Contextual Bandits

Ziyan Wang, Xiaoming Huo, Hao Wang

ICLR 2025arXiv:2406.09564
#18936

Investigating Pattern Neurons in Urban Time Series Forecasting

Chengxin Wang, Yiran Zhao, shaofeng cai et al.

ICLR 2025
#18937

Wayward Concepts In Multimodal Models

Brandon Trabucco, Max Gurinas, Kyle Doherty et al.

ICLR 2025
#18938

Can Watermarks be Used to Detect LLM IP Infringement For Free?

Zhengyue Zhao, Xiaogeng Liu, Somesh Jha et al.

ICLR 2025
#18939

Learning Diagrams: A Graphical Language for Compositional Training Regimes

Mason Lary, Richard Samuelson, Alexander Wilentz et al.

ICLR 2025
#18940

Neural Approximate Mirror Maps for Constrained Diffusion Models

Berthy Feng, Ricardo Baptista, Katherine Bouman

ICLR 2025arXiv:2406.12816
#18941

GANDALF: Generative AttentioN based Data Augmentation and predictive modeLing Framework for personalized cancer treatment

Aishwarya Jayagopal, Yanrong Zhang, Robert Walsh et al.

ICLR 2025
#18942

Provably Safeguarding a Classifier from OOD and Adversarial Samples

Nicolas Atienza, Johanne Cohen, Christophe Labreuche et al.

ICLR 2025arXiv:2501.10202
#18943

On the Fourier analysis in the SO(3) space : the EquiLoPO Network

Dmitrii Zhemchuzhnikov, Sergei Grudinin

ICLR 2025
#18944

Adapting Multi-modal Large Language Model to Concept Drift From Pre-training Onwards

Xiaoyu Yang, Jie Lu, En Yu

ICLR 2025arXiv:2405.13459
#18945

Bridging the Gap between Variational Inference and Stochastic Gradient MCMC in Function Space

Mengjing Wu, Junyu Xuan, Jie Lu

ICLR 2025
#18946

Grammar Reinforcement Learning: path and cycle counting in graphs with a Context-Free Grammar and Transformer approach

Jason Piquenot, Maxime Berar, Romain Raveaux et al.

ICLR 2025
#18947

HyperFace: Generating Synthetic Face Recognition Datasets by Exploring Face Embedding Hypersphere

Hatef Otroshi Shahreza, Sébastien Marcel

ICLR 2025arXiv:2411.08470
#18948

Rare-to-Frequent: Unlocking Compositional Generation Power of Diffusion Models on Rare Concepts with LLM Guidance

Dongmin Park, Sebin Kim, Taehong Moon et al.

ICLR 2025arXiv:2410.22376
#18949

Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents

Boyu Gou, Demi Ruohan Wang, Boyuan Zheng et al.

ICLR 2025arXiv:2410.05243
#18950

Scrutinize What We Ignore: Reining In Task Representation Shift Of Context-Based Offline Meta Reinforcement Learning

Hai Zhang, Boyuan Zheng, Tianying Ji et al.

ICLR 2025arXiv:2405.12001
#18951

BIRD: A Trustworthy Bayesian Inference Framework for Large Language Models

Yu Feng, Ben Zhou, Weidong Lin et al.

ICLR 2025arXiv:2404.12494
#18952

Decentralized Optimization with Coupled Constraints

Demyan Yarmoshik, Alexander Rogozin, Nikita Kiselev et al.

ICLR 2025arXiv:2407.02020
#18953

A Visual Dive into Conditional Flow Matching

Anne Gagneux, Ségolène Martin, Rémi Emonet et al.

ICLR 2025
#18954

Youku Dense Caption: A Large-scale Chinese Video Dense Caption Dataset and Benchmarks

Zixuan Xiong, Guangwei Xu, wenkai zhang et al.

ICLR 2025
#18955

Large Language Models Often Say One Thing and Do Another

Ruoxi Xu, Hongyu Lin, Xianpei Han et al.

ICLR 2025arXiv:2503.07003
#18956

Enhancing Vision-Language Model with Unmasked Token Alignment

Hongsheng Li, Jihao Liu, Boxiao Liu et al.

ICLR 2025arXiv:2405.19009
#18957

A3D: Does Diffusion Dream about 3D Alignment?

Savva Ignatyev, Nina Konovalova, Daniil Selikhanovych et al.

ICLR 2025arXiv:2406.15020
#18958

Making Transformer Decoders Better Differentiable Indexers

Wuchao Li, Kai Zheng, Defu Lian et al.

ICLR 2025
#18959

The KoLMogorov Test: Compression by Code Generation

Ori Yoran, Kunhao Zheng, Fabian Gloeckle et al.

ICLR 2025arXiv:2503.13992
#18960

Long Context Compression with Activation Beacon

Peitian Zhang, Zheng Liu, Shitao Xiao et al.

ICLR 2025arXiv:2401.03462
#18961

K-HALU: Multiple Answer Korean Hallucination Benchmark for Large Language Models

Jaehyung Seo, Heuiseok Lim

ICLR 2025
#18962

AutoUAD: Hyper-parameter Optimization for Unsupervised Anomaly Detection

Wei Dai, Jicong Fan

ICLR 2025
#18963

FlashMask: Efficient and Rich Mask Extension of FlashAttention

Guoxia Wang, Jinle Zeng, Xiyuan Xiao et al.

ICLR 2025arXiv:2410.01359
#18964

CipherPrune: Efficient and Scalable Private Transformer Inference

Yancheng Zhang, Jiaqi Xue, Mengxin Zheng et al.

ICLR 2025arXiv:2502.16782
#18965

FreCaS: Efficient Higher-Resolution Image Generation via Frequency-aware Cascaded Sampling

zhengqiang ZHANG, Ruihuang Li, Lei Zhang

ICLR 2025arXiv:2410.18410
#18966

Spatial-Mamba: Effective Visual State Space Models via Structure-Aware State Fusion

Chaodong Xiao, Minghan Li, zhengqiang ZHANG et al.

ICLR 2025arXiv:2410.15091
#18967

Data Selection via Optimal Control for Language Models

Yuxian Gu, Li Dong, Hongning Wang et al.

ICLR 2025arXiv:2410.07064
#18968

TeaserGen: Generating Teasers for Long Documentaries

Weihan Xu, Paul Pu Liang, Haven Kim et al.

ICLR 2025arXiv:2410.05586
#18969

VVC-Gym: A Fixed-Wing UAV Reinforcement Learning Environment for Multi-Goal Long-Horizon Problems

Xudong Gong, Feng Dawei, Kele Xu et al.

ICLR 2025oral
#18970

Scaling Laws for Downstream Task Performance in Machine Translation

Berivan Isik, NATALIA PONOMAREVA, Hussein Hazimeh et al.

ICLR 2025
#18971

Ranking-aware adapter for text-driven image ordering with CLIP

Wei-Hsiang Yu, Yen-Yu Lin, Ming-Hsuan Yang et al.

ICLR 2025arXiv:2412.06760
#18972

CURIE: Evaluating LLMs on Multitask Scientific Long-Context Understanding and Reasoning

Hao Cui, Zahra Shamsi, Gowoon Cheon et al.

ICLR 2025arXiv:2503.13517
#18973

SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression

Xin Wang, Yu Zheng, Zhongwei Wan et al.

ICLR 2025arXiv:2403.07378
#18974

LASER: A Neuro-Symbolic Framework for Learning Spatio-Temporal Scene Graphs with Weak Supervision

Jiani Huang, Ziyang Li, Mayur Naik et al.

ICLR 2025oral
#18975

Federated Continual Learning Goes Online: Uncertainty-Aware Memory Management for Vision Tasks and Beyond

Giuseppe Serra, Florian Buettner

ICLR 2025arXiv:2405.18925
#18976

Diversity-Rewarded CFG Distillation

Geoffrey Cideron, Andrea Agostinelli, Johan Ferret et al.

ICLR 2025arXiv:2410.06084
#18977

Backdooring Vision-Language Models with Out-Of-Distribution Data

Weimin Lyu, Michael Yao, Saumya Gupta et al.

ICLR 2025arXiv:2410.01264
#18978

NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models

Chankyu Lee, Rajarshi Roy, Mengyao Xu et al.

ICLR 2025arXiv:2405.17428
#18979

GenXD: Generating Any 3D and 4D Scenes

Yuyang Zhao, Chung-Ching Lin, Kevin Lin et al.

ICLR 2025oralarXiv:2411.02319
#18980

Meta-Continual Learning of Neural Fields

Seungyoon Woo, Junhyeog Yun, Gunhee Kim

ICLR 2025arXiv:2504.05806
#18981

Semi-Supervised Vision-Centric 3D Occupancy World Model for Autonomous Driving

Xiang Li, Pengfei Li, Yupeng Zheng et al.

ICLR 2025oralarXiv:2502.07309
#18982

Adversarial Attacks on Data Attribution

Xinhe Wang, Pingbang Hu, Junwei Deng et al.

ICLR 2025arXiv:2409.05657
#18983

DPLM-2: A Multimodal Diffusion Protein Language Model

Xinyou Wang, Zaixiang Zheng, Fei YE et al.

ICLR 2025arXiv:2410.13782
#18984

Continuous Autoregressive Modeling with Stochastic Monotonic Alignment for Speech Synthesis

Weiwei Lin, Chenhang HE

ICLR 2025arXiv:2502.01084
#18985

Trained Transformer Classifiers Generalize and Exhibit Benign Overfitting In-Context

Spencer Frei, Gal Vardi

ICLR 2025arXiv:2410.01774
#18986

Grounding Multimodal Large Language Model in GUI World

Weixian Lei, Difei Gao, Mike Zheng Shou

ICLR 2025
#18987

MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning

Haotian Zhang, Mingfei Gao, Zhe Gan et al.

ICLR 2025arXiv:2409.20566
#18988

NarrativeBridge: Enhancing Video Captioning with Causal-Temporal Narrative

Asmar Nadeem, Faegheh Sardari, Robert Dawes et al.

ICLR 2025oralarXiv:2406.06499
#18989

Learning View-invariant World Models for Visual Robotic Manipulation

Jing-Cheng Pang, Nan Tang, Kaiyuan Li et al.

ICLR 2025
#18990

Exploring Local Memorization in Diffusion Models via Bright Ending Attention

Chen Chen, Daochang Liu, Mubarak Shah et al.

ICLR 2025arXiv:2410.21665
#18991

Exploiting Structure in Offline Multi-Agent RL: The Benefits of Low Interaction Rank

Wenhao Zhan, Scott Fujimoto, Zheqing Zhu et al.

ICLR 2025arXiv:2410.01101
#18992

Towards Generalization Bounds of GCNs for Adversarially Robust Node Classification

Wen Wen, Han Li, Tieliang Gong et al.

ICLR 2025
#18993

Restating the Proof of Linear Convergence for Linear GNNs

Huayi Tang, Yuhe Guo, Yong Liu et al.

ICLR 2025
#18994

TGB-Seq Benchmark: Challenging Temporal GNNs with Complex Sequential Dynamics

Lu Yi, Jie Peng, Yanping Zheng et al.

ICLR 2025oralarXiv:2502.02975
#18995

Process Reward Model with Q-value Rankings

Wendi Li, Yixuan Li

ICLR 2025arXiv:2410.11287
#18996

UniCon: Unidirectional Information Flow for Effective Control of Large-Scale Diffusion Models

Fanghua Yu, Jinjin Gu, Jinfan Hu et al.

ICLR 2025arXiv:2503.17221
#18997

Efficient Cross-Episode Meta-RL

Gresa Shala, André Biedenkapp, Pierre Krack et al.

ICLR 2025
#18998

Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning

Linjiajie Fang, Ruoxue Liu, Jing Zhang et al.

ICLR 2025arXiv:2405.20555
#18999

TIGeR: Unifying Text-to-Image Generation and Retrieval with Large Multimodal Models

Leigang Qu, Haochuan Li, Tan Wang et al.

ICLR 2025arXiv:2406.05814
#19000

Rethinking Neural Multi-Objective Combinatorial Optimization via Neat Weight Embedding

Jinbiao Chen, Zhiguang Cao, Jiahai Wang et al.

ICLR 2025