Most Cited 2024 "sft" Papers

12,324 papers found • Page 62 of 62

#12201

Channel Vision Transformers: An Image Is Worth 1 x 16 x 16 Words

Yujia Bao, Srinivasan Sivanandan, THEOFANIS KARALETSOS

ICLR 2024posterarXiv:2309.16108
#12202

Solving High Frequency and Multi-Scale PDEs with Gaussian Processes

Shikai Fang, Madison Cooley, Da Long et al.

ICLR 2024posterarXiv:2311.04465
#12203

Adversarial Attacks on Fairness of Graph Neural Networks

Binchi Zhang, Yushun Dong, Chen Chen et al.

ICLR 2024posterarXiv:2310.13822
#12204

Task structure and nonlinearity jointly determine learned representational geometry

Matteo Alleman, Jack Lindsey, Stefano Fusi

ICLR 2024posterarXiv:2401.13558
#12205

ReLU Strikes Back: Exploiting Activation Sparsity in Large Language Models

Iman Mirzadeh, Keivan Alizadeh-Vahid, Sachin Mehta et al.

ICLR 2024posterarXiv:2310.04564
#12206

Think-on-Graph: Deep and Responsible Reasoning of Large Language Model on Knowledge Graph

Jiashuo Sun, Chengjin Xu, Lumingyuan Tang et al.

ICLR 2024posterarXiv:2307.07697
#12207

Tractable MCMC for Private Learning with Pure and Gaussian Differential Privacy

Yingyu Lin, Yian Ma, Yu-Xiang Wang et al.

ICLR 2024posterarXiv:2310.14661
#12208

Jailbreak in pieces: Compositional Adversarial Attacks on Multi-Modal Language Models

Erfan Shayegani, Yue Dong, Nael Abu-Ghazaleh

ICLR 2024spotlightarXiv:2307.14539
#12209

Graph Transformers on EHRs: Better Representation Improves Downstream Performance

Raphael Poulain, Rahmatollah Beheshti

ICLR 2024oral
#12210

SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step Reasoning

Ning Miao, Yee Whye Teh, Tom Rainforth

ICLR 2024posterarXiv:2308.00436
#12211

Scalable Modular Network: A Framework for Adaptive Learning via Agreement Routing

Minyang Hu, Hong Chang, Bingpeng Ma et al.

ICLR 2024poster
#12212

Improved Regret Bounds for Non-Convex Online-Within-Online Meta Learning

Jiechao GUAN, Hui Xiong

ICLR 2024poster
#12213

Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis

Jonghyun Lee, Hansam Cho, YoungJoon Yoo et al.

ICLR 2024posterarXiv:2401.09048
#12214

Recursive Generalization Transformer for Image Super-Resolution

Zheng Chen, Yulun Zhang, Jinjin Gu et al.

ICLR 2024posterarXiv:2303.06373
#12215

Score Models for Offline Goal-Conditioned Reinforcement Learning

Harshit Sikchi, Rohan Chitnis, Ahmed Touati et al.

ICLR 2024posterarXiv:2311.02013
#12216

Treatment Effects Estimation By Uniform Transformer

Ruoqi Yu, Shulei Wang

ICLR 2024posterarXiv:2008.03738
#12217

Representation Deficiency in Masked Language Modeling

Yu Meng, Jitin Krishnan, Sinong Wang et al.

ICLR 2024posterarXiv:2302.02060
#12218

Sampling Multimodal Distributions with the Vanilla Score: Benefits of Data-Based Initialization

Frederic Koehler, Thuy-Duong Vuong

ICLR 2024posterarXiv:2310.01762
#12219

MaGIC: Multi-modality Guided Image Completion

Hao Wang, Yongsheng Yu, Tiejian Luo et al.

ICLR 2024posterarXiv:2305.11818
#12220

Learning Robust Generalizable Radiance Field with Visibility and Feature Augmented Point Representation

Jiaxu Wang, Ziyi Zhang, Renjing Xu

ICLR 2024posterarXiv:2401.14354
#12221

HoloNets: Spectral Convolutions do extend to Directed Graphs

Christian Koke, Daniel Cremers

ICLR 2024posterarXiv:2310.02232
#12222

Searching for High-Value Molecules Using Reinforcement Learning and Transformers

Raj Ghugare, Santiago Miret, Adriana Hugessen et al.

ICLR 2024posterarXiv:2310.02902
#12223

Interpretable Meta-Learning of Physical Systems

Matthieu Blanke, marc lelarge

ICLR 2024posterarXiv:2312.00477
#12224

An Image Is Worth 1000 Lies: Transferability of Adversarial Images across Prompts on Vision-Language Models

Haochen Luo, Jindong Gu, Fengyuan Liu et al.

ICLR 2024spotlight
#12225

Fast Value Tracking for Deep Reinforcement Learning

Frank Shih, Faming Liang

ICLR 2024oralarXiv:2403.13178
#12226

Interpretable Sparse System Identification: Beyond Recent Deep Learning Techniques on Time-Series Prediction

Liu Xiaoyi, Duxin Chen, Wenjia Wei et al.

ICLR 2024poster
#12227

FedInverse: Evaluating Privacy Leakage in Federated Learning

DI WU, Jun Bai, Yiliao Song et al.

ICLR 2024poster
#12228

CircuitNet 2.0: An Advanced Dataset for Promoting Machine Learning Innovations in Realistic Chip Design Environment

Xun Jiang, zhuomin chai, Yuxiang Zhao et al.

ICLR 2024poster
#12229

Are Transformers with One Layer Self-Attention Using Low-Rank Weight Matrices Universal Approximators?

Tokio Kajitsuka, Issei Sato

ICLR 2024posterarXiv:2307.14023
#12230

Self-Supervised Contrastive Learning for Long-term Forecasting

Junwoo Park, Daehoon Gwak, Jaegul Choo et al.

ICLR 2024posterarXiv:2402.02023
#12231

Federated Orthogonal Training: Mitigating Global Catastrophic Forgetting in Continual Federated Learning

Yavuz Faruk Bakman, Duygu Nur Yaldiz, Yahya Ezzeldin et al.

ICLR 2024posterarXiv:2309.01289
#12232

Neuron Activation Coverage: Rethinking Out-of-distribution Detection and Generalization

Yibing Liu, Chris Xing TIAN, Haoliang Li et al.

ICLR 2024spotlightarXiv:2306.02879
#12233

Skeleton-of-Thought: Prompting LLMs for Efficient Parallel Generation

Xuefei Ning, Zinan Lin, Zixuan Zhou et al.

ICLR 2024posterarXiv:2307.15337
#12234

Rethinking CNN’s Generalization to Backdoor Attack from Frequency Domain

Quanrui Rao, Lin Wang, Wuying Liu

ICLR 2024poster
#12235

Variance Reduced Halpern Iteration for Finite-Sum Monotone Inclusions

Xufeng Cai, Ahmet Alacaoglu, Jelena Diakonikolas

ICLR 2024posterarXiv:2310.02987
#12236

Flow to Better: Offline Preference-based Reinforcement Learning via Preferred Trajectory Generation

Zhilong Zhang, Yihao Sun, Junyin Ye et al.

ICLR 2024oral
#12237

LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts

Hanan Gani, Shariq Bhat, Muzammal Naseer et al.

ICLR 2024posterarXiv:2310.10640
#12238

VBH-GNN: Variational Bayesian Heterogeneous Graph Neural Networks for Cross-subject Emotion Recognition

Chenyu Liu, XINLIANG ZHOU, Zhengri Zhu et al.

ICLR 2024oral
#12239

Neural Rate Control for Learned Video Compression

yiwei zhang, Guo Lu, Yunuo Chen et al.

ICLR 2024oral
#12240

PnP Inversion: Boosting Diffusion-based Editing with 3 Lines of Code

Xuan Ju, Ailing Zeng, Yuxuan Bian et al.

ICLR 2024poster
#12241

Harnessing Density Ratios for Online Reinforcement Learning

Philip Amortila, Dylan Foster, Nan Jiang et al.

ICLR 2024spotlightarXiv:2401.09681
#12242

Sliced Denoising: A Physics-Informed Molecular Pre-Training Method

yuyan ni, Shikun Feng, Wei-Ying Ma et al.

ICLR 2024posterarXiv:2311.02124
#12243

Improved Efficiency Based on Learned Saccade and Continuous Scene Reconstruction From Foveated Visual Sampling

Jiayang Liu, Yiming Bu, Daniel Tso et al.

ICLR 2024spotlight
#12244

Poisoned Forgery Face: Towards Backdoor Attacks on Face Forgery Detection

Jiawei Liang, Siyuan Liang, Aishan Liu et al.

ICLR 2024spotlightarXiv:2402.11473
#12245

Negatively Correlated Ensemble Reinforcement Learning for Online Diverse Game Level Generation

Ziqi Wang, Chengpeng Hu, Jialin Liu et al.

ICLR 2024poster
#12246

Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning

Zhaoyi Zhou, Chuning Zhu, Runlong Zhou et al.

ICLR 2024posterarXiv:2310.19308
#12247

Local Composite Saddle Point Optimization

Site Bai, Brian Bullins

ICLR 2024poster
#12248

ASID: Active Exploration for System Identification in Robotic Manipulation

Marius Memmel, Andrew Wagenmaker, Chuning Zhu et al.

ICLR 2024posterarXiv:2404.12308
#12249

Simple Hierarchical Planning with Diffusion

Chang Chen, Fei Deng, Kenji Kawaguchi et al.

ICLR 2024oralarXiv:2401.02644
#12250

sRGB Real Noise Modeling via Noise-Aware Sampling with Normalizing Flows

Dongjin Kim, Donggoo Jung, Sungyong Baik et al.

ICLR 2024poster
#12251

Improving Generalization of Alignment with Human Preferences through Group Invariant Learning

Rui Zheng, Wei Shen, Yuan Hua et al.

ICLR 2024spotlightarXiv:2310.11971
#12252

Dynamic Sparse Training with Structured Sparsity

Mike Lasby, Anna Golubeva, Utku Evci et al.

ICLR 2024posterarXiv:2305.02299
#12253

DENEVIL: TOWARDS DECIPHERING AND NAVIGATING THE ETHICAL VALUES OF LARGE LANGUAGE MODELS VIA INSTRUCTION LEARNING

Shitong Duan, Xiaoyuan Yi, Peng Zhang et al.

ICLR 2024oralarXiv:2310.11053
#12254

Robustifying State-space Models for Long Sequences via Approximate Diagonalization

Annan Yu, Arnur Nigmetov, Dmitriy Morozov et al.

ICLR 2024spotlightarXiv:2310.01698
#12255

Generative Adversarial Equilibrium Solvers

Denizalp Goktas, David Parkes, Ian Gemp et al.

ICLR 2024posterarXiv:2302.06607
#12256

Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models

Shuai Zhao, Xiaohan Wang, Linchao Zhu et al.

ICLR 2024posterarXiv:2305.18010
#12257

FedImpro: Measuring and Improving Client Update in Federated Learning

Zhenheng Tang, Yonggang Zhang, Shaohuai Shi et al.

ICLR 2024posterarXiv:2402.07011
#12258

Improving LoRA in Privacy-preserving Federated Learning

Youbang Sun, Zitao Li, Yaliang Li et al.

ICLR 2024posterarXiv:2403.12313
#12259

Efficient Inverse Multiagent Learning

Denizalp Goktas, Amy Greenwald, Sadie Zhao et al.

ICLR 2024spotlightarXiv:2502.14160
#12260

Neural Neighborhood Search for Multi-agent Path Finding

Zhongxia Yan, Cathy Wu

ICLR 2024oral
#12261

Nemesis: Normalizing the Soft-prompt Vectors of Vision-Language Models

Shuai Fu, Shuai Fu, Xiequn Wang et al.

ICLR 2024spotlightarXiv:2408.13979
#12262

FasterViT: Fast Vision Transformers with Hierarchical Attention

Ali Hatamizadeh, Greg Heinrich, Hongxu Yin et al.

ICLR 2024posterarXiv:2306.06189
#12263

C-TPT: Calibrated Test-Time Prompt Tuning for Vision-Language Models via Text Feature Dispersion

Hee Suk Yoon, Eunseop Yoon, Joshua Tian Jin Tee et al.

ICLR 2024posterarXiv:2403.14119
#12264

DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior

Jingxiang Sun, Bo Zhang, Ruizhi Shao et al.

ICLR 2024posterarXiv:2310.16818
#12265

Plugin estimators for selective classification with out-of-distribution detection

Harikrishna Narasimhan, Aditya Krishna Menon, Wittawat Jitkrittum et al.

ICLR 2024posterarXiv:2301.12386
#12266

P$^2$OT: Progressive Partial Optimal Transport for Deep Imbalanced Clustering

Chuyu Zhang, Hui Ren, Xuming He

ICLR 2024posterarXiv:2401.09266
#12267

Adaptive Stochastic Gradient Algorithm for Black-box Multi-Objective Learning

Feiyang YE, YUEMING LYU, Xuehao Wang et al.

ICLR 2024poster
#12268

Intriguing Properties of Data Attribution on Diffusion Models

Xiaosen Zheng, Tianyu Pang, Chao Du et al.

ICLR 2024posterarXiv:2311.00500
#12269

How Does Unlabeled Data Provably Help Out-of-Distribution Detection?

Xuefeng Du, Zhen Fang, Ilias Diakonikolas et al.

ICLR 2024posterarXiv:2402.03502
#12270

GlucoBench: Curated List of Continuous Glucose Monitoring Datasets with Prediction Benchmarks

Renat Sergazinov, Elizabeth Chun, Valeriya Rogovchenko et al.

ICLR 2024posterarXiv:2410.05780
#12271

Look, Remember and Reason: Grounded Reasoning in Videos with Language Models

Apratim Bhattacharyya, Sunny Panchal, Reza Pourreza et al.

ICLR 2024oralarXiv:2306.17778
#12272

Pushing Boundaries: Mixup's Influence on Neural Collapse

Quinn Fisher, Haoming Meng, Vardan Papyan

ICLR 2024posterarXiv:2402.06171
#12273

LLCP: Learning Latent Causal Processes for Reasoning-based Video Question Answer

Guangyi Chen, Yuke Li, Xiao Liu et al.

ICLR 2024oral
#12274

Implicit regularization of deep residual networks towards neural ODEs

Pierre Marion, Yu-Han Wu, Michael Sander et al.

ICLR 2024spotlightarXiv:2309.01213
#12275

Provably Efficient UCB-type Algorithms For Learning Predictive State Representations

Ruiquan Huang, Yingbin Liang, Jing Yang

ICLR 2024posterarXiv:2307.00405
#12276

Inner Classifier-Free Guidance and Its Taylor Expansion for Diffusion Models

Shikun Sun, Longhui Wei, Zhicai Wang et al.

ICLR 2024poster
#12277

Compressing Latent Space via Least Volume

Qiuyi Chen, Mark Fuge

ICLR 2024poster
#12278

CoLiDE: Concomitant Linear DAG Estimation

Seyed Saman Saboksayr, Gonzalo Mateos, Mariano Tepper

ICLR 2024posterarXiv:2310.02895
#12279

Going Beyond Neural Network Feature Similarity: The Network Feature Complexity and Its Interpretation Using Category Theory

Yiting Chen, Zhanpeng Zhou, Junchi Yan

ICLR 2024posterarXiv:2310.06756
#12280

A Unified Framework for Bayesian Optimization under Contextual Uncertainty

Sebastian Shenghong Tay, Chuan-Sheng Foo, Daisuke Urano et al.

ICLR 2024poster
#12281

Learning Large DAGs is Harder than you Think: Many Losses are Minimal for the Wrong DAG

Jonas Seng, Matej Zečević, Devendra Singh Dhami et al.

ICLR 2024poster
#12282

Nearly $d$-Linear Convergence Bounds for Diffusion Models via Stochastic Localization

Joe Benton, Valentin De Bortoli, Arnaud Doucet et al.

ICLR 2024spotlightarXiv:2308.03686
#12283

Active Retrosynthetic Planning Aware of Route Quality

Luotian Yuan, Yemin Yu, Ying Wei et al.

ICLR 2024poster
#12284

Solving Inverse Problems with Latent Diffusion Models via Hard Data Consistency

Bowen Song, Soo Min Kwon, Zecheng Zhang et al.

ICLR 2024spotlightarXiv:2307.08123
#12285

Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model

Yinan Zheng, Jianxiong Li, Dongjie Yu et al.

ICLR 2024posterarXiv:2401.10700
#12286

Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy

Pingzhi Li, Zhenyu Zhang, Prateek Yadav et al.

ICLR 2024spotlightarXiv:2310.01334
#12287

Non-Exchangeable Conformal Risk Control

António Farinhas, Chrysoula Zerva, Dennis Ulmer et al.

ICLR 2024posterarXiv:2310.01262
#12288

Spoken Question Answering and Speech Continuation Using Spectrogram-Powered LLM

Eliya Nachmani, Alon Levkovitch, Roy Hirsch et al.

ICLR 2024posterarXiv:2305.15255
#12289

USB-NeRF: Unrolling Shutter Bundle Adjusted Neural Radiance Fields

Moyang Li, Peng Wang, Lingzhe Zhao et al.

ICLR 2024posterarXiv:2310.02687
#12290

Are Bert Family Good Instruction Followers? A Study on Their Potential And Limitations

yisheng xiao, Juntao Li, Zechen Sun et al.

ICLR 2024poster
#12291

Synergistic Patch Pruning for Vision Transformer: Unifying Intra- & Inter-Layer Patch Importance

Yuyao Zhang, Lan Wei, Nikolaos Freris

ICLR 2024poster
#12292

Early Stopping Against Label Noise Without Validation Data

Suqin Yuan, Lei Feng, Tongliang Liu

ICLR 2024posterarXiv:2502.07551
#12293

Contrastive Preference Learning: Learning from Human Feedback without Reinforcement Learning

Joey Hejna, Rafael Rafailov, Harshit Sikchi et al.

ICLR 2024poster
#12294

Unknown Domain Inconsistency Minimization for Domain Generalization

Seungjae Shin, HeeSun Bae, Byeonghu Na et al.

ICLR 2024posterarXiv:2403.07329
#12295

Enhancing Transfer Learning with Flexible Nonparametric Posterior Sampling

Hyungi Lee, Giung Nam, Edwin Fong et al.

ICLR 2024posterarXiv:2403.07282
#12296

Finite Scalar Quantization: VQ-VAE Made Simple

Fabian Mentzer, David Minnen, Eirikur Agustsson et al.

ICLR 2024posterarXiv:2309.15505
#12297

Fixed-Budget Differentially Private Best Arm Identification

Zhirui Chen, P. N. Karthik, Yeow Meng Chee et al.

ICLR 2024posterarXiv:2401.09073
#12298

Rethinking Backdoor Attacks on Dataset Distillation: A Kernel Method Perspective

Ming-Yu Chung, Sheng-Yen Chou, Chia-Mu Yu et al.

ICLR 2024posterarXiv:2311.16646
#12299

Neural Contractive Dynamical Systems

Hadi Beik Mohammadi, Søren Hauberg, Georgios Arvanitidis et al.

ICLR 2024spotlightarXiv:2401.09352
#12300

Energy-based Automated Model Evaluation

Ru Peng, Heming Zou, Haobo Wang et al.

ICLR 2024posterarXiv:2401.12689
#12301

FreeDyG: Frequency Enhanced Continuous-Time Dynamic Graph Model for Link Prediction

Yuxing Tian, Yiyan Qi, Fan Guo

ICLR 2024oral
#12302

SEAL: A Framework for Systematic Evaluation of Real-World Super-Resolution

Wenlong Zhang, Xiaohui Li, Xiangyu Chen et al.

ICLR 2024spotlightarXiv:2309.03020
#12303

Toward Optimal Policy Population Growth in Two-Player Zero-Sum Games

Stephen McAleer, John Banister Lanier, Kevin A. Wang et al.

ICLR 2024poster
#12304

Towards Robust and Efficient Cloud-Edge Elastic Model Adaptation via Selective Entropy Distillation

Yaofo Chen, Shuaicheng Niu, Yaowei Wang et al.

ICLR 2024posterarXiv:2402.17316
#12305

Polynormer: Polynomial-Expressive Graph Transformer in Linear Time

Chenhui Deng, Zichao Yue, Zhiru Zhang

ICLR 2024posterarXiv:2403.01232
#12306

Beyond task performance: evaluating and reducing the flaws of large multimodal models with in-context-learning

Mustafa Shukor, Alexandre Rame, Corentin Dancette et al.

ICLR 2024posterarXiv:2310.00647
#12307

A Differentially Private Clustering Algorithm for Well-Clustered Graphs

Weiqiang He, Hendrik Fichtenberger, Pan Peng

ICLR 2024posterarXiv:2403.14332
#12308

The Trickle-down Impact of Reward Inconsistency on RLHF

Lingfeng Shen, Lingfeng Shen, Sihao Chen et al.

ICLR 2024poster
#12309

Contrastive Learning is Spectral Clustering on Similarity Graph

Zhiquan Tan, Yifan Zhang, Jingqin Yang et al.

ICLR 2024posterarXiv:2303.15103
#12310

Better Neural PDE Solvers Through Data-Free Mesh Movers

Peiyan Hu, Yue Wang, Zhi-Ming Ma

ICLR 2024posterarXiv:2312.05583
#12311

Weatherproofing Retrieval for Localization with Generative AI and Geometric Consistency

Yannis Kalantidis, Mert Bulent SARIYILDIZ, Rafael Rezende et al.

ICLR 2024posterarXiv:2402.09237
#12312

Memorization Capacity of Multi-Head Attention in Transformers

Sadegh Mahdavi, Renjie Liao, Christos Thrampoulidis

ICLR 2024spotlightarXiv:2306.02010
#12313

LUT-GEMM: Quantized Matrix Multiplication based on LUTs for Efficient Inference in Large-Scale Generative Language Models

Gunho Park, baeseong park, Minsub Kim et al.

ICLR 2024posterarXiv:2206.09557
#12314

Domain-Inspired Sharpness-Aware Minimization Under Domain Shifts

Ruipeng Zhang, Ziqing Fan, Jiangchao Yao et al.

ICLR 2024posterarXiv:2405.18861
#12315

Adaptive Retrieval and Scalable Indexing for k-NN Search with Cross-Encoders

Nishant Yadav, Nicholas Monath, Manzil Zaheer et al.

ICLR 2024posterarXiv:2405.03651
#12316

Enhancing Group Fairness in Online Settings Using Oblique Decision Forests

Somnath Basu Roy Chowdhury, Nicholas Monath, Ahmad Beirami et al.

ICLR 2024spotlightarXiv:2310.11401
#12317

True Knowledge Comes from Practice: Aligning Large Language Models with Embodied Environments via Reinforcement Learning

Weihao Tan, Wentao Zhang, Shanqi Liu et al.

ICLR 2024poster
#12318

Maximum Likelihood Estimation is All You Need for Well-Specified Covariate Shift

Jiawei Ge, Shange Tang, Jianqing Fan et al.

ICLR 2024posterarXiv:2311.15961
#12319

A Sublinear Adversarial Training Algorithm

Yeqi Gao, Lianke Qin, Zhao Song et al.

ICLR 2024posterarXiv:2208.05395
#12320

PhyloGFN: Phylogenetic inference with generative flow networks

MING YANG ZHOU, Zichao Yan, Elliot Layne et al.

ICLR 2024posterarXiv:2310.08774
#12321

Inducing High Energy-Latency of Large Vision-Language Models with Verbose Images

Kuofeng Gao, Yang Bai, Jindong Gu et al.

ICLR 2024oralarXiv:2401.11170
#12322

Performance Gaps in Multi-view Clustering under the Nested Matrix-Tensor Model

Hugo Lebeau, Mohamed El Amine Seddik, José Henrique Goulart

ICLR 2024posterarXiv:2402.10677
#12323

ZeRO++: Extremely Efficient Collective Communication for Large Model Training

Guanhua Wang, Heyang Qin, Sam Jacobs et al.

ICLR 2024poster
#12324

Multimodal Learning Without Labeled Multimodal Data: Guarantees and Applications

Paul Liang, Chun Kai Ling, Yun Cheng et al.

ICLR 2024posterarXiv:2306.04539