Most Cited ICLR "intent-aligned systems" Papers

6,124 papers found • Page 20 of 31

#3801

MovingParts: Motion-based 3D Part Discovery in Dynamic Radiance Field

Kaizhi Yang, Xiaoshuai Zhang, Zhiao Huang et al.

ICLR 2024spotlightarXiv:2303.05703
#3802

Contextual Bandits with Online Neural Regression

Rohan Deb, Yikun Ban, Shiliang Zuo et al.

ICLR 2024posterarXiv:2312.07145
#3803

Predictive auxiliary objectives in deep RL mimic learning in the brain

Ching Fang, Kimberly Stachenfeld

ICLR 2024posterarXiv:2310.06089
#3804

Score Regularized Policy Optimization through Diffusion Behavior

Huayu Chen, Cheng Lu, Zhengyi Wang et al.

ICLR 2024posterarXiv:2310.07297
#3805

WildChat: 1M ChatGPT Interaction Logs in the Wild

Wenting Zhao, Xiang Ren, Jack Hessel et al.

ICLR 2024oralarXiv:2405.01470
#3806

DAM: Towards a Foundation Model for Forecasting

Luke Darlow, Qiwen Deng, Ahmed Hassan et al.

ICLR 2024oral
#3807

Let 2D Diffusion Model Know 3D-Consistency for Robust Text-to-3D Generation

Junyoung Seo, Wooseok Jang, Min-Seop Kwak et al.

ICLR 2024posterarXiv:2303.07937
#3808

Query-Dependent Prompt Evaluation and Optimization with Offline Inverse RL

Hao Sun, Alihan Hüyük, Mihaela van der Schaar

ICLR 2024posterarXiv:2309.06553
#3809

Towards Transparent Time Series Forecasting

Krzysztof Kacprzyk, Tennison Liu, Mihaela van der Schaar

ICLR 2024poster
#3810

Traveling Waves Encode The Recent Past and Enhance Sequence Learning

T. Anderson Keller, Lyle Muller, Terrence Sejnowski et al.

ICLR 2024posterarXiv:2309.08045
#3811

GROOT: Learning to Follow Instructions by Watching Gameplay Videos

Shaofei Cai, Bowei Zhang, Zihao Wang et al.

ICLR 2024spotlightarXiv:2310.08235
#3812

FROSTER: Frozen CLIP is A Strong Teacher for Open-Vocabulary Action Recognition

Xiaohu Huang, Hao Zhou, Kun Yao et al.

ICLR 2024oralarXiv:2402.03241
#3813

Continual Learning in the Presence of Spurious Correlations: Analyses and a Simple Baseline

Donggyu Lee, Sangwon Jung, Taesup Moon

ICLR 2024poster
#3814

Data Debugging with Shapley Importance over Machine Learning Pipelines

Bojan Karlaš, David Dao, Matteo Interlandi et al.

ICLR 2024poster
#3815

Empirical Likelihood for Fair Classification

Pangpang Liu, Yichuan Zhao

ICLR 2024poster
#3816

SAFLEX: Self-Adaptive Augmentation via Feature Label Extrapolation

Mucong Ding, Bang An, Yuancheng Xu et al.

ICLR 2024posterarXiv:2410.02512
#3817

Deep Reinforcement Learning for Modelling Protein Complexes

Ziqi Gao, Tao Feng, Jiaxuan You et al.

ICLR 2024posterarXiv:2405.02299
#3818

LRR: Language-Driven Resamplable Continuous Representation against Adversarial Tracking Attacks

Jianlang Chen, Xuhong Ren, Qing Guo et al.

ICLR 2024oralarXiv:2404.06247
#3819

Learning to Embed Time Series Patches Independently

Seunghan Lee, Taeyoung Park, Kibok Lee

ICLR 2024posterarXiv:2312.16427
#3820

Learning Energy-Based Models by Cooperative Diffusion Recovery Likelihood

yaxuan zhu, Jianwen Xie, Yingnian Wu et al.

ICLR 2024spotlightarXiv:2309.05153
#3821

Balancing Act: Constraining Disparate Impact in Sparse Models

Meraj Hashemizadeh, Juan Ramirez, Rohan Sukumaran et al.

ICLR 2024posterarXiv:2310.20673
#3822

Flow Matching on General Geometries

Ricky T. Q. Chen, Yaron Lipman

ICLR 2024posterarXiv:2302.03660
#3823

LipVoicer: Generating Speech from Silent Videos Guided by Lip Reading

Yochai Yemini, Aviv Shamsian, Lior Bracha et al.

ICLR 2024posterarXiv:2306.03258
#3824

Bespoke Solvers for Generative Flow Models

Neta Shaul, Juan Perez, Ricky T. Q. Chen et al.

ICLR 2024spotlightarXiv:2310.19075
#3825

DittoGym: Learning to Control Soft Shape-Shifting Robots

Suning Huang, Boyuan Chen, Huazhe Xu et al.

ICLR 2024posterarXiv:2401.13231
#3826

Zero-Shot Robotic Manipulation with Pre-Trained Image-Editing Diffusion Models

Kevin Black, Mitsuhiko Nakamoto, Pranav Atreya et al.

ICLR 2024poster
#3827

VersVideo: Leveraging Enhanced Temporal Diffusion Models for Versatile Video Generation

Jinxi Xiang, Ricong Huang, Jun Zhang et al.

ICLR 2024oral
#3828

BrainLM: A foundation model for brain activity recordings

Josue Ortega Caro, Antonio Henrique de Oliveira Fonseca, Syed Rizvi et al.

ICLR 2024oral
#3829

Fourier Transporter: Bi-Equivariant Robotic Manipulation in 3D

Haojie Huang, Owen Howell, Dian Wang et al.

ICLR 2024posterarXiv:2401.12046
#3830

Faithful Explanations of Black-box NLP Models Using LLM-generated Counterfactuals

Yair Gat, Nitay Calderon, Amir Feder et al.

ICLR 2024posterarXiv:2310.00603
#3831

Large Language Models Cannot Self-Correct Reasoning Yet

Jie Huang, Xinyun Chen, Swaroop Mishra et al.

ICLR 2024posterarXiv:2310.01798
#3832

H-GAP: Humanoid Control with a Generalist Planner

Zhengyao Jiang, Yingchen Xu, Nolan Wagener et al.

ICLR 2024spotlightarXiv:2312.02682
#3833

Explaining Kernel Clustering via Decision Trees

Maximilian Fleissner, Leena Chennuru Vankadara, Debarghya Ghoshdastidar

ICLR 2024posterarXiv:2402.09881
#3834

Select to Perfect: Imitating desired behavior from large multi-agent data

Tim Franzmeyer, Edith Elkind, Philip Torr et al.

ICLR 2024posterarXiv:2405.03735
#3835

Integrating Planning and Deep Reinforcement Learning via Automatic Induction of Task Substructures

Jung-Chun Liu, Chi-Hsien Chang, Shao-Hua Sun et al.

ICLR 2024poster
#3836

GIO: Gradient Information Optimization for Training Dataset Selection

Dante Everaert, Christopher Potts

ICLR 2024spotlightarXiv:2306.11670
#3837

SLiMe: Segment Like Me

Aliasghar Khani, Saeid Asgari, Aditya Sanghi et al.

ICLR 2024posterarXiv:2309.03179
#3838

MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical Reasoning

Ke Wang, Houxing Ren, Aojun Zhou et al.

ICLR 2024posterarXiv:2310.03731
#3839

BadEdit: Backdooring Large Language Models by Model Editing

Yanzhou Li, Tianlin Li, Kangjie Chen et al.

ICLR 2024posterarXiv:2403.13355
#3840

Neural Monge Map estimation and its applications

Shaojun Ma, Yongxin Chen, Hao-Min Zhou et al.

ICLR 2024posterarXiv:2106.03812
#3841

Rethinking the Power of Graph Canonization in Graph Representation Learning with Stability

Zehao Dong, Muhan Zhang, Philip Payne et al.

ICLR 2024posterarXiv:2309.00738
#3842

On the Sample Complexity of Lipschitz Constant Estimation

Stephen Roberts, Julien Huang, Jan-Peter Calliess

ICLR 2024poster
#3843

Increasing Model Capacity for Free: A Simple Strategy for Parameter Efficient Fine-tuning

Haobo Song, Haobo SONG, Hao Zhao et al.

ICLR 2024posterarXiv:2407.01320
#3844

Image Background Serves as Good Proxy for Out-of-distribution Data

Sen Pei

ICLR 2024posterarXiv:2307.00519
#3845

FedTrans: Client-Transparent Utility Estimation for Robust Federated Learning

Mingkun Yang, Ran Zhu, Qing Wang et al.

ICLR 2024poster
#3846

CLEX: Continuous Length Extrapolation for Large Language Models

Guanzheng Chen, Xin Li, Zaiqiao Meng et al.

ICLR 2024posterarXiv:2310.16450
#3847

Combining Axes Preconditioners through Kronecker Approximation for Deep Learning

Venkata Sai Surya Subramanyam Duvvuri, Fnu Devvrit, Rohan Anil et al.

ICLR 2024poster
#3848

MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning

Xiang Yue, Xingwei Qu, Ge Zhang et al.

ICLR 2024spotlightarXiv:2309.05653
#3849

LEMON: Lossless model expansion

Yite Wang, Jiahao Su, Hanlin Lu et al.

ICLR 2024posterarXiv:2310.07999
#3850

Deceptive Fairness Attacks on Graphs via Meta Learning

Jian Kang, Yinglong Xia, Ross Maciejewski et al.

ICLR 2024posterarXiv:2310.15653
#3851

Optimal Sketching for Residual Error Estimation for Matrix and Vector Norms

Yi Li, Honghao Lin, David Woodruff

ICLR 2024posterarXiv:2408.08494
#3852

CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models

Sreyan Ghosh, Ashish Seth, Sonal Kumar et al.

ICLR 2024posterarXiv:2310.08753
#3853

Towards Robust Out-of-Distribution Generalization Bounds via Sharpness

Yingtian Zou, Kenji Kawaguchi, Yingnan Liu et al.

ICLR 2024spotlightarXiv:2403.06392
#3854

A Unified Sampling Framework for Solver Searching of Diffusion Probabilistic Models

Enshu Liu, Xuefei Ning, Huazhong Yang et al.

ICLR 2024posterarXiv:2312.07243
#3855

DiffusionSat: A Generative Foundation Model for Satellite Imagery

Samar Khanna, Patrick Liu, Linqi Zhou et al.

ICLR 2024oralarXiv:2312.03606
#3856

Denoising Diffusion Bridge Models

Linqi Zhou, Aaron Lou, Samar Khanna et al.

ICLR 2024posterarXiv:2309.16948
#3857

Causal Inference with Conditional Front-Door Adjustment and Identifiable Variational Autoencoder

Ziqi Xu, Debo Cheng, Jiuyong Li et al.

ICLR 2024posterarXiv:2310.01937
#3858

Impact of Computation in Integral Reinforcement Learning for Continuous-Time Control

Wenhan Cao, Wei Pan

ICLR 2024spotlightarXiv:2402.17375
#3859

Generalized Schrödinger Bridge Matching

Guan-Horng Liu, Yaron Lipman, Maximilian Nickel et al.

ICLR 2024posterarXiv:2310.02233
#3860

Repeated Random Sampling for Minimizing the Time-to-Accuracy of Learning

Patrik Okanovic, Roger Waleffe, Vasilis Mageirakos et al.

ICLR 2024posterarXiv:2305.18424
#3861

Privately Aligning Language Models with Reinforcement Learning

Fan Wu, Huseyin Inan, Arturs Backurs et al.

ICLR 2024posterarXiv:2310.16960
#3862

Let's Verify Step by Step

Hunter Lightman, Vineet Kosaraju, Yuri Burda et al.

ICLR 2024posterarXiv:2305.20050
#3863

Uncertainty Quantification via Stable Distribution Propagation

Felix Petersen, Aashwin Mishra, Hilde Kuehne et al.

ICLR 2024posterarXiv:2402.08324
#3864

LLMs Meet VLMs: Boost Open Vocabulary Object Detection with Fine-grained Descriptors

Sheng JIn, Xueying Jiang, Jiaxing Huang et al.

ICLR 2024posterarXiv:2402.04630
#3865

SKILL-MIX: a Flexible and Expandable Family of Evaluations for AI Models

Dingli Yu, Simran Kaur, Arushi Gupta et al.

ICLR 2024posterarXiv:2310.17567
#3866

LoTa-Bench: Benchmarking Language-oriented Task Planners for Embodied Agents

Jae-Woo Choi, Youngwoo Yoon, Youngwoo Yoon et al.

ICLR 2024posterarXiv:2402.08178
#3867

Fast-ELECTRA for Efficient Pre-training

Chengyu Dong, Liyuan Liu, Hao Cheng et al.

ICLR 2024posterarXiv:2310.07347
#3868

Skip-Attention: Improving Vision Transformers by Paying Less Attention

Shashank Venkataramanan, Amir Ghodrati, Yuki Asano et al.

ICLR 2024posterarXiv:2301.02240
#3869

Reasoning on Graphs: Faithful and Interpretable Large Language Model Reasoning

Linhao Luo, Yuan-Fang Li, Reza Haffari et al.

ICLR 2024posterarXiv:2310.01061
#3870

Fiber Monte Carlo

Nick Richardson, Deniz Oktay, Yaniv Ovadia et al.

ICLR 2024poster
#3871

A Framework and Benchmark for Deep Batch Active Learning for Regression

David Holzmüller, Viktor Zaverkin, Johannes Kästner et al.

ICLR 2024posterarXiv:2203.09410
#3872

ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation

Jiaming Liu, Senqiao Yang, Peidong Jia et al.

ICLR 2024posterarXiv:2306.04344
#3873

ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving

Zhibin Gou, Zhihong Shao, Yeyun Gong et al.

ICLR 2024posterarXiv:2309.17452
#3874

Towards Robust Offline Reinforcement Learning under Diverse Data Corruption

Rui Yang, Han Zhong, Jiawei Xu et al.

ICLR 2024spotlightarXiv:2310.12955
#3875

Vanishing Gradients in Reinforcement Finetuning of Language Models

Noam Razin, Hattie Zhou, Omid Saremi et al.

ICLR 2024posterarXiv:2310.20703
#3876

Effective and Efficient Federated Tree Learning on Hybrid Data

Qinbin Li, Chulin Xie, Xiaojun Xu et al.

ICLR 2024posterarXiv:2310.11865
#3877

Boosting the Adversarial Robustness of Graph Neural Networks: An OOD Perspective

Kuan Li, YiWen Chen, Yang Liu et al.

ICLR 2024poster
#3878

SetCSE: Set Operations using Contrastive Learning of Sentence Embeddings

Kang Liu

ICLR 2024posterarXiv:2404.17606
#3879

Unsupervised Pretraining for Fact Verification by Language Model Distillation

Adrian Bazaga, Pietro Lio, Gos Micklem

ICLR 2024posterarXiv:2309.16540
#3880

Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation

Haruka Kiyohara, Ren Kishimoto, Kosuke Kawakami et al.

ICLR 2024posterarXiv:2311.18207
#3881

Transformers as Decision Makers: Provable In-Context Reinforcement Learning via Supervised Pretraining

Licong Lin, Yu Bai, Song Mei

ICLR 2024posterarXiv:2310.08566
#3882

Improving Convergence and Generalization Using Parameter Symmetries

Bo Zhao, Robert M. Gower, Robin Walters et al.

ICLR 2024posterarXiv:2305.13404
#3883

COLEP: Certifiably Robust Learning-Reasoning Conformal Prediction via Probabilistic Circuits

Mintong Kang, Nezihe Merve Gürel, Linyi Li et al.

ICLR 2024posterarXiv:2403.11348
#3884

Manifold Preserving Guided Diffusion

Yutong He, Naoki Murata, Chieh-Hsin Lai et al.

ICLR 2024posterarXiv:2311.16424
#3885

Threaten Spiking Neural Networks through Combining Rate and Temporal Information

Zecheng Hao, Tong Bu, Xinyu Shi et al.

ICLR 2024oral
#3886

Federated Recommendation with Additive Personalization

Zhiwei Li, Guodong Long, Tianyi Zhou

ICLR 2024posterarXiv:2301.09109
#3887

Correlated Noise Provably Beats Independent Noise for Differentially Private Learning

Christopher Choquette-Choo, Krishnamurthy Dvijotham, Krishna Pillutla et al.

ICLR 2024posterarXiv:2310.06771
#3888

On the Stability of Expressive Positional Encodings for Graphs

Yinan Huang, William Lu, Joshua Robinson et al.

ICLR 2024posterarXiv:2310.02579
#3889

Evaluating Representation Learning on the Protein Structure Universe

Arian Jamasb, Alex Morehead, Chaitanya Joshi et al.

ICLR 2024posterarXiv:2406.13864
#3890

On the Hardness of Constrained Cooperative Multi-Agent Reinforcement Learning

Ziyi Chen, Yi Zhou, Heng Huang

ICLR 2024poster
#3891

Off-Policy Primal-Dual Safe Reinforcement Learning

Zifan Wu, Bo Tang, Qian Lin et al.

ICLR 2024posterarXiv:2401.14758
#3892

When should we prefer Decision Transformers for Offline Reinforcement Learning?

Prajjwal Bhargava, Rohan Chitnis, Alborz Geramifard et al.

ICLR 2024posterarXiv:2305.14550
#3893

ARM: Refining Multivariate Forecasting with Adaptive Temporal-Contextual Learning

Jiecheng Lu, Xu Han, Shihao Yang

ICLR 2024oralarXiv:2310.09488
#3894

SAS: Structured Activation Sparsification

Yusuke Sekikawa, Shingo Yashima

ICLR 2024poster
#3895

Threshold-Consistent Margin Loss for Open-World Deep Metric Learning

Qin ZHANG, Linghan Xu, Jun Fang et al.

ICLR 2024posterarXiv:2307.04047
#3896

Improved statistical and computational complexity of the mean-field Langevin dynamics under structured data

Atsushi Nitanda, Kazusato Oko, Taiji Suzuki et al.

ICLR 2024poster
#3897

Bridging Neural and Symbolic Representations with Transitional Dictionary Learning

Junyan Cheng, Peter Chin

ICLR 2024posterarXiv:2308.02000
#3898

Scale-Adaptive Diffusion Model for Complex Sketch Synthesis

Jijin Hu, Ke Li, Yonggang Qi et al.

ICLR 2024poster
#3899

Energy-conserving equivariant GNN for elasticity of lattice architected metamaterials

Ivan Grega, Ilyes Batatia, Gábor Csányi et al.

ICLR 2024posterarXiv:2401.16914
#3900

SALMON: Self-Alignment with Instructable Reward Models

Zhiqing Sun, Yikang Shen, Hongxin Zhang et al.

ICLR 2024posterarXiv:2310.05910
#3901

Get more for less: Principled Data Selection for Warming Up Fine-Tuning in LLMs

Feiyang Kang, Hoang Anh Just, Yifan Sun et al.

ICLR 2024posterarXiv:2405.02774
#3902

The Joint Effect of Task Similarity and Overparameterization on Catastrophic Forgetting — An Analytical Model

Daniel Goldfarb, Itay Evron, Nir Weinberger et al.

ICLR 2024posterarXiv:2401.12617
#3903

Compositional Preference Models for Aligning LMs

DONGYOUNG GO, Tomek Korbak, Germàn Kruszewski et al.

ICLR 2024posterarXiv:2310.13011
#3904

Diffusion Posterior Sampling for Linear Inverse Problem Solving: A Filtering Perspective

Zehao Dou, Yang Song

ICLR 2024poster
#3905

Demystifying Local & Global Fairness Trade-offs in Federated Learning Using Partial Information Decomposition

Faisal Hamman, Sanghamitra Dutta

ICLR 2024poster
#3906

Generative Modeling with Phase Stochastic Bridge

Tianrong Chen, Jiatao Gu, Laurent Dinh et al.

ICLR 2024posterarXiv:2310.07805
#3907

Tailoring Self-Rationalizers with Multi-Reward Distillation

Sahana Ramnath, Brihi Joshi, Skyler Hallinan et al.

ICLR 2024posterarXiv:2311.02805
#3908

Controlling Vision-Language Models for Multi-Task Image Restoration

Ziwei Luo, Fredrik K. Gustafsson, Zheng Zhao et al.

ICLR 2024posterarXiv:2310.01018
#3909

Measuring Vision-Language STEM Skills of Neural Models

Jianhao Shen, Ye Yuan, Srbuhi Mirzoyan et al.

ICLR 2024posterarXiv:2402.17205
#3910

Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers

Qingyan Guo, Rui Wang, Junliang Guo et al.

ICLR 2024poster
#3911

NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers

Kai Shen, Zeqian Ju, Xu Tan et al.

ICLR 2024spotlightarXiv:2304.09116
#3912

How connectivity structure shapes rich and lazy learning in neural circuits

Yuhan Helena Liu, Aristide Baratin, Jonathan Cornford et al.

ICLR 2024posterarXiv:2310.08513
#3913

NeuroBack: Improving CDCL SAT Solving using Graph Neural Networks

Wenxi Wang, Yang Hu, Mohit Tiwari et al.

ICLR 2024posterarXiv:2110.14053
#3914

Revisiting Deep Audio-Text Retrieval Through the Lens of Transportation

Tien Manh Luong, Khai Nguyen, Nhat Ho et al.

ICLR 2024posterarXiv:2405.10084
#3915

Branch-GAN: Improving Text Generation with (not so) Large Language Models

Fredrik Carlsson, Johan Broberg, Erik Hillbom et al.

ICLR 2024poster
#3916

A unique M-pattern for micro-expression spotting in long videos

Jinxuan Wang, Shiting Xu, Tong Zhang

ICLR 2024poster
#3917

Internal Cross-layer Gradients for Extending Homogeneity to Heterogeneity in Federated Learning

Yun-Hin Chan, Rui Zhou, Running Zhao et al.

ICLR 2024spotlightarXiv:2308.11464
#3918

iTransformer: Inverted Transformers Are Effective for Time Series Forecasting

Yong Liu, Tengge Hu, Haoran Zhang et al.

ICLR 2024oralarXiv:2310.06625
#3919

Stylized Offline Reinforcement Learning: Extracting Diverse High-Quality Behaviors from Heterogeneous Datasets

Yihuan Mao, Chengjie Wu, Xi Chen et al.

ICLR 2024oral
#3920

Demystifying Embedding Spaces using Large Language Models

Guy Tennenholtz, Yinlam Chow, ChihWei Hsu et al.

ICLR 2024posterarXiv:2310.04475
#3921

Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.

Raj Ghugare, Matthieu Geist, Glen Berseth et al.

ICLR 2024oralarXiv:2401.11237
#3922

Learning Thresholds with Latent Values and Censored Feedback

Jiahao Zhang, Tao Lin, Weiqiang Zheng et al.

ICLR 2024posterarXiv:2312.04653
#3923

Extending Power of Nature from Binary to Real-Valued Graph Learning in Real World

Chunshu Wu, Ruibing Song, Chuan Liu et al.

ICLR 2024poster
#3924

Guess & Sketch: Language Model Guided Transpilation

Celine Lee, Abdulrahman Mahmoud, Michal Kurek et al.

ICLR 2024posterarXiv:2309.14396
#3925

An Investigation of Representation and Allocation Harms in Contrastive Learning

Subha Maity, Mayank Agarwal, Mikhail Yurochkin et al.

ICLR 2024posterarXiv:2310.01583
#3926

Channel Vision Transformers: An Image Is Worth 1 x 16 x 16 Words

Yujia Bao, Srinivasan Sivanandan, THEOFANIS KARALETSOS

ICLR 2024posterarXiv:2309.16108
#3927

ReLU Strikes Back: Exploiting Activation Sparsity in Large Language Models

Iman Mirzadeh, Keivan Alizadeh-Vahid, Sachin Mehta et al.

ICLR 2024posterarXiv:2310.04564
#3928

Tractable MCMC for Private Learning with Pure and Gaussian Differential Privacy

Yingyu Lin, Yian Ma, Yu-Xiang Wang et al.

ICLR 2024posterarXiv:2310.14661
#3929

Jailbreak in pieces: Compositional Adversarial Attacks on Multi-Modal Language Models

Erfan Shayegani, Yue Dong, Nael Abu-Ghazaleh

ICLR 2024spotlightarXiv:2307.14539
#3930

Scalable Modular Network: A Framework for Adaptive Learning via Agreement Routing

Minyang Hu, Hong Chang, Bingpeng Ma et al.

ICLR 2024poster
#3931

Recursive Generalization Transformer for Image Super-Resolution

Zheng Chen, Yulun Zhang, Jinjin Gu et al.

ICLR 2024posterarXiv:2303.06373
#3932

Adversarial Training Can Provably Improve Robustness: Theoretical Analysis of Feature Learning Process Under Structured Data

Binghui Li, Yuanzhi Li

ICLR 2025posterarXiv:2410.08503
#3933

Treatment Effects Estimation By Uniform Transformer

Ruoqi Yu, Shulei Wang

ICLR 2024posterarXiv:2008.03738
#3934

Representation Deficiency in Masked Language Modeling

Yu Meng, Jitin Krishnan, Sinong Wang et al.

ICLR 2024posterarXiv:2302.02060
#3935

Sampling Multimodal Distributions with the Vanilla Score: Benefits of Data-Based Initialization

Frederic Koehler, Thuy-Duong Vuong

ICLR 2024posterarXiv:2310.01762
#3936

FedInverse: Evaluating Privacy Leakage in Federated Learning

DI WU, Jun Bai, Yiliao Song et al.

ICLR 2024poster
#3937

Neuron Activation Coverage: Rethinking Out-of-distribution Detection and Generalization

Yibing Liu, Chris Xing TIAN, Haoliang Li et al.

ICLR 2024spotlightarXiv:2306.02879
#3938

Skeleton-of-Thought: Prompting LLMs for Efficient Parallel Generation

Xuefei Ning, Zinan Lin, Zixuan Zhou et al.

ICLR 2024posterarXiv:2307.15337
#3939

Neural Rate Control for Learned Video Compression

yiwei zhang, Guo Lu, Yunuo Chen et al.

ICLR 2024oral
#3940

Sliced Denoising: A Physics-Informed Molecular Pre-Training Method

yuyan ni, Shikun Feng, Wei-Ying Ma et al.

ICLR 2024posterarXiv:2311.02124
#3941

Poisoned Forgery Face: Towards Backdoor Attacks on Face Forgery Detection

Jiawei Liang, Siyuan Liang, Aishan Liu et al.

ICLR 2024spotlightarXiv:2402.11473
#3942

Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning

Zhaoyi Zhou, Chuning Zhu, Runlong Zhou et al.

ICLR 2024posterarXiv:2310.19308
#3943

Simple Hierarchical Planning with Diffusion

Chang Chen, Fei Deng, Kenji Kawaguchi et al.

ICLR 2024oralarXiv:2401.02644
#3944

Dynamic Sparse Training with Structured Sparsity

Mike Lasby, Anna Golubeva, Utku Evci et al.

ICLR 2024posterarXiv:2305.02299
#3945

DENEVIL: TOWARDS DECIPHERING AND NAVIGATING THE ETHICAL VALUES OF LARGE LANGUAGE MODELS VIA INSTRUCTION LEARNING

Shitong Duan, Xiaoyuan Yi, Peng Zhang et al.

ICLR 2024oralarXiv:2310.11053
#3946

Robustifying State-space Models for Long Sequences via Approximate Diagonalization

Annan Yu, Arnur Nigmetov, Dmitriy Morozov et al.

ICLR 2024spotlightarXiv:2310.01698
#3947

Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models

Shuai Zhao, Xiaohan Wang, Linchao Zhu et al.

ICLR 2024posterarXiv:2305.18010
#3948

Efficient Inverse Multiagent Learning

Denizalp Goktas, Amy Greenwald, Sadie Zhao et al.

ICLR 2024spotlightarXiv:2502.14160
#3949

C-TPT: Calibrated Test-Time Prompt Tuning for Vision-Language Models via Text Feature Dispersion

Hee Suk Yoon, Eunseop Yoon, Joshua Tian Jin Tee et al.

ICLR 2024posterarXiv:2403.14119
#3950

Plugin estimators for selective classification with out-of-distribution detection

Harikrishna Narasimhan, Aditya Krishna Menon, Wittawat Jitkrittum et al.

ICLR 2024posterarXiv:2301.12386
#3951

Adaptive Stochastic Gradient Algorithm for Black-box Multi-Objective Learning

Feiyang YE, YUEMING LYU, Xuehao Wang et al.

ICLR 2024poster
#3952

Intriguing Properties of Data Attribution on Diffusion Models

Xiaosen Zheng, Tianyu Pang, Chao Du et al.

ICLR 2024posterarXiv:2311.00500
#3953

Provably Efficient UCB-type Algorithms For Learning Predictive State Representations

Ruiquan Huang, Yingbin Liang, Jing Yang

ICLR 2024posterarXiv:2307.00405
#3954

Inner Classifier-Free Guidance and Its Taylor Expansion for Diffusion Models

Shikun Sun, Longhui Wei, Zhicai Wang et al.

ICLR 2024poster
#3955

Going Beyond Neural Network Feature Similarity: The Network Feature Complexity and Its Interpretation Using Category Theory

Yiting Chen, Zhanpeng Zhou, Junchi Yan

ICLR 2024posterarXiv:2310.06756
#3956

Learning Large DAGs is Harder than you Think: Many Losses are Minimal for the Wrong DAG

Jonas Seng, Matej Zečević, Devendra Singh Dhami et al.

ICLR 2024poster
#3957

Nearly $d$-Linear Convergence Bounds for Diffusion Models via Stochastic Localization

Joe Benton, Valentin De Bortoli, Arnaud Doucet et al.

ICLR 2024spotlightarXiv:2308.03686
#3958

Active Retrosynthetic Planning Aware of Route Quality

Luotian Yuan, Yemin Yu, Ying Wei et al.

ICLR 2024poster
#3959

Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy

Pingzhi Li, Zhenyu Zhang, Prateek Yadav et al.

ICLR 2024spotlightarXiv:2310.01334
#3960

Non-Exchangeable Conformal Risk Control

António Farinhas, Chrysoula Zerva, Dennis Ulmer et al.

ICLR 2024posterarXiv:2310.01262
#3961

Early Stopping Against Label Noise Without Validation Data

Suqin Yuan, Lei Feng, Tongliang Liu

ICLR 2024posterarXiv:2502.07551
#3962

Enhancing Transfer Learning with Flexible Nonparametric Posterior Sampling

Hyungi Lee, Giung Nam, Edwin Fong et al.

ICLR 2024posterarXiv:2403.07282
#3963

Neural Contractive Dynamical Systems

Hadi Beik Mohammadi, Søren Hauberg, Georgios Arvanitidis et al.

ICLR 2024spotlightarXiv:2401.09352
#3964

Energy-based Automated Model Evaluation

Ru Peng, Heming Zou, Haobo Wang et al.

ICLR 2024posterarXiv:2401.12689
#3965

Toward Optimal Policy Population Growth in Two-Player Zero-Sum Games

Stephen McAleer, John Banister Lanier, Kevin A. Wang et al.

ICLR 2024poster
#3966

Towards Robust and Efficient Cloud-Edge Elastic Model Adaptation via Selective Entropy Distillation

Yaofo Chen, Shuaicheng Niu, Yaowei Wang et al.

ICLR 2024posterarXiv:2402.17316
#3967

Beyond task performance: evaluating and reducing the flaws of large multimodal models with in-context-learning

Mustafa Shukor, Alexandre Rame, Corentin Dancette et al.

ICLR 2024posterarXiv:2310.00647
#3968

The Trickle-down Impact of Reward Inconsistency on RLHF

Lingfeng Shen, Lingfeng Shen, Sihao Chen et al.

ICLR 2024poster
#3969

Better Neural PDE Solvers Through Data-Free Mesh Movers

Peiyan Hu, Yue Wang, Zhi-Ming Ma

ICLR 2024posterarXiv:2312.05583
#3970

Memorization Capacity of Multi-Head Attention in Transformers

Sadegh Mahdavi, Renjie Liao, Christos Thrampoulidis

ICLR 2024spotlightarXiv:2306.02010
#3971

Domain-Inspired Sharpness-Aware Minimization Under Domain Shifts

Ruipeng Zhang, Ziqing Fan, Jiangchao Yao et al.

ICLR 2024posterarXiv:2405.18861
#3972

Maximum Likelihood Estimation is All You Need for Well-Specified Covariate Shift

Jiawei Ge, Shange Tang, Jianqing Fan et al.

ICLR 2024posterarXiv:2311.15961
#3973

A Sublinear Adversarial Training Algorithm

Yeqi Gao, Lianke Qin, Zhao Song et al.

ICLR 2024posterarXiv:2208.05395
#3974

Performance Gaps in Multi-view Clustering under the Nested Matrix-Tensor Model

Hugo Lebeau, Mohamed El Amine Seddik, José Henrique Goulart

ICLR 2024posterarXiv:2402.10677
#3975

ZeRO++: Extremely Efficient Collective Communication for Large Model Training

Guanhua Wang, Heyang Qin, Sam Jacobs et al.

ICLR 2024poster
#3976

CausalLM is not optimal for in-context learning

Nan Ding, Tomer Levinboim, Jialin Wu et al.

ICLR 2024posterarXiv:2308.06912
#3977

An Unforgeable Publicly Verifiable Watermark for Large Language Models

Aiwei Liu, Leyi Pan, Xuming Hu et al.

ICLR 2024posterarXiv:2307.16230
#3978

Class Probability Matching with Calibrated Networks for Label Shift Adaption

Hongwei Wen, Annika Betken, Hanyuan Hang

ICLR 2024poster
#3979

Memory-Assisted Sub-Prototype Mining for Universal Domain Adaptation

Yuxiang (YU-HSIANG) LAI, Yi Zhou, Xinghong Liu et al.

ICLR 2024posterarXiv:2310.05453
#3980

Ito Diffusion Approximation of Universal Ito Chains for Sampling, Optimization and Boosting

Aleksei Ustimenko, Aleksandr Beznosikov

ICLR 2024posterarXiv:2310.06081
#3981

Würstchen: An Efficient Architecture for Large-Scale Text-to-Image Diffusion Models

Pablo Pernías, Dominic Rampas, Mats L. Richter et al.

ICLR 2024poster
#3982

Sparse MoE with Language Guided Routing for Multilingual Machine Translation

Xinyu Zhao, Xuxi Chen, Yu Cheng et al.

ICLR 2024poster
#3983

Neural Architecture Retrieval

Xiaohuan Pei, Yanxi Li, Minjing Dong et al.

ICLR 2024posterarXiv:2307.07919
#3984

Neural SDF Flow for 3D Reconstruction of Dynamic Scenes

wei mao, Richard Hartley, Mathieu Salzmann et al.

ICLR 2024poster
#3985

Compressing LLMs: The Truth is Rarely Pure and Never Simple

AJAY JAISWAL, Zhe Gan, Xianzhi Du et al.

ICLR 2024posterarXiv:2310.01382
#3986

ToolChain*: Efficient Action Space Navigation in Large Language Models with A* Search

Yuchen Zhuang, Xiang Chen, Tong Yu et al.

ICLR 2024posterarXiv:2310.13227
#3987

LayoutNUWA: Revealing the Hidden Layout Expertise of Large Language Models

Zecheng Tang, Zecheng Tang, Chenfei Wu et al.

ICLR 2024posterarXiv:2309.09506
#3988

Learning Mean Field Games on Sparse Graphs: A Hybrid Graphex Approach

Christian Fabian, Kai Cui, Heinz Koeppl

ICLR 2024posterarXiv:2401.12686
#3989

Understanding In-Context Learning in Transformers and LLMs by Learning to Learn Discrete Functions

Satwik Bhattamishra, Arkil Patel, Phil Blunsom et al.

ICLR 2024posterarXiv:2310.03016
#3990

One-hot Generalized Linear Model for Switching Brain State Discovery

Chengrui Li, Soon Ho Kim, Chris Rodgers et al.

ICLR 2024oralarXiv:2310.15263
#3991

Generative Modeling of Regular and Irregular Time Series Data via Koopman VAEs

Ilan Naiman, N. Benjamin Erichson, Pu Ren et al.

ICLR 2024posterarXiv:2310.02619
#3992

Annealing Self-Distillation Rectification Improves Adversarial Training

Yu-Yu Wu, Hung-Jui Wang, Shang-Tse Chen

ICLR 2024posterarXiv:2305.12118
#3993

Simplifying, Stabilizing and Scaling Continuous-time Consistency Models

Cheng Lu, Yang Song

ICLR 2025posterarXiv:2410.11081
#3994

Rotation Has Two Sides: Evaluating Data Augmentation for Deep One-class Classification

Guodong Wang, Yunhong Wang, Xiuguo Bao et al.

ICLR 2024spotlight
#3995

Small-scale proxies for large-scale Transformer training instabilities

Mitchell Wortsman, Peter Liu, Lechao Xiao et al.

ICLR 2024posterarXiv:2309.14322
#3996

Lion Secretly Solves a Constrained Optimization: As Lyapunov Predicts

Lizhang Chen, Bo Liu, Kaizhao Liang et al.

ICLR 2024spotlight
#3997

Node2ket: Efficient High-Dimensional Network Embedding in Quantum Hilbert Space

Hao Xiong, Yehui Tang, Yunlin He et al.

ICLR 2024poster
#3998

Procedural Fairness Through Decoupling Objectionable Data Generating Components

Zeyu Tang, Jialu Wang, Yang Liu et al.

ICLR 2024spotlightarXiv:2311.14688
#3999

Scaling Supervised Local Learning with Augmented Auxiliary Networks

Chenxiang Ma, Jibin Wu, Chenyang Si et al.

ICLR 2024posterarXiv:2402.17318
#4000

Scalable Monotonic Neural Networks

Hyunho Kim, Jong-Seok Lee

ICLR 2024poster