Most Cited ICLR Poster Papers
6,124 papers found • Page 19 of 31
Conference
Protein Language Model Fitness is a Matter of Preference
Cade Gordon, Amy Lu, Pieter Abbeel
RTDiff: Reverse Trajectory Synthesis via Diffusion for Offline Reinforcement Learning
Qianlan Yang, Yu-Xiong Wang
Do LLM Agents Have Regret? A Case Study in Online Learning and Games
Chanwoo Park, Xiangyu Liu, Asuman Ozdaglar et al.
Compute-Constrained Data Selection
Junjie Oscar Yin, Alexander Rush
Linear SCM Identification in the Presence of Confounders and Gaussian Noise
Vahideh Sanjaroonpouri, Pouria Ramazi
GNNCert: Deterministic Certification of Graph Neural Networks against Adversarial Perturbations
Zaishuo Xia, Han Yang, Binghui Wang et al.
Oracle Efficient Algorithms for Groupwise Regret
Krishna Acharya, Eshwar Ram Arunachaleswaran, Sampath Kannan et al.
Meta Continual Learning Revisited: Implicitly Enhancing Online Hessian Approximation via Variance Reduction
Yichen Wu, Long-Kai Huang, Renzhen Wang et al.
Label-Noise Robust Diffusion Models
Byeonghu Na, Yeongmin Kim, HeeSun Bae et al.
Unveiling the Unseen: Identifiable Clusters in Trained Depthwise Convolutional Kernels
Zahra Babaiee, Peyman Kiasari, Daniela Rus et al.
Towards Poisoning Fair Representations
Tianci Liu, Haoyu Wang, Feijie Wu et al.
Order-Preserving GFlowNets
Yihang Chen, Lukas Mauch
MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language Models
Deyao Zhu, jun chen, Xiaoqian Shen et al.
FITS: Modeling Time Series with $10k$ Parameters
Zhijian Xu, Ailing Zeng, Qiang Xu
Robust agents learn causal world models
Jonathan Richens, Tom Everitt
Constraint-Free Structure Learning with Smooth Acyclic Orientations
Riccardo Massidda, Francesco Landolfi, Martina Cinquini et al.
SEABO: A Simple Search-Based Method for Offline Imitation Learning
Jiafei Lyu, Xiaoteng Ma, Le Wan et al.
InstructDET: Diversifying Referring Object Detection with Generalized Instructions
Ronghao Dang, Jiangyan Feng, Haodong Zhang et al.
How Do Transformers Learn In-Context Beyond Simple Functions? A Case Study on Learning with Representations
Tianyu Guo, Wei Hu, Song Mei et al.
Efficient Integrators for Diffusion Generative Models
Kushagra Pandey, Maja Rudolph, Stephan Mandt
The Marginal Value of Momentum for Small Learning Rate SGD
Runzhe Wang, Sadhika Malladi, Tianhao Wang et al.
Fine-tuning Aligned Language Models Compromises Safety, Even When Users Do Not Intend To!
Xiangyu Qi, Yi Zeng, Tinghao Xie et al.
CLIP the Bias: How Useful is Balancing Data in Multimodal Learning?
Ibrahim Alabdulmohsin, Xiao Wang, Andreas Steiner et al.
On the Power of the Weisfeiler-Leman Test for Graph Motif Parameters
Matthias Lanzinger, Pablo Barcelo
Graph-based Virtual Sensing from Sparse and Partial Multivariate Observations
Giovanni De Felice, Andrea Cini, Daniele Zambon et al.
Point2SSM: Learning Morphological Variations of Anatomies from Point Clouds
Jadie Adams, Shireen Elhabian
Leftover Lunch: Advantage-based Offline Reinforcement Learning for Language Models
Ashutosh Baheti, Ximing Lu, Faeze Brahman et al.
Gradual Optimization Learning for Conformational Energy Minimization
Artem Tsypin, Leonid A. Ugadiarov, Kuzma Khrabrov et al.
COSA: Concatenated Sample Pretrained Vision-Language Foundation Model
Sihan Chen, Xingjian He, Handong Li et al.
FedWon: Triumphing Multi-domain Federated Learning Without Normalization
Weiming Zhuang, Lingjuan Lyu
QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models
Jing Liu, Ruihao Gong, Xiuying Wei et al.
Self-supervised Representation Learning from Random Data Projectors
Yi Sui, Tongzi Wu, Jesse Cresswell et al.
Privileged Sensing Scaffolds Reinforcement Learning
Edward Hu, James Springer, Oleh Rybkin et al.
Fully Hyperbolic Convolutional Neural Networks for Computer Vision
Ahmad Bdeir, Kristian Schwethelm, Niels Landwehr
Cameras as Rays: Pose Estimation via Ray Diffusion
Jason Zhang, Amy Lin, Moneish Kumar et al.
Vision-by-Language for Training-Free Compositional Image Retrieval
Shyamgopal Karthik, Karsten Roth, Massimiliano Mancini et al.
Single Motion Diffusion
Sigal Raab, Inbal Leibovitch, Guy Tevet et al.
DeepSPF: Spherical SO(3)-Equivariant Patches for Scan-to-CAD Estimation
Driton Salihu, Adam Misik, Yuankai Wu et al.
Non-negative Contrastive Learning
Yifei Wang, Qi Zhang, Yaoyu Guo et al.
Zero-Shot Robustification of Zero-Shot Models
Dyah Adila, Changho Shin, Linrong Cai et al.
Towards image compression with perfect realism at ultra-low bitrates
Marlene Careil, Matthew J Muckley, Jakob Verbeek et al.
BayesDiff: Estimating Pixel-wise Uncertainty in Diffusion via Bayesian Inference
Siqi Kou, Lei Gan, Dequan Wang et al.
Unified Projection-Free Algorithms for Adversarial DR-Submodular Optimization
Mohammad Pedramfar, Yididiya Nadew, Chris Quinn et al.
PRES: Toward Scalable Memory-Based Dynamic Graph Neural Networks
Junwei Su, Difan Zou, Chuan Wu
Dynamic Neural Fortresses: An Adaptive Shield for Model Extraction Defense
Siyu Luan, Zhenyi Wang, Li Shen et al.
Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text via Conditional Probability Curvature
Guangsheng Bao, Yanbin Zhao, Zhiyang Teng et al.
Conformal Inductive Graph Neural Networks
Soroush H. Zargarbashi, Aleksandar Bojchevski
Generative Sliced MMD Flows with Riesz Kernels
Johannes Hertrich, Christian Wald, Fabian Altekrüger et al.
Kosmos-G: Generating Images in Context with Multimodal Large Language Models
Xichen Pan, Li Dong, Shaohan Huang et al.
BaDExpert: Extracting Backdoor Functionality for Accurate Backdoor Input Detection
Tinghao Xie, Xiangyu Qi, Ping He et al.
Self-Supervised Dataset Distillation for Transfer Learning
Dong Bok Lee, Seanie Lee, Joonho Ko et al.
LogicMP: A Neuro-symbolic Approach for Encoding First-order Logic Constraints
Weidi Xu, Jingwei Wang, Lele Xie et al.
Leave-one-out Distinguishability in Machine Learning
Jiayuan Ye, Anastasia Borovykh, Soufiane Hayou et al.
Enhancing One-Shot Federated Learning Through Data and Ensemble Co-Boosting
Rong Dai, Yonggang Zhang, Ang Li et al.
How Well Do Supervised 3D Models Transfer to Medical Imaging Tasks?
Wenxuan Li, Alan Yuille, Zongwei Zhou
Enhancing Instance-Level Image Classification with Set-Level Labels
Renyu Zhang, Aly Khan, Yuxin Chen et al.
Hyper Evidential Deep Learning to Quantify Composite Classification Uncertainty
Changbin Li, Kangshuo Li, Yuzhe Ou et al.
Leveraging Generative Models for Unsupervised Alignment of Neural Time Series Data
Ayesha Vermani, Il Memming Park, Josue Nassar
Safe and Robust Watermark Injection with a Single OoD Image
Shuyang Yu, Junyuan Hong, Haobo Zhang et al.
Elucidating the design space of classifier-guided diffusion generation
Jiajun Ma, Tianyang Hu, Wenjia Wang et al.
Periodicity Decoupling Framework for Long-term Series Forecasting
Tao Dai, Beiliang Wu, Peiyuan Liu et al.
General Stability Analysis for Zeroth-Order Optimization Algorithms
Xinyue Liu, Hualin Zhang, Bin Gu et al.
Context is Environment
Sharut Gupta, Stefanie Jegelka, David Lopez-Paz et al.
HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion
Xian Liu, Jian Ren, Aliaksandr Siarohin et al.
EQA-MX: Embodied Question Answering using Multimodal Expression
Md Mofijul Islam, Alexi Gladstone, Riashat Islam et al.
Learning Stackable and Skippable LEGO Bricks for Efficient, Reconfigurable, and Variable-Resolution Diffusion Modeling
Huangjie Zheng, Zhendong Wang, Jianbo Yuan et al.
DNABERT-2: Efficient Foundation Model and Benchmark For Multi-Species Genomes
Zhihan Zhou, Yanrong Ji, Weijian Li et al.
PanoDiffusion: 360-degree Panorama Outpainting via Diffusion
Tianhao Wu, Chuanxia Zheng, Tat-Jen Cham
Making LLaMA SEE and Draw with SEED Tokenizer
Yuying Ge, Sijie Zhao, Ziyun Zeng et al.
A Characterization Theorem for Equivariant Networks with Point-wise Activations
Marco Pacini, Xiaowen Dong, Bruno Lepri et al.
Exploring the Common Appearance-Boundary Adaptation for Nighttime Optical Flow
Hanyu Zhou, Yi Chang, Haoyue Liu et al.
FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores
Dan Fu, Hermann Kumbong, Eric Nguyen et al.
Continuous Invariance Learning
LIN Yong, Fan Zhou, Lu Tan et al.
IceFormer: Accelerated Inference with Long-Sequence Transformers on CPUs
Yuzhen Mao, Martin Ester, Ke Li
Modeling state-dependent communication between brain regions with switching nonlinear dynamical systems
Orren Karniol-Tambour, David Zoltowski, E. Mika Diamanti et al.
Structuring Representation Geometry with Rotationally Equivariant Contrastive Learning
Sharut Gupta, Joshua Robinson, Derek Lim et al.
TEDDY: Trimming Edges with Degree-based Discrimination Strategy
Hyunjin Seo, Jihun Yun, Eunho Yang
RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems
Tianyang Liu, Canwen Xu, Julian McAuley
Structural Fairness-aware Active Learning for Graph Neural Networks
Haoyu Han, Xiaorui Liu, Li Ma et al.
Scalable Neural Network Kernels
Arijit Sehanobish, Krzysztof Choromanski, YUNFAN ZHAO et al.
Language Model Detectors Are Easily Optimized Against
Charlotte Nicks, Eric Mitchell, Rafael Rafailov et al.
Evoke: Evoking Critical Thinking Abilities in LLMs via Reviewer-Author Prompt Editing
Xinyu Hu, Pengfei Tang, Simiao Zuo et al.
CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding
Junyan Li, Delin Chen, Yining Hong et al.
Provable Benefits of Multi-task RL under Non-Markovian Decision Making Processes
Ruiquan Huang, Yuan Cheng, Jing Yang et al.
Like Oil and Water: Group Robustness Methods and Poisoning Defenses May Be at Odds
Michael-Andrei Panaitescu-Liess, Yigitcan Kaya, Sicheng Zhu et al.
Geometrically Aligned Transfer Encoder for Inductive Transfer in Regression Tasks
Sung Moon Ko, Sumin Lee, Dae-Woong Jeong et al.
Knowledge Fusion of Large Language Models
Fanqi Wan, Xinting Huang, Deng Cai et al.
Evaluating Language Model Agency Through Negotiations
Tim R. Davidson, Veniamin Veselovsky, Michal Kosinski et al.
DDMI: Domain-agnostic Latent Diffusion Models for Synthesizing High-Quality Implicit Neural Representations
Dogyun Park, Sihyeon Kim, Sojin Lee et al.
Sentence-level Prompts Benefit Composed Image Retrieval
Yang Bai, Xinxing Xu, Yong Liu et al.
Light-MILPopt: Solving Large-scale Mixed Integer Linear Programs with Lightweight Optimizer and Small-scale Training Dataset
Huigen Ye, Hua Xu, Hongyan Wang
SineNet: Learning Temporal Dynamics in Time-Dependent Partial Differential Equations
Xuan Zhang, Jacob Helwig, Yuchao Lin et al.
Test-time Adaptation against Multi-modal Reliability Bias
Mouxing Yang, Yunfan Li, Changqing Zhang et al.
RA-DIT: Retrieval-Augmented Dual Instruction Tuning
Victoria Lin, Xilun Chen, Mingda Chen et al.
A Real-World WebAgent with Planning, Long Context Understanding, and Program Synthesis
Izzeddin Gur, Hiroki Furuta, Austin Huang et al.
Outliers with Opposing Signals Have an Outsized Effect on Neural Network Optimization
Elan Rosenfeld, Andrej Risteski
Rethinking the symmetry-preserving circuits for constrained variational quantum algorithms
Ge Yan, Hongxu Chen, Kaisen Pan et al.
Attention Satisfies: A Constraint-Satisfaction Lens on Factual Errors of Language Models
Mert Yuksekgonul, Varun Chandrasekaran, Erik Jones et al.
Beyond Worst-case Attacks: Robust RL with Adaptive Defense via Non-dominated Policies
Xiangyu Liu, Chenghao Deng, Yanchao Sun et al.
Contrastive Difference Predictive Coding
Chongyi Zheng, Ruslan Salakhutdinov, Benjamin Eysenbach
Efficient-3Dim: Learning a Generalizable Single-image Novel-view Synthesizer in One Day
Yifan Jiang, Hao Tang, Jen-Hao Chang et al.
Image2Sentence based Asymmetrical Zero-shot Composed Image Retrieval
Yongchao Du, Min Wang, Wengang Zhou et al.
FairerCLIP: Debiasing CLIP's Zero-Shot Predictions using Functions in RKHSs
Sepehr Dehdashtian, Lan Wang, Vishnu Boddeti
Asymptotically Free Sketched Ridge Ensembles: Risks, Cross-Validation, and Tuning
Pratik Patil, Daniel LeJeune
CADS: Unleashing the Diversity of Diffusion Models through Condition-Annealed Sampling
Seyedmorteza Sadat, Jakob Buhmann, Derek Bradley et al.
Future Language Modeling from Temporal Document History
Changmao Li, Jeffrey Flanigan
Kalman Filter for Online Classification of Non-Stationary Data
Michalis Titsias, Alexandre Galashov, Amal Rannen-Triki et al.
Unbiased Watermark for Large Language Models
Zhengmian Hu, Lichang Chen, Xidong Wu et al.
Behaviour Distillation
Andrei Lupu, Chris Lu, Jarek Liesen et al.
Regularized Proportional Fairness Mechanism for Resource Allocation Without Money
Sujay Bhatt, Alec Koppel, Sumitra Ganesh et al.
Mitigating Emergent Robustness Degradation while Scaling Graph Learning
Xiangchi Yuan, Chunhui Zhang, Yijun Tian et al.
CAMBranch: Contrastive Learning with Augmented MILPs for Branching
Jiacheng Lin, Meng XU, Zhihua Xiong et al.
Understanding and Mitigating the Label Noise in Pre-training on Downstream Tasks
Hao Chen, Jindong Wang, Ankit Parag Shah et al.
Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs
Miao Xiong, Zhiyuan Hu, Xinyang Lu et al.
Supervised Knowledge Makes Large Language Models Better In-context Learners
Linyi Yang, Shuibai Zhang, Zhuohao Yu et al.
Conversational Drug Editing Using Retrieval and Domain Feedback
Shengchao Liu, Jiongxiao Wang, Yijin Yang et al.
Discovering Temporally-Aware Reinforcement Learning Algorithms
Matthew T Jackson, Chris Lu, Louis Kirsch et al.
ClimODE: Climate and Weather Forecasting with Physics-informed Neural ODEs
Yogesh Verma, Markus Heinonen, Vikas Garg
Input-gradient space particle inference for neural network ensembles
Trung Trinh, Markus Heinonen, Luigi Acerbi et al.
Towards Reliable and Efficient Backdoor Trigger Inversion via Decoupling Benign Features
Xiong Xu, Kunzhe Huang, Yiming Li et al.
Sample-Efficient Linear Representation Learning from Non-IID Non-Isotropic Data
Thomas T. Zhang, Leonardo Felipe Toso, James Anderson et al.
Beyond Reverse KL: Generalizing Direct Preference Optimization with Diverse Divergence Constraints
Chaoqi Wang, Yibo Jiang, Chenghao Yang et al.
SparseDFF: Sparse-View Feature Distillation for One-Shot Dexterous Manipulation
Qianxu Wang, Haotong Zhang, Congyue Deng et al.
Experimental Design for Multi-Channel Imaging via Task-Driven Feature Selection
Stefano Blumberg, Paddy Slator, Daniel Alexander
Benign Overfitting and Grokking in ReLU Networks for XOR Cluster Data
Zhiwei Xu, Yutong Wang, Spencer Frei et al.
New Insight of Variance reduce in Zero-Order Hard-Thresholding: Mitigating Gradient Error and Expansivity Contradictions
Xinzhe Yuan, William de Vazelhes, Bin Gu et al.
AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language Models
Xiaogeng Liu, Nan Xu, Muhao Chen et al.
Modulate Your Spectrum in Self-Supervised Learning
Xi Weng, Yunhao Ni, Tengwei Song et al.
Graph Parsing Networks
Yunchong Song, Siyuan Huang, Xinbing Wang et al.
Optimal transport based adversarial patch to leverage large scale attack transferability
Pol Labarbarie, Adrien CHAN-HON-TONG, Stéphane Herbin et al.
An Agnostic View on the Cost of Overfitting in (Kernel) Ridge Regression
Lijia Zhou, James Simon, Gal Vardi et al.
A Versatile Causal Discovery Framework to Allow Causally-Related Hidden Variables
Xinshuai Dong, Biwei Huang, Ignavier Ng et al.
KW-Design: Pushing the Limit of Protein Design via Knowledge Refinement
Zhangyang Gao, Cheng Tan, Xingran Chen et al.
MOTOR: A Time-to-Event Foundation Model For Structured Medical Records
Ethan Steinberg, Jason Fries, Yizhe Xu et al.
Constructing Adversarial Examples for Vertical Federated Learning: Optimal Client Corruption through Multi-Armed Bandit
Duanyi YAO, Songze Li, Ye XUE et al.
Learning with Mixture of Prototypes for Out-of-Distribution Detection
Haodong Lu, Dong Gong, Shuo Wang et al.
Magnitude Invariant Parametrizations Improve Hypernetwork Learning
Jose Javier Gonzalez Ortiz, John Guttag, Adrian Dalca
Is ImageNet worth 1 video? Learning strong image encoders from 1 long unlabelled video
Shashank Venkataramanan, Mamshad Nayeem Rizve, Joao Carreira et al.
CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing
Zhibin Gou, Zhihong Shao, Yeyun Gong et al.
Forward Learning with Top-Down Feedback: Empirical and Analytical Characterization
Ravi Srinivasan, Francesca Mignacco, Martino Sorbaro et al.
Pseudo-Generalized Dynamic View Synthesis from a Video
Xiaoming Zhao, R Colburn, Fangchang Ma et al.
Maximum Entropy Heterogeneous-Agent Reinforcement Learning
Jiarong Liu, Yifan Zhong, Siyi Hu et al.
Nougat: Neural Optical Understanding for Academic Documents
Lukas Blecher, Guillem Cucurull Preixens, Thomas Scialom et al.
Tree-Planner: Efficient Close-loop Task Planning with Large Language Models
Mengkang Hu, Yao Mu, Xinmiao Yu et al.
Plug-and-Play Policy Planner for Large Language Model Powered Dialogue Agents
Yang Deng, Wenxuan Zhang, Wai Lam et al.
Illusory Attacks: Information-theoretic detectability matters in adversarial attacks
Tim Franzmeyer, Stephen McAleer, Joao F. Henriques et al.
Independent-Set Design of Experiments for Estimating Treatment and Spillover Effects under Network Interference
Chencheng Cai, Xu Zhang, Edoardo Airoldi
Backdoor Secrets Unveiled: Identifying Backdoor Data with Optimized Scaled Prediction Consistency
Soumyadeep Pal, Yuguang Yao, Ren Wang et al.
DiLu: A Knowledge-Driven Approach to Autonomous Driving with Large Language Models
Licheng Wen, DAOCHENG FU, Xin Li et al.
Chain of Hindsight aligns Language Models with Feedback
Hao Liu, Carmelo Sferrazza, Pieter Abbeel
Discovering Failure Modes of Text-guided Diffusion Models via Adversarial Search
Qihao Liu, Adam Kortylewski, Yutong Bai et al.
MAP IT to Visualize Representations
Robert Jenssen
CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets
Lifan Yuan, Yangyi Chen, Xingyao Wang et al.
TRENDy: Temporal Regression of Effective Nonlinear Dynamics
Matthew Ricci, Guy Pelc, Zoe Piran et al.
Diverse Projection Ensembles for Distributional Reinforcement Learning
Moritz Akiya Zanger, Wendelin Boehmer, Matthijs T. J. Spaan
CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity
Aditya Bhatt, Daniel Palenicek, Boris Belousov et al.
Mask-Based Modeling for Neural Radiance Fields
Ganlin Yang, Guoqiang Wei, Zhizheng Zhang et al.
MixSup: Mixed-grained Supervision for Label-efficient LiDAR-based 3D Object Detection
Yuxue Yang, Lue Fan, Zhaoxiang Zhang
Algorithms for Caching and MTS with reduced number of predictions
Karim Ahmed Abdel Sadek, Marek Elias
Towards Identifiable Unsupervised Domain Translation: A Diversified Distribution Matching Approach
Sagar Shrestha, Xiao Fu
Neural Atoms: Propagating Long-range Interaction in Molecular Graphs through Efficient Communication Channel
Xuan Li, Zhanke Zhou, Jiangchao Yao et al.
Optimistic Bayesian Optimization with Unknown Constraints
Quoc Phong Nguyen, Wan Theng Ruth Chew, Le Song et al.
The Expressive Power of Low-Rank Adaptation
Yuchen Zeng, Kangwook Lee
Retrieval meets Long Context Large Language Models
Peng Xu, Wei Ping, Xianchao Wu et al.
Hiding in Plain Sight: Disguising Data Stealing Attacks in Federated Learning
Kostadin Garov, Dimitar I. Dimitrov, Nikola Jovanović et al.
Deep Reinforcement Learning Guided Improvement Heuristic for Job Shop Scheduling
Cong Zhang, Zhiguang Cao, Wen Song et al.
Pre-Training and Fine-Tuning Generative Flow Networks
Ling Pan, Moksh Jain, Kanika Madan et al.
Debiased Collaborative Filtering with Kernel-Based Causal Balancing
Haoxuan Li, Chunyuan Zheng, Yanghao Xiao et al.
Reverse Forward Curriculum Learning for Extreme Sample and Demo Efficiency
Stone Tao, Arth Shukla, Tse-kai Chan et al.
A Symmetry-Aware Exploration of Bayesian Neural Network Posteriors
Olivier Laurent, Emanuel Aldea, Gianni Franchi
SEA: Sparse Linear Attention with Estimated Attention Mask
Heejun Lee, Jina Kim, Jeff Willette et al.
ReFusion: Improving Natural Language Understanding with Computation-Efficient Retrieval Representation Fusion
Shangyu Wu, Ying Xiong, Yufei CUI et al.
An Emulator for Fine-tuning Large Language Models using Small Language Models
Eric Mitchell, Rafael Rafailov, Archit Sharma et al.
Neural Optimal Transport with General Cost Functionals
Arip Asadulaev, Alexander Korotin, Vage Egiazarian et al.
Bounding the Expected Robustness of Graph Neural Networks Subject to Node Feature Attacks
Yassine ABBAHADDOU, Sofiane ENNADIR, Johannes Lutzeyer et al.
Expressive Losses for Verified Robustness via Convex Combinations
Alessandro De Palma, Rudy R Bunel, Krishnamurthy Dvijotham et al.
Language Model Cascades: Token-Level Uncertainty And Beyond
Neha Gupta, Harikrishna Narasimhan, Wittawat Jitkrittum et al.
Faithful Rule Extraction for Differentiable Rule Learning Models
Xiaxia Wang, David Jaime Tena Cucala, Bernardo Grau et al.
Grokking as a First Order Phase Transition in Two Layer Networks
Noa Rubin, Inbar Seroussi, Zohar Ringel
JointNet: Extending Text-to-Image Diffusion for Dense Distribution Modeling
Jingyang Zhang, Shiwei Li, Yuanxun Lu et al.
A Hierarchical Bayesian Model for Few-Shot Meta Learning
Minyoung Kim, Timothy Hospedales
Neural Fine-Tuning Search for Few-Shot Learning
Panagiotis Eustratiadis, Łukasz Dudziak, Da Li et al.
FairTune: Optimizing Parameter Efficient Fine Tuning for Fairness in Medical Image Analysis
Raman Dutt, Ondrej Bohdal, Sotirios Tsaftaris et al.
Align With Purpose: Optimize Desired Properties in CTC Models with a General Plug-and-Play Framework
Eliya Segev, Maya Alroy, Ronen Katsir et al.
DIG In: Evaluating Disparities in Image Generations with Indicators for Geographic Diversity
Melissa Hall, Candace Ross, Adina Williams et al.
Memorization in Self-Supervised Learning Improves Downstream Generalization
Wenhao Wang, Muhammad Ahmad Kaleem, Adam Dziedzic et al.
A Fast and Provable Algorithm for Sparse Phase Retrieval
Jian-Feng Cai, Yu Long, Ruixue WEN et al.
Multilingual Jailbreak Challenges in Large Language Models
Yue Deng, Wenxuan Zhang, Sinno Pan et al.
A 2-Dimensional State Space Layer for Spatial Inductive Bias
Ethan Baron, Itamar Zimerman, Lior Wolf
High-dimensional SGD aligns with emerging outlier eigenspaces
Gerard Ben Arous, Reza Gheissari, Jiaoyang Huang et al.
Det-CGD: Compressed Gradient Descent with Matrix Stepsizes for Non-Convex Optimization
Hanmin Li, Avetik Karagulyan, Peter Richtarik
CODE REPRESENTATION LEARNING AT SCALE
Dejiao Zhang, Wasi Ahmad, Ming Tan et al.
AutoChunk: Automated Activation Chunk for Memory-Efficient Deep Learning Inference
Xuanlei Zhao, Shenggan Cheng, Guangyang LU et al.
TorchRL: A data-driven decision-making library for PyTorch
Albert Bou, Matteo Bettini, Sebastian Dittert et al.
MetaPhysiCa: Improving OOD Robustness in Physics-informed Machine Learning
S Chandra Mouli, Muhammad Alam, Bruno Ribeiro
Beyond Accuracy: Evaluating Self-Consistency of Code Large Language Models with IdentityChain
Marcus J. Min, Yangruibo Ding, Luca Buratti et al.
In-Context Learning through the Bayesian Prism
Madhur Panwar, Kabir Ahuja, Navin Goyal
Analyzing and Mitigating Object Hallucination in Large Vision-Language Models
Yiyang Zhou, Chenhang Cui, Jaehong Yoon et al.
Protein-Ligand Interaction Prior for Binding-aware 3D Molecule Diffusion Models
Zhilin Huang, Ling Yang, Xiangxin Zhou et al.
"What Data Benefits My Classifier?" Enhancing Model Performance and Interpretability through Influence-Based Data Selection
Anshuman Chhabra, Peizhao Li, Prasant Mohapatra et al.