Most Cited ICLR "activation communication" Papers
6,124 papers found • Page 30 of 31
Conference
How to visualize training dynamics in neural networks
Michael Hu, Shreyans Jain, Sangam Chaulagain et al.
Soft Robust MDPs and Risk-Sensitive MDPs: Equivalence, Policy Gradient, and Sample Complexity
Runyu Zhang, Yang Hu, Na Li
MotherNet: Fast Training and Inference via Hyper-Network Transformers
Andreas Mueller, Carlo Curino, Raghu Ramakrishnan
Intent3D: 3D Object Detection in RGB-D Scans Based on Human Intention
Weitai Kang, Mengxue Qu, Jyoti Kini et al.
POGEMA: A Benchmark Platform for Cooperative Multi-Agent Pathfinding
Alexey Skrynnik, Anton Andreychuk, Anatolii Borzilov et al.
HARDMath: A Benchmark Dataset for Challenging Problems in Applied Mathematics
Fan, Sarah Martinson, Erik Wang et al.
SePer: Measure Retrieval Utility Through The Lens Of Semantic Perplexity Reduction
Lu Dai, Yijie Xu, Jinhui Ye et al.
CheapNet: Cross-attention on Hierarchical representations for Efficient protein-ligand binding Affinity Prediction
Hyukjun Lim, Sun Kim, Sangseon Lee
Probabilistic Conformal Prediction with Approximate Conditional Validity
Vincent Plassier, Alexander Fishkov, Mohsen Guizani et al.
Forewarned is Forearmed: Harnessing LLMs for Data Synthesis via Failure-induced Exploration
Qintong Li, Jiahui Gao, Sheng Wang et al.
Transformers Provably Learn Two-Mixture of Linear Classification via Gradient Flow
Hongru Yang, Zhangyang Wang, Jason Lee et al.
Safety-Prioritizing Curricula for Constrained Reinforcement Learning
Cevahir Koprulu, Thiago Simão, Nils Jansen et al.
InfoGS: Efficient Structure-Aware 3D Gaussians via Lightweight Information Shaping
Yunchao Zhang, Guandao Yang, Leonidas Guibas et al.
Robust Root Cause Diagnosis using In-Distribution Interventions
Lokesh Nagalapatti, Ashutosh Srivastava, Sunita Sarawagi et al.
IV-mixed Sampler: Leveraging Image Diffusion Models for Enhanced Video Synthesis
Shitong Shao, zikai zhou, Lichen Bai et al.
Animate Your Thoughts: Reconstruction of Dynamic Natural Vision from Human Brain Activity
Yizhuo Lu, Changde Du, Chong Wang et al.
Breaking Mental Set to Improve Reasoning through Diverse Multi-Agent Debate
Yexiang Liu, Jie Cao, Zekun Li et al.
MatExpert: Decomposing Materials Discovery By Mimicking Human Experts
Qianggang Ding, Santiago Miret, Bang Liu
Multi-objective Differentiable Neural Architecture Search
Rhea Sukthanker, Arber Zela, Benedikt Staffler et al.
Unlocking State-Tracking in Linear RNNs Through Negative Eigenvalues
Riccardo Grazzi, Julien Siems, Arber Zela et al.
MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark
Sakshi, Utkarsh Tyagi, Sonal Kumar et al.
DeepLTL: Learning to Efficiently Satisfy Complex LTL Specifications for Multi-Task RL
Mathias Jackermeier, Alessandro Abate
CAX: Cellular Automata Accelerated in JAX
Maxence Faldor, Antoine Cully
INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge
Angelika Romanou, Negar Foroutan, Anna Sotnikova et al.
Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative Instructions
Juncheng Li, Kaihang Pan, Zhiqi Ge et al.
Towards domain-invariant Self-Supervised Learning with Batch Styles Standardization
Marin Scalbert, Maria Vakalopoulou, Florent Couzinie-Devy
SNIP: Bridging Mathematical Symbolic and Numeric Realms with Unified Pre-training
Kazem Meidani, Parshin Shojaee, Chandan Reddy et al.
Transformer-Modulated Diffusion Models for Probabilistic Multivariate Time Series Forecasting
Yuxin Li, Wenchao Chen, Xinyue Hu et al.
Enhancing Small Medical Learners with Privacy-preserving Contextual Prompting
xinlu zhang, Shiyang Li, Xianjun Yang et al.
Intelligent Switching for Reset-Free RL
Darshan Patil, Janarthanan Rajendran, Glen Berseth et al.
Byzantine Robust Cooperative Multi-Agent Reinforcement Learning as a Bayesian Game
Simin Li, Jun Guo, Jingqiao Xiu et al.
#InsTag: Instruction Tagging for Analyzing Supervised Fine-tuning of Large Language Models
Keming Lu, Hongyi Yuan, Zheng Yuan et al.
Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators
Daniel Geng, Andrew Owens
Exploring Target Representations for Masked Autoencoders
xingbin liu, Jinghao Zhou, Tao Kong et al.
Neural Language of Thought Models
Yi-Fu Wu, Minseung Lee, Sungjin Ahn
Statistical Rejection Sampling Improves Preference Optimization
Tianqi Liu, Yao Zhao, Rishabh Joshi et al.
Tell Your Model Where to Attend: Post-hoc Attention Steering for LLMs
Qingru Zhang, Chandan Singh, Liyuan Liu et al.
Privacy Amplification for Matrix Mechanisms
Christopher Choquette-Choo, Arun Ganesh, Thomas Steinke et al.
Constrained Bi-Level Optimization: Proximal Lagrangian Value Function Approach and Hessian-free Algorithm
Wei Yao, Chengming Yu, Shangzhi Zeng et al.
Geometry-Aware Projective Mapping for Unbounded Neural Radiance Fields
Junoh Lee, Hyunjun Jung, Jinhwi Park et al.
Identifiable Latent Polynomial Causal Models through the Lens of Change
Yuhang Liu, Zhen Zhang, Dong Gong et al.
Adaptive Regret for Bandits Made Possible: Two Queries Suffice
Zhou Lu, Qiuyi (Richard) Zhang, Xinyi Chen et al.
Thin-Shell Object Manipulations With Differentiable Physics Simulations
Yian Wang, Juntian Zheng, Zhehuan Chen et al.
Bayesian Coreset Optimization for Personalized Federated Learning
Prateek Chanda, Shrey Modi, Ganesh Ramakrishnan
Beyond Spatio-Temporal Representations: Evolving Fourier Transform for Temporal Graphs
Anson Simon Bastos, Kuldeep Singh, Abhishek Nadgeri et al.
Hierarchical Context Merging: Better Long Context Understanding for Pre-trained LLMs
Woomin Song, Seunghyuk Oh, Sangwoo Mo et al.
On the Over-Memorization During Natural, Robust and Catastrophic Overfitting
Runqi Lin, Chaojian Yu, Bo Han et al.
Mastering Memory Tasks with World Models
Mohammad Reza Samsami, Artem Zholus, Janarthanan Rajendran et al.
Towards Unified Multi-Modal Personalization: Large Vision-Language Models for Generative Recommendation and Beyond
Tianxin Wei, Bowen Jin, Ruirui Li et al.
DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models
Yung-Sung Chuang, Yujia Xie, Hongyin Luo et al.
Augmenting Transformers with Recursively Composed Multi-grained Representations
Xiang Hu, Qingyang Zhu, Kewei Tu et al.
Learning Conditional Invariances through Non-Commutativity
Abhra Chaudhuri, Serban Georgescu, Anjan Dutta
Bandits Meet Mechanism Design to Combat Clickbait in Online Recommendation
Thomas Kleine Buening, Aadirupa Saha, Christos Dimitrakakis et al.
RobustTSF: Towards Theory and Design of Robust Time Series Forecasting with Anomalies
Hao Cheng, Qingsong Wen, Yang Liu et al.
ARGS: Alignment as Reward-Guided Search
Maxim Khanov, Jirayu Burapacheep, Yixuan Li
Let Models Speak Ciphers: Multiagent Debate through Embeddings
Chau Pham, Boyi Liu, Yingxiang Yang et al.
Understanding when Dynamics-Invariant Data Augmentations Benefit Model-free Reinforcement Learning Updates
Nicholas Corrado, Josiah Hanna
Text-to-3D with Classifier Score Distillation
Xin Yu, Yuan-Chen Guo, Yangguang Li et al.
Dirichlet-based Per-Sample Weighting by Transition Matrix for Noisy Label Learning
HeeSun Bae, Seungjae Shin, Byeonghu Na et al.
Local Graph Clustering with Noisy Labels
Artur Back de Luca, Kimon Fountoulakis, Shenghao Yang
DistillSpec: Improving Speculative Decoding via Knowledge Distillation
Yongchao Zhou, Kaifeng Lyu, Ankit Singh Rawat et al.
Zero and Few-shot Semantic Parsing with Ambiguous Inputs
Elias Stengel-Eskin, Kyle Rawlins, Benjamin Van Durme
Large-Vocabulary 3D Diffusion Model with Transformer
Ziang Cao, Fangzhou Hong, Tong Wu et al.
Adversarial Attacks on Fairness of Graph Neural Networks
Binchi Zhang, Yushun Dong, Chen Chen et al.
Task structure and nonlinearity jointly determine learned representational geometry
Matteo Alleman, Jack Lindsey, Stefano Fusi
Graph Transformers on EHRs: Better Representation Improves Downstream Performance
Raphael Poulain, Rahmatollah Beheshti
SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step Reasoning
Ning Miao, Yee Whye Teh, Tom Rainforth
Improved Regret Bounds for Non-Convex Online-Within-Online Meta Learning
Jiechao GUAN, Hui Xiong
Score Models for Offline Goal-Conditioned Reinforcement Learning
Harshit Sikchi, Rohan Chitnis, Ahmed Touati et al.
Learning Robust Generalizable Radiance Field with Visibility and Feature Augmented Point Representation
Jiaxu Wang, Ziyi Zhang, Renjing Xu
HoloNets: Spectral Convolutions do extend to Directed Graphs
Christian Koke, Daniel Cremers
Searching for High-Value Molecules Using Reinforcement Learning and Transformers
Raj Ghugare, Santiago Miret, Adriana Hugessen et al.
Interpretable Meta-Learning of Physical Systems
Matthieu Blanke, marc lelarge
Interpretable Sparse System Identification: Beyond Recent Deep Learning Techniques on Time-Series Prediction
Liu Xiaoyi, Duxin Chen, Wenjia Wei et al.
CircuitNet 2.0: An Advanced Dataset for Promoting Machine Learning Innovations in Realistic Chip Design Environment
Xun Jiang, zhuomin chai, Yuxiang Zhao et al.
Are Transformers with One Layer Self-Attention Using Low-Rank Weight Matrices Universal Approximators?
Tokio Kajitsuka, Issei Sato
Self-Supervised Contrastive Learning for Long-term Forecasting
Junwoo Park, Daehoon Gwak, Jaegul Choo et al.
Rethinking CNN’s Generalization to Backdoor Attack from Frequency Domain
Quanrui Rao, Lin Wang, Wuying Liu
Flow to Better: Offline Preference-based Reinforcement Learning via Preferred Trajectory Generation
Zhilong Zhang, Yihao Sun, Junyin Ye et al.
VBH-GNN: Variational Bayesian Heterogeneous Graph Neural Networks for Cross-subject Emotion Recognition
Chenyu Liu, XINLIANG ZHOU, Zhengri Zhu et al.
Harnessing Density Ratios for Online Reinforcement Learning
Philip Amortila, Dylan Foster, Nan Jiang et al.
Improved Efficiency Based on Learned Saccade and Continuous Scene Reconstruction From Foveated Visual Sampling
Jiayang Liu, Yiming Bu, Daniel Tso et al.
Local Composite Saddle Point Optimization
Site Bai, Brian Bullins
Improving Generalization of Alignment with Human Preferences through Group Invariant Learning
Rui Zheng, Wei Shen, Yuan Hua et al.
Improving LoRA in Privacy-preserving Federated Learning
Youbang Sun, Zitao Li, Yaliang Li et al.
Neural Neighborhood Search for Multi-agent Path Finding
Zhongxia Yan, Cathy Wu
How Does Unlabeled Data Provably Help Out-of-Distribution Detection?
Xuefeng Du, Zhen Fang, Ilias Diakonikolas et al.
GlucoBench: Curated List of Continuous Glucose Monitoring Datasets with Prediction Benchmarks
Renat Sergazinov, Elizabeth Chun, Valeriya Rogovchenko et al.
Look, Remember and Reason: Grounded Reasoning in Videos with Language Models
Apratim Bhattacharyya, Sunny Panchal, Reza Pourreza et al.
Pushing Boundaries: Mixup's Influence on Neural Collapse
Quinn Fisher, Haoming Meng, Vardan Papyan
LLCP: Learning Latent Causal Processes for Reasoning-based Video Question Answer
Guangyi Chen, Yuke Li, Xiao Liu et al.
Implicit regularization of deep residual networks towards neural ODEs
Pierre Marion, Yu-Han Wu, Michael Sander et al.
Compressing Latent Space via Least Volume
Qiuyi Chen, Mark Fuge
CoLiDE: Concomitant Linear DAG Estimation
Seyed Saman Saboksayr, Gonzalo Mateos, Mariano Tepper
Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model
Yinan Zheng, Jianxiong Li, Dongjie Yu et al.
Spoken Question Answering and Speech Continuation Using Spectrogram-Powered LLM
Eliya Nachmani, Alon Levkovitch, Roy Hirsch et al.
Contrastive Preference Learning: Learning from Human Feedback without Reinforcement Learning
Joey Hejna, Rafael Rafailov, Harshit Sikchi et al.
Unknown Domain Inconsistency Minimization for Domain Generalization
Seungjae Shin, HeeSun Bae, Byeonghu Na et al.
Finite Scalar Quantization: VQ-VAE Made Simple
Fabian Mentzer, David Minnen, Eirikur Agustsson et al.
Fixed-Budget Differentially Private Best Arm Identification
Zhirui Chen, P. N. Karthik, Yeow Meng Chee et al.
FreeDyG: Frequency Enhanced Continuous-Time Dynamic Graph Model for Link Prediction
Yuxing Tian, Yiyan Qi, Fan Guo
Contrastive Learning is Spectral Clustering on Similarity Graph
Zhiquan Tan, Yifan Zhang, Jingqin Yang et al.
LUT-GEMM: Quantized Matrix Multiplication based on LUTs for Efficient Inference in Large-Scale Generative Language Models
Gunho Park, baeseong park, Minsub Kim et al.
True Knowledge Comes from Practice: Aligning Large Language Models with Embodied Environments via Reinforcement Learning
Weihao Tan, Wentao Zhang, Shanqi Liu et al.
Dual Associated Encoder for Face Restoration
Yu-Ju Tsai, Yu-Lun Liu, Lu Qi et al.
Does Writing with Language Models Reduce Content Diversity?
Vishakh Padmakumar, He He
Few-shot Hybrid Domain Adaptation of Image Generator
Hengjia Li, Yang Liu, Linxuan Xia et al.
Adaptive Rational Activations to Boost Deep Reinforcement Learning
Quentin Delfosse, Patrick Schramowski, Martin Mundt et al.
Towards Meta-Pruning via Optimal Transport
Alexander Theus, Olin Geimer, Friedrich Wicke et al.
From Posterior Sampling to Meaningful Diversity in Image Restoration
Noa Cohen, Hila Manor, Yuval Bahat et al.
LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset
Lianmin Zheng, Wei-Lin Chiang, Ying Sheng et al.
A Recipe for Improved Certifiable Robustness
Kai Hu, Klas Leino, Zifan Wang et al.
Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback
Yifu Yuan, Jianye HAO, Yi Ma et al.
Sample-Efficiency in Multi-Batch Reinforcement Learning: The Need for Dimension-Dependent Adaptivity
Emmeran Johnson, Ciara Pike-Burke, Patrick Rebeschini
Delphic Offline Reinforcement Learning under Nonidentifiable Hidden Confounding
Alizée Pace, Hugo Yèche, Bernhard Schoelkopf et al.
Label-free Node Classification on Graphs with Large Language Models (LLMs)
Zhikai Chen, Haitao Mao, Hongzhi Wen et al.
Function-space Parameterization of Neural Networks for Sequential Learning
Aidan Scannell, Riccardo Mereu, Paul Chang et al.
Boundary Denoising for Video Activity Localization
Mengmeng Xu, Mattia Soldan, Jialin Gao et al.
How to Catch an AI Liar: Lie Detection in Black-Box LLMs by Asking Unrelated Questions
Lorenzo Pacchiardi, Alex Chan, Sören Mindermann et al.
Alt-Text with Context: Improving Accessibility for Images on Twitter
Nikita Srivatsan, Sofia Samaniego, Omar Florez et al.
Combinatorial Bandits for Maximum Value Reward Function under Value-Index Feedback
Yiliu Wang, Wei Chen, Milan Vojnovic
Yet Another ICU Benchmark: A Flexible Multi-Center Framework for Clinical ML
Robin van de Water, Hendrik Schmidt, Paul Elbers et al.
PixArt-$\alpha$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Junsong Chen, Jincheng YU, Chongjian GE et al.
Consistency Training with Learnable Data Augmentation for Graph Anomaly Detection with Limited Supervision
Nan Chen, Zemin Liu, Bryan Hooi et al.
Improving equilibrium propagation without weight symmetry through Jacobian homeostasis
Axel Laborieux, Friedemann Zenke
Object-Aware Inversion and Reassembly for Image Editing
Zhen Yang, Ganggui Ding, Wen Wang et al.
What's In My Big Data?
Yanai Elazar, Akshita Bhagia, Ian Magnusson et al.
Minimum width for universal approximation using ReLU networks on compact domain
Namjun Kim, Chanho Min, Sejun Park
Provable Memory Efficient Self-Play Algorithm for Model-free Reinforcement Learning
Na Li, Yuchen Jiao, Hangguan Shan et al.
Generative Human Motion Stylization in Latent Space
chuan guo, Yuxuan Mu, Xinxin Zuo et al.
Achieving the Pareto Frontier of Regret Minimization and Best Arm Identification in Multi-Armed Bandits
Wang Chi Cheung, Vincent Tan, Zixin Zhong
Ins-DetCLIP: Aligning Detection Model to Follow Human-Language Instruction
Renjie Pi, Lewei Yao, Jianhua Han et al.
Generalized Policy Iteration using Tensor Approximation for Hybrid Control
Suhan Shetty, Teng Xue, Sylvain Calinon
Diffusion Models for Multi-Task Generative Modeling
Changyou Chen, Han Ding, Bunyamin Sisman et al.
Causal Modelling Agents: Causal Graph Discovery through Synergising Metadata- and Data-driven Reasoning
Ahmed Abdulaal, Adamos Hadjivasiliou, Nina Montaña-Brown et al.
FedCompass: Efficient Cross-Silo Federated Learning on Heterogeneous Client Devices Using a Computing Power-Aware Scheduler
Zilinghan Li, Pranshu Chaturvedi, Shilan He et al.
Bridging State and History Representations: Understanding Self-Predictive RL
Tianwei Ni, Benjamin Eysenbach, Erfan Seyedsalehi et al.
Rethinking Information-theoretic Generalization: Loss Entropy Induced PAC Bounds
Yuxin Dong, Tieliang Gong, Hong Chen et al.
DNA-GPT: Divergent N-Gram Analysis for Training-Free Detection of GPT-Generated Text
Xianjun Yang, Wei Cheng, Yue Wu et al.
SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models
Xin Zhang, Dong Zhang, Shimin Li et al.
Prompt Learning with Quaternion Networks
Boya Shi, Zhengqin Xu, Shuai Jia et al.
LUM-ViT: Learnable Under-sampling Mask Vision Transformer for Bandwidth Limited Optical Signal Acquisition
Lingfeng Liu, Dong Ni, Hangjie Yuan
Self-Supervised Speech Quality Estimation and Enhancement Using Only Clean Speech
Szu-Wei Fu, Kuo-Hsuan Hung, Yu Tsao et al.
Towards Eliminating Hard Label Constraints in Gradient Inversion Attacks
Yanbo Wang, Jian Liang, Ran He
Horizon-free Reinforcement Learning in Adversarial Linear Mixture MDPs
Kaixuan Ji, Qingyue Zhao, Jiafan He et al.
Ensemble Distillation for Unsupervised Constituency Parsing
Behzad Shayegh, Yanshuai Cao, Xiaodan Zhu et al.
Multi-modal Gaussian Process Variational Autoencoders for Neural and Behavioral Data
Rabia Gondur, Usama Bin Sikandar, Evan Schaffer et al.
ContextRef: Evaluating Referenceless Metrics for Image Description Generation
Elisa Kreiss, Elisa Kreiss, Eric Zelikman et al.
Kernel Metric Learning for In-Sample Off-Policy Evaluation of Deterministic RL Policies
Haanvid Lee, Tri Wahyu Guntara, Jongmin Lee et al.
Polynomial Width is Sufficient for Set Representation with High-dimensional Features
Peihao Wang, Shenghao Yang, Shu Li et al.
Adversarial Causal Bayesian Optimization
Scott Sussex, Pier Giuseppe Sessa, Anastasia Makarova et al.
A Dynamical View of the Question of Why
Mehdi Fatemi, Sindhu Chatralinganadoddi Mariyappa Gowda
GoLLIE: Annotation Guidelines improve Zero-Shot Information-Extraction
Oscar Sainz, Iker García-Ferrero, Rodrigo Agerri et al.
Generating Images with 3D Annotations Using Diffusion Models
Wufei Ma, Qihao Liu, Jiahao Wang et al.
Abstractors and relational cross-attention: An inductive bias for explicit relational reasoning in Transformers
Awni Altabaa, Taylor Webb, Jonathan Cohen et al.
Interpretable Diffusion via Information Decomposition
Xianghao Kong, Ollie Liu, Han Li et al.
LQ-LoRA: Low-rank plus Quantized Matrix Decomposition for Efficient Language Model Finetuning
Han Guo, Philip Greengard, Eric Xing et al.
Grounding Language Plans in Demonstrations Through Counterfactual Perturbations
Yanwei Wang, Johnson (Tsun-Hsuan) Wang, Jiayuan Mao et al.
Knowledge Card: Filling LLMs' Knowledge Gaps with Plug-in Specialized Language Models
Shangbin Feng, Weijia Shi, Yuyang Bai et al.
AUGCAL: Improving Sim2Real Adaptation by Uncertainty Calibration on Augmented Synthetic Images
Prithvijit Chattopadhyay, Bharat Goyal, Boglarka Ecsedi et al.
FreeReg: Image-to-Point Cloud Registration Leveraging Pretrained Diffusion Models and Monocular Depth Estimators
Haiping Wang, Yuan Liu, Bing WANG et al.
Transferring Learning Trajectories of Neural Networks
Daiki Chijiwa
Rethinking Branching on Exact Combinatorial Optimization Solver: The First Deep Symbolic Discovery Framework
Yufei Kuang, Jie Wang, Haoyang Liu et al.
MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training
Yizhi Li, Ruibin Yuan, Ge Zhang et al.
Beyond Memorization: Violating Privacy via Inference with Large Language Models
Robin Staab, Mark Vero, Mislav Balunovic et al.
How Realistic Is Your Synthetic Data? Constraining Deep Generative Models for Tabular Data
Mihaela Stoian, Salijona Dyrmishi, Maxime Cordy et al.
Masked Audio Generation using a Single Non-Autoregressive Transformer
Alon Ziv, Itai Gat, Gael Le Lan et al.
Sliced Wasserstein Estimation with Control Variates
Khai Nguyen, Nhat Ho
VONet: Unsupervised Video Object Learning With Parallel U-Net Attention and Object-wise Sequential VAE
Haonan Yu, Wei Xu
Relay Diffusion: Unifying diffusion process across resolutions for image synthesis
Jiayan Teng, Wendi Zheng, Ming Ding et al.
SparseFormer: Sparse Visual Recognition via Limited Latent Tokens
Ziteng Gao, Zhan Tong, Limin Wang et al.
EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision
Jiawei Yang, Boris Ivanovic, Or Litany et al.
Grokking in Linear Estimators -- A Solvable Model that Groks without Understanding
Noam Levi, Alon Beck, Yohai Bar-Sinai
Masked Autoencoders with Multi-Window Local-Global Attention Are Better Audio Learners
Sarthak Yadav, Sergios Theodoridis, Lars Kai Hansen et al.
Guiding Masked Representation Learning to Capture Spatio-Temporal Relationship of Electrocardiogram
Yeongyeon Na, Minje Park, Yunwon Tae et al.
CLAP: Collaborative Adaptation for Patchwork Learning
Sen Cui, Abudukelimu Wuerkaixi, Weishen Pan et al.
GRANDE: Gradient-Based Decision Tree Ensembles for Tabular Data
Sascha Marton, Stefan Lüdtke, Christian Bartelt et al.
Bayesian Low-rank Adaptation for Large Language Models
Adam Yang, Maxime Robeyns, Xi Wang et al.
CrossLoco: Human Motion Driven Control of Legged Robots via Guided Unsupervised Reinforcement Learning
Tianyu Li, Hyunyoung Jung, Matthew Gombolay et al.
TimeMixer: Decomposable Multiscale Mixing for Time Series Forecasting
Shiyu Wang, Haixu Wu, Xiaoming Shi et al.
Reverse Diffusion Monte Carlo
Xunpeng Huang, Hanze Dong, Yifan HAO et al.
Interpreting Robustness Proofs of Deep Neural Networks
Debangshu Banerjee, Avaljot Singh, Gagandeep Singh
Incentive-Aware Federated Learning with Training-Time Model Rewards
Zhaoxuan Wu, Mohammad Mohammadi Amiri, Ramesh Raskar et al.
WildFusion: Learning 3D-Aware Latent Diffusion Models in View Space
Katja Schwarz, Seung Wook Kim, Jun Gao et al.
ArchLock: Locking DNN Transferability at the Architecture Level with a Zero-Cost Binary Predictor
Tong Zhou, Shaolei Ren, Xiaolin Xu
Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition
Sihyun Yu, Weili Nie, De-An Huang et al.
Efficient Sharpness-Aware Minimization for Molecular Graph Transformer Models
Yili Wang, Kaixiong Zhou, Ninghao Liu et al.
When can transformers reason with abstract symbols?
Enric Boix-Adserà, Omid Saremi, Emmanuel Abbe et al.
On Adversarial Training without Perturbing all Examples
Max Losch, Mohamed Omran, David Stutz et al.
Provable Compositional Generalization for Object-Centric Learning
Thaddäus Wiedemer, Jack Brady, Alexander Panfilov et al.
Is This the Subspace You Are Looking for? An Interpretability Illusion for Subspace Activation Patching
Aleksandar Makelov, Georg Lange, Atticus Geiger et al.
Defining and extracting generalizable interaction primitives from DNNs
Lu Chen, Siyu Lou, Benhao Huang et al.
Spatially-Aware Transformers for Embodied Agents
Junmo Cho, Jaesik Yoon, Sungjin Ahn
Seer: Language Instructed Video Prediction with Latent Diffusion Models
Xianfan Gu, Chuan Wen, Weirui Ye et al.
Statistical Perspective of Top-K Sparse Softmax Gating Mixture of Experts
Huy Nguyen, Pedram Akbarian Saravi, Fanqi Yan et al.
NOLA: Compressing LoRA using Linear Combination of Random Basis
Soroush Abbasi Koohpayegani, K L Navaneet, Parsa Nooralinejad et al.
Fast Hyperboloid Decision Tree Algorithms
Philippe Chlenski, Ethan Turok, Antonio Moretti et al.
TiC-CLIP: Continual Training of CLIP Models
Saurabh Garg, Mehrdad Farajtabar, Hadi Pouransari et al.
Tree Cross Attention
Leo Feng, Frederick Tung, Hossein Hajimirsadeghi et al.
Neur2RO: Neural Two-Stage Robust Optimization
Justin Dumouchelle, Esther Julien, Jannis Kurtz et al.