Most Cited ICLR "portable model updates" Papers
6,124 papers found • Page 27 of 31
Conference
Quantum (Inspired) $D^2$-sampling with Applications
Poojan Shah, Ragesh Jaiswal
Local Steps Speed Up Local GD for Heterogeneous Distributed Logistic Regression
Michael Crawshaw, Blake Woodworth, Mingrui Liu
Long-tailed Adversarial Training with Self-Distillation
Seungju Cho, Hongsin Lee, Changick Kim
More Experts Than Galaxies: Conditionally-Overlapping Experts with Biologically-Inspired Fixed Routing
Sagi Shaier, Francisco Pereira, Katharina Kann et al.
Batch normalization is sufficient for universal function approximation in CNNs
Rebekka Burkholz
Understanding Warmup-Stable-Decay Learning Rates: A River Valley Loss Landscape View
Kaiyue Wen, Zhiyuan Li, Jason Wang et al.
The Ramanujan Library - Automated Discovery on the Hypergraph of Integer Relations
Itay Beit Halachmi, Ido Kaminer
LLCP: Learning Latent Causal Processes for Reasoning-based Video Question Answer
Guangyi Chen, Yuke Li, Xiao Liu et al.
Fiber Monte Carlo
Nick Richardson, Deniz Oktay, Yaniv Ovadia et al.
The Expressive Power of Transformers with Chain of Thought
William Merrill, Ashish Sabharwal
Inner Classifier-Free Guidance and Its Taylor Expansion for Diffusion Models
Shikun Sun, Longhui Wei, Zhicai Wang et al.
Linear Recurrences Accessible to Everyone
Felix Sarnthein
Efficient and Robust Neural Combinatorial Optimization via Wasserstein-Based Coresets
Xu Wang, Fuyou Miao, Wenjie Liu et al.
Compressing Latent Space via Least Volume
Qiuyi Chen, Mark Fuge
Bayesian Image Regression with Soft-thresholded Conditional Autoregressive Prior
Yuliang Xu, Jian Kang
Masked Temporal Interpolation Diffusion for Procedure Planning in Instructional Videos
Yufan Zhou, Zhaobo Qi, Lingshuai Lin et al.
An interpretable error correction method for enhancing code-to-code translation
Min Xue, Artur Andrzejak, Marla Leuther
Self-Supervised Diffusion Models for Electron-Aware Molecular Representation Learning
Gyoung S. Na, Chanyoung Park
A Unified Framework for Bayesian Optimization under Contextual Uncertainty
Sebastian Shenghong Tay, Chuan-Sheng Foo, Daisuke Urano et al.
Learning Large DAGs is Harder than you Think: Many Losses are Minimal for the Wrong DAG
Jonas Seng, Matej Zečević, Devendra Singh Dhami et al.
PhiNets: Brain-inspired Non-contrastive Learning Based on Temporal Prediction Hypothesis
Satoki Ishikawa, Makoto Yamada, Han Bao et al.
Active Retrosynthetic Planning Aware of Route Quality
Luotian Yuan, Yemin Yu, Ying Wei et al.
Zero-Mean Regularized Spectral Contrastive Learning: Implicitly Mitigating Wrong Connections in Positive-Pair Graphs
Xiong Zhou, Xianming Liu, feilong zhang et al.
Neural Field Classifiers via Target Encoding and Classification Loss
Xindi Yang, Zeke Xie, Xiong Zhou et al.
Safety-Prioritizing Curricula for Constrained Reinforcement Learning
Cevahir Koprulu, Thiago Simão, Nils Jansen et al.
Identifying latent state transitions in non-linear dynamical systems
Çağlar Hızlı, Çağatay Yıldız, Matthias Bethge et al.
GaussianAnything: Interactive Point Cloud Flow Matching for 3D Generation
Yushi LAN, Shangchen Zhou, Zhaoyang Lyu et al.
Sparse components distinguish visual pathways & their alignment to neural networks
Ammar I Marvi, Nancy Kanwisher, Meenakshi Khosla
Are Bert Family Good Instruction Followers? A Study on Their Potential And Limitations
yisheng xiao, Juntao Li, Zechen Sun et al.
Synergistic Patch Pruning for Vision Transformer: Unifying Intra- & Inter-Layer Patch Importance
Yuyao Zhang, Lan Wei, Nikolaos Freris
Scaling In-the-Wild Training for Diffusion-based Illumination Harmonization and Editing by Imposing Consistent Light Transport
Lvmin Zhang, Anyi Rao, Maneesh Agrawala
On Bias-Variance Alignment in Deep Models
Lin Chen, Michal Lukasik, Wittawat Jitkrittum et al.
Support is All You Need for Certified VAE Training
Changming Xu, Debangshu Banerjee, Deepak Vasisht et al.
Contrastive Preference Learning: Learning from Human Feedback without Reinforcement Learning
Joey Hejna, Rafael Rafailov, Harshit Sikchi et al.
Transformers Provably Learn Two-Mixture of Linear Classification via Gradient Flow
Hongru Yang, Zhangyang Wang, Jason Lee et al.
The Reversal Curse: LLMs trained on “A is B” fail to learn “B is A”
Lukas Berglund, Meg Tong, Maximilian Kaufmann et al.
Variance-enlarged Poisson Learning for Graph-based Semi-Supervised Learning with Extremely Sparse Labeled Data
Xiong Zhou, Xianming Liu, Hao Yu et al.
Forewarned is Forearmed: Harnessing LLMs for Data Synthesis via Failure-induced Exploration
Qintong Li, Jiahui Gao, Sheng Wang et al.
CheapNet: Cross-attention on Hierarchical representations for Efficient protein-ligand binding Affinity Prediction
Hyukjun Lim, Sun Kim, Sangseon Lee
Factual Context Validation and Simplification: A Scalable Method to Enhance GPT Trustworthiness and Efficiency
Tianyi Huang
Solving Homogeneous and Heterogeneous Cooperative Tasks with Greedy Sequential Execution
Shanqi Liu, Dong Xing, Pengjie Gu et al.
Century: A Framework and Dataset for Evaluating Historical Contextualisation of Sensitive Images
Canfer Akbulut, Kevin Robinson, Maribeth Rauh et al.
FreeDyG: Frequency Enhanced Continuous-Time Dynamic Graph Model for Link Prediction
Yuxing Tian, Yiyan Qi, Fan Guo
Concept Bottleneck Generative Models
Aya Abdelsalam Ismail, Julius Adebayo, Hector Corrada Bravo et al.
TC-MoE: Augmenting Mixture of Experts with Ternary Expert Choice
Shen Yan, Xingyan Bin, Sijun Zhang et al.
Toward Optimal Policy Population Growth in Two-Player Zero-Sum Games
Stephen McAleer, John Banister Lanier, Kevin A. Wang et al.
A Computational Framework for Modeling Emergence of Color Vision in the Human Brain
Atsunobu Kotani, Yi-Ren Ng
Synthesizing Realistic fMRI: A Physiological Dynamics-Driven Hierarchical Diffusion Model for Efficient fMRI Acquisition
Yufan Hu, Jiang, Wuyang Li et al.
Improving Neural Network Accuracy by Concurrently Training with a Twin Network
Benjamin Vandersmissen, Lucas Deckers, Jose Oramas
DeepSPF: Spherical SO(3)-Equivariant Patches for Scan-to-CAD Estimation
Driton Salihu, Adam Misik, Yuankai Wu et al.
The Trickle-down Impact of Reward Inconsistency on RLHF
Lingfeng Shen, Lingfeng Shen, Sihao Chen et al.
StringLLM: Understanding the String Processing Capability of Large Language Models
Xilong Wang, Hao Fu, Jindong Wang et al.
Probabilistic Adaptation of Black-Box Text-to-Video Models
Sherry Yang, Yilun Du, Bo Dai et al.
Weatherproofing Retrieval for Localization with Generative AI and Geometric Consistency
Yannis Kalantidis, Mert Bulent SARIYILDIZ, Rafael Rezende et al.
DreamClean: Restoring Clean Image Using Deep Diffusion Prior
Jie Xiao, Ruili Feng, Han Zhang et al.
How to visualize training dynamics in neural networks
Michael Hu, Shreyans Jain, Sangam Chaulagain et al.
Scrutinize What We Ignore: Reining In Task Representation Shift Of Context-Based Offline Meta Reinforcement Learning
Hai Zhang, Boyuan Zheng, Tianying Ji et al.
Learning and aligning single-neuron invariance manifolds in visual cortex
Mohammad Bashiri, Luca Baroni, Ján Antolík et al.
AstroCompress: A benchmark dataset for multi-purpose compression of astronomical data
Tuan Truong, Rithwik Sudharsan, Yibo Yang et al.
True Knowledge Comes from Practice: Aligning Large Language Models with Embodied Environments via Reinforcement Learning
Weihao Tan, Wentao Zhang, Shanqi Liu et al.
Improving Deep Regression with Tightness
Shihao Zhang, Yuguang Yan, Angela Yao
How much of my dataset did you use? Quantitative Data Usage Inference in Machine Learning
Yao Tong, Jiayuan Ye, Sajjad Zarifzadeh et al.
Private Mechanism Design via Quantile Estimation
Yuanyuan Yang, Tao Xiao, Bhuvesh Kumar et al.
Value-aligned Behavior Cloning for Offline Reinforcement Learning via Bi-level Optimization
Xingyu Jiang, Ning Gao, Xiuhui Zhang et al.
Watch Less, Do More: Implicit Skill Discovery for Video-Conditioned Policy
Wang, Zongqing Lu
RB-Modulation: Training-Free Stylization using Reference-Based Modulation
Litu Rout, Yujia Chen, Nataniel Ruiz et al.
Weaker MVI Condition: Extragradient Methods with Multi-Step Exploration
Yifeng Fan, Yongqiang Li, Bo Chen
ZeRO++: Extremely Efficient Collective Communication for Large Model Training
Guanhua Wang, Heyang Qin, Sam Jacobs et al.
Hybrid Directional Graph Neural Network for Molecules
Junyi An, Chao Qu, Zhipeng Zhou et al.
On the Hardness of Online Nonconvex Optimization with Single Oracle Feedback
Ziwei Guan, Yi Zhou, Yingbin Liang
YOLO-RD: Introducing Relevant and Compact Explicit Knowledge to YOLO by Retriever-Dictionary
Hao-Tang Tsui, Chien-Yao Wang, Hong-Yuan Liao
SFS: Smarter Code Space Search improves LLM Inference Scaling
Jonathan Light, Yue Wu, Yiyou Sun et al.
Mastering Task Arithmetic: $\tau$Jp as a Key Indicator for Weight Disentanglement
Kotaro Yoshida, Yuji Naraki, Takafumi Horie et al.
Recovery of Causal Graph Involving Latent Variables via Homologous Surrogates
Xiuchuan Li, Jun Wang, Tongliang Liu
MMD Graph Kernel: Effective Metric Learning for Graphs via Maximum Mean Discrepancy
Yan Sun, Jicong Fan
An improved analysis of per-sample and per-update clipping in federated learning
Bo Li, Xiaowen Jiang, Mikkel N. Schmidt et al.
Class Probability Matching with Calibrated Networks for Label Shift Adaption
Hongwei Wen, Annika Betken, Hanyuan Hang
Towards Offline Opponent Modeling with In-context Learning
Yuheng Jing, Kai Li, Bingyun Liu et al.
Mechanistic Interpretability Meets Vision Language Models: Insights and Limitations
Yiming Liu, Yuhui Zhang, Serena Yeung
Learning Polynomial Problems with $SL(2, \mathbb{R})$-Equivariance
Hannah Lawrence, Mitchell Harris
$\texttt{NAISR}$: A 3D Neural Additive Model for Interpretable Shape Representation
Yining Jiao, Carlton ZDANSKI, Julia Kimbell et al.
InterpGNN: Understand and Improve Generalization Ability of Transdutive GNNs through the Lens of Interplay between Train and Test Nodes
Jiawei Sun, Kailai Li, Ruoxin Chen et al.
A Progressive Training Framework for Spiking Neural Networks with Learnable Multi-hierarchical Model
Zecheng Hao, Xinyu Shi, Zihan Huang et al.
Generative Learning for Financial Time Series with Irregular and Scale-Invariant Patterns
Hongbin Huang, Minghua Chen, Xiao Qiao
Chain-of-Thought Provably Enables Learning the (Otherwise) Unlearnable
Chenxiao Yang, Zhiyuan Li, David Wipf
Scaling Long Context Training Data by Long-Distance Referrals
Yonghao Zhuang, Lanxiang Hu, Longfei Yun et al.
Grammar Reinforcement Learning: path and cycle counting in graphs with a Context-Free Grammar and Transformer approach
Jason Piquenot, Maxime Berar, Romain Raveaux et al.
Bridging the Gap between Variational Inference and Stochastic Gradient MCMC in Function Space
Mengjing Wu, Junyu Xuan, Jie Lu
Learning to Relax: Setting Solver Parameters Across a Sequence of Linear System Instances
Mikhail Khodak, Edmond Chow, Nina Balcan et al.
Würstchen: An Efficient Architecture for Large-Scale Text-to-Image Diffusion Models
Pablo Pernías, Dominic Rampas, Mats L. Richter et al.
Medium-Difficulty Samples Constitute Smoothed Decision Boundary for Knowledge Distillation on Pruned Datasets
Yudong Chen, Xuwei Xu, Frank de Hoog et al.
Understanding Convergence and Generalization in Federated Learning through Feature Learning Theory
Wei Huang, Ye Shi, Zhongyi Cai et al.
Robust System Identification: Finite-sample Guarantees and Connection to Regularization
Hank Park, Grani A. Hanasusanto, Yingying Li
GDrag:Towards General-Purpose Interactive Editing with Anti-ambiguity Point Diffusion
Xiaojian Lin, Hanhui Li, Yuhao Cheng et al.
Gaussian-Based Instance-Adaptive Intensity Modeling for Point-Supervised Facial Expression Spotting
Yicheng Deng, Hideaki Hayashi, Hajime Nagahara
BP-Modified Local Loss for Efficient Training of Deep Neural Networks
REN Lianhai, Qianxiao Li
ST-GCond: Self-supervised and Transferable Graph Dataset Condensation
Beining Yang, Qingyun Sun, Cheng Ji et al.
Simulating Training Dynamics to Reconstruct Training Data from Deep Neural Networks
Hanling Tian, Yuhang Liu, Mingzhen He et al.
Sparse MoE with Language Guided Routing for Multilingual Machine Translation
Xinyu Zhao, Xuxi Chen, Yu Cheng et al.
Neural Architecture Retrieval
Xiaohuan Pei, Yanxi Li, Minjing Dong et al.
Neural SDF Flow for 3D Reconstruction of Dynamic Scenes
wei mao, Richard Hartley, Mathieu Salzmann et al.
A new framework for evaluating model out-of-distribution generalisation for the biochemical domain
Raul Fernandez-Diaz, Hoang Thanh Lam, Vanessa Lopez et al.
Numerical Accounting in the Shuffle Model of Differential Privacy
Antti Koskela, Antti Honkela, Mikko Heikkilä
Unsupervised Multiple Kernel Learning for Graphs via Ordinality Preservation
Yan Sun, Stanley Kok
ADOPD: A Large-Scale Document Page Decomposition Dataset
Jiuxiang Gu, Xiangxi Shi, Jason Kuen et al.
Score-based generative models break the curse of dimensionality in learning a family of sub-Gaussian distributions
Frank Cole, Yulong Lu
Avoid Overclaims: Summary of Complexity Bounds for Algorithms in Minimization and Minimax Optimization
Siqi Zhang, Yifan Hu
To the Cutoff... and Beyond? A Longitudinal Perspective on LLM Data Contamination
Manley Roberts, Himanshu Thakur, Christine Herlihy et al.
XAIguiFormer: explainable artificial intelligence guided transformer for brain disorder identification
Hanning Guo, Farah Abdellatif, Yu Fu et al.
Minimalistic Predictions for Online Class Constraint Scheduling
Dorian Guyot, Alexandra Lassota
Mutual Effort for Efficiency: A Similarity-based Token Pruning for Vision Transformers in Self-Supervised Learning
Sheng Li, Qitao Tan, Yue Dai et al.
Resolution Attack: Exploiting Image Compression to Deceive Deep Neural Networks
Wangjia Yu, Xiaomeng Fu, Qiao Li et al.
Multi-Resolution Diffusion Models for Time Series Forecasting
Lifeng Shen, Weiyu Chen, James Kwok
Provable Benefits of Multi-task RL under Non-Markovian Decision Making Processes
Ruiquan Huang, Yuan Cheng, Jing Yang et al.
Straightness of Rectified Flow: A Theoretical Insight into Wasserstein Convergence
Saptarshi Roy, Vansh Bansal, Purnamrita Sarkar et al.
Rare event modeling with self-regularized normalizing flows: what can we learn from a single failure?
Charles Dawson, Van Tran, Max Li et al.
Multi-Label Node Classification with Label Influence Propagation
Yifei Sun, Zemin Liu, Bryan Hooi et al.
Language Control Diffusion: Efficiently Scaling through Space, Time, and Tasks
David Bell, Yujie Lu, Shinda Huang et al.
Like Oil and Water: Group Robustness Methods and Poisoning Defenses May Be at Odds
Michael-Andrei Panaitescu-Liess, Yigitcan Kaya, Sicheng Zhu et al.
Collaborative Discrete-Continuous Black-Box Prompt Learning for Language Models
Hualin Zhang, Haozhen Zhang, Zhekai Liu et al.
RESuM: A Rare Event Surrogate Model for Physics Detector Design
Ann-Kathrin Schuetz, Alan Poon, Aobo Li
SPD Attack - Prevention of AI Powered Image Editing by Image Immunization
Parth Badgujar, Shorya Singhal, Devansh Bhardwaj
Vision and Language Synergy for Rehearsal Free Continual Learning
Muhammad Anwar Masum, Mahardhika Pratama, Savitha Ramasamy et al.
SOREL: A Stochastic Algorithm for Spectral Risks Minimization
Yuze Ge, Rujun Jiang
Federated Few-Shot Class-Incremental Learning
Muhammad Anwar Masum, Mahardhika Pratama, Lin Liu et al.
FairDen: Fair Density-Based Clustering
Lena Krieger, Anna Beer, Pernille Matthews et al.
Building Blocks of Differentially Private Training
Mahmoud Hegazy, Aymeric Dieuleveut
Boundary Denoising for Video Activity Localization
Mengmeng Xu, Mattia Soldan, Jialin Gao et al.
SIMPL: Scalable and hassle-free optimisation of neural representations from behaviour
Tom George, Pierre Glaser, Kimberly Stachenfeld et al.
Generative Learning for Solving Non-Convex Problem with Multi-Valued Input-Solution Mapping
Enming Liang, Minghua Chen
Deep Networks Learn Features From Local Discontinuities in the Label Function
Prithaj Banerjee, Harish G Ramaswamy, Mahesh Yadav et al.
On Trajectory Augmentations for Off-Policy Evaluation
Ge Gao, Qitong Gao, Xi Yang et al.
Learn hybrid prototypes for multivariate time series anomaly detection
Ke-Yuan Shen
GRAPH-CONSTRAINED DIFFUSION FOR END-TO-END PATH PLANNING
DINGYUAN SHI, Yongxin Tong, Zimu Zhou et al.
Combinatorial Bandits for Maximum Value Reward Function under Value-Index Feedback
Yiliu Wang, Wei Chen, Milan Vojnovic
From Complexity to Clarity: Analytical Expressions of Deep Neural Network Weights via Clifford Algebra and Convexity
Mert Pilanci
On the Fourier analysis in the SO(3) space : the EquiLoPO Network
Dmitrii Zhemchuzhnikov, Sergei Grudinin
ImplicitSLIM and How it Improves Embedding-based Collaborative Filtering
Ilya Shenbin, Sergey Nikolenko
PixArt-$\alpha$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Junsong Chen, Jincheng YU, Chongjian GE et al.
Balancing Bias in Two-sided Markets for Fair Stable Matchings
Siyuan Wu, Leong Hou U, Panagiotis Karras
Convergence of Bayesian Bilevel Optimization
Shi Fu, Fengxiang He, Xinmei Tian et al.
Towards more rigorous evaluations of language models
Desi R Ivanova, Ilija Ilievski, Momchil Konstantinov
Consistency Training with Learnable Data Augmentation for Graph Anomaly Detection with Limited Supervision
Nan Chen, Zemin Liu, Bryan Hooi et al.
Rotation Has Two Sides: Evaluating Data Augmentation for Deep One-class Classification
Guodong Wang, Yunhong Wang, Xiuguo Bao et al.
Digi-Q: Learning VLM Q-Value Functions for Training Device-Control Agents
Hao Bai, Yifei Zhou, Li Li et al.
Lion Secretly Solves a Constrained Optimization: As Lyapunov Predicts
Lizhang Chen, Bo Liu, Kaizhao Liang et al.
Aligned LLMs Are Not Aligned Browser Agents
Priyanshu Kumar, Elaine Lau, Saranya Vijayakumar et al.
On the Effect of Batch Size in Byzantine-Robust Distributed Learning
Yi-Rui Yang, Chang-Wei Shi, Wu-Jun Li
Three-in-One: Fast and Accurate Transducer for Hybrid-Autoregressive ASR
Hainan Xu, Travis Bartley, Vladimir Bataev et al.
Node2ket: Efficient High-Dimensional Network Embedding in Quantum Hilbert Space
Hao Xiong, Yehui Tang, Yunlin He et al.
Towards LLM4QPE: Unsupervised Pretraining of Quantum Property Estimation and A Benchmark
Yehui Tang, Hao Xiong, Nianzu Yang et al.
Sensitivity Verification for Additive Decision Tree Ensembles
Arhaan Ahmad, Tanay Tayal, Ashutosh Gupta et al.
Diffusion Models and Gaussian Flow Matching: Two Sides of the Same Coin
Ruiqi Gao, Emiel Hoogeboom, Jonathan Heek et al.
PAE: Reinforcement Learning from External Knowledge for Efficient Exploration
Zhe Wu, Haofei Lu, Junliang Xing et al.
LOIRE: LifelOng learning on Incremental data via pre-trained language model gRowth Efficiently
Xue Han, Yitong Wang, Junlan Feng et al.
Continual Slow-and-Fast Adaptation of Latent Neural Dynamics (CoSFan): Meta-Learning What-How & When to Adapt
Ryan Missel, Linwei Wang
Provable Memory Efficient Self-Play Algorithm for Model-free Reinforcement Learning
Na Li, Yuchen Jiao, Hangguan Shan et al.
Sketching for Convex and Nonconvex Regularized Least Squares with Sharp Guarantees
Yingzhen Yang, Ping Li
Problem-Parameter-Free Federated Learning
Wenjing Yan, Kai Zhang, Xiaolu Wang et al.
From Decoupling to Adaptive Transformation: a Wider Optimization Space for PTQ
Zhaojing Wen, Qiulin Zhang, Yuan Zhang et al.
Exploiting Hidden Symmetry to Improve Objective Perturbation for DP Linear Learners with a Nonsmooth L1-Norm
Du Chen, Geoffrey A. Chua
Leveraging Flatness to Improve Information-Theoretic Generalization Bounds for SGD
Ze Peng, Jian Zhang, Yisen Wang et al.
Reveal Object in Lensless Photography via Region Gaze and Amplification
Xiangjun Yin, Huihui Yue
Fast Imitation via Behavior Foundation Models
Matteo Pirotta, Andrea Tirinzoni, Ahmed Touati et al.
Provably Safeguarding a Classifier from OOD and Adversarial Samples
Nicolas Atienza, Johanne Cohen, Christophe Labreuche et al.
Federated Text-driven Prompt Generation for Vision-Language Models
Chen Qiu, Xingyu Li, Chaithanya Kumar Mummadi et al.
Ins-DetCLIP: Aligning Detection Model to Follow Human-Language Instruction
Renjie Pi, Lewei Yao, Jianhua Han et al.
GANDALF: Generative AttentioN based Data Augmentation and predictive modeLing Framework for personalized cancer treatment
Aishwarya Jayagopal, Yanrong Zhang, Robert Walsh et al.
Reward Dimension Reduction for Scalable Multi-Objective Reinforcement Learning
Giseung Park, Youngchul Sung
Finding and Only Finding Differential Nash Equilibria by Both Pretending to be a Follower
Guodong Zhang, Xuchan Bao
Less is More: Fewer Interpretable Region via Submodular Subset Selection
Ruoyu Chen, Hua Zhang, Siyuan Liang et al.
On the Inherent Privacy Properties of Discrete Denoising Diffusion Models
Eli Chien, Pan Li, Vamsi Potluru et al.
Generalized Policy Iteration using Tensor Approximation for Hybrid Control
Suhan Shetty, Teng Xue, Sylvain Calinon
ACTIVE: Offline Reinforcement Learning via Adaptive Imitation and In-sample $V$-Ensemble
Tianyuan Chen, Ronglong Cai, Faguo Wu et al.
On LLM Knowledge Distillation - A Comparison between Forward KL and Reverse KL
Yihan Cao, Yanbin Kang
Diffusion Models for Multi-Task Generative Modeling
Changyou Chen, Han Ding, Bunyamin Sisman et al.
Score-based free-form architectures for high-dimensional Fokker-Planck equations
Feng Liu, Faguo Wu, Xiao Zhang
Causal Modelling Agents: Causal Graph Discovery through Synergising Metadata- and Data-driven Reasoning
Ahmed Abdulaal, Adamos Hadjivasiliou, Nina Montaña-Brown et al.
Select before Act: Spatially Decoupled Action Repetition for Continuous Control
Buqing Nie, Yangqing Fu, Yue Gao
Fugatto 1: Foundational Generative Audio Transformer Opus 1
Rafael Valle, Rohan Badlani, Zhifeng Kong et al.
Efficient Interpolation between Extragradient and Proximal Methods for Weak MVIs
Thomas Pethick, Ioannis Mavrothalassitis, Volkan Cevher
Latent 3D Graph Diffusion
Yuning You, Ruida Zhou, Jiwoong Park et al.
Does Progress On Object Recognition Benchmarks Improve Generalization on Crowdsourced, Global Data?
Megan Richards, Polina Kirichenko, Diane Bouchacourt et al.
Scalable Monotonic Neural Networks
Hyunho Kim, Jong-Seok Lee
Understanding Methods for Scalable MCTS
Will Knipe
Combining Axes Preconditioners through Kronecker Approximation for Deep Learning
Venkata Sai Surya Subramanyam Duvvuri, Fnu Devvrit, Rohan Anil et al.
RetroInText: A Multimodal Large Language Model Enhanced Framework for Retrosynthetic Planning via In-Context Representation Learning
Chenglong Kang, Xiaoyi Liu, Fei Guo
FedTrans: Client-Transparent Utility Estimation for Robust Federated Learning
Mingkun Yang, Ran Zhu, Qing Wang et al.
The mechanistic basis of data dependence and abrupt learning in an in-context classification task
Gautam Reddy Nallamala
NuwaDynamics: Discovering and Updating in Causal Spatio-Temporal Modeling
Kun Wang, Hao Wu, Yifan Duan et al.
Rethinking Information-theoretic Generalization: Loss Entropy Induced PAC Bounds
Yuxin Dong, Tieliang Gong, Hong Chen et al.
SOInter: A Novel Deep Energy-Based Interpretation Method for Explaining Structured Output Models
S. Fatemeh Seyyedsalehi, Mahdieh Baghshah, Hamid Rabiee
Mind the Gap: Examining the Self-Improvement Capabilities of Large Language Models
Yuda Song, Hanlin Zhang, Carson Eisenach et al.
Connectome Mapping: Shape-Memory Network via Interpretation of Contextual Semantic Information
Kyungsu Lee, Haeyun Lee, Jae Youn Hwang
GAFormer: Enhancing Timeseries Transformers Through Group-Aware Embeddings
Jingyun Xiao, Ran Liu, Eva Dyer
Fat-to-Thin Policy Optimization: Offline Reinforcement Learning with Sparse Policies
Lingwei Zhu, Han Wang, Yukie Nagai
SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models
Xin Zhang, Dong Zhang, Shimin Li et al.
Prompt Learning with Quaternion Networks
Boya Shi, Zhengqin Xu, Shuai Jia et al.
Lost in Prediction: Why Social Media Narratives Don't Help Macroeconomic Forecasting?
Almog Gueta, Roi Reichart, Amir Feder et al.
Global Identifiability of Overcomplete Dictionary Learning via L1 and Volume Minimization
Yuchen Sun, Kejun Huang