Joint Hierarchical Representation Learning of Samples and Features via Informed Tree-Wasserstein Distance

0citations

arXiv:2501.03627

Citations

#2219

in NeurIPS 2025

of 5858 papers

Authors

Data Points

Authors

Ya-Wei Eileen Lin Ronald Coifman Gal Mishne Ronen Talmon

Abstract

High-dimensional data often exhibit hierarchical structures in both modes: samples and features. Yet, most existing approaches for hierarchical representation learning consider only one mode at a time. In this work, we propose an unsupervised method for jointly learning hierarchical representations of samples and features via Tree-Wasserstein Distance (TWD). Our method alternates between the two data modes. It first constructs a tree for one mode, then computes a TWD for the other mode based on that tree, and finally uses the resulting TWD to build the second mode’s tree. By repeatedly alternating through these steps, the method gradually refines both trees and the corresponding TWDs, capturing meaningful hierarchical representations of the data. We provide a theoretical analysis showing that our method converges. We show that our method can be integrated into hyperbolic graph convolutional networks as a pre-processing technique, improving performance in link prediction and node classification tasks. In addition, our method outperforms baselines in sparse approximation and unsupervised Wasserstein distance learning tasks on word-document and single-cell RNA-sequencing datasets.

Citation History

Jan 25, 2026

Jan 26, 2026

Jan 28, 2026