Validity of the hierarchical continuity assumption in experimental scRNA-seq data
Determine to what extent the hierarchical continuity assumption—namely, that mean expression vectors of parent and child nodes in a lineage tree are close in Euclidean space, as operationalized by the continuity loss in the hierarchical k-means (h-k-means) and hierarchical Gaussian mixture model (h-GMM)—holds in experimental single-cell RNA-seq datasets, where differentiation may not be strictly hierarchical and the provided label hierarchy is manually curated.
References
While our assumption on the distribution of data in the nodes of the hierarchy is obviously satisfied for simulated data, it is not clear how this assumption holds in experimental datasets as the differentiation process might not always lead to a hierarchy and the hierarchical prior is usually established manually and might therefore be less reliable .