Are you an EPFL student looking for a semester project?
Work with us on data science and visualisation projects, and deploy your project as an app on top of Graph Search.
A long-standing question in biology is whether multipotent somatic stem and progenitor cells (SSPCs) feature molecular properties that could guide their system-independent identification. Population-based transcriptomic studies have so far not been able to provide a definite answer, given the rarity and heterogeneous nature of these cells. Here, we exploited the resolving power of single-cell RNA-sequencing to develop a computational model that is able to accurately distinguish SSPCs from differentiated cells across tissues. The resulting classifier is based on the combined expression of 23 genes including known players in multipotency, proliferation, and tumorigenesis, as well as novel ones, such as Lcp1 and Vgll4 that we functionally validate in intestinal organoids. We show how this approach enables the identification of stem-like cells in still ambiguous systems such as the pancreas and the epidermis as well as the exploration of lineage commitment hierarchies, thus facilitating the study of biological processes such as cellular differentiation, tissue regeneration, and cancer. Stem Cells2017;35:2390-2402