Positional Information from Single Cell RNASeq Data via Manifold Learning

Jeremy Lauro; Boris I Shraiman; Madhav Mani; Nicholas Noll

Positional Information from Single Cell RNASeq Data via Manifold Learning

ORAL

Abstract

The connection between developmental cell fate and embryonic position has led many to propose that early gene expression patterns encode an intrinsic coordinate system within each cell. The spatial patterns of early developmental gene expression have been mapped out in several model systems, and recent work has addressed the inverse problem of inferring the position of a cell from its gene expression with the aid of pre-existing maps quantifying the expression of "marker" genes as a function of position. In our work, we pursue the goal of de novo inference of positional information, based on the assumption of continuity in gene expression space combined with the hypothesis that gene expression data approximately lies on a low dimensional manifold. Our method involves two stages: (i) a preliminary dimensional reduction step to de-noise the data and (ii) a machine learning algorithm to parameterize a low-dimensional manifold that approximates the high-dimensional sequencing data. We do not assume an isometry between the high-dimensional dataset and its low-dimensional representation; instead, we take a novel approach by enforcing monotonicity in neighborhood rankings. We verify our methods on early Drosophila embryos using FISH and scRNAseq data, finding good linear agreement between positional coordinates and our learned transcriptomic manifold.

March 18, 2025, 3:42 PM – March 18, 2025, 3:54 PM

Presenters

Jeremy Lauro

University of California, Santa Barbara

Authors

Jeremy Lauro

University of California, Santa Barbara
Boris I Shraiman

University of California, Santa Barbara
Madhav Mani

Northwestern University, Northwestern
Nicholas Noll

Karius