APS Logo

Interpretable and tractable probabilistic models for single-cell RNA sequencing

ORAL

Abstract



We outline a class of models of technical and biological variability for single-cell RNA sequencing (scRNA-seq). This technology can quantify the number of RNA molecules in millions of cells at a genome-wide scale. Due to the volume of the data, typical scRNA-seq analyses are ad hoc and descriptive, typically relying on normalization, background subtraction, and data filtering. We argue that the descriptive worldview is insufficient, as it oversimplifies the intrinsic variability in the transcriptional and experimental processes.

Our approach emphasizes a class of processes that are particularly suited to the analysis of scRNA-seq data. To facilitate biological discovery, these processes are interpretable: each parameter encodes the rate of a biophysical phenomenon. To enable use with large datasets, they are also tractable, translating the convolution of noise sources into the composition of generating functions. We present computationally facile solutions to generic biological systems, which couple stochastic transcriptional driving to downstream RNA processing. In addition, we outline probabilistic strategies for phenomena particular to scRNA-seq, such as non-ergodicity in developmental trajectories, background contamination, and variability in capturing individual molecules.

Publication: Gorin G, Yoshida SR, Pachter L. Transient and delay chemical master equations. bioRxiv; 2022 Oct.<br>Gorin G, Fang M, Chari T, Pachter L. RNA velocity unraveled. PLOS Computational Biology; 2022 Sep.<br>Gorin G, Pachter L. Monod: mechanistic analysis of single-cell RNA sequencing count data. bioRxiv; 2022 Feb.<br>Gorin G, Pachter L. Modeling bursty transcription and splicing with the chemical master equation. Biophysical Journal; 2022 Feb.<br>Gorin G*, Vastola JJ*, Fang M, Pachter L. Interpretable and tractable models of transcriptional noise for the rational design of single-molecule quantification experiments. bioRxiv; 2021 Sep. <br>Gorin G, Pachter L. Length Biases in Single-Cell RNA Sequencing of pre-mRNA. bioRxiv; 2021 Jul. <br>Gorin G, Pachter L. Intrinsic and extrinsic noise are distinguishable in a synthesis–export–degradation model of mRNA production. bioRxiv; 2020 Sep. <br><br>Planned: <br>Gorin G, Fang M, Pachter L. Modeling RNA dynamics by occupation measures.

Presenters

  • Gennady Gorin

    California Institute of Technology

Authors

  • Gennady Gorin

    California Institute of Technology

  • Lior Pachter

    California Institute of Technology, Caltech

  • Meichen Fang

    California Institute of Technology

  • John J Vastola

    Harvard Medical School