Reading-out DNA translocation experiments with (un)supervised Machine Learning

Ángel Díaz Carral; Magnus Ostertag; Aleksandra Radenovic; Maria Fyta

Reading-out DNA translocation experiments with (un)supervised Machine Learning

ORAL

Abstract

DNA molecules can electrophoretically be driven through a nanoscale opening in a material giving rise to rich and measurable ionic current blockades. In this work, we train machine learning (ML) models on experimental ionic blockade data from DNA nucleotide translocation through 2D pores of different diameters. We propose a novel method that at the same time reduces the current traces to a few physical descriptors and trains low-complexity models. We describe each translocation event by four features referring to the structure of the ionic current time series. Training on this lower dimensional data and utilizing deep neural networks (DNN) and convolutional neural networks (CNN) we reach a high accuracy. Compared to more complex baseline schemes such as extreme gradient tree boosting (XGBoost) and recurrent neural network (RNN) based models trained on the full ionic current trace the former perform is comparable or even better. Our findings clearly reveal that the use of the ionic blockade height as a feature together with a proper combination of neural networks and feature extraction provides a strong enhancement in the detection and read-out sensitivity of novel nanopore sequencers.

March 16, 2021, 10:12 AM – March 16, 2021, 10:24 AM

Presenters

Ángel Díaz Carral

University of Stuttgart

Authors

Ángel Díaz Carral

University of Stuttgart
Magnus Ostertag

University of Stuttgart
Aleksandra Radenovic

École Polytechnique Fédérale de Lausanne, Ecole Polytechnique Federale de Lausanne, Laboratory of Nanoscale Biology, School of Engineering, Institute of Bioengineering, EPFL
Maria Fyta

University of Stuttgart