Prospective experimental validation of machine learning for biological sequence design
Invited
Abstract
Prediction of protein functional properties from sequence is a central challenge that would allow us to discover new proteins with specific functionality. Experimental breakthroughs allow data on the relationship between sequence and function to be rapidly acquired that can be used to train and validate machine learning models that predict protein function directly from sequence. However, the cost and latency of wet-lab experiments require methods that find good sequences in few experimental rounds, where each round contains large batches of sequence designs. In this setting, I will discuss model-based optimization approaches that allow us to take advantage of sample inefficient methods and find diverse optimal sequence candidates for experimental evaluation. The potential of this approach is illustrated through the design and experimental validation of viable AAV capsid protein variants for gene therapy applications in addition to the design and validation of peptides as potential therapeutics.
–
Presenters
-
Lucy Colwell
Univ of Cambridge
Authors
-
Lucy Colwell
Univ of Cambridge