APS Logo

Testing Fluorescent Protein Sequence Entropy for Correlation with Protein Properties

ORAL

Abstract

Physics, specifically the Second Law of Thermodynamics, governs protein folding. The resolution of the Levinthal Paradox shows that proteins do not explore all possible configurations randomly; instead they collapse into intermediate states and/or assume their lowest energy native states. The primary sequence dictates the final native states, but there is no solid theoretical framework for identifying the most important amino acids necessary for folding. Currently, multiple sequence alignments of all members of a protein family are assumed to be structurally/functionally important. Validation of such assumptions requires cumbersome computational analysis. Here, we present our initial findings on the relationship between amino acid sequence entropy and observable properties of fluorescent proteins. We used the centralized database FPbase of fluorescent protein (FP) characteristics to facilitate discovery and inferences about FP properties. The FPbase sequence analyses were done as follows: 1) sort FP by length and remove outliers, 2) align FP sequences, and 3) perform Shannon entropy calculations to quantify sequence variability. We concluded that the amino acids essential for FP structure and function typically have low entropy values. This research represents a significant step towards devising a statistical approach to identify key amino acids in FPs and improve our understanding of protein folding.

Presenters

  • Isabela E Camacho

    Santa Clara University

Authors

  • Isabela E Camacho

    Santa Clara University

  • Fernando Skaff

    University of Florida

  • Shyam Gajavelli

    University of Florida