Pairwise Similarity of Polymer Ensembles
POSTER
Abstract
In contrast to small molecules having well-defined molecular structures, polymers are rarely a single, well-defined structure. Instead, they are ensembles composed of polymer chains with various sequences, topologies, and molar mass distributions. While there are numerous approaches to measuring the pairwise similarity of small molecules, accurately calculating the pairwise similarity between polymer ensembles is still challenging. A common approach is to generate a single embedding vector for each ensemble and subsequently compute the distance between these vectors to yield a similarity score. For example, the embedding of a random copolymer can be defined by a weighted average of embedding vectors of the repeat units. However, these common average approaches neglect significant features. Here, we explicitly consider all the entities of the ensembles in the calculation of the similarity score and show that our method captures differences that are neglected using the average method, enabling us to resolve subtle differences between molecular distributions. Our method presents a critical step to quantitatively calculate polymer ensemble similarity and enable nearest neighbor search in polymer database.
Presenters
-
Jiale Shi
Massachusetts Institute of Technology
Authors
-
Jiale Shi
Massachusetts Institute of Technology
-
Dylan Walsh
Massachusetts Institute of Technology
-
Debra J Audus
NIST
-
Bradley D Olsen
Massachusetts Institute of Technology MIT, Massachusetts Institute of Technology