Pairwise Similarity of Polymer Ensembles

Jiale Shi; Dylan Walsh; Debra J Audus; Bradley D Olsen

Pairwise Similarity of Polymer Ensembles

POSTER

Abstract

In contrast to small molecules having well-defined molecular structures, polymers are rarely a single, well-defined structure. Instead, they are ensembles composed of polymer chains with various sequences, topologies, and molar mass distributions. While there are numerous approaches to measuring the pairwise similarity of small molecules, accurately calculating the pairwise similarity between polymer ensembles is still challenging. A common approach is to generate a single embedding vector for each ensemble and subsequently compute the distance between these vectors to yield a similarity score. For example, the embedding of a random copolymer can be defined by a weighted average of embedding vectors of the repeat units. However, these common average approaches neglect significant features. Here, we explicitly consider all the entities of the ensembles in the calculation of the similarity score and show that our method captures differences that are neglected using the average method, enabling us to resolve subtle differences between molecular distributions. Our method presents a critical step to quantitatively calculate polymer ensemble similarity and enable nearest neighbor search in polymer database.

Presenters

Jiale Shi

Massachusetts Institute of Technology

Authors

Jiale Shi

Massachusetts Institute of Technology
Dylan Walsh

Massachusetts Institute of Technology
Debra J Audus

NIST
Bradley D Olsen

Massachusetts Institute of Technology MIT, Massachusetts Institute of Technology