Quantifying Similarity between Polymer Ensembles
ORAL
Abstract
Synthetic polymers are typically stochastic in nature, which means that instead of having a single well-defined structure, they are ensembles with distributions across molecular mass, topology and sequence. Thus, determining the similarity between polymers is significantly more challenging than for small molecules where a number of established methods exist. When dealing with polymer ensembles, a typical approach is to compute the embedding vector or fingerprint for monomer or polymer and then take the weighted average to yield a single embedding vector for the ensemble. This pre-averaging can obfuscate differences between ensembles and can erroneously yield predictions of perfect similarity for two dissimilar ensembles. Using inspiration from the informatics community, we adopt the Earth Mover's Distance (EMD) along with explicit calculation of distances between all constituents. As its name implies, EMD allows for computing the cost of moving an entity, earth or probability, from one location to another. We demonstrate the utility of this technique with a number of case studies ranging from copolymers to experimentally measured molecular mass distributions. Ultimately, similarity can be used for enhanced search in data resources such as a Community Resource for Innovation in Polymer Technology and to accelerate machine learning enabled polymer design.
* This work was primarily funded by the National Science Foundation Convergence Accelerator award number ITE-2134795.
–
Presenters
-
Debra J Audus
NIST
Authors
-
Debra J Audus
NIST
-
Jiale Shi
Massachusetts Institute of Technology
-
Dylan Walsh
Massachusetts Institute of Technology
-
Weizhong Zou
Massachusetts Institute of Technology
-
Nathan J Rebello
Massachusetts Institute of Technology
-
Michael E Deagen
Rensselaer Polytechnic Institute
-
Katharina Fransen
Massachusetts Institute of Technology
-
Xian Gao
University of Notre Dame
-
Bradley D Olsen
Massachusetts Institute of Technology MI, Massachusetts Institute of Technology