In this study, we employ machine learning techniques to improve materials in data scarcity conditions. In particular, we focus on the prediction of the cloud point temperatures of polymer–water systems with thermoresponsive behavior. We compare a model trained directly on the available data with a model based on representations learned through an encoder–decoder model, in turn pre-trained on a larger dataset to generate molecular fingerprints. Our results demonstrate that the embedding-based model significantly outperforms the direct model in predicting the cloud point temperature under the data limitations imposed by rigorous curation. This approach highlights the potential of domain-informed representation learning to tackle complex materials science problems with limited data.
Building similarity graph...
Analyzing shared references across papers
Loading...
Marcela Elisabeth Penoff
Facundo I. Altuna
Luis A. Miccio
Applied Sciences
Consejo Nacional de Investigaciones Científicas y Técnicas
National University of Mar del Plata
Instituto de Investigaciones en Ciencia y Tecnología de Materiales
Building similarity graph...
Analyzing shared references across papers
Loading...
Penoff et al. (Fri,) studied this question.
www.synapsesocial.com/papers/69ada8dfbc08abd80d5bc4c8 — DOI: https://doi.org/10.3390/app16052557