Prediction of Glass Transition Temperature of Polymers Using Simple Machine Learning
Polymer materials have garnered significant attention due to their exceptional mechanical properties and diverse industrial applications. Understanding the glass transition temperature (Tg) of polymers is critical to prevent operational failures at specific temperatures. Traditional methods for meas...
Published in: | Polymers |
---|---|
Main Author: | |
Format: | Article |
Language: | English |
Published: |
Multidisciplinary Digital Publishing Institute (MDPI)
2024
|
Online Access: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85203853854&doi=10.3390%2fpolym16172464&partnerID=40&md5=0175c44a415a6ed53ceadef2f2adbd5e |
id |
2-s2.0-85203853854 |
---|---|
spelling |
2-s2.0-85203853854 Fatriansyah J.F.; Linuwih B.D.P.; Andreano Y.; Sari I.S.; Federico A.; Anis M.; Surip S.N.; Jaafar M. Prediction of Glass Transition Temperature of Polymers Using Simple Machine Learning 2024 Polymers 16 17 10.3390/polym16172464 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85203853854&doi=10.3390%2fpolym16172464&partnerID=40&md5=0175c44a415a6ed53ceadef2f2adbd5e Polymer materials have garnered significant attention due to their exceptional mechanical properties and diverse industrial applications. Understanding the glass transition temperature (Tg) of polymers is critical to prevent operational failures at specific temperatures. Traditional methods for measuring Tg, such as differential scanning calorimetry (DSC) and dynamic mechanical analysis, while accurate, are often time-consuming, costly, and susceptible to inaccuracies due to random and uncertain factors. To address these limitations, the aim of the present study is to investigate the potential of Simplified Molecular Input Line Entry System (SMILES) as descriptors in simple machine learning models to predict Tg efficiently and reliably. Five models were utilized: k-nearest neighbors (KNNs), support vector regression (SVR), extreme gradient boosting (XGBoost), artificial neural network (ANN), and recurrent neural network (RNN). SMILES descriptors were converted into numerical data using either One Hot Encoding (OHE) or Natural Language Processing (NLP). The study found that SMILES inputs with fewer than 200 characters were inadequate for accurately describing compound structures, while inputs exceeding 200 characters diminished model performance due to the curse of dimensionality. The ANN model achieved the highest R2 value of 0.79; however, the XGB model, with an R2 value of 0.774, exhibited the highest stability and shorter training times compared to other models, making it the preferred choice for Tg prediction. The efficiency of the OHE method over NLP was demonstrated by faster training times across the KNN, SVR, XGB, and ANN models. Validation of new polymer data showed the XGB model’s robustness, with an average prediction deviation of 9.76 from actual Tg values. These findings underscore the importance of optimizing SMILES conversion methods and model parameters to enhance prediction reliability. Future research should focus on improving model accuracy and generalizability by incorporating additional features and advanced techniques. This study contributes to the development of efficient and reliable predictive models for polymer properties, facilitating the design and application of new polymer materials. © 2024 by the authors. Multidisciplinary Digital Publishing Institute (MDPI) 20734360 English Article All Open Access; Gold Open Access |
author |
Fatriansyah J.F.; Linuwih B.D.P.; Andreano Y.; Sari I.S.; Federico A.; Anis M.; Surip S.N.; Jaafar M. |
spellingShingle |
Fatriansyah J.F.; Linuwih B.D.P.; Andreano Y.; Sari I.S.; Federico A.; Anis M.; Surip S.N.; Jaafar M. Prediction of Glass Transition Temperature of Polymers Using Simple Machine Learning |
author_facet |
Fatriansyah J.F.; Linuwih B.D.P.; Andreano Y.; Sari I.S.; Federico A.; Anis M.; Surip S.N.; Jaafar M. |
author_sort |
Fatriansyah J.F.; Linuwih B.D.P.; Andreano Y.; Sari I.S.; Federico A.; Anis M.; Surip S.N.; Jaafar M. |
title |
Prediction of Glass Transition Temperature of Polymers Using Simple Machine Learning |
title_short |
Prediction of Glass Transition Temperature of Polymers Using Simple Machine Learning |
title_full |
Prediction of Glass Transition Temperature of Polymers Using Simple Machine Learning |
title_fullStr |
Prediction of Glass Transition Temperature of Polymers Using Simple Machine Learning |
title_full_unstemmed |
Prediction of Glass Transition Temperature of Polymers Using Simple Machine Learning |
title_sort |
Prediction of Glass Transition Temperature of Polymers Using Simple Machine Learning |
publishDate |
2024 |
container_title |
Polymers |
container_volume |
16 |
container_issue |
17 |
doi_str_mv |
10.3390/polym16172464 |
url |
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85203853854&doi=10.3390%2fpolym16172464&partnerID=40&md5=0175c44a415a6ed53ceadef2f2adbd5e |
description |
Polymer materials have garnered significant attention due to their exceptional mechanical properties and diverse industrial applications. Understanding the glass transition temperature (Tg) of polymers is critical to prevent operational failures at specific temperatures. Traditional methods for measuring Tg, such as differential scanning calorimetry (DSC) and dynamic mechanical analysis, while accurate, are often time-consuming, costly, and susceptible to inaccuracies due to random and uncertain factors. To address these limitations, the aim of the present study is to investigate the potential of Simplified Molecular Input Line Entry System (SMILES) as descriptors in simple machine learning models to predict Tg efficiently and reliably. Five models were utilized: k-nearest neighbors (KNNs), support vector regression (SVR), extreme gradient boosting (XGBoost), artificial neural network (ANN), and recurrent neural network (RNN). SMILES descriptors were converted into numerical data using either One Hot Encoding (OHE) or Natural Language Processing (NLP). The study found that SMILES inputs with fewer than 200 characters were inadequate for accurately describing compound structures, while inputs exceeding 200 characters diminished model performance due to the curse of dimensionality. The ANN model achieved the highest R2 value of 0.79; however, the XGB model, with an R2 value of 0.774, exhibited the highest stability and shorter training times compared to other models, making it the preferred choice for Tg prediction. The efficiency of the OHE method over NLP was demonstrated by faster training times across the KNN, SVR, XGB, and ANN models. Validation of new polymer data showed the XGB model’s robustness, with an average prediction deviation of 9.76 from actual Tg values. These findings underscore the importance of optimizing SMILES conversion methods and model parameters to enhance prediction reliability. Future research should focus on improving model accuracy and generalizability by incorporating additional features and advanced techniques. This study contributes to the development of efficient and reliable predictive models for polymer properties, facilitating the design and application of new polymer materials. © 2024 by the authors. |
publisher |
Multidisciplinary Digital Publishing Institute (MDPI) |
issn |
20734360 |
language |
English |
format |
Article |
accesstype |
All Open Access; Gold Open Access |
record_format |
scopus |
collection |
Scopus |
_version_ |
1812871793914937344 |