Optimized Feature Selection and Transformations for Early Stage Prediction of Autism Using Supervised Machine Learning Models

Autism Spectrum Disorder (ASD) is a neurodevelopmental syndrome which cannot be curable but can be predicted in early stage. Early prediction and cure may help to diagnose the autism. In existing methods, prediction of best feature is not identified for detecting the autism in early stage. In this p...

Full description

Bibliographic Details
Published in:International Journal of Modern Education and Computer Science
Main Author: Praveena K.N.; Mahalakshmi R.; Manjunath C.; Zubair A.F.; Karthikeyan P.
Format: Article
Language:English
Published: Modern Education and Computer Science Press 2023
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85179371094&doi=10.5815%2fijmecs.2023.06.06&partnerID=40&md5=d8ab9275a627dcbb246c38e58ac9de4d
id 2-s2.0-85179371094
spelling 2-s2.0-85179371094
Praveena K.N.; Mahalakshmi R.; Manjunath C.; Zubair A.F.; Karthikeyan P.
Optimized Feature Selection and Transformations for Early Stage Prediction of Autism Using Supervised Machine Learning Models
2023
International Journal of Modern Education and Computer Science
15
6
10.5815/ijmecs.2023.06.06
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85179371094&doi=10.5815%2fijmecs.2023.06.06&partnerID=40&md5=d8ab9275a627dcbb246c38e58ac9de4d
Autism Spectrum Disorder (ASD) is a neurodevelopmental syndrome which cannot be curable but can be predicted in early stage. Early prediction and cure may help to diagnose the autism. In existing methods, prediction of best feature is not identified for detecting the autism in early stage. In this proposed research, prediction of ASD has been done by identifying the best feature transformation technique with best ML classifier and finding out the most significant feature for diagnosis of autism in early age. Early-detected ASD datasets pertaining to toddler and child are collected and applied few Feature transformation techniques, comprising log, power-box-cox and yeo-Johnson transformations to these datasets. Then, using these ASD datasets, several classification approaches were applied, and their efficiency was evaluated. Adaboost given 100% accuracy for toddler dataset and whereas, Random forest showed 98.3% accuracy for child datasets. The feature transformations ensuing the best prediction was Log, Power-Box cox and Yeo-Johnson Transformation for toddler and Log transformation for children datasets. After these exploration, various feature selection techniques like univariate (UNI) and recursive feature elimination (RFE) are applied to these transformed datasets to recognize the most significant ASD risk feature to predict the autism in early stage for toddler and child data. It is found that A5 feature is most significant feature for toddler, A4 stands most significant feature for child based on univariate and RFE. This benefits the doctor to provide the suitable diagnosis in their early stage of life. The results of these logical methodologies show that ML methods can yield precise predictions of ASD when they are accurately optimised. This shows that using these models for early ASD detection may be feasible. © 2023, Modern Education and Computer Science Press. All rights reserved.
Modern Education and Computer Science Press
20750161
English
Article
All Open Access; Gold Open Access
author Praveena K.N.; Mahalakshmi R.; Manjunath C.; Zubair A.F.; Karthikeyan P.
spellingShingle Praveena K.N.; Mahalakshmi R.; Manjunath C.; Zubair A.F.; Karthikeyan P.
Optimized Feature Selection and Transformations for Early Stage Prediction of Autism Using Supervised Machine Learning Models
author_facet Praveena K.N.; Mahalakshmi R.; Manjunath C.; Zubair A.F.; Karthikeyan P.
author_sort Praveena K.N.; Mahalakshmi R.; Manjunath C.; Zubair A.F.; Karthikeyan P.
title Optimized Feature Selection and Transformations for Early Stage Prediction of Autism Using Supervised Machine Learning Models
title_short Optimized Feature Selection and Transformations for Early Stage Prediction of Autism Using Supervised Machine Learning Models
title_full Optimized Feature Selection and Transformations for Early Stage Prediction of Autism Using Supervised Machine Learning Models
title_fullStr Optimized Feature Selection and Transformations for Early Stage Prediction of Autism Using Supervised Machine Learning Models
title_full_unstemmed Optimized Feature Selection and Transformations for Early Stage Prediction of Autism Using Supervised Machine Learning Models
title_sort Optimized Feature Selection and Transformations for Early Stage Prediction of Autism Using Supervised Machine Learning Models
publishDate 2023
container_title International Journal of Modern Education and Computer Science
container_volume 15
container_issue 6
doi_str_mv 10.5815/ijmecs.2023.06.06
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85179371094&doi=10.5815%2fijmecs.2023.06.06&partnerID=40&md5=d8ab9275a627dcbb246c38e58ac9de4d
description Autism Spectrum Disorder (ASD) is a neurodevelopmental syndrome which cannot be curable but can be predicted in early stage. Early prediction and cure may help to diagnose the autism. In existing methods, prediction of best feature is not identified for detecting the autism in early stage. In this proposed research, prediction of ASD has been done by identifying the best feature transformation technique with best ML classifier and finding out the most significant feature for diagnosis of autism in early age. Early-detected ASD datasets pertaining to toddler and child are collected and applied few Feature transformation techniques, comprising log, power-box-cox and yeo-Johnson transformations to these datasets. Then, using these ASD datasets, several classification approaches were applied, and their efficiency was evaluated. Adaboost given 100% accuracy for toddler dataset and whereas, Random forest showed 98.3% accuracy for child datasets. The feature transformations ensuing the best prediction was Log, Power-Box cox and Yeo-Johnson Transformation for toddler and Log transformation for children datasets. After these exploration, various feature selection techniques like univariate (UNI) and recursive feature elimination (RFE) are applied to these transformed datasets to recognize the most significant ASD risk feature to predict the autism in early stage for toddler and child data. It is found that A5 feature is most significant feature for toddler, A4 stands most significant feature for child based on univariate and RFE. This benefits the doctor to provide the suitable diagnosis in their early stage of life. The results of these logical methodologies show that ML methods can yield precise predictions of ASD when they are accurately optimised. This shows that using these models for early ASD detection may be feasible. © 2023, Modern Education and Computer Science Press. All rights reserved.
publisher Modern Education and Computer Science Press
issn 20750161
language English
format Article
accesstype All Open Access; Gold Open Access
record_format scopus
collection Scopus
_version_ 1809678476645498880