Bootstrapping Simulation in Determining the Prognostic Factors of Lung Cancer Disease by Parametric Survival Analysis
Big data analytics focuses on getting useful insights, trends and pattern out of complex and large data. Increasing the sample by resampling the data, in biostatistics expertise, can be employed using the bootstrapping techniques. The world of bootstrapping is very large and expanding where it does...
Published in: | 2023 IEEE International Conference on Computing, ICOCO 2023 |
---|---|
Main Author: | |
Format: | Conference paper |
Language: | English |
Published: |
Institute of Electrical and Electronics Engineers Inc.
2023
|
Online Access: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85184855389&doi=10.1109%2fICOCO59262.2023.10398056&partnerID=40&md5=1dccb55ed27db78ad16425fa619a0bd2 |
id |
2-s2.0-85184855389 |
---|---|
spelling |
2-s2.0-85184855389 Muhamad Jamil S.A.; Affendi Abdullah M.A.; Ibrahim N.; Mansor M.M.; Md Ghani N.A. Bootstrapping Simulation in Determining the Prognostic Factors of Lung Cancer Disease by Parametric Survival Analysis 2023 2023 IEEE International Conference on Computing, ICOCO 2023 10.1109/ICOCO59262.2023.10398056 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85184855389&doi=10.1109%2fICOCO59262.2023.10398056&partnerID=40&md5=1dccb55ed27db78ad16425fa619a0bd2 Big data analytics focuses on getting useful insights, trends and pattern out of complex and large data. Increasing the sample by resampling the data, in biostatistics expertise, can be employed using the bootstrapping techniques. The world of bootstrapping is very large and expanding where it does not only compute the confidence interval but also perform a standard resampling method. Nevertheless, survival analysis study mostly allows the data to be not normally distributed because of the censored observations. Small number of samples also one of the reasons why this study has to perform bootstrapping to overcome the issues of biasness. Bootstrapping method is said to be one of the best methods in handling skewed data. Thus, by considering bootstrapping method, this study aims to find the most significant prognostic factors of lung cancer disease that affect the survival times with the presence of censored observations by using the parametric survival analysis. Therefore, based on 100, 150, 250 and 600 number of sampling sizes, exponential distribution appeared to fit all the assigned sample sizes. Weibull and log-logistic distribution seems to fit the data only for 100 number of samples. Races and two of the interaction terms in the model appeared to be the most significant prognostic factors affecting the survival time of lung cancer. © 2023 IEEE. Institute of Electrical and Electronics Engineers Inc. English Conference paper |
author |
Muhamad Jamil S.A.; Affendi Abdullah M.A.; Ibrahim N.; Mansor M.M.; Md Ghani N.A. |
spellingShingle |
Muhamad Jamil S.A.; Affendi Abdullah M.A.; Ibrahim N.; Mansor M.M.; Md Ghani N.A. Bootstrapping Simulation in Determining the Prognostic Factors of Lung Cancer Disease by Parametric Survival Analysis |
author_facet |
Muhamad Jamil S.A.; Affendi Abdullah M.A.; Ibrahim N.; Mansor M.M.; Md Ghani N.A. |
author_sort |
Muhamad Jamil S.A.; Affendi Abdullah M.A.; Ibrahim N.; Mansor M.M.; Md Ghani N.A. |
title |
Bootstrapping Simulation in Determining the Prognostic Factors of Lung Cancer Disease by Parametric Survival Analysis |
title_short |
Bootstrapping Simulation in Determining the Prognostic Factors of Lung Cancer Disease by Parametric Survival Analysis |
title_full |
Bootstrapping Simulation in Determining the Prognostic Factors of Lung Cancer Disease by Parametric Survival Analysis |
title_fullStr |
Bootstrapping Simulation in Determining the Prognostic Factors of Lung Cancer Disease by Parametric Survival Analysis |
title_full_unstemmed |
Bootstrapping Simulation in Determining the Prognostic Factors of Lung Cancer Disease by Parametric Survival Analysis |
title_sort |
Bootstrapping Simulation in Determining the Prognostic Factors of Lung Cancer Disease by Parametric Survival Analysis |
publishDate |
2023 |
container_title |
2023 IEEE International Conference on Computing, ICOCO 2023 |
container_volume |
|
container_issue |
|
doi_str_mv |
10.1109/ICOCO59262.2023.10398056 |
url |
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85184855389&doi=10.1109%2fICOCO59262.2023.10398056&partnerID=40&md5=1dccb55ed27db78ad16425fa619a0bd2 |
description |
Big data analytics focuses on getting useful insights, trends and pattern out of complex and large data. Increasing the sample by resampling the data, in biostatistics expertise, can be employed using the bootstrapping techniques. The world of bootstrapping is very large and expanding where it does not only compute the confidence interval but also perform a standard resampling method. Nevertheless, survival analysis study mostly allows the data to be not normally distributed because of the censored observations. Small number of samples also one of the reasons why this study has to perform bootstrapping to overcome the issues of biasness. Bootstrapping method is said to be one of the best methods in handling skewed data. Thus, by considering bootstrapping method, this study aims to find the most significant prognostic factors of lung cancer disease that affect the survival times with the presence of censored observations by using the parametric survival analysis. Therefore, based on 100, 150, 250 and 600 number of sampling sizes, exponential distribution appeared to fit all the assigned sample sizes. Weibull and log-logistic distribution seems to fit the data only for 100 number of samples. Races and two of the interaction terms in the model appeared to be the most significant prognostic factors affecting the survival time of lung cancer. © 2023 IEEE. |
publisher |
Institute of Electrical and Electronics Engineers Inc. |
issn |
|
language |
English |
format |
Conference paper |
accesstype |
|
record_format |
scopus |
collection |
Scopus |
_version_ |
1809677889116831744 |