The Effectiveness of a Probabilistic Principal Component Analysis Model and Expectation Maximisation Algorithm in Treating Missing Daily Rainfall Data
The reliability and accuracy of a risk assessment of extreme hydro-meteorological events are highly dependent on the quality of the historical rainfall time series data. However, missing data in a time series such as this could result in lower quality data. Therefore, this paper proposes a multiple-...
Published in: | Asia-Pacific Journal of Atmospheric Sciences |
---|---|
Main Author: | |
Format: | Article |
Language: | English |
Published: |
Korean Meteorological Society
2020
|
Online Access: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85067665731&doi=10.1007%2fs13143-019-00135-8&partnerID=40&md5=3f3a317155735cc93610d1e59e0197cb |
id |
2-s2.0-85067665731 |
---|---|
spelling |
2-s2.0-85067665731 Chuan Z.L.; Deni S.M.; Fam S.-F.; Ismail N. The Effectiveness of a Probabilistic Principal Component Analysis Model and Expectation Maximisation Algorithm in Treating Missing Daily Rainfall Data 2020 Asia-Pacific Journal of Atmospheric Sciences 56 1 10.1007/s13143-019-00135-8 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85067665731&doi=10.1007%2fs13143-019-00135-8&partnerID=40&md5=3f3a317155735cc93610d1e59e0197cb The reliability and accuracy of a risk assessment of extreme hydro-meteorological events are highly dependent on the quality of the historical rainfall time series data. However, missing data in a time series such as this could result in lower quality data. Therefore, this paper proposes a multiple-imputation algorithm for treating missing data without requiring information from adjoining monitoring stations. The proposed imputation algorithms are based on the M-component probabilistic principal component analysis model and an expectation maximisation algorithm (MPPCA-EM). In order to evaluate the effectiveness of the MPPCA-EM imputation algorithm, six distinct historical daily rainfall time series data were recorded from six monitoring stations. These stations were located at the coastal and inland regions of the East-Coast Economic Region (ECER) Malaysia. The results of analysis show that, when it comes to treating missing historical daily rainfall time series data recorded from coastal monitoring stations, the 2-component probabilistic principal component analysis model and expectation-maximisation algorithm (2PPCA-EM) were found to be superior to the single- and multiple-imputation algorithms proposed in previous studies. On the contrary, the single-imputation algorithms as proposed in previous studies were superior to the MPPCA-EM imputation algorithms when treating missing historical daily rainfall time series data recorded from inland monitoring stations. © 2019, Korean Meteorological Society and Springer Nature B.V. Korean Meteorological Society 19767633 English Article |
author |
Chuan Z.L.; Deni S.M.; Fam S.-F.; Ismail N. |
spellingShingle |
Chuan Z.L.; Deni S.M.; Fam S.-F.; Ismail N. The Effectiveness of a Probabilistic Principal Component Analysis Model and Expectation Maximisation Algorithm in Treating Missing Daily Rainfall Data |
author_facet |
Chuan Z.L.; Deni S.M.; Fam S.-F.; Ismail N. |
author_sort |
Chuan Z.L.; Deni S.M.; Fam S.-F.; Ismail N. |
title |
The Effectiveness of a Probabilistic Principal Component Analysis Model and Expectation Maximisation Algorithm in Treating Missing Daily Rainfall Data |
title_short |
The Effectiveness of a Probabilistic Principal Component Analysis Model and Expectation Maximisation Algorithm in Treating Missing Daily Rainfall Data |
title_full |
The Effectiveness of a Probabilistic Principal Component Analysis Model and Expectation Maximisation Algorithm in Treating Missing Daily Rainfall Data |
title_fullStr |
The Effectiveness of a Probabilistic Principal Component Analysis Model and Expectation Maximisation Algorithm in Treating Missing Daily Rainfall Data |
title_full_unstemmed |
The Effectiveness of a Probabilistic Principal Component Analysis Model and Expectation Maximisation Algorithm in Treating Missing Daily Rainfall Data |
title_sort |
The Effectiveness of a Probabilistic Principal Component Analysis Model and Expectation Maximisation Algorithm in Treating Missing Daily Rainfall Data |
publishDate |
2020 |
container_title |
Asia-Pacific Journal of Atmospheric Sciences |
container_volume |
56 |
container_issue |
1 |
doi_str_mv |
10.1007/s13143-019-00135-8 |
url |
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85067665731&doi=10.1007%2fs13143-019-00135-8&partnerID=40&md5=3f3a317155735cc93610d1e59e0197cb |
description |
The reliability and accuracy of a risk assessment of extreme hydro-meteorological events are highly dependent on the quality of the historical rainfall time series data. However, missing data in a time series such as this could result in lower quality data. Therefore, this paper proposes a multiple-imputation algorithm for treating missing data without requiring information from adjoining monitoring stations. The proposed imputation algorithms are based on the M-component probabilistic principal component analysis model and an expectation maximisation algorithm (MPPCA-EM). In order to evaluate the effectiveness of the MPPCA-EM imputation algorithm, six distinct historical daily rainfall time series data were recorded from six monitoring stations. These stations were located at the coastal and inland regions of the East-Coast Economic Region (ECER) Malaysia. The results of analysis show that, when it comes to treating missing historical daily rainfall time series data recorded from coastal monitoring stations, the 2-component probabilistic principal component analysis model and expectation-maximisation algorithm (2PPCA-EM) were found to be superior to the single- and multiple-imputation algorithms proposed in previous studies. On the contrary, the single-imputation algorithms as proposed in previous studies were superior to the MPPCA-EM imputation algorithms when treating missing historical daily rainfall time series data recorded from inland monitoring stations. © 2019, Korean Meteorological Society and Springer Nature B.V. |
publisher |
Korean Meteorological Society |
issn |
19767633 |
language |
English |
format |
Article |
accesstype |
|
record_format |
scopus |
collection |
Scopus |
_version_ |
1814778507048255488 |