Life Insurance Prediction and Its Sustainability Using Machine Learning Approach
Owning life insurance coverage that is not enough to pay for the expenses is called underinsurance, and it has been found to have a significant influence on the sustainability and financial health of families. However, insurance companies need to have a good profile of potential policyholders. Custo...
Published in: | Sustainability (Switzerland) |
---|---|
Main Author: | |
Format: | Article |
Language: | English |
Published: |
Multidisciplinary Digital Publishing Institute (MDPI)
2023
|
Online Access: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85164958209&doi=10.3390%2fsu151310737&partnerID=40&md5=d3abb99cde72ab4334099d3ac2931cae |
id |
2-s2.0-85164958209 |
---|---|
spelling |
2-s2.0-85164958209 Shamsuddin S.N.; Ismail N.; Nur-Firyal R. Life Insurance Prediction and Its Sustainability Using Machine Learning Approach 2023 Sustainability (Switzerland) 15 13 10.3390/su151310737 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85164958209&doi=10.3390%2fsu151310737&partnerID=40&md5=d3abb99cde72ab4334099d3ac2931cae Owning life insurance coverage that is not enough to pay for the expenses is called underinsurance, and it has been found to have a significant influence on the sustainability and financial health of families. However, insurance companies need to have a good profile of potential policyholders. Customer profiling has become one of the essential marketing strategies for any sustainable business, such as the insurance market, to identify potential life insurance purchasers. One well-known method of carrying out customer profiling and segmenting is machine learning. Hence, this study aims to provide a helpful framework for predicting potential life insurance policyholders using a data mining approach with different sampling methods and to lead to a transition to sustainable life insurance industry development. Various samplings, such as the Synthetic Minority Over-sampling Technique, Randomly Under-Sampling, and ensemble (bagging and boosting) techniques, are proposed to handle the imbalanced dataset. The result reveals that the decision tree is the best performer according to ROC and, according to balanced accuracy, F1 score, and GM comparison, Naïve Bayes seems to be the best performer. It is also found that ensemble models do not guarantee high performance in this imbalanced dataset. However, the ensembled and sampling method plays a significant role in overcoming the imbalanced problem. © 2023 by the authors. Multidisciplinary Digital Publishing Institute (MDPI) 20711050 English Article All Open Access; Gold Open Access |
author |
Shamsuddin S.N.; Ismail N.; Nur-Firyal R. |
spellingShingle |
Shamsuddin S.N.; Ismail N.; Nur-Firyal R. Life Insurance Prediction and Its Sustainability Using Machine Learning Approach |
author_facet |
Shamsuddin S.N.; Ismail N.; Nur-Firyal R. |
author_sort |
Shamsuddin S.N.; Ismail N.; Nur-Firyal R. |
title |
Life Insurance Prediction and Its Sustainability Using Machine Learning Approach |
title_short |
Life Insurance Prediction and Its Sustainability Using Machine Learning Approach |
title_full |
Life Insurance Prediction and Its Sustainability Using Machine Learning Approach |
title_fullStr |
Life Insurance Prediction and Its Sustainability Using Machine Learning Approach |
title_full_unstemmed |
Life Insurance Prediction and Its Sustainability Using Machine Learning Approach |
title_sort |
Life Insurance Prediction and Its Sustainability Using Machine Learning Approach |
publishDate |
2023 |
container_title |
Sustainability (Switzerland) |
container_volume |
15 |
container_issue |
13 |
doi_str_mv |
10.3390/su151310737 |
url |
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85164958209&doi=10.3390%2fsu151310737&partnerID=40&md5=d3abb99cde72ab4334099d3ac2931cae |
description |
Owning life insurance coverage that is not enough to pay for the expenses is called underinsurance, and it has been found to have a significant influence on the sustainability and financial health of families. However, insurance companies need to have a good profile of potential policyholders. Customer profiling has become one of the essential marketing strategies for any sustainable business, such as the insurance market, to identify potential life insurance purchasers. One well-known method of carrying out customer profiling and segmenting is machine learning. Hence, this study aims to provide a helpful framework for predicting potential life insurance policyholders using a data mining approach with different sampling methods and to lead to a transition to sustainable life insurance industry development. Various samplings, such as the Synthetic Minority Over-sampling Technique, Randomly Under-Sampling, and ensemble (bagging and boosting) techniques, are proposed to handle the imbalanced dataset. The result reveals that the decision tree is the best performer according to ROC and, according to balanced accuracy, F1 score, and GM comparison, Naïve Bayes seems to be the best performer. It is also found that ensemble models do not guarantee high performance in this imbalanced dataset. However, the ensembled and sampling method plays a significant role in overcoming the imbalanced problem. © 2023 by the authors. |
publisher |
Multidisciplinary Digital Publishing Institute (MDPI) |
issn |
20711050 |
language |
English |
format |
Article |
accesstype |
All Open Access; Gold Open Access |
record_format |
scopus |
collection |
Scopus |
_version_ |
1809677777842995200 |