Life Insurance Prediction and Its Sustainability Using Machine Learning Approach

Owning life insurance coverage that is not enough to pay for the expenses is called underinsurance, and it has been found to have a significant influence on the sustainability and financial health of families. However, insurance companies need to have a good profile of potential policyholders. Custo...

Full description

Bibliographic Details
Published in:Sustainability (Switzerland)
Main Author: Shamsuddin S.N.; Ismail N.; Nur-Firyal R.
Format: Article
Language:English
Published: Multidisciplinary Digital Publishing Institute (MDPI) 2023
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85164958209&doi=10.3390%2fsu151310737&partnerID=40&md5=d3abb99cde72ab4334099d3ac2931cae
id 2-s2.0-85164958209
spelling 2-s2.0-85164958209
Shamsuddin S.N.; Ismail N.; Nur-Firyal R.
Life Insurance Prediction and Its Sustainability Using Machine Learning Approach
2023
Sustainability (Switzerland)
15
13
10.3390/su151310737
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85164958209&doi=10.3390%2fsu151310737&partnerID=40&md5=d3abb99cde72ab4334099d3ac2931cae
Owning life insurance coverage that is not enough to pay for the expenses is called underinsurance, and it has been found to have a significant influence on the sustainability and financial health of families. However, insurance companies need to have a good profile of potential policyholders. Customer profiling has become one of the essential marketing strategies for any sustainable business, such as the insurance market, to identify potential life insurance purchasers. One well-known method of carrying out customer profiling and segmenting is machine learning. Hence, this study aims to provide a helpful framework for predicting potential life insurance policyholders using a data mining approach with different sampling methods and to lead to a transition to sustainable life insurance industry development. Various samplings, such as the Synthetic Minority Over-sampling Technique, Randomly Under-Sampling, and ensemble (bagging and boosting) techniques, are proposed to handle the imbalanced dataset. The result reveals that the decision tree is the best performer according to ROC and, according to balanced accuracy, F1 score, and GM comparison, Naïve Bayes seems to be the best performer. It is also found that ensemble models do not guarantee high performance in this imbalanced dataset. However, the ensembled and sampling method plays a significant role in overcoming the imbalanced problem. © 2023 by the authors.
Multidisciplinary Digital Publishing Institute (MDPI)
20711050
English
Article
All Open Access; Gold Open Access
author Shamsuddin S.N.; Ismail N.; Nur-Firyal R.
spellingShingle Shamsuddin S.N.; Ismail N.; Nur-Firyal R.
Life Insurance Prediction and Its Sustainability Using Machine Learning Approach
author_facet Shamsuddin S.N.; Ismail N.; Nur-Firyal R.
author_sort Shamsuddin S.N.; Ismail N.; Nur-Firyal R.
title Life Insurance Prediction and Its Sustainability Using Machine Learning Approach
title_short Life Insurance Prediction and Its Sustainability Using Machine Learning Approach
title_full Life Insurance Prediction and Its Sustainability Using Machine Learning Approach
title_fullStr Life Insurance Prediction and Its Sustainability Using Machine Learning Approach
title_full_unstemmed Life Insurance Prediction and Its Sustainability Using Machine Learning Approach
title_sort Life Insurance Prediction and Its Sustainability Using Machine Learning Approach
publishDate 2023
container_title Sustainability (Switzerland)
container_volume 15
container_issue 13
doi_str_mv 10.3390/su151310737
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85164958209&doi=10.3390%2fsu151310737&partnerID=40&md5=d3abb99cde72ab4334099d3ac2931cae
description Owning life insurance coverage that is not enough to pay for the expenses is called underinsurance, and it has been found to have a significant influence on the sustainability and financial health of families. However, insurance companies need to have a good profile of potential policyholders. Customer profiling has become one of the essential marketing strategies for any sustainable business, such as the insurance market, to identify potential life insurance purchasers. One well-known method of carrying out customer profiling and segmenting is machine learning. Hence, this study aims to provide a helpful framework for predicting potential life insurance policyholders using a data mining approach with different sampling methods and to lead to a transition to sustainable life insurance industry development. Various samplings, such as the Synthetic Minority Over-sampling Technique, Randomly Under-Sampling, and ensemble (bagging and boosting) techniques, are proposed to handle the imbalanced dataset. The result reveals that the decision tree is the best performer according to ROC and, according to balanced accuracy, F1 score, and GM comparison, Naïve Bayes seems to be the best performer. It is also found that ensemble models do not guarantee high performance in this imbalanced dataset. However, the ensembled and sampling method plays a significant role in overcoming the imbalanced problem. © 2023 by the authors.
publisher Multidisciplinary Digital Publishing Institute (MDPI)
issn 20711050
language English
format Article
accesstype All Open Access; Gold Open Access
record_format scopus
collection Scopus
_version_ 1809677777842995200