Consensus clustering and fuzzy classification for breast cancer prognosis
Extracting usable and useful knowledge from large and complex data sets is a difficult and challenging problem. In this paper, we show how two complementary techniques have been used to tackle this problem in the context of breast cancer. Diagnosis concerns the identification of cancer within a pati...
Published in: | Proceedings - 24th European Conference on Modelling and Simulation, ECMS 2010 |
---|---|
Main Author: | |
Format: | Conference paper |
Language: | English |
Published: |
European Council for Modelling and Simulation
2010
|
Online Access: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-84857934629&doi=10.7148%2f2010-0015-0022&partnerID=40&md5=21c9e8e42fc925e5b4c9ac30f80a07e5 |
id |
2-s2.0-84857934629 |
---|---|
spelling |
2-s2.0-84857934629 Garibaldi J.M.; Soria D.; Rasmani K.A. Consensus clustering and fuzzy classification for breast cancer prognosis 2010 Proceedings - 24th European Conference on Modelling and Simulation, ECMS 2010 10.7148/2010-0015-0022 https://www.scopus.com/inward/record.uri?eid=2-s2.0-84857934629&doi=10.7148%2f2010-0015-0022&partnerID=40&md5=21c9e8e42fc925e5b4c9ac30f80a07e5 Extracting usable and useful knowledge from large and complex data sets is a difficult and challenging problem. In this paper, we show how two complementary techniques have been used to tackle this problem in the context of breast cancer. Diagnosis concerns the identification of cancer within a patient; in contrast, prognosis concerns the prediction of the ongoing course of the disease, including issues such as the choice of potential treatments such as chemotherapy or drug therapy, in combination with estimation of chances (or length) of survival. Reliable prognosis depends on many factors, including the identification of the type of this heterogeneous disease. We first use a consensus clustering methodology to identify core, well-characterised sub-groups (or classes) of the disease based on a large database of protein biomarkers from over a thousand patients. We then use fuzzy rule induction and simplification algorithms to generate a simple, comprehensible set of rules for use in future model-based classification. The methods are described and their use is illustrated on real-world data. © ECMS. European Council for Modelling and Simulation English Conference paper All Open Access; Green Open Access |
author |
Garibaldi J.M.; Soria D.; Rasmani K.A. |
spellingShingle |
Garibaldi J.M.; Soria D.; Rasmani K.A. Consensus clustering and fuzzy classification for breast cancer prognosis |
author_facet |
Garibaldi J.M.; Soria D.; Rasmani K.A. |
author_sort |
Garibaldi J.M.; Soria D.; Rasmani K.A. |
title |
Consensus clustering and fuzzy classification for breast cancer prognosis |
title_short |
Consensus clustering and fuzzy classification for breast cancer prognosis |
title_full |
Consensus clustering and fuzzy classification for breast cancer prognosis |
title_fullStr |
Consensus clustering and fuzzy classification for breast cancer prognosis |
title_full_unstemmed |
Consensus clustering and fuzzy classification for breast cancer prognosis |
title_sort |
Consensus clustering and fuzzy classification for breast cancer prognosis |
publishDate |
2010 |
container_title |
Proceedings - 24th European Conference on Modelling and Simulation, ECMS 2010 |
container_volume |
|
container_issue |
|
doi_str_mv |
10.7148/2010-0015-0022 |
url |
https://www.scopus.com/inward/record.uri?eid=2-s2.0-84857934629&doi=10.7148%2f2010-0015-0022&partnerID=40&md5=21c9e8e42fc925e5b4c9ac30f80a07e5 |
description |
Extracting usable and useful knowledge from large and complex data sets is a difficult and challenging problem. In this paper, we show how two complementary techniques have been used to tackle this problem in the context of breast cancer. Diagnosis concerns the identification of cancer within a patient; in contrast, prognosis concerns the prediction of the ongoing course of the disease, including issues such as the choice of potential treatments such as chemotherapy or drug therapy, in combination with estimation of chances (or length) of survival. Reliable prognosis depends on many factors, including the identification of the type of this heterogeneous disease. We first use a consensus clustering methodology to identify core, well-characterised sub-groups (or classes) of the disease based on a large database of protein biomarkers from over a thousand patients. We then use fuzzy rule induction and simplification algorithms to generate a simple, comprehensible set of rules for use in future model-based classification. The methods are described and their use is illustrated on real-world data. © ECMS. |
publisher |
European Council for Modelling and Simulation |
issn |
|
language |
English |
format |
Conference paper |
accesstype |
All Open Access; Green Open Access |
record_format |
scopus |
collection |
Scopus |
_version_ |
1812871802420985856 |