Consensus clustering and fuzzy classification for breast cancer prognosis

Extracting usable and useful knowledge from large and complex data sets is a difficult and challenging problem. In this paper, we show how two complementary techniques have been used to tackle this problem in the context of breast cancer. Diagnosis concerns the identification of cancer within a pati...

Full description

Bibliographic Details
Published in:Proceedings - 24th European Conference on Modelling and Simulation, ECMS 2010
Main Author: Garibaldi J.M.; Soria D.; Rasmani K.A.
Format: Conference paper
Language:English
Published: European Council for Modelling and Simulation 2010
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-84857934629&doi=10.7148%2f2010-0015-0022&partnerID=40&md5=21c9e8e42fc925e5b4c9ac30f80a07e5
id 2-s2.0-84857934629
spelling 2-s2.0-84857934629
Garibaldi J.M.; Soria D.; Rasmani K.A.
Consensus clustering and fuzzy classification for breast cancer prognosis
2010
Proceedings - 24th European Conference on Modelling and Simulation, ECMS 2010


10.7148/2010-0015-0022
https://www.scopus.com/inward/record.uri?eid=2-s2.0-84857934629&doi=10.7148%2f2010-0015-0022&partnerID=40&md5=21c9e8e42fc925e5b4c9ac30f80a07e5
Extracting usable and useful knowledge from large and complex data sets is a difficult and challenging problem. In this paper, we show how two complementary techniques have been used to tackle this problem in the context of breast cancer. Diagnosis concerns the identification of cancer within a patient; in contrast, prognosis concerns the prediction of the ongoing course of the disease, including issues such as the choice of potential treatments such as chemotherapy or drug therapy, in combination with estimation of chances (or length) of survival. Reliable prognosis depends on many factors, including the identification of the type of this heterogeneous disease. We first use a consensus clustering methodology to identify core, well-characterised sub-groups (or classes) of the disease based on a large database of protein biomarkers from over a thousand patients. We then use fuzzy rule induction and simplification algorithms to generate a simple, comprehensible set of rules for use in future model-based classification. The methods are described and their use is illustrated on real-world data. © ECMS.
European Council for Modelling and Simulation

English
Conference paper
All Open Access; Green Open Access
author Garibaldi J.M.; Soria D.; Rasmani K.A.
spellingShingle Garibaldi J.M.; Soria D.; Rasmani K.A.
Consensus clustering and fuzzy classification for breast cancer prognosis
author_facet Garibaldi J.M.; Soria D.; Rasmani K.A.
author_sort Garibaldi J.M.; Soria D.; Rasmani K.A.
title Consensus clustering and fuzzy classification for breast cancer prognosis
title_short Consensus clustering and fuzzy classification for breast cancer prognosis
title_full Consensus clustering and fuzzy classification for breast cancer prognosis
title_fullStr Consensus clustering and fuzzy classification for breast cancer prognosis
title_full_unstemmed Consensus clustering and fuzzy classification for breast cancer prognosis
title_sort Consensus clustering and fuzzy classification for breast cancer prognosis
publishDate 2010
container_title Proceedings - 24th European Conference on Modelling and Simulation, ECMS 2010
container_volume
container_issue
doi_str_mv 10.7148/2010-0015-0022
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-84857934629&doi=10.7148%2f2010-0015-0022&partnerID=40&md5=21c9e8e42fc925e5b4c9ac30f80a07e5
description Extracting usable and useful knowledge from large and complex data sets is a difficult and challenging problem. In this paper, we show how two complementary techniques have been used to tackle this problem in the context of breast cancer. Diagnosis concerns the identification of cancer within a patient; in contrast, prognosis concerns the prediction of the ongoing course of the disease, including issues such as the choice of potential treatments such as chemotherapy or drug therapy, in combination with estimation of chances (or length) of survival. Reliable prognosis depends on many factors, including the identification of the type of this heterogeneous disease. We first use a consensus clustering methodology to identify core, well-characterised sub-groups (or classes) of the disease based on a large database of protein biomarkers from over a thousand patients. We then use fuzzy rule induction and simplification algorithms to generate a simple, comprehensible set of rules for use in future model-based classification. The methods are described and their use is illustrated on real-world data. © ECMS.
publisher European Council for Modelling and Simulation
issn
language English
format Conference paper
accesstype All Open Access; Green Open Access
record_format scopus
collection Scopus
_version_ 1812871802420985856