Consensus clustering and fuzzy classification for breast cancer prognosis

Extracting usable and useful knowledge from large and complex data sets is a difficult and challenging problem. In this paper, we show how two complementary techniques have been used to tackle this problem in the context of breast cancer. Diagnosis concerns the identification of cancer within a pati...

Full description

Bibliographic Details
Published in:Proceedings - 24th European Conference on Modelling and Simulation, ECMS 2010
Main Author: Garibaldi J.M.; Soria D.; Rasmani K.A.
Format: Conference paper
Language:English
Published: European Council for Modelling and Simulation 2010
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-84857934629&doi=10.7148%2f2010-0015-0022&partnerID=40&md5=21c9e8e42fc925e5b4c9ac30f80a07e5
Description
Summary:Extracting usable and useful knowledge from large and complex data sets is a difficult and challenging problem. In this paper, we show how two complementary techniques have been used to tackle this problem in the context of breast cancer. Diagnosis concerns the identification of cancer within a patient; in contrast, prognosis concerns the prediction of the ongoing course of the disease, including issues such as the choice of potential treatments such as chemotherapy or drug therapy, in combination with estimation of chances (or length) of survival. Reliable prognosis depends on many factors, including the identification of the type of this heterogeneous disease. We first use a consensus clustering methodology to identify core, well-characterised sub-groups (or classes) of the disease based on a large database of protein biomarkers from over a thousand patients. We then use fuzzy rule induction and simplification algorithms to generate a simple, comprehensible set of rules for use in future model-based classification. The methods are described and their use is illustrated on real-world data. © ECMS.
ISSN:
DOI:10.7148/2010-0015-0022