Evolving very-compact fuzzy models for gene expression data analysis

Barreto-Sanz, Miguel Arturo (School of Management and Engineering Vaud, HES-SO // University of Applied Sciences Western Switzerland) ; Bujard, Alexandre (School of Management and Engineering Vaud, HES-SO // University of Applied Sciences Western Switzerland) ; Peña-Reyes, Carlos Andrés (School of Management and Engineering Vaud, HES-SO // University of Applied Sciences Western Switzerland)

Selecting predicitve gene pools from thousands of gene expression values is one of the main tasks in microarray data analysis. For this purpose multivariate techniques have proven much better, in terms of predicitve value and biological relevance, than univariate techniques as they are able to capture relevant relationships and interactions between genes. An additional goal for gene-expression profiling is finding models that, besides being predictive, are also understandable so as they can provide some insight on the underlying mechanisms. Models based on fuzzy logic might, potentially, exhibit both characteristics. However, accuracy and interpretability are usually contradictory objectives, and one must accept a trade off between them. Indeed, literature shows that the approaches based on fuzzy logic may be divided in two groups: accurate but complex models (i.e, with many rules using many variables per rule) on one hand, and models with only few short rules (thus, interpretable) but exhibiting limited accuracy. We present in this paper the application of Fuzzy CoCo, our cooperative coevolutionary fuzzy modelling approach, in order to deal efficiently with the accuracy-interpretability tradeoff. Fuzzy CoCo is able to find very compact fuzzy models, in terms of number of rules and number of variables per rule, while still exhibiting high predictive power. To validate the performance of our approach, we tested Fuzzy CoCo on four known data sets addressing each one a form of cancer: Leukemia, colon, lung, and prostate. We compared our results-in terms of maximum number of rules, number of variables per rule, and accuracy-with those of other similar works (i.e., based on fuzzy logic). Our models reached similar or better accuracy while being considerably smaller.


Keywords:
Conference Type:
full paper
Faculty:
Ingénierie et Architecture
School:
HEIG-VD
Institute:
IICT - Institut des Technologies de l'Information et de la Communication
Publisher:
Larnaca, Cyprus, 11-13 November 2012
Date:
2012-11
Larnaca, Cyprus
11-13 November 2012
Pagination:
6 p.
Published in:
2012 IEEE 12th International Conference on Bioinformatics & Bioengineering (BIBE)
Numeration (vol. no.):
pp. 356-361
DOI:
ISBN:
978-1-4673-4358-9
Appears in Collection:

Note: The status of this file is: restricted


 Record created 2020-02-11, last modified 2020-02-13

Fulltext:
Download fulltext
PDF

Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)