International Journal of Soft Computing

Year: 2016
Volume: 11
Issue: 5
Page No. 334 - 342

Review of the Effect of Feature Selection for Microarray Data on the Classification Accuracy for Cancer Data Sets

Authors : Naeimeh Elkhani and Ravie Chandren Muniyandi

Abstract: DNA microarrays can be used to monitor the expression level of thousands of genes simultaneously and gene microarray data can be used in cancer diagnosis and classification. Many machine learning techniques have been developed for computational analyses of microarray data. A common difficulty for all techniques is the large number of genes compared to the small sample size which has a negative impact on their speed and accuracy. To overcome these limitations, feature selection techniques are applied to distinguish between significant and redundant or irrelevant genes. Feature selection methods are used for two main goals. The first is to identify the relationship between specific diseases and genes. The second is to examine a compact set of discriminative genes to develop a pattern classifier with good generalizability and limited complexity. Here, we review different feature selection methods for cancer microarray data sets and analyze their accuracy. We describe methods commonly used for selecting significant features including filters, wrappers and embedded methods, categorized according to their experimental methodology. We then compare the classification accuracy of the methods for various cancer data sets and their time complexity to make some suggestions regarding the use of suitable methods for cancer data sets.

How to cite this article:

Naeimeh Elkhani and Ravie Chandren Muniyandi, 2016. Review of the Effect of Feature Selection for Microarray Data on the Classification Accuracy for Cancer Data Sets. International Journal of Soft Computing, 11: 334-342.

Design and power by Medwell Web Development Team. © Medwell Publishing 2024 All Rights Reserved