A short, preliminary version of this paper has been accepted for publication by CIKM ’11.
A NEW SUPERVISED FEATURE SELECTION METHOD FOR PATTERN CLASSIFICATION
Article first published online: 7 SEP 2012
© 2012 Wiley Periodicals, Inc.
Volume 30, Issue 2, pages 342–361, May 2014
How to Cite
Liu, H., Wu, X. and Zhang, S. (2014), A NEW SUPERVISED FEATURE SELECTION METHOD FOR PATTERN CLASSIFICATION. Computational Intelligence, 30: 342–361. doi: 10.1111/j.1467-8640.2012.00465.x
- Issue published online: 6 MAY 2014
- Article first published online: 7 SEP 2012
- Received 2 September 2011; Revised 3 July 2012; Accepted 6 July 2012
- feature selection;
- dimensionality reduction;
- information entropy;
- pattern classification;
With the rapid development of information techniques, the dimensionality of data in many application domains, such as text categorization and bioinformatics, is getting higher and higher. The high-dimensionality data may bring many adverse situations, such as overfitting, poor performance, and low efficiency, to traditional learning algorithms in pattern classification. Feature selection aims at reducing the dimensionality of data and providing discriminative features for pattern learning algorithms. Due to its effectiveness, feature selection is now gaining increasing attentions from a variety of disciplines and currently many efforts have been attempted in this field. In this paper, we propose a new supervised feature selection method to pick important features by using information criteria. Unlike other selection methods, the main characteristic of our method is that it not only takes both maximal relevance to the class labels and minimal redundancy to the selected features into account, but also works like feature clustering in an agglomerative way. To measure the relevance and redundancy of feature exactly, two different information criteria, i.e., mutual information and coefficient of relevance, have been adopted in our method. The performance evaluations on 12 benchmark data sets show that the proposed method can achieve better performance than other popular feature selection methods in most cases.