SEARCH

SEARCH BY CITATION

Keywords:

  • classification;
  • k-nearest-neighbors;
  • standardizing predictors

When employing nearest neighbor classifiers scaling of input variables is often useful. In this paper we propose a small modification in usual data preprocessing: scaling of variables should be done by use of pooled variances instead of overall ones. Thus prediction accuracy is distinctly improved in some situations.

Abstract

When employing nearest neighbor classifiers scaling of input variables is often useful. In this paper we propose a small modification in usual data preprocessing: scaling of variables should be done by use of pooled variances instead of overall ones. Thus prediction accuracy is distinctly improved in some situations. Copyright © 2008 John Wiley & Sons, Ltd.