3. Data Exploration and Preparation

  1. Stéphane Tufféry

Published Online: 20 MAR 2011

DOI: 10.1002/9780470979174.ch3

Data Mining and Statistics for Decision Making

Data Mining and Statistics for Decision Making

How to Cite

Tufféry, S. (2011) Data Exploration and Preparation, in Data Mining and Statistics for Decision Making, John Wiley & Sons, Ltd, Chichester, UK. doi: 10.1002/9780470979174.ch3

Author Information

  1. University of Rennes, France

Publication History

  1. Published Online: 20 MAR 2011
  2. Published Print: 11 MAR 2011

ISBN Information

Print ISBN: 9780470688298

Online ISBN: 9780470979174

SEARCH

Keywords:

  • Data exploration and preparation;
  • Examining the distribution of variables;
  • Detection of rare or missing values;
  • Detection of aberrant values;
  • Detection of extreme values;
  • Homoscedasticity and heteroscedasticity;
  • Detection of the most discriminating variables;
  • Transformation of variables

Summary

This chapter contains sections titled:

  • The different types of data

  • Examining the distribution of variables

  • Detection of rare or missing values

  • Detection of aberrant values

  • Detection of extreme values

  • Tests of normality

  • Homoscedasticity and heteroscedasticity

  • Detection of the most discriminating variables

  • Transformation of variables

  • Choosing ranges of values of binned variables

  • Creating new variables

  • Detecting interactions

  • Automatic variable selection

  • Detection of collinearity

  • Sampling