Standard Article

Data Mining

  1. David J. Hand

Published Online: 15 JUL 2005

DOI: 10.1002/0470011815.b2a00007

Encyclopedia of Biostatistics

Encyclopedia of Biostatistics

How to Cite

Hand, D. J. 2005. Data Mining. Encyclopedia of Biostatistics. 2.

Author Information

  1. Imperial College London, London, UK

Publication History

  1. Published Online: 15 JUL 2005


Data mining seeks unexpected, interesting, or valuable structures in large data sets. There are two distinct classes of data mining tool, modeling, and pattern discovery. Difficulties arise in coping with data distortion and errors. The tools of data mining hold great promise for scientific and medical advance, but many theoretical questions remain open.


  • large data sets;
  • pattern discovery;
  • anomaly detection;
  • score functions;
  • missing data;
  • distorted data