Data Mining, Software Packages for
Published Online: 15 JUL 2005
Copyright © 2005 John Wiley & Sons, Ltd
Encyclopedia of Biostatistics
How to Cite
Haughton, D. 2005. Data Mining, Software Packages for. Encyclopedia of Biostatistics.
- Published Online: 15 JUL 2005
The term data mining refers to the identification—within a typically large database—of new, valid, and interesting patterns. While data mining has become most popular in the context of, for example, database marketing, most of the methods under the data mining umbrella have been widely applied in biostatistics. We describe which main applications of data mining have arisen recently in biostatistics, and introduce the reader to some of the available data mining software packages with a reference to biostatistical needs.
- association analysis;
- data mining;
- Kohonen maps;
- microarray data;
- neural nets;
- SAS Enterprise Miner, XLMiner