Standard Article
Data Mining, Software Packages for
Published Online: 15 JUL 2005
DOI: 10.1002/0470011815.b2a13093
Copyright © 2005 John Wiley & Sons, Ltd
Book Title

Encyclopedia of Biostatistics
Additional Information
How to Cite
Haughton, D. 2005. Data Mining, Software Packages for. Encyclopedia of Biostatistics.
Publication History
- Published Online: 15 JUL 2005
- Abstract
- Article
- Figures
- References
Abstract
The term data mining refers to the identification—within a typically large database—of new, valid, and interesting patterns. While data mining has become most popular in the context of, for example, database marketing, most of the methods under the data mining umbrella have been widely applied in biostatistics. We describe which main applications of data mining have arisen recently in biostatistics, and introduce the reader to some of the available data mining software packages with a reference to biostatistical needs.
Keywords:
- association analysis;
- CART;
- data mining;
- Kohonen maps;
- MARS;
- microarray data;
- neural nets;
- pharmacovigilance;
- SAS Enterprise Miner, XLMiner
