Research Article
Text mining: Generating hypotheses from MEDLINE
Article first published online: 16 DEC 2003
DOI: 10.1002/asi.10389
Copyright © 2003 Wiley Periodicals, Inc.
Issue

Journal of the American Society for Information Science and Technology
Volume 55, Issue 5, pages 396–413, March 2004
Additional Information
How to Cite
Srinivasan, P. (2004), Text mining: Generating hypotheses from MEDLINE. J. Am. Soc. Inf. Sci., 55: 396–413. doi: 10.1002/asi.10389
Publication History
- Issue published online: 10 FEB 2004
- Article first published online: 16 DEC 2003
- Manuscript Revised: 24 SEP 2003
- Manuscript Accepted: 24 SEP 2003
- Manuscript Received: 22 MAY 2003
- Abstract
- Article
- References
- Cited By
Abstract
Hypothesis generation, a crucial initial step for making scientific discoveries, relies on prior knowledge, experience, and intuition. Chance connections made between seemingly distinct subareas sometimes turn out to be fruitful. The goal in text mining is to assist in this process by automatically discovering a small set of interesting hypotheses from a suitable text collection. In this report, we present open and closed text mining algorithms that are built within the discovery framework established by Swanson and Smalheiser. Our algorithms represent topics using metadata profiles. When applied to MEDLINE, these are MeSH based profiles. We present experiments that demonstrate the effectiveness of our algorithms. Specifically, our algorithms successfully generate ranked term lists where the key terms representing novel relationships between topics are ranked high.

1532-2890/asset/olbannerleft.gif?v=1&s=d833098325c9f1060bcbee51adf276c155608167)
1532-2890/asset/olbannercenter.gif?v=1&s=661179918edb4fa732edfd3408eb050a6ce87809)
1532-2890/asset/olbannerright.gif?v=1&s=1ef8a363944134c502cbffa1937878a71b4cc635)