Research Article
Ranking indirect connections in literature-based discovery: The role of medical subject headings
Article first published online: 28 JUN 2006
DOI: 10.1002/asi.20438
Copyright © 2006 Wiley Periodicals, Inc., A Wiley Company
Issue

Journal of the American Society for Information Science and Technology
Volume 57, Issue 11, pages 1427–1439, September 2006
Additional Information
How to Cite
Swanson, D. R., Smalheiser, N. R. and Torvik, V. I. (2006), Ranking indirect connections in literature-based discovery: The role of medical subject headings. J. Am. Soc. Inf. Sci., 57: 1427–1439. doi: 10.1002/asi.20438
Publication History
- Issue published online: 8 AUG 2006
- Article first published online: 28 JUN 2006
- Manuscript Accepted: 22 AUG 2005
- Manuscript Revised: 14 JUL 2005
- Manuscript Received: 14 FEB 2004
- Abstract
- Article
- References
- Cited By
Abstract
Arrowsmith, a computer-assisted process for literature-based discovery, takes as input two disjoint sets of records (A, C) from the Medline database. It produces a list of title words and phrases, B, that are common to A and C, and displays the title context in which each B-term occurs within A and within C. Subject experts then can try to find A–B and B–C title-pairs that together may suggest novel and plausible indirect A–C relationships (via B-terms) that are of particular interest in the absence of any known direct A–C relationship. The list of B-terms typically is so large that it is difficult to find the relatively few that contribute to scientifically interesting connections. The purpose of the present article is to propose and test several techniques for improving the quality of the B-list. These techniques exploit the Medical Subject Headings (MeSH) that are assigned to each input record. A MesH-based concept of literature cohesiveness is defined and plays a key role. The proposed techniques are tested on a published example of indirect connections between migraine and magnesium deficiency. The tests demonstrate how the earlier results can be replicated with a more efficient and more systematic computer-aided process.

1532-2890/asset/olbannerleft.gif?v=1&s=d833098325c9f1060bcbee51adf276c155608167)
1532-2890/asset/olbannercenter.gif?v=1&s=661179918edb4fa732edfd3408eb050a6ce87809)
1532-2890/asset/olbannerright.gif?v=1&s=1ef8a363944134c502cbffa1937878a71b4cc635)