http://archive.lis.unt.edu:2025/resume
Research
Images of similarity: A visual exploration of optimal similarity metrics and scaling properties of TREC topic-document sets
Article first published online: 30 APR 1999
DOI: 10.1002/(SICI)1097-4571(1999)50:8<639::AID-ASI2>3.0.CO;2-C
Copyright © 1999 John Wiley & Sons, Inc.
Issue
1532-2890/asset/cover.gif?v=1&s=c9d33fcfc2cef6291d3cfd7d7be1de8ca023f1c7)
Journal of the American Society for Information Science
Volume 50, Issue 8, pages 639–651, 1999
Additional Information
How to Cite
Rorvig, M. (1999), Images of similarity: A visual exploration of optimal similarity metrics and scaling properties of TREC topic-document sets. J. Am. Soc. Inf. Sci., 50: 639–651. doi: 10.1002/(SICI)1097-4571(1999)50:8<639::AID-ASI2>3.0.CO;2-C
- †
http://archive.lis.unt.edu:2025/resume
Publication History
- Issue published online: 30 APR 1999
- Article first published online: 30 APR 1999
- Manuscript Revised: 4 JUN 1998
- Manuscript Accepted: 4 JUN 1998
- Manuscript Received: 8 JAN 1998
- Abstract
- References
- Cited By
Abstract
Multiple similarity measures for five TREC topic-document sets from the LDC TREC Collection Disk 1 are derived from the full text of documents. Each measure on each set is scaled using SAS MDS under ordinal, interval, and MLE assumptions. The resulting 75 permutations are ploted. It is suggested that cosine-vector and overlap measures for similarity appear to recover optimal data relationships among the documents of the five sets. MLE assumptions appear to be required to model the data adequately.

1532-2890/asset/olbannerleft.gif?v=1&s=d833098325c9f1060bcbee51adf276c155608167)
1532-2890/asset/olbannercenter.gif?v=1&s=661179918edb4fa732edfd3408eb050a6ce87809)
1532-2890/asset/olbannerright.gif?v=1&s=1ef8a363944134c502cbffa1937878a71b4cc635)