Research Article
Statistical principal components analysis for retrieval experiments
Article first published online: 22 JAN 2007
DOI: 10.1002/asi.20537
Copyright © 2007 Wiley Periodicals, Inc., A Wiley Company
Issue

Journal of the American Society for Information Science and Technology
Volume 58, Issue 4, pages 560–574, 15 February 2007
Additional Information
How to Cite
Dinçer, B. T. (2007), Statistical principal components analysis for retrieval experiments. J. Am. Soc. Inf. Sci., 58: 560–574. doi: 10.1002/asi.20537
Publication History
- Issue published online: 16 FEB 2007
- Article first published online: 22 JAN 2007
- Manuscript Accepted: 20 MAY 2006
- Manuscript Revised: 19 MAY 2006
- Manuscript Received: 23 JAN 2006
- Abstract
- Article
- References
- Cited By
Abstract
In this article, the statistical principal components analysis (PCA) is proposed as a method for performance comparisons of different retrieval strategies. It is shown that the PCA method can reveal implicit performance relations among retrieval systems across information needs (i.e., queries, topics). For illustration, the TREC 12 robust track data have been reevaluated by the PCA method and have been shown to reveal easily the performance relations that are hard to see with traditional techniques. Therefore, PCA promises a uniform evaluation framework that can be used for large-scale evaluation of retrieval experiments. In addition to the mean average precision (MAP) measure, relative analytic distance (RAD) is proposed as a new performance summary measure based on the same notion introduced by PCA.

1532-2890/asset/olbannerleft.gif?v=1&s=d833098325c9f1060bcbee51adf276c155608167)
1532-2890/asset/olbannercenter.gif?v=1&s=661179918edb4fa732edfd3408eb050a6ce87809)
1532-2890/asset/olbannerright.gif?v=1&s=1ef8a363944134c502cbffa1937878a71b4cc635)