Research Article
A “stereo” document representation for textual information retrieval
Article first published online: 17 FEB 2006
DOI: 10.1002/asi.20343
Copyright © 2006 Wiley Periodicals, Inc.
Issue

Journal of the American Society for Information Science and Technology
Volume 57, Issue 6, pages 768–774, April 2006
Additional Information
How to Cite
Chen, L., Zeng, J. and Tokuda, N. (2006), A “stereo” document representation for textual information retrieval. J. Am. Soc. Inf. Sci., 57: 768–774. doi: 10.1002/asi.20343
Publication History
- Issue published online: 24 MAR 2006
- Article first published online: 17 FEB 2006
- Manuscript Accepted: 23 MAR 2005
- Manuscript Revised: 7 FEB 2005
- Manuscript Received: 29 NOV 2004
- Abstract
- Article
- References
- Cited By
Abstract
A new document representation model is presented in this paper. This model is based on the idea of representing a document by two or more pictures of the document taken from different perspectives. It is shown that by applying the stereo representation model, enhanced textual retrieval performance is achieved because the new model improves the capability of capturing individual features of the document. Experiments have been conducted on two standard corpora, TIME and ADI, using the standard term vector method and the latent semantic indexing (LSI) method based upon both the stereo representation model and the traditional representation model. Statistical t-tests on the experimental results have convincingly illustrated that these methods achieve significant improvements in retrieval performances with the stereo representation model over those with the traditional representation model.

1532-2890/asset/olbannerleft.gif?v=1&s=d833098325c9f1060bcbee51adf276c155608167)
1532-2890/asset/olbannercenter.gif?v=1&s=661179918edb4fa732edfd3408eb050a6ce87809)
1532-2890/asset/olbannerright.gif?v=1&s=1ef8a363944134c502cbffa1937878a71b4cc635)