Research Article
A model for quantitative evaluation of an end-to-end question-answering system
Article first published online: 23 APR 2007
DOI: 10.1002/asi.20560
Copyright © 2007 Wiley Periodicals, Inc., A Wiley Company
Issue

Journal of the American Society for Information Science and Technology
Volume 58, Issue 8, pages 1082–1099, June 2007
Additional Information
How to Cite
Wacholder, N., Kelly, D., Kantor, P., Rittman, R., Sun, Y., Bai, B., Small, S., Yamrom, B. and Strzalkowski, T. (2007), A model for quantitative evaluation of an end-to-end question-answering system. J. Am. Soc. Inf. Sci., 58: 1082–1099. doi: 10.1002/asi.20560
Publication History
- Issue published online: 22 MAY 2007
- Article first published online: 23 APR 2007
- Manuscript Accepted: 17 JUL 2006
- Manuscript Revised: 1 FEB 2006
- Manuscript Received: 15 MAR 2005
- Abstract
- Article
- References
- Cited By
Abstract
We describe a procedure for quantitative evaluation of interactive question-answering systems and illustrate it with application to the High-Quality Interactive Question-Answering (HITIQA) system. Our objectives were (a) to design a method to realistically and reliably assess interactive question-answering systems by comparing the quality of reports produced using different systems, (b) to conduct a pilot test of this method, and (c) to perform a formative evaluation of the HITIQA system. Far more important than the specific information gathered from this pilot evaluation is the development of (a) a protocol for evaluating an emerging technology, (b) reusable assessment instruments, and (c) the knowledge gained in conducting the evaluation. We conclude that this method, which uses a surprisingly small number of subjects and does not rely on predetermined relevance judgments, measures the impact of system change on work produced by users. Therefore this method can be used to compare the product of interactive systems that use different underlying technologies.

1532-2890/asset/olbannerleft.gif?v=1&s=d833098325c9f1060bcbee51adf276c155608167)
1532-2890/asset/olbannercenter.gif?v=1&s=661179918edb4fa732edfd3408eb050a6ce87809)
1532-2890/asset/olbannerright.gif?v=1&s=1ef8a363944134c502cbffa1937878a71b4cc635)