Research Article
Automatic multidocument summarization of research abstracts: Design and user evaluation
Article first published online: 12 JUN 2007
DOI: 10.1002/asi.20618
Copyright © 2007 Wiley Periodicals, Inc., A Wiley Company
Issue

Journal of the American Society for Information Science and Technology
Volume 58, Issue 10, pages 1419–1435, August 2007
Additional Information
How to Cite
Ou, S., Khoo, C. S.G. and Goh, D. H. (2007), Automatic multidocument summarization of research abstracts: Design and user evaluation. J. Am. Soc. Inf. Sci., 58: 1419–1435. doi: 10.1002/asi.20618
Publication History
- Issue published online: 19 JUL 2007
- Article first published online: 12 JUN 2007
- Manuscript Accepted: 11 NOV 2006
- Manuscript Revised: 30 OCT 2006
- Manuscript Received: 30 DEC 2005
- Abstract
- Article
- References
- Cited By
Abstract
The purpose of this study was to develop a method for automatic construction of multidocument summaries of sets of research abstracts that may be retrieved by a digital library or search engine in response to a user query. Sociology dissertation abstracts were selected as the sample domain in this study. A variable-based framework was proposed for integrating and organizing research concepts and relationships as well as research methods and contextual relations extracted from different dissertation abstracts. Based on the framework, a new summarization method was developed, which parses the discourse structure of abstracts, extracts research concepts and relationships, integrates the information across different abstracts, and organizes and presents them in a Web-based interface. The focus of this article is on the user evaluation that was performed to assess the overall quality and usefulness of the summaries. Two types of variable-based summaries generated using the summarization method—with or without the use of a taxonomy—were compared against a sentence-based summary that lists only the research-objective sentences extracted from each abstract and another sentence-based summary generated using the MEAD system that extracts important sentences. The evaluation results indicate that the majority of sociological researchers (70%) and general users (64%) preferred the variable-based summaries generated with the use of the taxonomy.

1532-2890/asset/olbannerleft.gif?v=1&s=d833098325c9f1060bcbee51adf276c155608167)
1532-2890/asset/olbannercenter.gif?v=1&s=661179918edb4fa732edfd3408eb050a6ce87809)
1532-2890/asset/olbannerright.gif?v=1&s=1ef8a363944134c502cbffa1937878a71b4cc635)