Research Article
Design and implementation of a web mining system for organizing search engine results
Article first published online: 19 APR 2005
DOI: 10.1002/int.20086
Copyright © 2005 Wiley Periodicals, Inc.
Additional Information
How to Cite
Schenker, A., Last, M. and Kandel, A. (2005), Design and implementation of a web mining system for organizing search engine results. Int. J. Intell. Syst., 20: 607–625. doi: 10.1002/int.20086
Publication History
- Issue published online: 19 APR 2005
- Article first published online: 19 APR 2005
- Abstract
- References
- Cited By
Abstract
We present the design and implementation of a web mining system that creates a hierarchical clustering of web documents retrieved by commercial web search engines. The cluster hierarchy is produced by a novel method called the Cluster Hierarchy Construction Algorithm (CHCA) and it can be used to explore the topics of interest related to the search query and their relationships. We discuss important design issues for our system, including stemming and dimensionality reduction, as well as some implementation details. We show examples of system results, compare them with results from similar systems, and analyze the responses to a survey of the system's users. © 2005 Wiley Periodicals, Inc. Int J Int Syst 20: 607–625, 2005.

1098-111X/asset/INT_left.gif?v=1&s=c0d44ac5ce99265330169e2ac3d22da4ab6b1a5d)
1098-111X/asset/INT_centre.gif?v=1&s=e94826a6788e7bb0695867b68ca2c030d8c7a252)
1098-111X/asset/INT_right.gif?v=1&s=d4616ff123f9b0a0199cc9f89f77f112e4ce3a70)
1098-111X/asset/cover.gif?v=1&s=7f7c12f2c86265974044b2b3f9936860ffc468a0)