Mining Web Resources for Enhancing Information Retrieval
Mining Web functional dependencies for flexible information access
Article first published online: 6 SEP 2007
DOI: 10.1002/asi.20628
Copyright © 2007 Wiley Periodicals, Inc., A Wiley Company
Issue

Journal of the American Society for Information Science and Technology
Volume 58, Issue 12, pages 1805–1819, October 2007
Additional Information
How to Cite
Perugini, S., Ramakrishnan, N. (2007), Mining Web functional dependencies for flexible information access. J. Am. Soc. Inf. Sci., 58: 1805–1819. doi: 10.1002/asi.20628
Publication History
- Issue published online: 24 SEP 2007
- Article first published online: 6 SEP 2007
- Manuscript Accepted: 4 JAN 2007
- Abstract
- Article
- References
- Cited By
Abstract
We present an approach to enhancing information access through Web structure mining in contrast to traditional approaches involving usage mining. Specifically, we mine the hardwired hierarchical hyperlink structure of Web sites to identify patterns of term-term co-occurrences we call Web functional dependencies (FDs). Intuitively, a Web FD ‘x → y’ declares that all paths through a site involving a hyperlink labeled x also contain a hyperlink labeled y. The complete set of FDs satisfied by a site help characterize (flexible and expressive) interaction paradigms supported by a site, where a paradigm is the set of explorable sequences therein. We describe algorithms for mining FDs and results from mining several hierarchical Web sites and present several interface designs that can exploit such FDs to provide compelling user experiences.

1532-2890/asset/olbannerleft.gif?v=1&s=d833098325c9f1060bcbee51adf276c155608167)
1532-2890/asset/olbannercenter.gif?v=1&s=661179918edb4fa732edfd3408eb050a6ce87809)
1532-2890/asset/olbannerright.gif?v=1&s=1ef8a363944134c502cbffa1937878a71b4cc635)