Log in

Correlation-Based Web Document Clustering for Adaptive Web Interface Design

  • Web Information Systems Engineering Paper
  • Published:
Knowledge and Information Systems Aims and scope Submit manuscript

Abstract

A great challenge for web site designers is how to ensure users' easy access to important web pages efficiently. In this paper we present a clustering-based approach to address this problem. Our approach to this challenge is to perform efficient and effective correlation analysis based on web logs and construct clusters of web pages to reflect the co-visit behavior of web site users. We present a novel approach for adapting previous clustering algorithms that are designed for databases in the problem domain of web page clustering, and show that our new methods can generate high-quality clusters for very large web logs when previous methods fail. Based on the high-quality clustering results, we then apply the data-mined clustering knowledge to the problem of adapting web interfaces to improve users' performance. We develop an automatic method for web interface adaptation: by introducing index pages that minimize overall user browsing costs. The index pages are aimed at providing short cuts for users to ensure that users get to their objective web pages fast, and we solve a previously open problem of how to determine an optimal number of index pages. We empirically show that our approach performs better than many of the previous algorithms based on experiments on several realistic web log files.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price includes VAT (France)

Instant access to the full article PDF.

Similar content being viewed by others

Author information

Authors and Affiliations

Authors

Additional information

Received 25 November 2000 / Revised 15 March 2001 / Accepted in revised form 14 May 2001

Rights and permissions

Reprints and permissions

About this article

Cite this article

Su, Z., Yang, Q., Zhang, H. et al. Correlation-Based Web Document Clustering for Adaptive Web Interface Design. Knowl Inform Sys 4, 151–167 (2002). https://doi.org/10.1007/s101150200002

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s101150200002

Navigation