Contextual document clustering is a novel approach which uses information theoretic measures to cluster semantically related documents bound together by an implicit set of concepts or themes of narrow specificity. It facilitates cluster-based retrieval by assessing the similarity between a query and the cluster themes' probability distribution. In this paper, we assess a relevance feedback mechanism, based on query refinement, that modifies the query's probability distribution using a small number of documents that have been judged relevant to the query. We demonstrate that by providing only one relevance judgment, a performance improvement of 33% was obtained.

Original languageEnglish
Pages (from-to)1176-1184
Number of pages9
JournalInformation Processing and Management
Volume42
Issue number5
DOIs
StatePublished - 1 Sep 2006

    Scopus subject areas

  • Information Systems
  • Media Technology
  • Computer Science Applications
  • Management Science and Operations Research
  • Library and Information Sciences

    Research areas

  • Document clustering, Information retrieval, Relevance feedback

ID: 36369108