Результаты исследований: Научные публикации в периодических изданиях › статья › Рецензирование
Contextual document clustering. / Dobrynin, Vladimir; Patterson, David; Rooney, Niall.
в: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Том 2997, 01.12.2004, стр. 167-180.Результаты исследований: Научные публикации в периодических изданиях › статья › Рецензирование
}
TY - JOUR
T1 - Contextual document clustering
AU - Dobrynin, Vladimir
AU - Patterson, David
AU - Rooney, Niall
PY - 2004/12/1
Y1 - 2004/12/1
N2 - In this paper we present a novel algorithm for document clustering. This approach is based on distributional clustering where subject related words, which have a narrow context, are identified to form metatags for that subject. These contextual words form the basis for creating thematic clusters of documents. In a similar fashion to other research papers on document clustering, we analyze the quality of this approach with respect to document categorization problems and show it to outperform the information theoretic method of sequential information bottleneck.
AB - In this paper we present a novel algorithm for document clustering. This approach is based on distributional clustering where subject related words, which have a narrow context, are identified to form metatags for that subject. These contextual words form the basis for creating thematic clusters of documents. In a similar fashion to other research papers on document clustering, we analyze the quality of this approach with respect to document categorization problems and show it to outperform the information theoretic method of sequential information bottleneck.
UR - http://www.scopus.com/inward/record.url?scp=33646532191&partnerID=8YFLogxK
M3 - Article
AN - SCOPUS:33646532191
VL - 2997
SP - 167
EP - 180
JO - Lecture Notes in Computer Science
JF - Lecture Notes in Computer Science
SN - 0302-9743
ER -
ID: 36369783