A Comparison of Machine Measures of Text Document Similarity with Human Judgments |
Reviews
[Write a review of this article]
There are no reviews of this article
Find related articles from these CiteULike users
Find related articles with these CiteULike tags
AbstractA central problem in the information sciences involves measuring the semantic similarity between text documents. Although this is fundamentally a cognitive modeling problem, existing methods have not been assessed in terms of their ability to emulate human judgments of similarity. To address this problem, we conducted a controlled psychological experiment that collected repeated similarity measures for each pair of documents in a small corpus of short news documents. We then considered...
BibTeX record
RIS record