acknowledgements aggregation algorithms alignment annotation associative author_attribution bayes chunking citeseer classics classification clustering collaboration collaborative_tagging collation collocation computer concepts context corpus data datamining data_mining dawg digital digital_libraries discretization document edition emergent_semantics emerging encyclopedia extraction feature_selection feature_sets female folksonomy functions gender genetic_criticism google hmm humanities information intertextuality keyphrase knn knowledge latent_semantic_indexing learning libraries linguistics literary_criticism lsi male mark memex metadata mining monk named_entities nearest_neighbor net neural newspapers ngrams ontology pair_project patterns perl plagiarism preservation probabilistic_indexing public publishing pubmed quality scaling science scrabble scully segmentation semantics semantic_web semiotics sequences shlomo similarity social social_tagging software tag_cloud tagging tagora term_weighting text textbase text_mining topic_models topics topoi trails training tree trie undiscovered vector_space visualization weights wikipedia winnow writing