tCorpus_docsim {corpustools}R Documentation

Document similarity

Description

(back to overview)

Details

Compare documents, and perform similarity based deduplication

compare_documents() Compare documents
$deduplicate() Remove duplicate documents

[Package corpustools version 0.5.1 Index]