ctgov_text_similarity {ctrialsgov} | R Documentation |
Similarity Matrix
Description
Takes one or more vectors of text and returns a similarity matrix.
Usage
ctgov_text_similarity(
...,
max_terms = 10000,
tolower = TRUE,
min_df = 0,
max_df = 1
)
Arguments
... |
one or more vectors of text to search; must all be the same length |
max_terms |
maximum number of terms to consider for keywords |
tolower |
should keywords respect the case of the raw terms |
min_df |
minimum proportion of documents that a term should be present in to be included in the keywords |
max_df |
maximum proportion of documents that a term should be present in to be included in the keywords |
Value
a distance matrix
[Package ctrialsgov version 0.2.5 Index]