dtm_tfidf {udpipe} | R Documentation |
Term Frequency - Inverse Document Frequency calculation
Description
Term Frequency - Inverse Document Frequency calculation. Averaged by each term.
Usage
dtm_tfidf(dtm)
Arguments
dtm |
an object returned by |
Value
a vector with tfidf values, one for each term in the dtm
matrix
Examples
data(brussels_reviews_anno)
x <- subset(brussels_reviews_anno, xpos == "NN")
x <- x[, c("doc_id", "lemma")]
x <- document_term_frequencies(x)
dtm <- document_term_matrix(x)
## Calculate tfidf
tfidf <- dtm_tfidf(dtm)
hist(tfidf, breaks = "scott")
head(sort(tfidf, decreasing = TRUE))
head(sort(tfidf, decreasing = FALSE))
[Package udpipe version 0.8.11 Index]