removeSparseTerms {tm} | R Documentation |
Remove Sparse Terms from a Term-Document Matrix
Description
Remove sparse terms from a document-term or term-document matrix.
Usage
removeSparseTerms(x, sparse)
Arguments
x |
A |
sparse |
A numeric for the maximal allowed sparsity in the range from bigger zero to smaller one. |
Value
A term-document matrix where those terms from x
are
removed which have at least a sparse
percentage of empty (i.e.,
terms occurring 0 times in a document) elements. I.e., the resulting
matrix contains only terms with a sparse factor of less than
sparse
.
Examples
data("crude")
tdm <- TermDocumentMatrix(crude)
removeSparseTerms(tdm, 0.2)
[Package tm version 0.7-13 Index]