comparison.cloud {wordcloud} | R Documentation |
Plot a comparison cloud
Description
Plot a cloud comparing the frequencies of words across documents.
Usage
comparison.cloud(term.matrix,scale=c(4,.5), max.words=300,
random.order=FALSE, rot.per=.1,
colors=brewer.pal(max(3,ncol(term.matrix)),"Dark2"),
use.r.layout=FALSE, title.size=3,
title.colors=NULL, match.colors=FALSE,
title.bg.colors="grey90", ...)
Arguments
term.matrix |
A term frequency matrix whose rows represent words and whose columns represent documents. |
scale |
A vector of length 2 indicating the range of the size of the words. |
max.words |
Maximum number of words to be plotted. least frequent terms dropped |
random.order |
plot words in random order. If false, they will be plotted in decreasing frequency |
rot.per |
proportion words with 90 degree rotation |
colors |
Color words in the order of columns in |
use.r.layout |
if false, then c++ code is used for collision detection, otherwise R is used |
title.size |
Size of document titles |
title.colors |
Colors used for document titles. See details. |
match.colors |
Logical: should colors document titles colors match word colors? See details. |
title.bg.colors |
Colors used for the background of document titles. |
... |
Additional parameters to be passed to text (and strheight,strwidth). |
Details
Let p_{i,j}
be the rate at which word i occurs in document j, and p_j
be the
average across documents(\sum_ip_{i,j}/ndocs
). The size of each word is mapped to its maximum deviation
( max_i(p_{i,j}-p_j)
), and its angular position is determined by the document where that maximum occurs.
If title.colors
is not NULL
, it is used for document titles and match.colors
is ignored.
Value
nothing
Examples
if(require(tm)){
data(SOTU)
corp <- SOTU
corp <- tm_map(corp, removePunctuation)
corp <- tm_map(corp, content_transformer(tolower))
corp <- tm_map(corp, removeNumbers)
corp <- tm_map(corp, function(x)removeWords(x,stopwords()))
term.matrix <- TermDocumentMatrix(corp)
term.matrix <- as.matrix(term.matrix)
colnames(term.matrix) <- c("SOTU 2010","SOTU 2011")
comparison.cloud(term.matrix,max.words=40,random.order=FALSE)
comparison.cloud(term.matrix,max.words=40,random.order=FALSE,
title.colors=c("red","blue"),title.bg.colors=c("grey40","grey70"))
comparison.cloud(term.matrix,max.words=40,random.order=FALSE,
match.colors=TRUE)
}