poliblog {lda} | R Documentation |
A collection of political blogs with ratings.
Description
A collection of 773 political blogs in LDA format with conservative/liberal ratings.
Usage
data(poliblog.documents)
data(poliblog.vocab)
data(poliblog.ratings)
Format
poliblog.documents
and poliblog.vocab
comprise a corpus of 773 political blogs conforming to the LDA format.
poliblog.ratings
is a numeric vector of length 773 which gives
a rating of liberal (-100) or conservative (100) to each document in
the corpus.
Source
Blei, David M. and McAuliffe, John. Supervised topic models. Advances in Neural Information Processing Systems, 2008.
See Also
lda.collapsed.gibbs.sampler
for the format of the
corpus.
Examples
data(poliblog.documents)
data(poliblog.vocab)
data(poliblog.ratings)
[Package lda version 1.5.2 Index]