make.count.table {textreg}R Documentation

Count number of times documents have a given phrase.

Description

Given a list of phrases, count how many documents they appear in and subdivide by positive and negative appearance.

Usage

make.count.table(phrases, labeling, corpus)

Arguments

phrases

List of strings

labeling

Vector of +1/0/-1 labels

corpus

A corpus object from tm package

Details

This method does not consider multiple counts of phrases within documents. Phrases can have wildcards and stemming notation. See grab.fragments.

Value

a dataframe of statistics. per.pos is the percent of the documents with the phrase that are positively labeled. per.tag is the percent of the positively labeled documents that have the phrase.

See Also

grab.fragments

Other textregCounting: make.phrase.matrix, phrase.count

Examples

library( tm )
data( bathtub )
lbl = meta( bathtub )$meth.chl
make.count.table( c("bathtub","strip+", "vapor *"), lbl, bathtub )

[Package textreg version 0.1.5 Index]