Quantitative Analysis of Textual Data

Documentation for package ‘quanteda’ version 4.0.2

Help Pages

-- A --

as.character.corpus	Coercion and checking methods for corpus objects
as.character.tokens	Coercion, checking, and combining functions for tokens objects
as.corpus	Coercion and checking methods for corpus objects
as.data.frame.kwic	Locate keywords-in-context
as.dfm	Coercion and checking functions for dfm objects
as.dictionary	Coercion and checking functions for dictionary objects
as.dictionary.data.frame	Coercion and checking functions for dictionary objects
as.fcm	Coercion and checking functions for fcm objects
as.list.tokens	Coercion, checking, and combining functions for tokens objects
as.matrix.dfm	Coerce a dfm to a matrix or data.frame
as.phrase	Declare a pattern to be a sequence of separate patterns
as.tokens	Coercion, checking, and combining functions for tokens objects
as.tokens.spacyr_parsed	Coercion, checking, and combining functions for tokens objects
as.tokens_xptr	Methods for tokens_xptr objects
as.tokens_xptr.tokens	Methods for tokens_xptr objects
as.tokens_xptr.tokens_xptr	Methods for tokens_xptr objects
as.yaml	Convert quanteda dictionary objects to the YAML format

-- B --

bootstrap_dfm

Bootstrap a dfm

-- C --

char_keep	Select or remove elements from a character vector
char_ngrams	Create n-grams and skip-grams from tokens
char_remove	Select or remove elements from a character vector
char_segment	Segment texts on a pattern match
char_select	Select or remove elements from a character vector
char_tolower	Convert the case of character objects
char_toupper	Convert the case of character objects
char_trim	Remove sentences based on their token lengths or a pattern match
char_wordstem	Stem the terms in an object
concat	Return the concatenator character from an object
concatenator	Return the concatenator character from an object
convert	Convert quanteda objects to non-quanteda formats
convert.corpus	Convert quanteda objects to non-quanteda formats
convert.dfm	Convert quanteda objects to non-quanteda formats
corpus	Construct a corpus object
corpus.character	Construct a corpus object
corpus.Corpus	Construct a corpus object
corpus.corpus	Construct a corpus object
corpus.data.frame	Construct a corpus object
corpus.kwic	Construct a corpus object
corpus_group	Combine documents in corpus by a grouping variable
corpus_reshape	Recast the document units of a corpus
corpus_sample	Randomly sample documents from a corpus
corpus_segment	Segment texts on a pattern match
corpus_subset	Extract a subset of a corpus
corpus_trim	Remove sentences based on their token lengths or a pattern match

-- D --

data_char_sampletext	A paragraph of text for testing various text-based functions
data_char_ukimmig2010	Immigration-related sections of 2010 UK party manifestos
data_corpus_inaugural	US presidential inaugural address texts
data_dfm_lbgexample	dfm from data in Table 1 of Laver, Benoit, and Garry (2003)
data_dictionary_LSD2015	Lexicoder Sentiment Dictionary (2015)
dfm	Create a document-feature matrix
dfm_compress	Recombine a dfm or fcm by combining identical dimension elements
dfm_group	Combine documents in a dfm by a grouping variable
dfm_keep	Select features from a dfm or fcm
dfm_lookup	Apply a dictionary to a dfm
dfm_match	Match the feature set of a dfm to given feature names
dfm_remove	Select features from a dfm or fcm
dfm_replace	Replace features in dfm
dfm_sample	Randomly sample documents from a dfm
dfm_select	Select features from a dfm or fcm
dfm_smooth	Weight the feature frequencies in a dfm
dfm_sort	Sort a dfm by frequency of one or more margins
dfm_subset	Extract a subset of a dfm
dfm_tfidf	Weight a dfm by _tf-idf_
dfm_tolower	Convert the case of the features of a dfm and combine
dfm_toupper	Convert the case of the features of a dfm and combine
dfm_trim	Trim a dfm using frequency threshold-based feature selection
dfm_weight	Weight the feature frequencies in a dfm
dfm_wordstem	Stem the terms in an object
dictionary	Create a dictionary
docfreq	Compute the (weighted) document frequency of a feature
docid	Get or set document names
docnames	Get or set document names
docnames<-	Get or set document names
docvars	Get or set document-level variables
docvars<-	Get or set document-level variables

-- F --

fcm	Create a feature co-occurrence matrix
fcm_compress	Recombine a dfm or fcm by combining identical dimension elements
fcm_keep	Select features from a dfm or fcm
fcm_remove	Select features from a dfm or fcm
fcm_select	Select features from a dfm or fcm
fcm_sort	Sort an fcm in alphabetical order of the features
fcm_tolower	Convert the case of the features of a dfm and combine
fcm_toupper	Convert the case of the features of a dfm and combine
featfreq	Compute the frequencies of features
featnames	Get the feature labels from a dfm

-- I --

index	Locate a pattern in a tokens object
is.collocations	Check if an object is collocations
is.corpus	Coercion and checking methods for corpus objects
is.dfm	Coercion and checking functions for dfm objects
is.dictionary	Coercion and checking functions for dictionary objects
is.fcm	Create a feature co-occurrence matrix
is.index	Locate a pattern in a tokens object
is.kwic	Locate keywords-in-context
is.phrase	Declare a pattern to be a sequence of separate patterns
is.tokens	Coercion, checking, and combining functions for tokens objects
is.tokens_xptr	Methods for tokens_xptr objects

-- K --

kwic	Locate keywords-in-context

-- M --

meta	Get or set object metadata
meta<-	Get or set object metadata

-- N --

ndoc	Count the number of documents or features
nfeat	Count the number of documents or features
nsentence	Count the number of sentences
ntoken	Count the number of tokens or types
ntype	Count the number of tokens or types

-- P --

phrase	Declare a pattern to be a sequence of separate patterns
print-method	Print methods for quanteda core objects
print-methods	Print methods for quanteda core objects
print.corpus	Print methods for quanteda core objects
print.dfm	Print methods for quanteda core objects
print.dictionary	Print methods for quanteda core objects
print.kwic	Print methods for quanteda core objects
print.tokens	Print methods for quanteda core objects

-- Q --

quanteda_options

Get or set package options for quanteda

-- S --

segid	Get or set document names
spacyr-methods	Extensions for and from spacy_parse objects
sparsity	Compute the sparsity of a document-feature matrix

-- T --

textmodels	Models for scaling and classification of textual data
textplots	Plots for textual data
textstats	Statistics for textual data
tokens	Construct a tokens object
tokens_chunk	Segment tokens object by chunks of a given size
tokens_compound	Convert token sequences into compound tokens
tokens_group	Combine documents in a tokens object by a grouping variable
tokens_keep	Select or remove tokens from a tokens object
tokens_lookup	Apply a dictionary to a tokens object
tokens_ngrams	Create n-grams and skip-grams from tokens
tokens_remove	Select or remove tokens from a tokens object
tokens_replace	Replace tokens in a tokens object
tokens_sample	Randomly sample documents from a tokens object
tokens_select	Select or remove tokens from a tokens object
tokens_skipgrams	Create n-grams and skip-grams from tokens
tokens_split	Split tokens by a separator pattern
tokens_subset	Extract a subset of a tokens
tokens_tolower	Convert the case of tokens
tokens_toupper	Convert the case of tokens
tokens_wordstem	Stem the terms in an object
tokens_xptr	Methods for tokens_xptr objects
topfeatures	Identify the most frequent features in a dfm
types	Get word types from a tokens object

-- misc --

$.corpus	Get or set document-level variables
$.dfm	Get or set document-level variables
$.tokens	Get or set document-level variables
$<-.corpus	Get or set document-level variables
$<-.dfm	Get or set document-level variables
$<-.tokens	Get or set document-level variables