Tools for Statistical Content Analysis

Documentation for package ‘tosca’ version 0.3-2

Help Pages

as.corpus.textmeta	Transform textmeta to corpus
as.meta	"meta" Component of "textmeta"-Objects
as.textmeta.corpus	Transform corpus to textmeta
cleanTexts	Data Preprocessing
clusterTopics	Cluster Analysis
deleteAndRenameDuplicates	Deletes and Renames Articles with the same ID
duplist	Creating List of Duplicates
filterCount	Subcorpus With Count Filter
filterCount.default	Subcorpus With Count Filter
filterCount.textmeta	Subcorpus With Count Filter
filterDate	Subcorpus With Date Filter
filterDate.default	Subcorpus With Date Filter
filterDate.textmeta	Subcorpus With Date Filter
filterID	Subcorpus With ID Filter
filterID.default	Subcorpus With ID Filter
filterID.textmeta	Subcorpus With ID Filter
filterWord	Subcorpus With Word Filter
filterWord.default	Subcorpus With Word Filter
filterWord.textmeta	Subcorpus With Word Filter
importance	Top Words per Topic
intruderTopics	Function to validate the fit of the LDA model
intruderWords	Function to validate the fit of the LDA model
is.duplist	Creating List of Duplicates
is.textmeta	"textmeta"-Objects
is.textmeta_tidy	Transform textmeta to an object with tidy text data
LDAgen	Function to fit LDA model
LDAprep	Create Lda-ready Dataset
makeWordlist	Counts Words in Text Corpora
mergeLDA	Preparation of Different LDAs For Clustering
mergeTextmeta	Merge Textmeta Objects
plot.textmeta	"textmeta"-Objects
plotArea	Plotting topics over time as stacked areas below plotted lines.
plotFreq	Plotting Counts of specified Wordgroups over Time (relative to Corpus)
plotHeat	Plotting Topics over Time relative to Corpus
plotScot	Plots Counts of Documents or Words over Time (relative to Corpus)
plotTopic	Plotting Counts of Topics over Time (Relative to Corpus)
plotTopicWord	Plotting Counts of Topics-Words-Combination over Time (Relative to Words)
plotWordpt	Plots Counts of Topics-Words-Combination over Time (Relative to Topics)
plotWordSub	Plotting Counts/Proportion of Words/Docs in LDA-generated Topic-Subcorpora over Time
precision	Precision and Recall
print.duplist	Creating List of Duplicates
print.textmeta	"textmeta"-Objects
print.textmeta_tidy	Transform textmeta to an object with tidy text data
readTextmeta	Read Corpora as CSV
readTextmeta.df	Read Corpora as CSV
readWhatsApp	Read WhatsApp files
readWiki	Read Pages from Wikipedia
readWikinews	Read files from Wikinews
recall	Precision and Recall
removeHTML	Removes XML/HTML Tags and Umlauts
removeUmlauts	Removes XML/HTML Tags and Umlauts
removeXML	Removes XML/HTML Tags and Umlauts
sampling	Sample Texts
showMeta	Export Readable Meta-Data of Articles.
showTexts	Exports Readable Text Lists
summary.duplist	Creating List of Duplicates
summary.textmeta	"textmeta"-Objects
textmeta	"textmeta"-Objects
tidy.textmeta	Transform textmeta to an object with tidy text data
topicCoherence	Calculating Topic Coherence
topicsInText	Coloring the words of a text corresponding to topic allocation
topTexts	Get The IDs Of The Most Representive Texts
topWords	Top Words per Topic
vprecision	Precision and Recall
vrecall	Precision and Recall