Tools for Statistical Content Analysis


[Up] [Top]

Documentation for package ‘tosca’ version 0.3-2

Help Pages

as.corpus.textmeta Transform textmeta to corpus
as.meta "meta" Component of "textmeta"-Objects
as.textmeta.corpus Transform corpus to textmeta
cleanTexts Data Preprocessing
clusterTopics Cluster Analysis
deleteAndRenameDuplicates Deletes and Renames Articles with the same ID
duplist Creating List of Duplicates
filterCount Subcorpus With Count Filter
filterCount.default Subcorpus With Count Filter
filterCount.textmeta Subcorpus With Count Filter
filterDate Subcorpus With Date Filter
filterDate.default Subcorpus With Date Filter
filterDate.textmeta Subcorpus With Date Filter
filterID Subcorpus With ID Filter
filterID.default Subcorpus With ID Filter
filterID.textmeta Subcorpus With ID Filter
filterWord Subcorpus With Word Filter
filterWord.default Subcorpus With Word Filter
filterWord.textmeta Subcorpus With Word Filter
importance Top Words per Topic
intruderTopics Function to validate the fit of the LDA model
intruderWords Function to validate the fit of the LDA model
is.duplist Creating List of Duplicates
is.textmeta "textmeta"-Objects
is.textmeta_tidy Transform textmeta to an object with tidy text data
LDAgen Function to fit LDA model
LDAprep Create Lda-ready Dataset
makeWordlist Counts Words in Text Corpora
mergeLDA Preparation of Different LDAs For Clustering
mergeTextmeta Merge Textmeta Objects
plot.textmeta "textmeta"-Objects
plotArea Plotting topics over time as stacked areas below plotted lines.
plotFreq Plotting Counts of specified Wordgroups over Time (relative to Corpus)
plotHeat Plotting Topics over Time relative to Corpus
plotScot Plots Counts of Documents or Words over Time (relative to Corpus)
plotTopic Plotting Counts of Topics over Time (Relative to Corpus)
plotTopicWord Plotting Counts of Topics-Words-Combination over Time (Relative to Words)
plotWordpt Plots Counts of Topics-Words-Combination over Time (Relative to Topics)
plotWordSub Plotting Counts/Proportion of Words/Docs in LDA-generated Topic-Subcorpora over Time
precision Precision and Recall
print.duplist Creating List of Duplicates
print.textmeta "textmeta"-Objects
print.textmeta_tidy Transform textmeta to an object with tidy text data
readTextmeta Read Corpora as CSV
readTextmeta.df Read Corpora as CSV
readWhatsApp Read WhatsApp files
readWiki Read Pages from Wikipedia
readWikinews Read files from Wikinews
recall Precision and Recall
removeHTML Removes XML/HTML Tags and Umlauts
removeUmlauts Removes XML/HTML Tags and Umlauts
removeXML Removes XML/HTML Tags and Umlauts
sampling Sample Texts
showMeta Export Readable Meta-Data of Articles.
showTexts Exports Readable Text Lists
summary.duplist Creating List of Duplicates
summary.textmeta "textmeta"-Objects
textmeta "textmeta"-Objects
tidy.textmeta Transform textmeta to an object with tidy text data
topicCoherence Calculating Topic Coherence
topicsInText Coloring the words of a text corresponding to topic allocation
topTexts Get The IDs Of The Most Representive Texts
topWords Top Words per Topic
vprecision Precision and Recall
vrecall Precision and Recall