Stylometric Multivariate Analyses


[Up] [Top]

Documentation for package ‘stylo’ version 0.7.5

Help Pages

assign.plot.colors Assign colors to samples
change.encoding Change character encoding
check.encoding Check character encoding in corpus folder
classify Machine-learning supervised classification
crossv Function to Perform Cross-Validation
define.plot.area Define area for scatterplots
delete.markup Delete HTML or XML tags
delete.stop.words Exclude stop words (e.g. pronouns, particles, etc.) from a dataset
dist.argamon Delta Distance
dist.cosine Cosine Distance
dist.delta Delta Distance
dist.eder Delta Distance
dist.entropy Entropy Distance
dist.minmax Min-Max Distance (aka Ruzicka Distance)
dist.simple Cosine Distance
dist.wurzburg Cosine Delta Distance (aka Wurzburg Distance)
galbraith Table of word frequencies (Galbraith, Rowling, Coben, Tolkien, Lewis)
gui.classify GUI for the function classify
gui.oppose GUI for the function oppose
gui.stylo GUI for stylo
imposters Authorship Verification Classifier Known as the Imposters Method
imposters.optimize Tuning Parameters for the Imposters Method
lee Table of word frequencies (Lee, Capote, Faulkner, Styron, etc.)
load.corpus Load text files
load.corpus.and.parse Load text files and perform pre-processing
make.frequency.list Make List of the Most Frequent Elements (e.g. Words)
make.ngrams Make text n-grams
make.samples Split text to samples
make.table.of.frequencies Prepare a table of (relative) word frequencies
novels A selection of 19th-century English novels
oppose Contrastive analysis of texts
parse.corpus Perform pre-processing (tokenization, n-gram extracting, etc.)
parse.pos.tags Extract POS-tags or Words from Annotated Corpora
perform.culling Exclude variables (e.g. words, n-grams) from a frequency table that are too characteristic for some samples
perform.delta Distance-based classifier
perform.impostors An Authorship Verification Classifier Known as the Impostors Method. ATTENTION: this function is obsolete; refer to a new implementation, aka the imposters() function!
perform.knn k-Nearest Neighbor classifier
perform.naivebayes Naive Bayes classifier
perform.nsc Nearest Shrunken Centroids classifier
perform.svm Support Vector Machines classifier
performance.measures Accuracy, Precision, Recall, and the F Measure
plot.sample.size Plot Classification Accuracy for Short Text Samples
rolling.classify Sequential machine-learning classification
rolling.delta Sequential stylometric analysis
samplesize.penalize Determining Minimal Sample Size for Text Classification
stylo Stylometric multidimensional analyses
stylo.default.settings Setting variables for the package stylo
stylo.network Bootstrap consensus networks, with D3 visualization
stylo.package Stylometric multidimensional analyses
stylo.pronouns List of pronouns
txt.to.features Split string of words or other countable features
txt.to.words Split text into words
txt.to.words.ext Split text into words: extended version
zeta.chisquare Compare two subcorpora using a home-brew variant of Craig's Zeta
zeta.craig Compare two subcorpora using Craig's Zeta
zeta.eder Compare two subcorpora using Eder's Zeta