lma_meta {lingmatch} | R Documentation |
Calculate Text-Based Metastatistics
Description
Calculate simple descriptive statistics from text.
Usage
lma_meta(text)
Arguments
text |
A character vector of texts. |
Value
A data.frame:
-
characters
: Total number of characters. -
syllables
: Total number of syllables, as estimated by split length of
'a+[eu]*|e+a*|i+|o+[ui]*|u+|y+[aeiou]*'
- 1. -
words
: Total number of words (raw word count). -
unique_words
: Number of unique words (binary word count). -
clauses
: Number of clauses, as marked by commas, colons, semicolons, dashes, or brackets within sentences. -
sentences
: Number of sentences, as marked by periods, question marks, exclamation points, or new line characters. -
words_per_clause
: Average number of words per clause. -
words_per_sentence
: Average number of words per sentence. -
sixltr
: Number of words 6 or more characters long. -
characters_per_word
: Average number of characters per word (characters
/words
). -
syllables_per_word
: Average number of syllables per word (syllables
/words
). -
type_token_ratio
: Ratio of unique to total words:unique_words
/words
. -
reading_grade
: Flesch-Kincaid grade level: .39 *words
/sentences
+ 11.8 *syllables
/words
- 15.59. -
numbers
: Number of terms starting with numbers. -
punct
: Number of terms starting with non-alphanumeric characters. -
periods
: Number of periods. -
commas
: Number of commas. -
qmarks
: Number of question marks. -
exclams
: Number of exclamation points. -
quotes
: Number of quotation marks (single and double). -
apostrophes
: Number of apostrophes, defined as any modified letter apostrophe, or backtick or single straight or curly quote surrounded by letters. -
brackets
: Number of bracketing characters (including parentheses, and square, curly, and angle brackets). -
orgmarks
: Number of characters used for organization or structuring (including dashes, foreword slashes, colons, and semicolons).
Examples
text <- c(
succinct = "It is here.",
verbose = "Hear me now. I shall tell you about it. It is here. Do you hear?",
couched = "I might be wrong, but it seems to me that it might be here.",
bigwords = "Object located thither.",
excited = "It's there! It's there! It's there!",
drippy = "It's 'there', right? Not 'here'? 'there'? Are you Sure?",
struggly = "It's here -- in that place where it is. Like... the 1st place (here)."
)
lma_meta(text)