format_check {fossilbrush}R Documentation

format_check

Description

Function to perform a series of basic formatting checks geared towards taxonomic name data. The function very simply checks for non letter characters in the taxonomic names, that species-level names contain two words, and genus-level and above names contain one word.

Usage

format_check(x, ranks, species = FALSE, species_sep = " ", verbose = TRUE)

Arguments

x

A dataframe with hierarchically organised, taxonomic information. If x only comprises the taxonomic information,

ranks

does not need to be specified, but the columns must be in order of decreasing taxonomic rank @param ranks The column names of the taxonomic data fields in x. These must be provided in order of decreasing taxonomic rank

species

A logical indicating if x contains a species column. As the data must be supplied in hierarchical order, this column will naturally be the last column in x and species-specific spell checks will be performed on this column.

species_sep

A character vector of length one specifying the genus name and specific epithet in the species column, if present

verbose

A logical determining if any flagged errors should be reported to the console

Value

A list of two lists. The first list flags the row indexes of columns whose elements contains non-letter characters. The second list flags the row indexes of columns whose elements do not contain the correct numbers of words

Examples

# load dataset
data("brachios")
# define ranks
b_ranks <- c("phylum", "class", "order", "family", "genus")
# run function
flag <- format_check(brachios, ranks = b_ranks)

[Package fossilbrush version 1.0.3 Index]