auk_split {auk} | R Documentation |
Split an eBird data file by species
Description
Given an eBird Basic Dataset (EBD) and a list of species, split the file into
multiple text files, one for each species. This function is typically used
after auk_filter()
has been applied if the resulting file is too large to
be read in all at once.
Usage
auk_split(
file,
species,
prefix,
taxonomy_version,
sep = "\t",
overwrite = FALSE
)
Arguments
file |
character; input file. |
species |
character; species to filter and split by, provided as scientific or English common names, or a mixture of both. These names must match the official eBird Taxomony (ebird_taxonomy). |
prefix |
character; a file and directory prefix. For example, if
splitting by species "A" and "B" and |
taxonomy_version |
integer; the version (i.e. year) of the taxonomy. In
most cases, this should be left empty to use the version of the taxonomy
included in the package. See |
sep |
character; the input field separator, the eBird file is tab separated by default. Must only be a single character and space delimited is not allowed since spaces appear in many of the fields. |
overwrite |
logical; overwrite output files if they already exists. |
Details
The list of species is checked against the eBird taxonomy for
validity. This taxonomy is updated once a year in August. The auk
package
includes a copy of the eBird taxonomy, current at the time of release;
however, if the EBD and auk
versions are not aligned, you may need to
explicitly specify which version of the taxonomy to use, in which case
the eBird API will be queried to get the correct version of the taxonomy.
Value
A vector of output filenames, one for each species.
See Also
Other text:
auk_clean()
,
auk_select()
Examples
## Not run:
species <- c("Canada Jay", "Cyanocitta stelleri")
# get the path to the example data included in the package
# in practice, provide path to a filtered ebd file
# e.g. f <- "data/ebd_filtered.txt
f <- system.file("extdata/ebd-sample.txt", package = "auk")
# output to a temporary directory for example
# in practice, provide the path to the output location
# e.g. prefix <- "output/ebd_"
prefix <- file.path(tempdir(), "ebd_")
species_files <- auk_split(f, species = species, prefix = prefix)
## End(Not run)