R: Find unique (taxon) occurrence records

uniqify {divvy}

R Documentation

Find unique (taxon) occurrence records

Description

Subset a dataset to unique spatial localities or locality-taxon combinations.

Usage

uniqify(dat, xy, taxVar = NULL, na.rm = TRUE)

Arguments

`dat`	A `data.frame` or `matrix` containing taxon names, coordinates, and any associated variables; or a list of such structures.
`xy`	A vector of two elements, specifying the name or numeric position of columns in `dat` containing coordinates, e.g. longitude and latitude. Coordinates for any shared sampling sites should be identical, and where sites are raster cells, coordinates are usually expected to be cell centroids.
`taxVar`	The name or numeric position of the column containing taxonomic identifications. `taxVar` must be of same class as `xy`, e.g. a numeric column position if `xy` is given as a vector of numeric positions.
`na.rm`	Should records missing information be removed? Default is yes.

Details

The na.rm argument applies to coordinate values and, if taxVar is supplied, to taxon values. If na.rm = FALSE, any NA values will be retained and treated as their own value. Note that divvy ignores any rows with missing coordinates for the subsampling functions cookies(), clustr(), and bandit().

Value

An object with the same class and columns as dat, containing the subset of rows representing unique coordinates (if only xy supplied) or unique taxon-site combinations (if taxVar is also supplied). The first record at each spatial locality is retained, or if taxVar is specified, the first record of each taxon at a locality.

Examples

# generate occurrence data
x  <- rep(1, 10)
y  <- c(rep(1, 5), 2:6)
sp <- c(rep(letters[1:3], 2),
        rep(letters[4:5], 2))
obs <- data.frame(x, y, sp)

# compare original and unique datasets:
# rows 4 and 5 removed as duplicates of rows 1 and 2, respectively
obs
uniqify(obs, taxVar = 3, xy = 1:2)

# using taxon identifications or other third variable is optional
uniqify(obs, xy = c('x', 'y'))

# caution - data outside the taxon and occurrence variables
# will be lost where associated with duplicate occurrences
obs$notes <- letters[11:20]
uniqify(obs, 1:2, 3)
# the notes 'n' and 'o' are absent in the output data

[Package divvy version 1.0.0 Index]