nest_distinct {nplyr} | R Documentation |
Subset distinct/unique rows within a nested data frame
Description
nest_distinct()
selects only unique/distinct rows in a nested data frame.
Usage
nest_distinct(.data, .nest_data, ..., .keep_all = FALSE)
Arguments
.data |
A data frame, data frame extension (e.g., a tibble), or a lazy data frame (e.g., from dbplyr or dtplyr). |
.nest_data |
A list-column containing data frames |
... |
Optional variables to use when determining uniqueness. If there are multiple rows for a given combination of inputs, only the first row will be preserved. If omitted, will use all variables. |
.keep_all |
If |
Details
nest_distinct()
is largely a wrapper for dplyr::distinct()
and maintains
the functionality of distinct()
within each nested data frame. For more
information on distinct()
, please refer to the documentation in
dplyr
.
Value
An object of the same type as .data
. Each object in the column .nest_data
will also be of the same type as the input. Each object in .nest_data
has
the following properties:
Rows are a subset of the input but appear in the same order.
Columns are not modified if
...
is empty or.keep_all
isTRUE
. Otherwise,nest_distinct()
first callsdplyr::mutate()
to create new columns within each object in.nest_data
.Groups are not modified.
Data frame attributes are preserved.
Examples
gm_nest <- gapminder::gapminder %>% tidyr::nest(country_data = -continent)
gm_nest %>% nest_distinct(country_data, country)
gm_nest %>% nest_distinct(country_data, country, year)