| nest_distinct {nplyr} | R Documentation |
Subset distinct/unique rows within a nested data frame
Description
nest_distinct() selects only unique/distinct rows in a nested data frame.
Usage
nest_distinct(.data, .nest_data, ..., .keep_all = FALSE)
Arguments
.data |
A data frame, data frame extension (e.g., a tibble), or a lazy data frame (e.g., from dbplyr or dtplyr). |
.nest_data |
A list-column containing data frames |
... |
Optional variables to use when determining uniqueness. If there are multiple rows for a given combination of inputs, only the first row will be preserved. If omitted, will use all variables. |
.keep_all |
If |
Details
nest_distinct() is largely a wrapper for dplyr::distinct() and maintains
the functionality of distinct() within each nested data frame. For more
information on distinct(), please refer to the documentation in
dplyr.
Value
An object of the same type as .data. Each object in the column .nest_data
will also be of the same type as the input. Each object in .nest_data has
the following properties:
Rows are a subset of the input but appear in the same order.
Columns are not modified if
...is empty or.keep_allisTRUE. Otherwise,nest_distinct()first callsdplyr::mutate()to create new columns within each object in.nest_data.Groups are not modified.
Data frame attributes are preserved.
Examples
gm_nest <- gapminder::gapminder %>% tidyr::nest(country_data = -continent)
gm_nest %>% nest_distinct(country_data, country)
gm_nest %>% nest_distinct(country_data, country, year)