data_dict_group_by {madshapR} | R Documentation |
Group listed data dictionaries by specified column names
Description
Groups the data dictionary element(s) by the groups defined by the query.
This function groups both the 'Variables' and 'Categories' elements (if
the group exists under the same definition in in both). This function is
analogous to running dplyr::group_by()
. Each element is named using the
group values. data_dict_ungroup()
reverses the effect.
Usage
data_dict_group_by(data_dict, col)
Arguments
data_dict |
A list of data frame(s) representing metadata to be transformed. |
col |
variable to group by. |
Details
A data dictionary contains the list of variables in a dataset and metadata
about the variables and can be associated with a dataset. A data dictionary
object is a list of data frame(s) named 'Variables' (required) and
'Categories' (if any). To be usable in any function, the data frame
'Variables' must contain at least the name
column, with all unique and
non-missing entries, and the data frame 'Categories' must contain at least
the variable
and name
columns, with unique combination of
variable
and name
.
Value
A list of data frame(s) identifying a workable data dictionary structure.
See Also
dplyr::group_by()
, data_dict_ungroup()
Examples
{
# use madshapR_DEMO provided by the package
# Create a list of data dictionaries where the column 'table' is added to
# refer to the associated dataset. The object created is not a
# data dictionary per say, but can be used as a structure which can be
# shaped into a data dictionary.
data_dict_list <- list(
data_dict_1 <- madshapR_DEMO$data_dict_TOKYO ,
data_dict_2 <- madshapR_DEMO$data_dict_MELBOURNE)
names(data_dict_list) = c("dataset_TOKYO","dataset_MELBOURNE")
data_dict_nest <- data_dict_list_nest(data_dict_list, name_group = 'table')
data_dict_group_by(data_dict_nest, col = "table")
}