summarizePedigrees {BGmisc} | R Documentation |
Summarize Pedigree Data
Description
This function summarizes pedigree data, including calculating summary statistics for all numeric variables, and finding the originating member for each family, maternal, and paternal line.
Usage
summarizePedigrees(
ped,
famID = "famID",
personID = "ID",
momID = "momID",
dadID = "dadID",
matID = "matID",
patID = "patID",
type = c("fathers", "mothers", "families"),
byr = NULL,
include_founder = FALSE,
founder_sort_var = NULL,
nbiggest = 5,
noldest = 5,
skip_var = NULL,
five_num_summary = FALSE,
verbose = FALSE
)
Arguments
ped |
a pedigree dataset. Needs ID, momID, and dadID columns |
famID |
character. Name of the column to be created in ped for the family ID variable |
personID |
character. Name of the column in ped for the person ID variable |
momID |
character. Name of the column in ped for the mother ID variable |
dadID |
character. Name of the column in ped for the father ID variable |
matID |
Character. Maternal line ID variable to be created and added to the pedigree |
patID |
Character. Paternal line ID variable to be created and added to the pedigree |
type |
The type of summary statistics to calculate. Options are "fathers", "mothers", and "families". |
byr |
Optional column name for birth year. |
include_founder |
Logical, if TRUE, include the founder of each line in the summary statistics. |
founder_sort_var |
The variable to sort the founders by. If NULL, the founders will be sorted by birth year (‘byr') if that’s present and by 'personID' otherwise. |
nbiggest |
The number of biggest lines to return. |
noldest |
The number of oldest lines to return. |
skip_var |
A character vector of variables to skip when calculating summary statistics. |
five_num_summary |
Logical, if TRUE, include the 5-number summary (min, Q1, median, Q3, max) in the summary statistics. |
verbose |
Logical, if TRUE, print progress messages. |
Value
A data.frame (or list) containing summary statistics for family, maternal, and paternal lines, as well as the 5 oldest and biggest lines.