randomNames {randomNames} | R Documentation |
Random Names Function
Description
Function to generate random gender and ethnicity correct first and/or last names. Names are chosen proportionally based upon their probability of appearing in a large scale database of real names. Function to generate random gender and ethnicity correct first and/or/last names. The function probabilistically samples from the embedded randomNamesData dataset to provide a realistic list of first and/or last names.
Usage
randomNames(n,
gender,
ethnicity,
which.names="both",
name.order="last.first",
name.sep=", ",
sample.with.replacement=TRUE,
return.complete.data=FALSE)
Arguments
n |
OPTIONAL. Positive integer indicating how many name to produce. Best to use when no gender or ethnicity data is provided and one simply wants |
gender |
OPTIONAL. A vector indicating the genders for the names to be calculated. The maximum of |
ethnicity |
OPTIONAL. A vector indicating the ethnicities for the names to be calculated. The maximum of
|
which.names |
OPTIONAL. One of |
name.order |
OPTIONAL. If |
name.sep |
OPTIONAL. If |
sample.with.replacement |
Boolean argument (defaults to TRUE) indicating whether sampling is done with replacement. |
return.complete.data |
Boolean argument (defaults to FALSE) indicating whether to return data including gender and ethnicity codes used for name construct. If set to TRUE, data is returned as a data.frame/data.table. |
Details
Typical use of the function is to submit a vector of genders and ethnicities to derived a gender and ethnicity representative vector of first and/or last names.
Value
Function returns a character vector containing first and/or last names.
Author(s)
Damian W. Betebenner dbetebenner@nciea.org
See Also
Examples
randomNames() ## Returns a single name in "last, first" format
randomNames(5, which.names="first") ## Returns 5 first names
randomNames(5, return.complete.data=TRUE) ## Returns entire data.table
test.df <- data.frame(GENDER=sample(0:1, 100, replace=TRUE),
ETHNICITY=sample(1:6, 100, replace=TRUE))
test.names <- randomNames(gender=test.df$GENDER,
ethnicity=test.df$ETHNICITY)
head(test.names)
ethnicities <- c("African American", "Hispanic", "Asian", "White", "Native American")
genders <- c("Female", "Male")
test.df <- data.frame(GENDER=sample(genders, 100, replace=TRUE),
ETHNICITY=sample(ethnicities, 100, replace=TRUE))
test.names <- randomNames(gender=test.df$GENDER,
ethnicity=test.df$ETHNICITY)
head(test.names)