first_names_race {predictrace} | R Documentation |
Surnames and number of people of each race with that first name
Description
A dataset containing over 167 thousands surnames and the number of people of each race with that surname. Citation for this data: Tzioumis, Konstantinos (2018) Demographic aspects of first names, Scientific Data, 5:180025 [dx.doi.org/10.1038/sdata.2018.25].
Usage
first_names_race
Format
A data frame with 4,251 rows and 8 variables:
- name
Surname
- likely_race
The most likely race based on the probability of each race
- probability_american_indian
Probability that the surname is American Indian
- probability_asian
Probability that the surname is Asian
- probability_black
Probability that the surname is Black
- probability_hispanic
Probability that the surname is Hispanic
- probability_white
Probability that the surname is White
- probability_2races
Probability that the surname is two or more races
...
Source
https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/TYJKEZ
[Package predictrace version 2.0.1 Index]