map_gender {genderBR} | R Documentation |
Map the use of Brazilian first names by gender and by state
Description
map_gender
retrieves data on the number of male or female uses of a given first name
by state from the Instituto Brasileiro de Geografia e Estatistica's 2010 Census API.
Usage
map_gender(name, gender = NULL, encoding = "ASCII//TRANSLIT")
Arguments
name |
A string with a Brazilian first name. The name can also be passed to the function
as a full name (e.g., Ana Maria de Souza). |
gender |
A string with the gender to look for. Valid inputs are |
encoding |
Encoding used to read Brazilian names and stip accents.
Defaults to |
Details
Information on the gender associated with Brazilian first names was collect in the 2010 Census (Censo Demografico de 2010, in Portuguese), in July of that year, by the Instituto Brasileiro de Demografia e Estatistica (IBGE). The surveyed population includes 190,8 million Brazilians living in all 27 states. According to the IBGE, there are more than 130,000 unique first names in this population.
Value
get_gender
returns a tbl_df, tbl, data.frame
with the following variables:
-
nome
State's name. -
uf
State's abbreviation. -
freq
Total number of persons with the name provided. -
populacao
State's total population. -
sexo
Same as thesexo
argument provided. -
prop
Persons with the name and gender provided per 100,000 inhabitants.
Note
Names with different spell (e.g., Ana and Anna, or Marcos and Markos) are considered different names. Additionally, only names with more than 20 occurrences, or more than 15 occurrences in a given state, are considered.
References
For more information on the IBGE's data, please check (in Portuguese): https://censo2010.ibge.gov.br/nomes/
See Also
Examples
## Not run:
# Map the use of the name 'Maria'
map_gender('maria')
# The function accepts full names
map_gender('Maria da Silva Santos')
# Or names in uppercase
map_gender('MARIA DA SILVA SANTOS')
# Select desired gender
map_gender('AUGUSTO ROBERTO', gender = 'm')
map_gender('John da Silva', gender = 'm')
## End(Not run)