| baseball_players1000 {npsm} | R Documentation |
Career Information for a Random Sample of 1000 Baseball Players
Description
Demographics and position information on 1000 randomly selected baseball players who debuted after 1945.
Usage
data("baseball_players1000")
Format
A data frame with 1000 observations on the following 28 variables.
playerIDa character vector
birthYeara numeric vector
birthMontha numeric vector
birthDaya numeric vector
birthCountrya character vector
birthStatea character vector
nameFirsta character vector
nameLasta character vector
weighta numeric vector
heighta numeric vector
batsa character vector
throwsa character vector
debutYeara numeric vector
G_alla numeric vector
G_pa numeric vector
G_ca numeric vector
G_1ba numeric vector
G_2ba numeric vector
G_3ba numeric vector
G_ssa numeric vector
G_lfa numeric vector
G_cfa numeric vector
G_rfa numeric vector
G_ofa numeric vector
G_dha numeric vector
G_pha numeric vector
G_pra numeric vector
pitchera logical vector
Details
A random subset of baseball players who debuted after 1945 and played in at least 160 games. Includes information on birth (date and location); height (inches) and weight (pounds); whether they bat left (L), right (R), or switch (B); and games played at each postion. The variable pitcher is a derived variable based on if the majority of games were played as a pitcher (i.e.; G_pr/G_all > 0.5).
Source
https://github.com/chadwickbureau/baseballdatabank
References
https://github.com/chadwickbureau/baseballdatabank/blob/master/readme2014.txt
Examples
data(baseball_players1000)
hist(baseball_players1000$weight,xlab="Weight (lbs)",
probability=TRUE, ylim=c(0,0.02),
main="Histogram of Weight for 1000 Baseball Players")
lines(density(baseball_players1000$weight,na.rm=TRUE))