baseball {SDAResources} | R Documentation |
baseball data
Description
Statistics on 797 baseball players, compiled by Jenifer Boshes from the rosters of all major league teams in November 2004. Missing values (for variables pball, intwalk, hbp, sacrfly; all other variables have complete data) are coded as NA.
Usage
data(baseball)
Format
This data frame contains the following columns:
- team:
team played for at the beginning of the season
- leagueid:
AL or NL
- player:
a unique identifier for each baseball player
- salary:
player salary in 2004
- pos:
primary position coded as P, C, 1B, 2B, 3B, SS, RF, LF, or CF
- gplay:
games played
- gstart:
games started
- inning:
number of innings
- putout:
number of putouts
- assist:
number of assists
- error:
errors
- dplay:
number of double plays
- pball:
number of passed balls (only applies to catchers)
- gbat:
number of games that player appeared at bat
- atbat:
number of at bats
- run:
number of runs scored
- hit:
number of hits
- secbase:
number of doubles
- thirdbase:
number of triples
- homerun:
number of home runs
- rbi:
number of runs batted in
- stolenb:
number of stolen bases
- csteal:
number of times caught stealing
- walk:
number of times walked
- strikeout:
number of strikeouts
- intwalk:
number of times intentionally walked
- hbp:
number of times hit by pitch
- sacrhit:
number of sacrifice hits
- sacrfly:
number of sacrifice flies
- gidplay:
grounded into double play
References
Forman, S. L. (2004). Baseball-reference.com—Major league statistics and information. www.baseball-reference.com (accessed November 2004).
Lohr (2021), Sampling: Design and Analysis, 3rd Edition. Boca Raton, FL: CRC Press.
Lu and Lohr (2021), R Companion for Sampling: Design and Analysis, 3rd Edition, 1st Edition. Boca Raton, FL: CRC Press.