baseball {SDAResources}R Documentation

baseball data

Description

Statistics on 797 baseball players, compiled by Jenifer Boshes from the rosters of all major league teams in November 2004. Missing values (for variables pball, intwalk, hbp, sacrfly; all other variables have complete data) are coded as NA.

Usage

data(baseball)

Format

This data frame contains the following columns:

team:

team played for at the beginning of the season

leagueid:

AL or NL

player:

a unique identifier for each baseball player

salary:

player salary in 2004

pos:

primary position coded as P, C, 1B, 2B, 3B, SS, RF, LF, or CF

gplay:

games played

gstart:

games started

inning:

number of innings

putout:

number of putouts

assist:

number of assists

error:

errors

dplay:

number of double plays

pball:

number of passed balls (only applies to catchers)

gbat:

number of games that player appeared at bat

atbat:

number of at bats

run:

number of runs scored

hit:

number of hits

secbase:

number of doubles

thirdbase:

number of triples

homerun:

number of home runs

rbi:

number of runs batted in

stolenb:

number of stolen bases

csteal:

number of times caught stealing

walk:

number of times walked

strikeout:

number of strikeouts

intwalk:

number of times intentionally walked

hbp:

number of times hit by pitch

sacrhit:

number of sacrifice hits

sacrfly:

number of sacrifice flies

gidplay:

grounded into double play

References

Forman, S. L. (2004). Baseball-reference.com—Major league statistics and information. www.baseball-reference.com (accessed November 2004).

Lohr (2021), Sampling: Design and Analysis, 3rd Edition. Boca Raton, FL: CRC Press.

Lu and Lohr (2021), R Companion for Sampling: Design and Analysis, 3rd Edition, 1st Edition. Boca Raton, FL: CRC Press.


[Package SDAResources version 0.1.1 Index]