| mcdonald {bestglm} | R Documentation |
Pollution dataset from McDonald and Schwing (1973)
Description
Regression data used to illustrate ridge regression
Usage
data("mcdonald")
Format
A data frame with 60 observations on the following 16 variables.
PRECAverage annual precipitation in inches
JANTAverage January temperature in degrees F
JULTSame for July
OVR65Percent of 1960 SMSA population aged 65 or older
POPNAverage household size
EDUCMedian school years completed by those over 22
HOUSPercent of housing units which are sound & with all facilities
DENSPopulation per sq. mile in urbanized areas, 1960
NONWPercent non-white population in urbanized areas, 1960
WWDRKPercent employed in white collar occupations
POORPercent of families with income < $3000
HCRelative hydrocarbon pollution potential
NOXSame for nitric oxides
SOxSame for sulphur dioxide
HUMIDAnnual average percent relative humidity at 1pm
MORTTotal age-adjusted mortality rate per 100,000
Details
Ridge regression example
Source
Gary C. McDonald and Richard C. Schwing (1973), Instabilities of Regression Estimates Relating Air Pollution to Mortality, Technometrics 15/3, 463-481.
Examples
data(mcdonald)
vifx(mcdonald[, -ncol(mcdonald)])