age_data {conTree}R Documentation

Age and Demographics data

Description

The data come from 9243 questionnaires filled out by shopping mall customers in the San Francisco Bay Area (Impact Resources, Inc., Columbus, OH). Here we attempt to estimate a persons age as a function of the other 13 demographic variables. For this data set age value is reported as being in one of seven intervals ⁠{13-17, 18-24, 25-34, 35-44, 45-54, 55-64, >= 65}⁠. Each persons age is randomly generated uniformly within its corresponding reported interval. For the last interval an exponential distribution was used with mean corresponding to life expectancy after reaching age 65.

Usage

age_data

Format

age_data

A list of 3 items.

xage

data frame of 8856 observations on 13 variables

yage

Randomly generated age in the range above

gbage

gradient boosting model for median age given x

Source

The Elements of Statistical Learning, Data Mining, Second Edition, by Hastie, Tibshirani, and Friedman.


[Package conTree version 0.3-1 Index]