PIMA {regclass} | R Documentation |
Pima Diabetes dataset
Description
Diabetes among women aged 21+ with Pima heritage
Usage
data("PIMA")
Format
A data frame with 392 observations on the following 8 variables.
Pregnant
a numeric vector, number of times the woman has been pregnant
Glucose
a numeric vector, plasma glucose concentration
BloodPressure
a numeric vector, diastolic blood pressure in mm Hg
BodyFat
a numeric vector, a measurement of the triceps skinfold thickness which is an indicator of body fat percentage
Insulin
a numeric vector, 2-hour serum insulin
BMI
a numeric vector, body mass index
Age
a numeric vector, years
Diabetes
a factor with levels
No
Yes
Details
Data on 768 women belonging to the Pima tribe. The purpose is to study the associations between having diabetes and various physiological characteristics. Although there are surely other factors (including genetic) that influence the chance of having diabetes, the hope is that by having women who are genetically similar (all from the Pima tribe), that these other factors are naturally accounted for.
Source
Adapted from the UCI data repository https://archive.ics.uci.edu/ml/datasets/Pima+Indians+Diabetes. A variable measuring the “diabetes pedigree function" has been omitted.