Anscombe {sur} | R Documentation |
Anscombe's Four Datasets
Description
This dataset is used to illustrate the importance of statistical display as an adjunct to summary statistics. Anscombe (1973) fabricated four different bivariate datasets such that, for all datasets, the respective X and Y means, X and Y standard deviations, and correlations, slopes, intercepts, and standard errors of estimate are equal. Accordingly, without a visual representation of these four panels, one might assume that the data values for all four datasets are the same. Scatterplots illustrate, however, the extent to which these datasets are different from one another.
Usage
Anscombe
Format
A data frame with 11 rows and 8 variables:
- x1
values of X for the first dataset
- y1
values of Y for the first dataset
- x2
values of X for the second dataset
- y2
values of Y for the second dataset
- x3
values of X for the third dataset
- y3
values of Y for the third dataset
- x4
values of X for the fourth dataset
- y4
values of Y for the fourth dataset