crime {ggbiplot} | R Documentation |
U. S. Crimes
Description
This dataset gives rates of occurrence (per 100,000 people) various serious crimes in each of the 50 U. S. states, originally from the United States Statistical Abstracts (1970). The data were analyzed by John Hartigan (1975) in his book Clustering Algorithms and were later reanalyzed by Friendly (1991).
Usage
data(crime)
Format
A data frame with 50 observations on the following 10 variables.
state
state name, a character vector
murder
a numeric vector
rape
a numeric vector
robbery
a numeric vector
assault
a numeric vector
burglary
a numeric vector
larceny
a numeric vector
auto
auto thefts, a numeric vector
st
state abbreviation, a character vector
region
region of the U.S., a factor with levels
Northeast
South
North Central
West
Source
The data are originally from the United States Statistical Abstracts (1970). This dataset also appears in the SAS/Stat Sample library, Getting Started Example for PROC PRINCOMP, https://support.sas.com/documentation/onlinedoc/stat/ex_code/131/princgs.html, from which the current copy was derived.
References
Friendly, M. (1991). SAS System for Statistical Graphics. SAS Institute.
Hartigan, J. A. (1975). Clustering Algorithms. John Wiley and Sons.
Examples
data(crime)
library(ggplot2)
crime.pca <-
crime |>
dplyr::select(where(is.numeric)) |>
prcomp(scale. = TRUE)
ggbiplot(crime.pca,
labels = crime$st ,
circle = TRUE,
varname.size = 4,
varname.color = "red") +
theme_minimal(base_size = 14)