nba_clusts {clustree} | R Documentation |
Clustered NBA positions dataset
Description
NBA positions dataset clustered using k-means with a range of values of k
Usage
nba_clusts
Format
nba_clusts
is a data.frame containing the NBA positions dataset
with additional columns holding k-means clusterings at different values of
k
and the first two principal components
-
Position - Player position
-
TurnoverPct - Turnover percentage
-
ReboundPct - Rebound percentage
-
AssistPct - Assist percentage
-
FieldGoalPct - Field goal percentage
-
K1 - K5 - Results of k-means clustering
-
PC1 - First principal component
-
PC2 - Second principal component
Source
NBA positions downloaded from https://github.com/lazappi/nba_positions.
The source dataset is available from Kaggle at https://www.kaggle.com/drgilermo/nba-players-stats/data?select=Seasons_Stats.csv and was originally scraped from Basketball Reference.
See https://github.com/lazappi/clustree/blob/master/data-raw/nba_clusts.R for details of how clustering was performed.