diving {ohenery}R Documentation

Olympic Diving Data

Description

One hundred years of Men's Olympic Platform Diving records.

Usage

data(diving)

Format

A data.frame object with 695 observations and 13 columns.

The columns are defined as follows:

Name

The participant's name.

Age

The age of the participant at the time of the Olympics. Some values missing.

Height

The height of the participant at the time of the Olympics, in centimeters. Many values missing.

Weight

The height of the participant at the time of the Olympics, in kilograms. Many values missing.

Team

The string name of the team (country) which the participant represented.

NOC

The string name of the National Olympic Committee which the participant represented. This is a three character code.

Games

The string name of the Olympic games, including a year.

Year

The integer year of the Olympics. These range from 1906 through 2016.

City

The string name of the host city.

Medal

A string of “Gold”, “Silver”, “Bronze” or NA.

EventId

A unique integer ID for each Olympics.

AthleteId

A unique integer ID for each participant.

HOST_NOC

The string name of the National Olympic Committee of the nation hosting the Olympics. This is a three character code.

Note

The author makes no guarantees regarding correctness of this data.

Please attribute this data to the upstream harvester.

Author(s)

Steven E. Pav shabbychef@gmail.com

Source

Data were collected by Randi Griffin from the website “sports-reference.com”, and staged on Kaggle at https://www.kaggle.com/heesoo37/120-years-of-olympic-history-athletes-and-results.

Examples


library(dplyr)
library(forcats)
data(diving)

fitdat <- diving %>%
  mutate(Finish=case_when(grepl('Gold',Medal)   ~ 1,
                          grepl('Silver',Medal) ~ 2,
                          grepl('Bronze',Medal) ~ 3,
                          TRUE ~ 4)) %>%
  mutate(weight=ifelse(Finish <= 3,1,0)) %>%
  mutate(cut_age=cut(coalesce(Age,22.0),c(12,19.5,21.5,22.5,25.5,99),include.lowest=TRUE)) %>%
  mutate(country=forcats::fct_relevel(forcats::fct_lump(factor(NOC),n=5),'Other')) %>%
  mutate(home_advantage=NOC==HOST_NOC)

hensm(Finish ~ cut_age + country + home_advantage,data=fitdat,weights=weight,group=EventId,ngamma=3)


[Package ohenery version 0.1.1 Index]