titanic {mlr3data} | R Documentation |
Titanic
Description
Classification data to predict the fate of passengers on the ocean liner "Titanic".
Contains 10 features and 1309 observations. Target column is "Survived"
.
Pre-processing
All column names have been changed to
snake_case
.training and test set have been joined. Observations of the test set have a missing value in the target column
"survived"
.Column '"survived"' has been re-encoded to a factor with levels '"yes"' and '"no"'.
Id column has been removed.
Passenger class
"pclass"
has been converted to an ordered factor.Features
"sex"
and"embarked"
have been converted to factors.Empty strings in
"cabin"
and"embarked"
have been encoded as missing values.
Source
titanic and https://www.kaggle.com/c/titanic/data
Examples
data("titanic", package = "mlr3data")
str(titanic)
[Package mlr3data version 0.7.0 Index]