ols_plot_cooksd_chart {olsrr}R Documentation

Cooks' D chart

Description

Chart of cook's distance to detect observations that strongly influence fitted values of the model.

Usage

ols_plot_cooksd_chart(model, type = 1, threshold = NULL, print_plot = TRUE)

Arguments

model

An object of class lm.

type

An integer between 1 and 5 selecting one of the 6 methods for computing the threshold.

threshold

Threshold for detecting outliers.

print_plot

logical; if TRUE, prints the plot else returns a plot object.

Details

Cook's distance was introduced by American statistician R Dennis Cook in 1977. It is used to identify influential data points. It depends on both the residual and leverage i.e it takes it account both the x value and y value of the observation.

Steps to compute Cook's distance:

A data point having a large cook's d indicates that the data point strongly influences the fitted values. There are several methods/formulas to compute the threshold used for detecting or classifying observations as outliers and we list them below.

where n and k stand for

Value

ols_plot_cooksd_chart returns a list containing the following components:

outliers

a data.frame with observation number and cooks distance that exceed threshold

threshold

threshold for classifying an observation as an outlier

See Also

ols_plot_cooksd_bar()

Examples

model <- lm(mpg ~ disp + hp + wt, data = mtcars)
ols_plot_cooksd_chart(model)
ols_plot_cooksd_chart(model, type = 4)
ols_plot_cooksd_chart(model, threshold = 0.2)


[Package olsrr version 0.6.0 Index]