histbin {insurancerating}R Documentation

Create a histogram with outlier bins

Description

Visualize the distribution of a single continuous variable by dividing the x axis into bins and counting the number of observations in each bin. Data points that are considered outliers can be binned together. This might be helpful to display numerical data over a very wide range of values in a compact way.

Usage

histbin(
  data,
  x,
  left = NULL,
  right = NULL,
  line = FALSE,
  bins = 30,
  fill = NULL,
  color = NULL,
  fill_outliers = "#a7d1a7"
)

Arguments

data

data.frame

x

variable name in data.frame data that should be mapped

left

numeric indicating the floor of the range

right

numeric indicating the ceiling of the range

line

show density line (default is FALSE)

bins

numeric to indicate number of bins

fill

color used to fill bars

color

color for bar lines

fill_outliers

color used to fill outlier bars

Details

Wrapper function around ggplot2::geom_histogram(). The method is based on suggestions from https://edwinth.github.io/blog/outlier-bin/.

Value

a ggplot2 object

Author(s)

Martin Haringa

Examples

histbin(MTPL2, premium)
histbin(MTPL2, premium, left = 30, right = 120, bins = 30)


[Package insurancerating version 0.7.4 Index]