R: Plot Bayesian Information Criterion as a Function of Number...

plotBIC {Ckmeans.1d.dp}

R Documentation

Plot Bayesian Information Criterion as a Function of Number of Clusters

Description

Plot Bayesian information criterion (BIC) as a function of the number of clusters obtained from optimal univariate clustering results returned from Ckmeans.1d.dp. The BIC normalized by sample size (BIC/n) is shown.

Usage

plotBIC(
  ck, xlab="Number of clusters k",
  ylab = "BIC/n", type="b",
  sub=paste("n =", length(ck$cluster)),
  main=paste("Bayesian information criterion",
             "(normalized by sample size)", sep="\n"),
  ...
)

Arguments

`ck`	an object of class `Ckmeans.1d.dp` returned by `Ckmeans.1d.dp`.
`xlab`	a character string. The x-axis label for the plot.
`ylab`	a character string. The x-axis label for the plot.
`type`	the type of plot to be drawn. See `plot`.
`main`	a character string. The title for the plot.
`sub`	a character string. The subtitle for the plot.
`...`	arguments passed to `plot` function in package graphics.

Details

The function visualizes the input data as sticks whose heights are the weights. It uses different colors to indicate optimal k-means clusters. The method to calcualte BIC based on Gaussian mixture models estimated on a univariate clustering is described in (Song and Zhong 2020).

Value

An object of class "Ckmeans.1d.dp" defined in Ckmeans.1d.dp.

Author(s)

Joe Song

References

Song M, Zhong H (2020). “Efficient weighted univariate clustering maps outstanding dysregulated genomic zones in human cancers.” Bioinformatics, 36(20), 5027–5036. doi:10.1093/bioinformatics/btaa613.

Examples

# Example: clustering data generated from a Gaussian mixture
#          model of two components
x <- rnorm(50, mean=-1, sd=0.3)
x <- append(x, rnorm(50, mean=1, sd=0.3) )
res <- Ckmeans.1d.dp(x)
plotBIC(res)

y <- (rnorm(length(x)))^2
res <- Ckmeans.1d.dp(x, y=y)
plotBIC(res)

[Package Ckmeans.1d.dp version 4.3.5 Index]