deleteNearZeroCoefficientOfVariation.mb {Coxmos}R Documentation

deleteNearZeroCoefficientOfVariation.mb

Description

Filters out variables from a dataset that exhibit a coefficient of variation below a specified threshold, ensuring the retention of variables with meaningful variability.

Usage

deleteNearZeroCoefficientOfVariation.mb(X, LIMIT = 0.1)

Arguments

X

List of numeric matrices or data.frames. Explanatory variables. Qualitative variables must be transform into binary variables.

LIMIT

Numeric. Cutoff for minimum variation. If coefficient is lesser than the limit, the variables are removed because not vary enough (default: 0.1).

Details

The deleteNearZeroCoefficientOfVariation function is a pivotal tool in data preprocessing, especially when dealing with high-dimensional datasets. The coefficient of variation (CoV) is a normalized measure of data dispersion, calculated as the ratio of the standard deviation to the mean. In many scientific investigations, variables with a low CoV might be considered as offering limited discriminative information, potentially leading to noise in subsequent statistical analyses. By setting a threshold through the LIMIT parameter, this function provides a systematic approach to identify and exclude variables that do not meet the desired variability criteria. The underlying rationale is that variables with a CoV below the set threshold might not contribute significantly to the variability of the dataset and could be redundant or even detrimental for certain analyses. The function returns a modified dataset, a list of deleted variables, and the computed coefficients of variation for each variable. This comprehensive output ensures that researchers are well-informed about the preprocessing steps and can make subsequent analytical decisions with confidence.

Value

A list of three objects. X: A list with as many blocks as X input, but with the variables filtered. variablesDeleted: A list with as many blocks as X input, with the name of the variables that have been removed. coeff_variation: A list with as many blocks as X input, with the coefficient of variation per variable.

Author(s)

Pedro Salguero Garcia. Maintainer: pedsalga@upv.edu.es

Examples

data("X_multiomic")
X <- X_multiomic
filter <- deleteNearZeroCoefficientOfVariation.mb(X, LIMIT = 0.1)


[Package Coxmos version 1.0.2 Index]