R: Best combination of normalization and imputation method

best_combination {lfproQC}

R Documentation

Best combination of normalization and imputation method

Description

This function will provide the best combinations of normalization and imputation methods for the user given dataset based on the intragroup variation evaluation parameters called PCV, PEV and PMAD.

Usage

best_combination(data_input, groups)

Arguments

`data_input`	Label-free proteomics expression data as a dataframe
`groups`	Group information about the input data

Details

Label-free LC-MS proteomics expression data is often affected by heterogeneity and missing values. Normalization and missing value imputation are the commonly used techniques to solve these issues and make the dataset suitable for further downstream analysis. This function provides the best combination of normalization and imputation methods for the dataset, choosing from the three normalization methods (vsn, loess, and rlr) and three imputation methods (knn, lls, svd). The intragroup variation evaluation measures named pooled co-efficient of variance (PCV), pooled estimate of variance (PEV) and pooled median absolute deviation (PMAD) are used for selecting the best combination of normalization and imputation method for the given dataset. It will return the best combinations based on each evaluation parameters of PCV, PEV, and PMAD.

Along with this, the user can get all three normalized datasets, nine combinations of normalized and missing values imputed datasets, and the PCV, PEV, and PMAD result values.

Value

This function gives the list which consist of following results.

'Best Combinations' The best combinations based on each PCV, PEV and PMAD for the given dataset.

'PCV Result' Values of groupwise PCV, overall PCV, PCV mean, PCV median and PCV standard deviation for all combinations.

'PEV Result' Values of groupwise PEV, overall PEV, PEV mean, PEV median and PEV standard deviation for all combinations.

'PMAD Result' Values of groupwise PMAD, overall PMAD, PMAD mean, PMAD median and PMAD standard deviation for all combinations.

'vsn_data' The 'vsn' normalized dataset

'loess_data' The 'loess' normalized dataset

'rlr_data' The 'rlr' normalized dataset

'knn_vsn_data' The dataset normalized by 'vsn' method and missing values imputed by 'knn' method.

'knn_loess_data' The dataset normalized by 'loess' method and missing values imputed by 'knn' method.

'knn_rlr_data' The dataset normalized by 'rlr' method and missing values imputed by 'knn' method.

'lls_vsn_data' The dataset normalized by 'vsn' method and missing values imputed by 'lls' method.

'lls_loess_data' The dataset normalized by 'loess' method and missing values imputed by 'lls' method.

'lls_rlr_data' The dataset normalized by 'rlr' method and missing values imputed by 'lls' method.

'svd_vsn_data' The dataset normalized by 'vsn' method and missing values imputed by 'svd' method.

'svd_loess_data' The dataset normalized by 'loess' method and missing values imputed by 'svd' method.

'svd_rlr_data' The dataset normalized by 'rlr' method and missing values imputed by 'svd' method.

Author(s)

Dr Sudhir Srivastava ("Sudhir.Srivastava@icar.gov.in")

Kabilan S ("kabilan151414@gmail.com")

Examples


result <- best_combination(yeast_data, yeast_groups)
result$`Best combinations`
result$`PCV Result`
result$`PMAD Result`
result$`knn_rlr_data`

[Package lfproQC version 0.1.0 Index]