R: Calculates the p-values

pauc {Biocomb}

R Documentation

Calculates the p-values

Description

This auxiliary function calculates the p-value of the statistical significance test of the difference of samples from two classes using AUC values (for each input feature). It takes as an input the results of the AUC value calculation using function compute.aucs. It can be reasonably used only for two-class problem. The results is in the form of “numeric vector” with p-values for each features.

Usage

pauc(auc,n=100,n.plus=0.5,labels=numeric(),pos=numeric())

Arguments

`auc`	a numeric vector of AUC values.
`n`	the whole number of observations for the test.
`n.plus`	the number of cases in the sample with the positive class.
`labels`	the factor with the class labels.
`pos`	the numeric vector with the level of the positive class.

Details

This auxiliary function's main job is to calculate the p-values of the statistical significance test of two samples, defined by negative and positive class labels, i.e. two-class problem. See the “Value” section to this page for more details.

Value

A returned data consists is the following:

pauc

a numeric vector with the p-values for each feature

References

David J. Hand and Robert J. Till (2001). A Simple Generalisation of the Area Under the ROC Curve for Multiple Class Classification Problems. Machine Learning 45(2), p. 171–186.

Examples

# example
data(data_test)

# class label must be factor
data_test[,ncol(data_test)]<-as.factor(data_test[,ncol(data_test)])

auc.val=compute.aucs(dattable=data_test)
vauc<-auc.val[,"AUC"]
val=levels(data_test[,ncol(data_test)])

if(length(val)==2)
{
	 pos=auc.val[,"Positive class"]
	 paucv<-pauc(auc=vauc,labels=data_test[,ncol(data_test)],pos=pos)
}else{
	 num.size=100
	 num.prop=0.5
	 paucv<-pauc(auc=vauc,n=num.size,n.plus=num.prop)
}

[Package Biocomb version 0.4 Index]