POS {propOverlap} | R Documentation |
Calculating the proportional Overlapping Scores.
Description
POS
computes the proportional overlapping scores of the given genes (features). This score measures the overlap degree between gene expression values across various classes. It produces a value lies in the interval [0,1]. A lower score denotes gene with higher discriminative power for the considered classification problem.
Usage
POS(ES, Core, Y)
Arguments
ES |
gene (feature) matrix: P, number of genes, by N, number of samples(observations). |
Core |
a |
Y |
a vector of length N for samples' class label. |
Details
For each gene, POS
computes a measure that estimates the overlapping degree between the expression intervals of different classes. For estimating the overlap, POS measure takes into account three factors: the length of the overlapping region; number of the overlapped samples (observations); the proportion of each class's overlapped samples to the total number of overlapping samples.
Value
It returns a vector of length P for ‘POS’ measures of all genes (features).
Author(s)
Osama Mahmoud ofamah@essex.ac.uk
References
Mahmoud O., Harrison A., Perperoglou A., Gul A., Khan Z., Metodiev M. and Lausen B. (2014) A feature selection method for classification within functional genomics experiments based on the proportional overlapping score. BMC Bioinformatics, 2014, 15:274.
See Also
CI.emprical
for the core interval boundaries and GMask
for the gene masks.
Examples
data(leukaemia)
Score <- POS(leukaemia[1:7129,], CI.emprical(leukaemia[1:7129,],
leukaemia[7130,]), leukaemia[7130,])
Score[1:5] #show the proportional overlapping scores for the first 5 features
summary(Score) #show the the summary of the scores of all features.