tpacForCancer {TPAC}R Documentation

Executes the TPAC method on cancer gene expression data

Description

Executes the TPAC (tissue-adjusted pathway analysis for cancer) method (tpacForCollection) on cancer gene expression data using normal tissue expression data from the Human Protein Atlas (HPA) that is included in the package as hpa.data. This HPA normal tissue data was specially processed by the HPA group as FPKM values using a pipeline similar to that employed by GDC for the TCGA data. For consistency with this HPA normal tissue data, the provided cancer.gene.expr data must be specified as FPKM+1 values. Please see the vignette for an example of calling this function using appropriately normalized TCGA gene expression data.

Usage

    tpacForCancer(cancer.gene.expr, cancer.type, gene.set.collection, 
                  min.set.size=1, max.set.size)

Arguments

cancer.gene.expr

An n x p matrix of gene expression values for n tumors of the specified tumor type and p genes. The data should be normalized as FPKM+1 values, row names should be sample ID, and column names should be Ensembl gene IDs.

cancer.type

Cancer type of the expression data. Must be one of the supported cancer types as per getSupportedCancerTypes.

gene.set.collection

List of m gene sets for which scores are computed. Each element in the list corresponds to a gene set and the list element is a vector of Ensembl IDs for genes in the set. Gene set names should be specified as list names.

min.set.size

See description of min.size in createGeneSetCollection

max.set.size

See description of max.size in createGeneSetCollection

Value

A list containing two elements:

See Also

tpac, hpa.data, tpacForCollection, getSupportedCancerTypes, createGeneSetCollection

Examples

    # Simulate Gaussian expression data for 10 genes and 10 samples
    # (Note: cancer expression should be FPKM+1 for real applications)
    cancer.gene.expr=matrix(rnorm(200), nrow=20)
    # Create arbitrary Ensembl IDs
    gene.ids = c("ENSG00000000003","ENSG00000000005","ENSG00000000419",
                 "ENSG00000000457","ENSG00000000460","ENSG00000000938",
                 "ENSG00000000971","ENSG00000001036","ENSG00000001084",
                 "ENSG00000001167")
    colnames(cancer.gene.expr) = gene.ids
    # Define a collection with two disjoint sets that span the 10 genes
    collection=list(set1=gene.ids[1:5], set2=gene.ids[6:10])    
    # Execute TPAC on both sets
    tpacForCancer(cancer.gene.expr=cancer.gene.expr, cancer.type="glioma", 
                  gene.set.collection=collection)

[Package TPAC version 0.2.0 Index]