HCC {movieROC}R Documentation

Hepatocellular carcinoma data

Description

This dataset is derived from gene expression arrays of tumor and adjacent non-tumor tissues of 62 Taiwanese cases of hepatocellular carcinoma. The complete dataset was deposited in NCBI’s Gene Expression Omnibus (GEO) and it is available through series accession number GSE37988. This dataset contains 948 from the 27,578 autosomal CpG sites screened.

Usage

data("HCC")

Format

A data frame with 124 observations on 952 variables. First 4 variables are tissue (identification number for the tissue; from 1 to 62), sex (female or male), age (age in years of the patient), and tumor (status of the tissue; nontumor or tumor). The following 948 from cg03409548 to cg20240860 are numeric variables containing the relative gene expression intensities of the corresponding gene.

Source

NCBI’s Gene Expression Omnibus (GEO) - Series accession number GSE37988 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE37988)

Examples

data(HCC)
str(HCC) # Data structure
table(HCC$tumor) # Number of non-tumor and tumor tissues

# Histograms of gene 18384097 expression intensities for non-tumor and tumor tissues
oldpar <- par(mfrow = c(2,1))
x <- subset(HCC, tumor == "nontumor")$cg18384097
y <- subset(HCC, tumor == "tumor")$cg18384097
hist(x, xlim = c(0,1), main = "Gene 18384097 in non-tumor tissues")
hist(y, xlim = c(0,1), main = "Gene 18384097 in tumor tissues")
par(oldpar)

[Package movieROC version 0.1.1 Index]