R: Pseudo bulk data generation function

generateBulk {InteRD}

R Documentation

Pseudo bulk data generation function

Description

This function generates a pseudo bulk samples by random sampled number of cells per subject.

Usage

generateBulk(eset,ct.varname,sample,disease = NULL,ct.sub,prop_mat = NULL,
nbulk = 50,samplewithRep = FALSE,low_s = 0.3,upp_s = 0.7)

Arguments

`eset`	The 'ExpressionSet' object for single cells.
`ct.varname`	Variable name for 'cell types'.
`sample`	Variable name for subject/samples.
`disease`	Indicate the health condition of subjects.
`ct.sub`	A subset of cell types that are selected to construct pseudo bulk samples. If NULL, then all cell types are used.
`prop_mat`	Manually input the cell-type proportion for pseudo bulk samples.
`nbulk`	The number of pseudo bulk samples to be constructed.
`samplewithRep`	Logical, randomly sample single cells with replacement. Default is F.
`low_s`	Lower support a for uniform distribution U[a,b].
`upp_s`	Upper support b for uniform distribution U[a,b].

Value

Pseudo bulk samples in the format of 'ExpressionSet', and the true cell-type proportions.

Examples

##read data
library(InteRD)
readRDSFromWeb<-function(ref) {readRDS(gzcon(url(ref)))}
urlremote<-"https://github.com/chencxxy28/Data/raw/main/data_InteRD/"
seger<-readRDSFromWeb(paste0(urlremote,"segerstolpe.rds"))

##generate a pseudo bulk data with two samples
set.seed(1234567)
pseudo.seger<-generateBulk(seger[["sc.eset.qc"]], ct.varname = "cluster",
sample = "sample", ct.sub = c("alpha","beta","delta","gamma"),
nbulk = 2, low_s = 0.3, upp_s = 0.7)

[Package InteRD version 0.1.1 Index]