bootstrap_null {crosstalkr}R Documentation

Bootstrap null distribution for RWR

Description

This function will generate a bootstrapped null distribution to identify signficant vertices in a PPI given a set of user-defined seed proteins. Bootstrapping is done by performing random walk with repeats repeatedly over "random" sets of seed proteins. Degree distribution of user-provided seeds is used to inform sampling.

Usage

bootstrap_null(
  seed_proteins,
  g,
  n = 1000,
  agg_int = 100,
  gamma = 0.6,
  eps = 1e-10,
  tmax = 1000,
  norm = TRUE,
  set_seed = NULL,
  cache = NULL,
  seed_name = NULL,
  ncores = 1
)

Arguments

seed_proteins

user defined seed proteins

g

igraph object

n

number of random walks with repeats to create null distribution

agg_int

number of runs before we need to aggregate the results - necessary to save memory. set at lower numbers to save even more memory.

gamma

restart probability

eps

maximum allowed difference between the computed probabilities at the steady state

tmax

the maximum number of iterations for the RWR

norm

if True, w is normalized by dividing each value by the column sum.

set_seed

integer to set random number seed - for reproducibility

cache

A filepath to a folder downloaded files should be stored

seed_name

Name to give the cached ngull distribution - must be a character string

ncores

Number of cores to use - defaults to 1. Significant speedup can be achieved by using multiple cores for computation.

Value

data frame containing mean/ standard deviation for null distribution

Examples


#g <- prep_biogrid()
#bootstrap_null(seed_proteins = c("EGFR", "KRAS"), g= g, ncores = 1, n = 10)


[Package crosstalkr version 1.0.5 Index]