R: Find non-overlapping segments based on a summary statistic

segments {BGData}

R Documentation

Find non-overlapping segments based on a summary statistic

Description

Given a summary statistic and a threshold, this function computes the number of non-overlapping segments, each defined as a discovery (i.e., statistic[i] <= threshold) +/- a gap, in the same units as bp (often base-pair position).

Usage

segments(statistic, chr, bp, threshold, gap, trim = FALSE, verbose = FALSE)

Arguments

`statistic`	A statistic (e.g., BFDR or p-values).
`chr`	A vector containing the chromosome for each value of `statistic`.
`bp`	A vector containing the base-pair positions for each value of `statistic`.
`threshold`	The threshold to determine 'significance' (e.g., `1e-5` for p-values).
`gap`	1/2 of the length of the desired segments.
`trim`	Whether to collapse segments that were artifically inflated by `gap`. Defaults to `FALSE`.
`verbose`	Whether progress updates will be posted. Defaults to `FALSE`.

Value

A data frame containing the following information:

`chr`	Chromosome
`start`	Index where segment starts within `statistic`.
`end`	Index where segment ends within `statistic`.
`length`	Length of segment.
`bpStart`	Base-pair position where segment starts.
`bpEnd`	Base-pair position where segment ends.
`bpLength`	Length of segment in base-pair positions.
`minValue`	Smallest value of `statistic` within segment.
`minValuePos`	Position of variant with the smallest value of `statistic` within segment.

Examples

library(BGData)

# Load example data
bg <- BGData:::loadExample()

# Perform GWAS
pValues <- GWAS(
    formula = FT10 ~ 1,
    data = bg,
    method = "rayOLS"
)

# Determine segments within +/- 1MB from a significant variant
segments <- segments(
    statistic = pValues[, 4],
    chr = map(bg)$chromosome,
    bp = map(bg)$base_pair_position,
    threshold = 1e-5,
    gap = 1e6,
    trim = FALSE,
    verbose = FALSE
)

[Package BGData version 2.4.1 Index]