calc_ehh {rehh} | R Documentation |
EHH and iHH computation for a given focal marker
Description
Compute Extended Haplotype Homozygosity (EHH) and integrated EHH (iHH) for a given focal marker.
Usage
calc_ehh(
haplohh,
mrk,
limhaplo = 2,
limhomohaplo = 2,
limehh = 0.05,
include_zero_values = FALSE,
include_nhaplo = FALSE,
phased = TRUE,
polarized = TRUE,
scalegap = NA,
maxgap = NA,
discard_integration_at_border = TRUE,
lower_y_bound = limehh,
interpolate = TRUE
)
Arguments
haplohh |
an object of class |
mrk |
integer representing the number of the focal marker within the haplohh object or string representing its ID/name. |
limhaplo |
if there are less than |
limhomohaplo |
if there are less than |
limehh |
limit at which EHH stops to be evaluated |
include_zero_values |
logical. If |
include_nhaplo |
logical. If |
phased |
logical. If |
polarized |
logical. |
scalegap |
scale or cap gaps larger than the specified size to the specified size (default= |
maxgap |
maximum allowed gap in bp between two markers. If exceeded, further calculation of EHH is stopped at the gap
(default= |
discard_integration_at_border |
logical. If |
lower_y_bound |
lower y boundary of the area to be integrated over (default: |
interpolate |
logical. Affects only IHH values. If |
Details
Values for allele-specific Extended Haplotype Homozygosity (EHH) are computed upstream and downstream of the focal marker for each of its alleles. These values are integrated with respect to their genomic positions to yield an 'integrated EHH' (iHH) value for each allele.
Value
The returned value is a list containing the following elements:
- mrk.name
The name/identifier of the focal marker.
- freq
A vector with the frequencies of the alleles of the focal marker.
- ehh
A data frame with EHH values for each allele of the focal marker.
- ihh
A vector with iHH (integrated EHH) values for each allele of the focal marker.
References
Gautier, M. and Naves, M. (2011). Footprints of selection in the ancestral admixture of a New World Creole cattle breed. Molecular Ecology, 20, 3128-3143.
Klassmann, A. and Gautier, M. (2020). Detecting selection using Extended Haplotype Homozygosity-based statistics on unphased or unpolarized data (preprint). https://doi.org/10.22541/au.160405572.29972398/v1
Sabeti, P.C. et al. (2002). Detecting recent positive selection in the human genome from haplotype structure. Nature, 419, 832-837.
Sabeti, P.C. et al. (2007). Genome-wide detection and characterization of positive selection in human populations. Nature, 449, 913-918.
Tang, K. and Thornton, K.R. and Stoneking, M. (2007). A New Approach for Using Genome Scans to Detect Recent Positive Selection in the Human Genome. Plos Biology, 7, e171.
Voight, B.F. and Kudaravalli, S. and Wen, X. and Pritchard, J.K. (2006). A map of recent positive selection in the human genome. Plos Biology, 4, e72.
See Also
data2haplohh
, plot.ehh
, calc_ehhs
, scan_hh
.
Examples
#example haplohh object (280 haplotypes, 1424 SNPs)
#see ?haplohh_cgu_bta12 for details
data(haplohh_cgu_bta12)
#computing EHH statistics for the marker "F1205400"
#which displays a strong signal of selection
ehh <- calc_ehh(haplohh_cgu_bta12, mrk = "F1205400")