gl.report.callrate {dartR.base} | R Documentation |
Reports summary of Call Rate for loci or individuals
Description
SNP datasets generated by DArT have missing values primarily arising from failure to call a SNP because of a mutation at one or both of the restriction enzyme recognition sites. P/A datasets (SilicoDArT) have missing values because it was not possible to call whether a sequence tag was amplified or not. This function tabulates the number of missing values as quantiles.
Usage
gl.report.callrate(
x,
method = "loc",
ind.to.list = 20,
plot.display = TRUE,
plot.theme = theme_dartR(),
plot.colors = NULL,
plot.dir = NULL,
plot.file = NULL,
bins = 50,
verbose = NULL,
...
)
Arguments
x |
Name of the genlight object containing the SNP or presence/absence (SilicoDArT) data [required]. |
method |
Specify the type of report by locus (method='loc') or individual (method='ind') [default 'loc']. |
ind.to.list |
Number of individuals to list for callrate [default 20] |
plot.display |
Specify if plot is to be displayed in the graphics window [default TRUE]. |
plot.theme |
User specified theme [default theme_dartR()]. |
plot.colors |
Vector with two color names for the borders and fill [default c("#2171B5", "#6BAED6")]. |
plot.dir |
Directory to save the plot RDS files [default as specified by the global working directory or tempdir()] |
plot.file |
Filename (minus extension) for the RDS plot file [Required for plot save] |
bins |
Number of bins to display in histograms [default 25]. |
verbose |
Verbosity: 0, silent or fatal errors; 1, begin and end; 2, progress log; 3, progress and results summary; 5, full report [default 2, unless specified using gl.set.verbosity]. |
... |
Parameters passed to function ggsave, such as width and height, when the ggplot is to be saved. |
Details
This function expects a genlight object, containing either SNP data or SilicoDArT (=presence/absence data). Callrate is summarized by locus or by individual to allow sensible decisions on thresholds for filtering taking into consideration consequential loss of data. The summary is in the form of a tabulation and plots.
To avoid issues from inadvertent use of this function in an assignment statement, the function returns the genlight object unaltered. Plot themes can be obtained from:
Plot colours can be set with gl.select.colors().
If plot.file is specified, plots are saved to the directory specified by the user, or the global default working directory set by gl.set.wd() or to the tempdir().
Value
Returns unaltered genlight object
Author(s)
Custodian: Arthur Georges – Post to https://groups.google.com/d/forum/dartr
See Also
Other matched report:
gl.report.hamming()
,
gl.report.locmetric()
,
gl.report.maf()
,
gl.report.overshoot()
,
gl.report.pa()
,
gl.report.rdepth()
,
gl.report.reproducibility()
,
gl.report.secondaries()
,
gl.report.taglength()
Examples
# SNP data
test.gl <- testset.gl[1:20,]
gl.report.callrate(test.gl)
gl.report.callrate(test.gl,method='ind')
gl.report.callrate(test.gl,method='ind',plot.file="test")
gl.report.callrate(test.gl,method='loc',by.pop=TRUE)
gl.report.callrate(test.gl,method='loc',by.pop=TRUE,plot.file="test")
# Tag P/A data
test.gs <- testset.gs[1:20,]
gl.report.callrate(test.gs)
gl.report.callrate(test.gs,method='ind')
test.gl <- testset.gl[1:20,]
gl.report.callrate(test.gl)