| genotypeDiversity {polysat} | R Documentation |
Genotype Diversity Statistics
Description
genotypeDiversity calculates diversity statistics based on
genotype frequencies, using a distance matrix to assign individuals to
genotypes. The Shannon and Simpson functions are also
available to calculate these statistics directly from a vector
of frequencies.
Usage
genotypeDiversity(genobject, samples = Samples(genobject),
loci = Loci(genobject),
d = meandistance.matrix(genobject, samples, loci,
all.distances = TRUE,
distmetric = Lynch.distance),
threshold = 0, index = Shannon, ...)
Shannon(p, base = exp(1))
Simpson(p)
Simpson.var(p)
Arguments
genobject |
An object of the class |
samples |
An optional character vector indicating a subset of samples to analyze. |
loci |
An optional character vector indicating a subset of loci to analyze. |
d |
A list such as that produced by |
threshold |
The maximum genetic distance between two samples that can be considered to be the same genotype. |
index |
The diversity index to calculate. This should be |
... |
Additional arguments to pass to |
p |
A vector of counts. |
base |
The base of the logarithm for calculating the Shannon index. This is
|
Details
genotypeDiversity runs assignClones on distance
matrices for individual loci and then for all loci, for each seperate
population. The results of assignClones are used to
calculate a vector of genotype frequencies, which is passed to
index.
Shannon calculates the Shannon index, which is:
-\sum \frac{p_i}{N}\ln(\frac{p_i}{N})
(or log base 2 or any other base, using the base argument) given
a vector p of genotype counts, where N is the sum of those counts.
Simpson calculates the Simpson index, which is:
\sum \frac{p_{i}(p_{i} - 1)}{N(N - 1)}
Simpson.var calculates the variance of the Simpson index:
\frac{4N(N-1)(N-2)\sum p_{i}^3 + 2N(N-1)\sum p_{i}^2 -
2N(N-1)(2N-3)(\sum p_{i}^2)^2}{[N(N-1)]^2}
The variance of the Simpson index can be used to calculate a confidence
interval, for example the results of Simpson plus or minus twice
the square root of the results of Simpson.var would be the 95%
confidence interval.
Value
A matrix of diversity index results, with populations in rows and
loci in columns. The final column is called "overall" and gives
the results when all loci are analyzed together.
Author(s)
Lindsay V. Clark
References
Shannon, C. E. (1948) A mathematical theory of communication. Bell System Technical Journal 27:379–423 and 623–656.
Simpson, E. H. (1949) Measurement of diversity. Nature 163:688.
Lowe, A., Harris, S. and Ashton, P. (2004) Ecological Genetics: Design, Analysis, and Application. Wiley-Blackwell.
Arnaud-Haond, S., Duarte, M., Alberto, F. and Serrao, E. A. (2007) Standardizing methods to address clonality in population studies. Molecular Ecology 16:5115–5139.
http://www.comparingpartitions.info/index.php?link=Tut4
See Also
Examples
# set up dataset
mydata <- new("genambig", samples=c("a","b","c"), loci=c("F","G"))
Genotypes(mydata, loci="F") <- list(c(115,118,124),c(115,118,124),
c(121,124))
Genotypes(mydata, loci="G") <- list(c(162,170,174),c(170,172),
c(166,180,182))
Usatnts(mydata) <- c(3,2)
# get genetic distances
mydist <- meandistance.matrix(mydata, all.distances=TRUE)
# calculate diversity under various conditions
genotypeDiversity(mydata, d=mydist)
genotypeDiversity(mydata, d=mydist, base=2)
genotypeDiversity(mydata, d=mydist, threshold=0.3)
genotypeDiversity(mydata, d=mydist, index=Simpson)
genotypeDiversity(mydata, d=mydist, index=Simpson.var)