fHill {OnomasticDiversity}R Documentation

Calculate the Hill's diversity numbers

Description

This function obtains the Hill's diversity numbers introduced by M. O. Hill. It is a method for quantifying species biodiversity that can be adapted to the context of onomastic.

Usage

fHill(x, k, n, location, lambda)

Arguments

x

dataframe of the data values for each species.

k

name of a variable which represents absolute frequency for each species.

n

name of a variable which represents total number of individuals.

location

represents the grouping element.

lambda

free parameter.

Details

For a community ii, the Hill's diversity numbers are defined by the expression J(λ)=(kSipkiλ)11λJ(\lambda) = \left(\sum \limits_{k\in S_i} p_{ki}^\lambda\right)^{\frac{1}{1-\lambda}} with the restriction λ0\lambda \geq 0 where pkip_{ki} represents the relative frequency of species kk and SiS_i are all species at the community, species richness, and λ\lambda is a free parameter. (This is equivalent to the exponential of Renyi's generalised entropy). The Renyi entropy of order λ\lambda, where λ0\lambda \geq 0 and λ1\lambda \neq 1, is defined as Hλ(X)=11λlog(i=1npiλ)\mathrm{H}_{\lambda}(X)=\frac{1}{1-\lambda} \log \left(\sum \limits_{i=1}^{n} p_{i}^{\lambda}\right) Here, XX is a discrete random variable with possible outcomes in the set A={x1,x2,,xn}\mathcal{A}=\left\{x_{1}, x_{2}, \ldots, x_{n}\right\} and corresponding probabilities piPr(X=xi)p_{i} \doteq \operatorname{Pr}\left(X=x_{i}\right) for i=1,,ni=1, \ldots, n. The logarithm is conventionally taken to be base 2, especially in the context of information theory where bits are used. If the probabilities are pi=1/np_{i}=1 / n for all i=1,,ni=1, \ldots, n, then all the Renyi entropies of the distribution are equal: Hλ(X)=logn\mathrm{H}_{\lambda}(X)=\log n. In general, for all discrete random variables X,Hλ(X)X, \mathrm{H}_{\lambda}(X) is a non-increasing function in λ\lambda..

Particular cases of λ\lambda values: λ=0,J(0)=Si\lambda = 0, J(0)=S_i, it corresponds species richness; λ=1,J(1)=eHt\lambda = 1, J(1)=e^{H_{t}}, it corresponds the exponential of Shannon's entropy; and λ=2,J(2)=DSi\lambda = 2, J(2)= D_{S_i}, it corresponds the 'inverse' Simpson index.

In onomastic context, pkip_{ki} denotes the relative frequency of surname kk in region (\approx community diversity context) ii and SiS_i are all surnames in region ii.

Value

A dataframe containing the following components:

location

represents the grouping element, for example the communities / regions.

hill

the value of the Hill's diversity index.

Author(s)

Maria Jose Ginzo Villamayor

References

Hill, M. O. (1973). Diversity and Evenness: a unifying notation and its consequences. Ecology, 54, 427–32.

See Also

fCressieRead.

Examples

data(surnamesgal14)
result = fHill (x= surnamesgal14, k="number", n="population",
location  = "muni", lambda= 0)
result

data(namesmengal16)
result = fHill (x= namesmengal16, k="number", n="population",
location  = "muni", lambda= 0)
result

data(nameswomengal16)
result = fHill (x= nameswomengal16, k="number", n="population",
location  = "muni", lambda= 0)
result

[Package OnomasticDiversity version 0.1 Index]