R: Bayes Factor for Posterior Beta Distribution

dfba_beta_bayes_factor {DFBA}

R Documentation

Bayes Factor for Posterior Beta Distribution

Description

Given a beta posterior distribution and given a prior for the variate, computes the Bayes factor for either point or interval null hypotheses.

Usage

dfba_beta_bayes_factor(a_post, b_post, method, H0, a0 = 1, b0 = 1)

Arguments

`a_post`	The first shape parameter for the posterior beta distribution. Must be positive and finite.
`b_post`	The second shape parameter for the posterior beta distribution. Must be positive and finite.
`method`	One of `"interval"` if the null hypothesis is a range on the `[0,1]` interval or `"point"` if the null hypothesis is a single number in the `[0,1]` interval
`H0`	If method="interval", then the H0 input is vector of two values, which are lower and upper limits for the null hypothesis; if method="point", then the H0 input is single number, which is the null hypothesis value
`a0`	The first shape parameter for the prior beta distribution (default is 1). Must be positive and finite.
`b0`	The second shape parameter for the prior beta distribution(default is 1). Must be positive and finite.

Details

For a binomial variate with n1 successes and n2 failures, the Bayesian analysis for the population success rate parameter \phi is distributed as a beta density function with shape parameters a_post and b_post for a_post = n1 + a0 and b_post = n2 + b0 where a0 and b0 are the shape parameters for the prior beta distribution. It is common for users to be interested in testing hypotheses about the population \phi parameter. The Bayes factor is useful to assess if either the null or the alternative hypothesis are credible.

There are two types of null hypotheses – an interval null hypothesis and a point null hypothesis. For example, an interval null hypothesis might be \phi \le .5 with the alternative hypothesis being \phi > .5, whereas a point null hypothesis might be \phi = .5 with the alternative being \phi \ne .5. It is conventional to call the null hypothesis H_0 and to call the alternative hypothesis H_1. For frequentist null hypothesis testing, H_0 is assumed to be true, to see if this assumption is likely or not. With the frequentist approach the null hypothesis cannot be proved since it was assumed in the first place. With frequentist statistics, H_0 is thus either retained as assumed or it is rejected. Unlike the frequentist approach, Bayesian hypothesis testing does not assume either H_0 or H_1; it instead assumes a prior distribution for the population parameter \phi, and based on this assumption arrives at a posterior distribution for the parameter given the data of n1 and n2 for the binomial outcomes.

There are two related Bayes factors - BF10 and BF01 where BF01 = 1/BF10. When BF10 > 1, there is more support for the alternative hypothesis, whereas when BF01 > 1, there is more support for the null hypothesis. Thus, in Bayesian hypothesis testing it is possible to build support for either H_0 or H_1. In essence, the Bayes factor is a measure of the relative strength of evidence. There is no standard guideline for recommending a decision about the prevailing hypothesis, but several statisticians have suggested criteria. Jeffreys (1961) suggested that BF > 10 was strong and BF > 100 was decisive; Kass and Raffrey (1995) suggested that BF > 20 was strong and BF > 150 was decisive. Chechile (2020) argued from a decision-theory framework for a third option for the user to decide not to decide if the prevailing Bayes factor is not sufficiently large. From this decision-making perspective, Chechile (2020) suggested that BF > 19 was a good bet - too good to disregard, BF > 99 was a strong bet - irresponsible to avoid, and BF > 20,001 was virtually certain. Chechile also pointed out that despite the Bayes factor value there is often some probability, however small, for either hypothesis. Ultimately, each academic discipline has to set the standard for their field for the strength of evidence. Yet even when the Bayes factor is below the user's threshold for making claims about the hypotheses, the value of the Bayes factor from one study can be nonetheless valuable to other researchers and might be combined via a product rule in a meta-analysis. Thus, the value of the Bayes factor has a descriptive utility.

The Bayes factor BF10 for an interval null is the ratio of the posterior odds of H_1 to H_0 divided by the prior odds of H_1 to H_0. Also, the converse Bayes factor BF01 is the ratio of posterior odds of H_0 to H_1 divided by the prior odds of H_0 to H_1; hence BF01 = 1/BF10. If there is no change in the odds ratio as a function of new data being collected, then BF10 = BF01 = 1. But, if evidence is more likely for one of the hypotheses, then either BF10 or BF01 will be greater than 1.

The population parameter \phi is distributed on the continuous interval [0,1]. The prior and posterior beta distribution are probability density displays. Importantly, this means that no point has a nonzero probability density, even as the probability mass for any mathematical point is zero. For this reason, all point null hypotheses have a probability measure of zero, but can have a probability density that can be different for prior and posterior distributions. There still is a meaningful Bayes factor for a point hypothesis. As described in Chechile (2020),

BF10 = [p(H_1|D)/p(H_1)][p(H_0)/p(H_0|D)]

where D denotes the data. The first term in this equation is 1/1 = 1. But the second term is of the form 0/0, which appears to undefined. However, by using L'Hospital's rule, it can be proved that the term p(H_0)/p(H_0|D) is the ratio of prior probability density at the null point divided by the posterior probability density. This method for finding the Bayes factor for a point is called the Savage-Dickey method because of the separate contributions from both of those statisticians (Dickey & Lientz, 1970).

Value

A list containing the following components:

`method`	The string of either `"interval"` or `"point"` corresponding to the type of null hypothesis tested
`a_post`	The value for the posterior beta first shape parameter
`b_post`	The value for the posterior beta second shape parameter
`a0`	The first shape parameter for the prior beta distribution
`b0`	The second shape parameter for the prior beta distribution
`BF10`	The Bayes factor for the alternative over the null hypothesis
`BF01`	The Bayes factor for the null over the alternative hypothesis
`null_hypothesis`	The value for the null hypothesis when `method = "point"`
`H0lower`	The lower limit of the null hypothesis when `method = "interval"`
`H0upper`	The upper limit of the null hypothesis when `method = "interval"`
`dpriorH0`	The prior probability density for the null point when `method = "point"`
`dpostH0`	The posterior probability density for the null point when `method = "point"`
`pH0`	The prior probability for the null hypothesis when `method = "interval"`
`pH1`	The prior probability for the alternative hypothesis when `method = "interval"`
`postH0`	The posterior probability for the null hypothesis when `method = "interval"`
`postH1`	The posterior probability for the alternative hypothesis when `method = "interval"`

References

Chechile, R. A. (2020). Bayesian Statistics for Experimental Scientists: A General Introduction Using Distribution-Free Methods. MIT Press.

Dickey, J. M., & Lientz, B. P. (1970). The weighted likelihood ratio, sharp hypotheses about chance, the order of a Markov chain. The Annals of Mathematical Statistics, 41, 214-226.

Jeffreys, H. (1961). Theory of Probability (3rd ed.). Oxford: Oxford University Press.

Kass, R. E., & Rafftery, A. E. (1995). Bayes factors. Journal of the American Statistical Association, 90, 773-795.

Examples

## Examples with the default uniform prior
dfba_beta_bayes_factor(a_post = 17,
                       b_post = 5,
                       method = "interval",
                       H0 = c(0, .5)
                       )
dfba_beta_bayes_factor(a_post = 377,
                       b_post = 123,
                       method = "point",
                       H0 = .75)

# An example with the Jeffreys prior
dfba_beta_bayes_factor(a_post = 377.5,
                       b_post = 123.5,
                       method = "point",
                       H0 = .75,
                       a0 = .5,
                       b0 = .5
                       )


dfba_beta_bayes_factor(a_post = 273,
                       b_post = 278,
                       method = "interval",
                       H0 = c(.4975,
                              .5025)
                       )

[Package DFBA version 0.1.0 Index]