journals {multibridge}R Documentation

Prevalence of Statistical Reporting Errors

Description

This data set, "journals" provides a summary of statistical reporting errors (i.e., inconsistencies between reported test statistic and reported p-value) of 16,695 research articles reporting results from null hypothesis significance testing (NHST). The selected articles were published in eight major journals in psychology between 1985 to 2013:

In total, Nuijten et al. (2016) recomputed 258,105 p-values with the R software package statcheck which extracts statistics from articles and recomputes the p-values. The anonymized dataset and the data documentation was openly available on the Open Science Framework (https://osf.io/d3ukb/; https://osf.io/c6ap2/).

Usage

data(journals)

Format

A data.frame with 8 rows and 14 variables:

Variable Name Description
journal The journal name a research article was published in.
articles_downloaded The number of articles downloaded per journal.
articles_with_NHST The number of articles with NHST results.
perc_articles_with_NHST The percentage of all downloaded articles that had NHST results.
nr_NHST The total number of NHST results.
mean_nr_NHST_per_article_with_NHST The mean number of NHST results per article that had at least one NHST result.
mean_nr_NHST_per_article_all_included The mean number of NHST results in all downloaded articles.
errors The total number of errors.
dec_errors The total number of decision errors (i.e., an error that may have changed the statistical conclusion of the result).
perc_errors The percentage of all results that was an error.
perc_dec_errors The percentage of all results that was a decision error.
perc_articles_with_errors The percentage of all articles that had at least one error.
perc_articles_with_dec_errors The percentage of all articles that had at least one error.
APAfactor APA factor: number of detected NHST results / total number of detected p values.

References

Nuijten MB, Hartgerink CH, van Assen MA, Epskamp S, Wicherts JM (2016). “The prevalence of statistical reporting errors in psychology (1985–2013).” Behavior Research Methods, 48, 1205–1226.

Examples

data(journals)
# Prior specification 
# We assign a uniform Beta distribution on each binomial probability
a <- rep(1, 8)  
b <- rep(1, 8)  

x <- journals$errors 
n  <- journals$nr_NHST
factor_levels <- levels(journals$journal)

# restricted hypothesis
Hr1 <- c('JAP , PS , JCCP , PLOS , DP , FP , JEPG < JPSP')
out <- binom_bf_informed(x=x, n=n, Hr=Hr1, a=a, b=b, 
factor_levels=factor_levels, niter = 2e3)

summary(out)

[Package multibridge version 1.2.0 Index]