journals {multibridge} | R Documentation |
Prevalence of Statistical Reporting Errors
Description
This data set, "journals" provides a summary of statistical reporting errors (i.e., inconsistencies between reported test statistic and reported p-value) of 16,695 research articles reporting results from null hypothesis significance testing (NHST). The selected articles were published in eight major journals in psychology between 1985 to 2013:
-
Developmental Psychology (DP)
-
Frontiers in Psychology (FP)
-
Journal of Applied Psychology (JAP)
-
Journal of Consulting and Clinical Psychology (JCCP)
-
Journal of Experimental Psychology: General (JEPG)
-
Journal of Personality and Social Psychology (JPSP)
-
Public Library of Science (PLoS)
-
Psychological Science (PS)
In total, Nuijten et al. (2016) recomputed 258,105 p-values with the R
software
package statcheck
which extracts statistics from articles and recomputes the p-values.
The anonymized dataset and the data documentation was openly available on the
Open Science Framework (https://osf.io/d3ukb/; https://osf.io/c6ap2/).
Usage
data(journals)
Format
A data.frame
with 8 rows and 14 variables:
Variable Name | Description |
journal | The journal name a research article was published in. |
articles_downloaded | The number of articles downloaded per journal. |
articles_with_NHST | The number of articles with NHST results. |
perc_articles_with_NHST | The percentage of all downloaded articles that had NHST results. |
nr_NHST | The total number of NHST results. |
mean_nr_NHST_per_article_with_NHST | The mean number of NHST results per article that had at least one NHST result. |
mean_nr_NHST_per_article_all_included | The mean number of NHST results in all downloaded articles. |
errors | The total number of errors. |
dec_errors | The total number of decision errors (i.e., an error that may have changed the statistical conclusion of the result). |
perc_errors | The percentage of all results that was an error. |
perc_dec_errors | The percentage of all results that was a decision error. |
perc_articles_with_errors | The percentage of all articles that had at least one error. |
perc_articles_with_dec_errors | The percentage of all articles that had at least one error. |
APAfactor | APA factor: number of detected NHST results / total number of detected p values. |
References
Nuijten MB, Hartgerink CH, van Assen MA, Epskamp S, Wicherts JM (2016). “The prevalence of statistical reporting errors in psychology (1985–2013).” Behavior Research Methods, 48, 1205–1226.
Examples
data(journals)
# Prior specification
# We assign a uniform Beta distribution on each binomial probability
a <- rep(1, 8)
b <- rep(1, 8)
x <- journals$errors
n <- journals$nr_NHST
factor_levels <- levels(journals$journal)
# restricted hypothesis
Hr1 <- c('JAP , PS , JCCP , PLOS , DP , FP , JEPG < JPSP')
out <- binom_bf_informed(x=x, n=n, Hr=Hr1, a=a, b=b,
factor_levels=factor_levels, niter = 2e3)
summary(out)