IRTest_Cont {IRTest} | R Documentation |
Item and ability parameters estimation for continuous response items
Description
This function estimates IRT item and ability parameters when all items are scored continuously. Based on Bock & Aitkin's (1981) marginal maximum likelihood and EM algorithm (EM-MML), this function provides several latent distribution estimation algorithms which could free the normality assumption on the latent variable. If the normality assumption is violated, application of these latent distribution estimation methods could reflect non-normal characteristics of the unknown true latent distribution, thereby providing more accurate parameter estimates (Li, 2021; Woods & Lin, 2009; Woods & Thissen, 2006).
Usage
IRTest_Cont(
data,
range = c(-6, 6),
q = 121,
initialitem = NULL,
ability_method = "EAP",
latent_dist = "Normal",
max_iter = 200,
threshold = 1e-04,
bandwidth = "SJ-ste",
h = NULL
)
Arguments
data |
A matrix or data frame of item responses where responses are coded as 0 or 1. Rows and columns indicate examinees and items, respectively. |
range |
Range of the latent variable to be considered in the quadrature scheme.
The default is from |
q |
A numeric value that represents the number of quadrature points. The default value is 121. |
initialitem |
A matrix of initial item parameter values for starting the estimation algorithm. The default value is |
ability_method |
The ability parameter estimation method.
The available options are Expected a posteriori ( |
latent_dist |
A character string that determines latent distribution estimation method.
Insert |
max_iter |
A numeric value that determines the maximum number of iterations in the EM-MML. The default value is 200. |
threshold |
A numeric value that determines the threshold of EM-MML convergence. A maximum item parameter change is monitored and compared with the threshold. The default value is 0.0001. |
bandwidth |
A character value that can be used if |
h |
A natural number less than or equal to 10 if |
Details
-
The probability of a response
u=x
, where0<u<1
(see Martinez, 2023) -
P(u=x | a, b, \nu) = \frac{1}{B(\mu\nu, \,\nu(1-\mu))} u^{\mu\nu-1} (1-u)^{\nu(1-\mu)-1}
where
\mu = \frac{e^{a(\theta -b)}}{1+e^{a(\theta -b)}}
. - Latent distribution estimation methods
-
1) Empirical histogram method
P(\theta=X_k)=A(X_k)
where
k=1, 2, ..., q
,X_k
is the location of thek
th quadrature point, andA(X_k)
is a value of probability mass function evaluated atX_k
. Empirical histogram method thus hasq-1
parameters.2) Two-component Gaussian mixture distribution
P(\theta=X)=\pi \phi(X; \mu_1, \sigma_1)+(1-\pi) \phi(X; \mu_2, \sigma_2)
where
\phi(X; \mu, \sigma)
is the value of a Gaussian component with mean\mu
and standard deviation\sigma
evaluated atX
.3) Davidian curve method
P(\theta=X)=\left\{\sum_{\lambda=0}^{h}{{m}_{\lambda}{X}^{\lambda}}\right\}^{2}\phi(X; 0, 1)
where
h
corresponds to the argumenth
and determines the degree of the polynomial.4) Kernel density estimation method
P(\theta=X)=\frac{1}{Nh}\sum_{j=1}^{N}{K\left(\frac{X-\theta_j}{h}\right)}
where
N
is the number of examinees,\theta_j
isj
th examinee's ability parameter,h
is the bandwidth which corresponds to the argumentbandwidth
, andK( \cdot )
is a kernel function. The Gaussian kernel is used in this function.5) Log-linear smoothing method
P(\theta=X_{q})=\exp{\left(\beta_{0}+\sum_{m=1}^{h}{\beta_{m}X_{q}^{m}}\right)}
where
h
is the hyper parameter which determines the smoothness of the density, and\theta
can take totalQ
finite values (X_1, \dots ,X_q, \dots, X_Q
).
Value
This function returns a list
of several objects:
par_est |
The item parameter estimates. |
se |
The asymptotic standard errors for item parameter estimates. |
fk |
The estimated frequencies of examinees at quadrature points. |
iter |
The number of EM-MML iterations elapsed for the convergence. |
quad |
The location of quadrature points. |
diff |
The final value of the monitored maximum item parameter change. |
Ak |
The estimated discrete latent distribution. It is discrete (i.e., probability mass function) by the quadrature scheme. |
Pk |
The posterior probabilities of examinees at quadrature points. |
theta |
The estimated ability parameter values. If |
theta_se |
Standard error of ability estimates. The asymptotic standard errors for |
logL |
The deviance (i.e., -2logL). |
density_par |
The estimated density parameters. |
Options |
A replication of input arguments and other information. |
Author(s)
Seewoo Li cu@yonsei.ac.kr
References
Bock, R. D., & Aitkin, M. (1981). Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm. Psychometrika, 46(4), 443-459.
Casabianca, J. M., & Lewis, C. (2015). IRT item parameter recovery with marginal maximum likelihood estimation using loglinear smoothing models. Journal of Educational and Behavioral Statistics, 40(6), 547-578.
Li, S. (2021). Using a two-component normal mixture distribution as a latent distribution in estimating parameters of item response models. Journal of Educational Evaluation, 34(4), 759-789.
Li, S. (2022). The effect of estimating latent distribution using kernel density estimation method on the accuracy and efficiency of parameter estimation of item response models [Master's thesis, Yonsei University, Seoul]. Yonsei University Library.
Martinez, A. J. (2023). Beta item factor analysis for asymmetric, bounded, and continuous item response data. OSF. DOI:10.31234/osf.io/tp8sx.
Mislevy, R. J. (1984). Estimating latent distributions. Psychometrika, 49(3), 359-381.
Mislevy, R. J., & Bock, R. D. (1985). Implementation of the EM algorithm in the estimation of item parameters: The BILOG computer program. In D. J. Weiss (Ed.). Proceedings of the 1982 item response theory and computerized adaptive testing conference (pp. 189-202). University of Minnesota, Department of Psychology, Computerized Adaptive Testing Conference.
Woods, C. M., & Lin, N. (2009). Item response theory with estimation of the latent density using Davidian curves. Applied Psychological Measurement, 33(2), 102-117.
Woods, C. M., & Thissen, D. (2006). Item response theory with estimation of the latent population distribution using spline-based densities. Psychometrika, 71(2), 281-301.
Examples
# Generating a continuous item response data
data <- DataGeneration(N = 1000, nitem_C = 10)$data_C
# Analysis
M1 <- IRTest_Cont(data, max_iter = 3) # increase `max_iter` in real analyses.