bandwidth {SNSequate} | R Documentation |
Automatic selection of the bandwidth parameter h
Description
This functions implements the minimization of the combined penalty function
described by Holland and Thayer (1989); Von Davier et al, (2004). It returns
the optimal value of h
for kernel continuization, according to the above
mentioned criteria. Different types of kernels (others than the gaussian) are accepted.
Usage
bandwidth(scores, kert, degree, design, Kp = 1, scores2, degreeXA, degreeYA,
J, K, L, wx, wy, w, r=NULL)
Arguments
Note that depending on the specified equating design, not all arguments are necessary as detailed below.
scores |
If the "EG" design is specified, a vector containing the raw sample frequencies coming from one group taking the test. If the "SG" design is specified, a matrix containing the (joint) bivariate sample
frequencies for If the "CB" design is specified, a two column matrix containing the observed scores
of the sample taking test If either the "NEAT_CB" or "NEAT_PSE" design is selected, a two column matrix containing
the observed scores on test |
kert |
A character string giving the type of kernel to be used for continuization.
Current options include " |
degree |
Either a number or vector indicating the number of power moments to be fitted to the marginal distributions, or the number or cross moments to be fitted to the joint distributions, respectively. For the "EG" design it will be a number (see Details). |
design |
A character string indicating the equating design (one of "EG", "SG", "CB", "NEAT_CE", "NEAT_PSE") |
Kp |
A number which acts as a weight for the second term in the combined penalization function used
to obtain |
scores2 |
Only used for the "CB", "NEAT_CE" and "NEAT_PSE" designs. See the description of
|
degreeXA |
A vector indicating the number of power moments to be fitted to the marginal distributions
|
degreeYA |
Only used for the "NEAT_CE" and "NEAT_PSE" designs (see the description for
|
J |
The number of possible |
K |
The number of possible |
L |
The number of possible |
wx |
A number that satisfies |
wy |
A number that satisfies |
w |
A number that satisfies |
r |
Score probabilities. |
Details
To automatically select h
, the function minimizes
where , and
. The terms
and
are such that
acts as a smoothness penalty term that avoids rapid
fluctuations in the approximated density (see Chapter 10 in Von Davier, 2011 for more details). The
term corresponds to the
Kp
argument of the bandwidth
function. The
values are assumed to be estimated by polynomial loglinear models of specific
degree
, which come from a call to loglin.smooth
.
Value
A number which is the optimal value of h
.
Author(s)
Jorge Gonzalez jorge.gonzalez@mat.uc.cl
References
Gonzalez, J. (2014). SNSequate: Standard and Nonstandard Statistical Models and Methods for Test Equating. Journal of Statistical Software, 59(7), 1-30.
Von Davier, A., Holland, P., and Thayer, D. (2004). The Kernel Method of Test Equating. New York, NY: Springer-Verlag.
A. von Davier (Ed.) (2011). Statistical Models for Equating, Scaling, and Linking. New York: Springer
See Also
Examples
#Example: The "Standard" column and firsts two rows of Table 10.1 in
#Chapter 10 of Von Davier 2011
data(Math20EG)
hx.logis<-bandwidth(scores=Math20EG[,1],kert="logis",degree=2,design="EG")$h
hx.unif<-bandwidth(scores=Math20EG[,1],kert="unif",degree=2,design="EG")$h
hx.gauss<-bandwidth(scores=Math20EG[,1],kert="gauss",degree=2,design="EG")$h
hy.logis<-bandwidth(scores=Math20EG[,2],kert="logis",degree=3,design="EG")$h
hy.unif<-bandwidth(scores=Math20EG[,2],kert="unif",degree=3,design="EG")$h
hy.gauss<-bandwidth(scores=Math20EG[,2],kert="gauss",degree=3,design="EG")$h
partialTable10.1<-rbind(c(hx.logis,hx.unif,hx.gauss),
c(hy.logis,hy.unif,hy.gauss))
dimnames(partialTable10.1)<-list(c("h.x","h.y"),c("Logistic","Uniform","Gaussian"))
partialTable10.1