difGenLord {difR} | R Documentation |
Generalized Lord's chi-squared DIF method
Description
Performs DIF detection among multiple groups using generalized Lord's chi-squared method.
Usage
difGenLord(Data, group, focal.names, model, c = NULL, engine = "ltm",
discr = 1, irtParam = NULL, nrFocal = 2, same.scale = TRUE, anchor = NULL,
alpha = 0.05, purify = FALSE, nrIter = 10, p.adjust.method = NULL,
save.output = FALSE, output = c("out", "default"))
## S3 method for class 'GenLord'
print(x, ...)
## S3 method for class 'GenLord'
plot(x, plot = "lordStat", item = 1, pch = 8,
number = TRUE, col = "red", colIC = rep("black",
length(x$focal.names)+1), ltyIC = 1:(length(x$focal.names)
+ 1), save.plot = FALSE, save.options = c("plot", "default", "pdf"),
ref.name = NULL, ...)
Arguments
Data |
numeric: either the data matrix only, or the data matrix plus the vector of group membership. See Details. |
group |
numeric or character: either the vector of group membership or the column indicator (within |
focal.names |
numeric or character vector indicating the levels of |
model |
character: the IRT model to be fitted (either |
c |
optional numeric value or vector giving the values of the constrained pseudo-guessing parameters. See Details. |
engine |
character: the engine for estimating the 1PL model, either |
discr |
either |
irtParam |
matrix with 2J rows (where J is the number of items) and at most 9 columns containing item parameters estimates. See Details. |
nrFocal |
numeric: the number of focal groups (default is 2). |
same.scale |
logical: are the item parameters of the |
anchor |
either |
alpha |
numeric: significance level (default is 0.05). |
purify |
logical: should the method be used iteratively to purify the set of anchor items? (default is FALSE). |
nrIter |
numeric: the maximal number of iterations in the item purification process (default is 10). |
p.adjust.method |
either |
save.output |
logical: should the output be saved into a text file? (Default is |
output |
character: a vector of two components. The first component is the name of the output file, the second component is either the file path or |
x |
the result from a |
plot |
character: the type of plot, either |
item |
numeric or character: either the number or the name of the item for which ICC curves are plotted. Used only when |
pch , col |
type of usual |
number |
logical: should the item number identification be printed (default is |
colIC , ltyIC |
vectors of elements of the usual |
save.plot |
logical: should the plot be saved into a separate file? (default is |
save.options |
character: a vector of three components. The first component is the name of the output file, the second component is either the file path or |
ref.name |
either |
... |
other generic parameters for the |
Details
The generalized Lord's chi-squared method (Kim, Cohen and Park, 1995), also referred to as Qj statistic, allows for detecting uniform or non-uniform
differential item functioning among multiple groups by setting an appropriate item response model. The input can be of two kinds: either by displaying
the full data, the group membership, the focal groups and the model, or by giving the item parameter estimates (with the option irtParam
).
Both can be supplied, but in this case only the parameters in irtParam
are used for computing generalized Lord's chi-squared statistic.
The Data
is a matrix whose rows correspond to the subjects and columns to the items.
In addition, Data
can hold the vector of group membership. If so, group
indicates the column of Data
which corresponds to the group membership, either by specifying its name or by giving the column number. Otherwise, group
must be a vector of same length as nrow(Data)
.
Missing values are allowed for item responses (not for group membership) but must be coded as NA
values. They are discarded for item parameter estimation.
The vector of group membership must hold at least three different values, either as numeric or character. The focal groups are defined by the values of the argument focal.names
.
If the model is not the 1PL model, or if engine
is equal to "ltm"
, the selected IRT model is fitted using marginal maximum likelihood by means of the functions from the ltm
package (Rizopoulos, 2006). Otherwise, the 1PL model is fitted as a generalized linear mixed model, by means of the glmer
function of the lme4
package (Bates and Maechler, 2009).
With the "1PL"
model and the "ltm"
engine, the common discrimination parameter is set equal to 1 by default. It is possible to fix another value through the argumentdiscr
. Alternatively, this common discrimination parameter can be estimated (though not returned) by fixing discr
to
NULL
.
The 3PL model can be fitted either unconstrained (by setting c
to NULL
) or by fixing the pseudo-guessing values. In the latter case, the argument c
is either a numeric vector of same length of the number of items, with one value per item pseudo-guessing parameter, or a single value which is duplicated for all the items. If c
is different from NULL
then the 3PL model is always fitted (whatever the value of model
).
The irtParam
matrix has a number of rows equal to the number of groups (reference and focal ones) times the number of items J. The first J rows refer to the item parameter estimates in the reference group, while the next sets of J rows correspond to the same items in each of
the focal groups. The number of columns depends on the selected IRT model: 2 for the 1PL model, 5 for the 2PL model, 6 for the constrained 3PL model and 9 for the unconstrained 3PL model. The columns of irtParam
have to follow the same structure as the output of itemParEst
command (the latter can actually be used to create the irtParam
matrix). The number of focal groups has to be specified with argument nrFocal
(default value is 2).
In addition to the matrix of parameter estimates, one has to specify whether items in the focal groups were rescaled to those of the reference group. If not, rescaling is performed by equal means anchoring (Cook and Eignor, 1991). Argument same.scale
is used for this choice (default option is TRUE
and assumes therefore that the parameters are already placed on a same scale).
The threshold (or cut-score) for classifying items as DIF is computed as the quantile of the chi-squared distribution with lower-tail probability of one minus alpha
and p degrees of freedom. The value of p is the product of the number of focal groups by the number of item parameters to be tested (1 for the 1PL model, 2 for the 2PL model or the constrained 3PL model, and 3 for the unconstrained 3PL model).
Item purification can be performed by setting purify
to TRUE
. In this case, the purification occurs in the equal means anchoring process: items detected as DIF are iteratively removed from the set of items used for equal means anchoring, and the procedure is repeated until either the same items
are identified twice as functioning differently, or when nrIter
iterations have been performed. In the latter case a warning message is printed. See Candell and Drasgow (1988) for further details.
Adjustment for multiple comparisons is possible with the argument p.adjust.method
. The latter must be an acronym of one of the available adjustment methods of the p.adjust
function. According to Kim and Oshima (2013), Holm and Benjamini-Hochberg adjustments (set respectively by "Holm"
and "BH"
) perform best for DIF purposes. See p.adjust
function for further details. Note that item purification is performed on original statistics and p-values; in case of adjustment for multiple comparisons this is performed after item purification.
A pre-specified set of anchor items can be provided through the anchor
argument. It must be a vector of either item names (which must match exactly the column names of Data
argument) or integer values (specifying the column numbers for item identification). In case anchor items are provided, they are used to rescale the item parameters on a common metric. None of the anchor items are tested for DIF: the output separates anchor items and tested items and DIF results are returned only for the latter. Note also that item purification is not activated when anchor items are provided (even if purify
is set to TRUE
). By default it is NULL
so that no anchor item is specified. If item parameters are provided thorugh the irtParam
argument and if they are on the same scale (i.e. if same.scale
is TRUE
), then anchor items are not used (even if they are specified).
The output of the difGenLord
, as displayed by the print.GenLord
function, can be stored in a text file provided that save.output
is set to TRUE
(the default value FALSE
does not execute the storage). In this case, the name of the text file must be given as a character string into the first component
of the output
argument (default name is "out"
), and the path for saving the text file can be given through the second component of output
. The
default value is "default"
, meaning that the file will be saved in the current working directory. Any other path can be specified as a character string: see the
Examples section for an illustration.
Two types of plots are available. The first one is obtained by setting plot="lordStat"
and it is the default option. The chi-squared statistics are displayed
on the Y axis, for each item. The detection threshold is displayed by a horizontal line, and items flagged as DIF are printed with the color defined by argument col
.
By default, items are spotted with their number identification (number=TRUE
); otherwise they are simply drawn as dots whose form is given by the option pch
.
The other type of plot is obtained by setting plot="itemCurve"
. In this case, the fitted ICC curves are displayed for one specific item set by the argument
item
. The latter argument can hold either the name of the item or its number identification. The item parameters are extracted from the itemParFinal
matrix
if the output argument purification
is TRUE
, otherwise from the itemParInit
matrix and after a rescaling of the item parameters using the
itemRescale
command. A legend is displayed in the upper left corner of the plot. The colors and types of traits for these curves are defined by means of
the arguments colIC
and ltyIC
respectively. These are set as vectors of length 2, the first element for the reference group and the second for the focal group.
Finally, the ref.name
argument permits to display the name if the reference group (instead of "Reference") in the legend.
Both types of plots can be stored in a figure file, either in PDF or JPEG format. Fixing save.plot
to TRUE
allows this process. The figure is defined through
the components of save.options
. The first two components perform similarly as those of the output
argument. The third component is the figure format, with
allowed values "pdf"
(default) for PDF file and "jpeg"
for JPEG file.
Value
A list of class "GenLord" with the following arguments:
genLordChi |
the values of the generalized Lord's chi-squared statistics. |
p.value |
the vector of p-values for the generalized Lord's chi-square statistics. |
alpha |
the value of |
thr |
the threshold (cut-score) for DIF detection. |
df |
the degrees of freedom of the asymptotic null distribution of the statistics. |
DIFitems |
either the column indicators of the items which were detected as DIF items, or "No DIF item detected". |
p.adjust.method |
the value of the |
adjusted.p |
either |
purification |
the value of |
nrPur |
the number of iterations in the item purification process. Returned only if |
difPur |
a binary matrix with one row per iteration in the item purification process and one column per item. Zeros and ones in the i-th
row refer to items which were classified respectively as non-DIF and DIF items at the (i-1)-th step. The first row corresponds to the initial
classification of the items. Returned only if |
convergence |
logical indicating whether the iterative item purification process stopped before the maximal number |
model |
the value of |
c |
The value of the |
engine |
The value of the |
discr |
the value of the |
itemParInit |
the matrix of initial parameter estimates, with the same format as |
itemParFinal |
the matrix of final parameter estimates, with the same format as |
estPar |
a logical value indicating whether the item parameters were estimated ( |
names |
the names of the items. |
anchor.names |
the value of the |
focal.names |
the value of the |
save.output |
the value of the |
output |
the value of the |
Author(s)
Sebastien Beland
Collectif pour le Developpement et les Applications en Mesure et Evaluation (Cdame)
Universite du Quebec a Montreal
sebastien.beland.1@hotmail.com, http://www.cdame.uqam.ca/
David Magis
Department of Psychology, University of Liege
Research Group of Quantitative Psychology and Individual Differences, KU Leuven
David.Magis@uliege.be, http://ppw.kuleuven.be/okp/home/
Gilles Raiche
Collectif pour le Developpement et les Applications en Mesure et Evaluation (Cdame)
Universite du Quebec a Montreal
raiche.gilles@uqam.ca, http://www.cdame.uqam.ca/
References
Bates, D. and Maechler, M. (2009). lme4: Linear mixed-effects models using S4 classes. R package version 0.999375-31. http://CRAN.R-project.org/package=lme4
Candell, G.L. and Drasgow, F. (1988). An iterative procedure for linking metrics and assessing item bias in item response theory. Applied Psychological Measurement, 12, 253-260.
Cook, L. L. and Eignor, D. R. (1991). An NCME instructional module on IRT equating methods. Educational Measurement: Issues and Practice, 10, 37-45.
Kim, S.-H., Cohen, A.S. and Park, T.-H. (1995). Detection of differential item functioning in multiple groups. Journal of Educational Measurement, 32, 261-276. doi: 10.1111/j.1745-3984.1995.tb00466.x
Kim, J., and Oshima, T. C. (2013). Effect of multiple testing adjustment in differential item functioning detection. Educational and Psychological Measurement, 73, 458–470. doi: 10.1177/0013164412467033
Magis, D., Beland, S., Tuerlinckx, F. and De Boeck, P. (2010). A general framework and an R package for the detection of dichotomous differential item functioning. Behavior Research Methods, 42, 847-862. doi: 10.3758/BRM.42.3.847
Rizopoulos, D. (2006). ltm: An R package for latent variable modelling and item response theory analyses. Journal of Statistical Software, 17, 1–25. doi: 10.18637/jss.v017.i05
See Also
Examples
## Not run:
# Loading of the verbal data
data(verbal)
attach(verbal)
# Creating four groups according to gender ("Man" or "Woman") and trait
# anger score ("Low" or "High")
group <- rep("WomanLow",nrow(verbal))
group[Anger>20 & Gender==0] <- "WomanHigh"
group[Anger<=20 & Gender==1] <- "ManLow"
group[Anger>20 & Gender==1] <- "ManHigh"
# New data set
Verbal <- cbind(verbal[,1:24], group)
# Reference group: "WomanLow"
names <- c("WomanHigh", "ManLow", "ManHigh")
# Three equivalent settings of the data matrix and the group membership
# 1PL model, "ltm" engine
r <- difGenLord(Verbal, group = 25, focal.names = names, model = "1PL")
difGenLord(Verbal, group = "group", focal.name = names, model = "1PL")
difGenLord(Verbal[,1:24], group = Verbal[,25], focal.names = names, model = "1PL")
# 1PL model, "ltm" engine, estimated common discrimination
r <- difGenLord(Verbal, group = 25, focal.names = names, model = "1PL", discr = NULL)
# 1PL model, "lme4" engine
difGenLord(Verbal, group = "group", focal.name = names, model = "1PL", engine = "lme4")
# With items 1 to 5 set as anchor items
difGenLord(Verbal, group = 25, focal.names = names, model = "1PL", anchor = 1:5)
# Multiple comparisons adjustment using Benjamini-Hochberg method
difGenLord(Verbal, group = 25, focal.names = names, model = "1PL", p.adjust.method = "BH")
# With item purification
difGenLord(Verbal, group = 25, focal.names = names, model = "1PL", purify = TRUE)
# Saving the output into the "GLresults.txt" file (and default path)
r <- difGenLord(Verbal, group = 25, focal.names = names, model = "1PL",
save.output = TRUE, output = c("GLresults", "default"))
# Splitting the data into the four subsets according to "group"
data0<-data1<-data2<-data3<-NULL
for (i in 1:nrow(verbal)){
if (group[i]=="WomanLow") data0<-rbind(data0,as.numeric(verbal[i,1:24]))
if (group[i]=="WomanHigh") data1<-rbind(data1,as.numeric(verbal[i,1:24]))
if (group[i]=="ManLow") data2<-rbind(data2,as.numeric(verbal[i,1:24]))
if (group[i]=="ManHigh") data3<-rbind(data3,as.numeric(verbal[i,1:24]))
}
# Estimation of the item parameters (1PL model)
m0.1PL<-itemParEst(data0, model = "1PL")
m1.1PL<-itemParEst(data1, model = "1PL")
m2.1PL<-itemParEst(data2, model = "1PL")
m3.1PL<-itemParEst(data3, model = "1PL")
# Merging the item parameters WITHOUT rescaling
irt.noscale<-rbind(m0.1PL,m1.1PL,m2.1PL,m3.1PL)
rownames(irt.noscale)<-rep(colnames(verbal[,1:24]),4)
# Merging the item parameters WITH rescaling
irt.scale<-rbind(m0.1PL, itemRescale(m0.1PL,m1.1PL),
itemRescale(m0.1PL,m2.1PL) ,itemRescale(m0.1PL,m3.1PL))
rownames(irt.scale)<-rep(colnames(verbal[,1:24]),4)
# Equivalent calculations
difGenLord(irtParam = irt.noscale, nrFocal = 3, same.scale = FALSE)
difGenLord(irtParam = irt.scale, nrFocal = 3, same.scale = TRUE)
# With item purification
difGenLord(irtParam = irt.noscale, nrFocal = 3, same.scale = FALSE, purify = TRUE)
# Graphical devices
plot(r)
plot(r, plot = "itemCurve", item = 1)
plot(r, plot = "itemCurve", item = 6)
plot(r, plot = "itemCurve", item = 6, ref.name = "WomanHigh")
# Plotting results and saving it in a PDF figure
plot(r, save.plot = TRUE, save.options = c("plot", "default", "pdf"))
# Changing the path, JPEG figure
path <- "c:/Program Files/"
plot(r, save.plot = TRUE, save.options = c("plot", path, "jpeg"))
## End(Not run)