RRTCS-package {RRTCS} | R Documentation |
Randomized Response Techniques for Complex Surveys
Description
The aim of this package is to calculate point and interval estimation for linear parameters with data obtained from randomized response surveys. Twenty one RR methods are implemented for complex surveys:
- Randomized response procedures to estimate parameters of a qualitative stigmatizing characteristic: Christofides model, Devore model, Forced-Response model, Horvitz model, Horvitz model with unknown B, Kuk model, Mangat model, Mangat model with unknown B, Mangat-Singh model, Mangat-Singh-Singh model, Mangat-Singh-Singh model with unknown B, Singh-Joarder model, SoberanisCruz model and Warner model.
- Randomized response procedures to estimate parameters of a quantitative stigmatizing characteristic: BarLev model, Chaudhuri-Christofides model, Diana-Perri-1 model, Diana-Perri-2 model, Eichhorn-Hayre model, Eriksson model and Saha model.
Using the usual notation in survey sampling, we consider a finite population , consisting of
different elements.
Let
be the value of the sensitive aspect under study for the
th population element. Our aim is to estimate the finite population total
of the variable of interest
or the population mean
. If we can estimate the
proportion of the population presenting a certain stigmatized behaviour
, the variable
takes the value 1 if
(the group with the stigmatized behaviour) and the value zero otherwise. Some qualitative models use an innocuous or related attribute
whose
population proportion can be known or unknown.
Assume that a sample is chosen according to a general design
with inclusion probabilities
.
In order to include a wide variety of RR procedures, we consider the unified approach given by Arnab (1994). The interviews of individuals in the sample
are conducted in accordance with the RR model. For each
the RR induces a random response
(denoted scrambled response) so that the revised
randomized response
(Chaudhuri and Christofides, 2013) is an unbiased estimation of
. Then, an unbiased estimator for the population total of
the sensitive characteristic
is given by
The variance of this estimator is given by:
where is the variance of
under the randomized device and
is the design-variance of the Horvitz Thompson estimator
of
values.
This variance is estimated by:
where varies with the RR device and the estimation of the design-variance,
, is obtained using Deville's method
(Deville, 1993).
The confidence interval at % level is given by
where denotes the
% quantile of a standard normal distribution.
Similarly, an unbiased estimator for the population mean is given by
and an unbiased estimator for its variance is calculated as:
In cases where the population size is unknown, we consider Hàjek-type estimators for the mean:
and Taylor-series linearization variance estimation of the ratio (Wolter, 2007) is used.
In qualitative models, the values and
for
are described in each model.
In some quantitative models, the values and
for
are calculated in a general form (Arcos et al, 2015) as follows:
The randomized response given by the person is
with and where
and
are scramble variables whose distributions are assumed to be known. We denote by
and
respectively the mean and standard deviation of the variable
.
The transformed variable is
its variance is
where
and the estimated variance is
Some of the quantitative techniques considered can be viewed as particular cases of the above described procedure. Other models are described in the respective function.
Alternatively, the variance can be estimated using certain resampling methods.
Author(s)
Beatriz Cobo Rodríguez, Department of Statistics and Operations Research. University of Granada beacr@ugr.es
María del Mar Rueda García, Department of Statistics and Operations Research. University of Granada mrueda@ugr.es
Antonio Arcos Cebrián, Department of Statistics and Operations Research. University of Granada arcos@ugr.es
Maintainer: Beatriz Cobo Rodríguez beacr@ugr.es
References
Arcos, A., Rueda, M., Singh, S. (2015). A generalized approach to randomised response for quantitative variables. Quality and Quantity 49, 1239-1256.
Arnab, R. (1994). Non-negative variance estimator in randomized response surveys. Comm. Stat. Theo. Math. 23, 1743-1752.
Chaudhuri, A., Christofides, T.C. (2013). Indirect Questioning in Sample Surveys Springer-Verlag Berlin Heidelberg.
Deville, J.C. (1993). Estimation de la variance pour les enquêtes en deux phases. Manuscript, INSEE, Paris.
Wolter, K.M. (2007). Introduction to Variance Estimation. 2nd Edition. Springer.