ss4HHSp {samplesize4surveys} | R Documentation |
Sample Sizes for Household Surveys in Two-Stages for Estimating Single Proportions
Description
This function computes a grid of possible sample sizes for estimating single proportions under two-stage sampling designs.
Usage
ss4HHSp(N, M, r, b, rho, P, delta, conf, m)
Arguments
N |
The population size. |
M |
Number of clusters in the population. |
r |
Percentage of people within the subpopulation of interest. |
b |
Average household size (number of members). |
rho |
The Intraclass Correlation Coefficient. |
P |
The value of the estimated proportion. |
delta |
The maximun margin of error that can be allowed for the estimation. |
conf |
The statistical confidence. By default |
m |
(vector) Number of households selected within PSU. |
Details
In two-stage (2S) sampling, the design effect is defined by
Where is defined as the intraclass correlation coefficient,
is the average sample size of units selected inside each cluster.
The relationship of the full sample size of the two stage design (2S) with the
simple random sample (SI) design is given by
Value
This function returns a grid of possible sample sizes. The first column represent the design effect, the second column is the number of clusters to be selected, the third column is the number of units to be selected inside the clusters, and finally, the last column indicates the full sample size induced by this particular strategy.
Author(s)
Hugo Andres Gutierrez Rojas <hagutierrezro at gmail.com>
References
Gutierrez, H. A. (2009), Estrategias de muestreo: Diseno de encuestas y estimacion de parametros. Editorial Universidad Santo Tomas
See Also
Examples
ss4HHSp(N = 50000000, M = 3000, r = 1, b = 3.5,
rho = 0.034, P = 0.05, delta = 0.05, conf = 0.95,
m = c(5:15))
##################################
# Example with BigCity data #
# Sample size for the estimation #
# of the unemployment rate #
##################################
library(TeachingSampling)
data(BigCity)
BigCity1 <- BigCity[!is.na(BigCity$Employment), ]
summary(BigCity1$Employment)
BigCity1$Unemp <- Domains(BigCity1$Employment)[, 1]
BigCity1$Active <- Domains(BigCity1$Employment)[, 1] +
Domains(BigCity1$Employment)[, 3]
N <- nrow(BigCity)
M <- length(unique(BigCity$PSU))
r <- sum(BigCity1$Active)/N
b <- N/length(unique(BigCity$HHID))
rho <- ICC(BigCity1$Unemp, BigCity1$PSU)$ICC
P <- sum(BigCity1$Unemp)/sum(BigCity1$Active)
delta <- 0.05
conf <- 0.95
m <- c(5:15)
ss4HHSp(N, M, r, b, rho, P, delta, conf, m)