R: Sample Data for Small Area Estimation using Averaging Pseudo...

dataVill {saePseudo}

R Documentation

Sample Data for Small Area Estimation using Averaging Pseudo Area Level Model

Dataset to simulate Small Area Estimation using Averaging Pseudo Area Level Model This data is generated by these following steps:

Generate population data consisting Area1 (province), Area2 (region), Area3 (sub-district), Area4 (village), and Unit. The auxiliary variabels are generated by Uniform distribution with (x1 ~ U(40, 100)) and Normal distribution with (x2 ~ N(70, 5)). The coefficient parameters are set as \beta_{0} = 0.5, \beta_{1} = 0.2, and \beta_{2} = 0.2
Calculate y_{k} = \beta_{0}+\beta_{1}*x1_{k}+\beta_{2}*x_2{k}
Generate number of sample with simple random sampling with replacement
Calculate ydir_area4 = \frac{\Sigma{y_{k}}}{n}, vardir_area4 = \frac{\Sigma{(y_{k}-\frac{\Sigma{y_{k}}}{n})^2}}{n-1}, and auxiliary variable X1 = \frac{\Sigma{x1_{k}}}{n}
Calculate N (number of population) based on the initial population generated for each Area4 selected as a sample
Area1, Area2, Area3, Area4, ydir_area4, vardir_area4, X1, and N are combined in a dataframe called dataVill.

dataVill

A data frame with 83 observations on the following 8 variables:

[Package saePseudo version 0.1.0 Index]