| EAimp {modi} | R Documentation |
Epidemic Algorithm for imputation of multivariate outliers in incomplete survey data.
Description
After running EAdet an imputation of the detected outliers with
EAimp may be run.
Usage
EAimp(
data,
weights,
outind,
reach = "max",
transmission.function = "root",
power = ncol(data),
distance.type = "euclidean",
duration = 5,
maxl = 5,
kdon = 1,
monitor = FALSE,
threshold = FALSE,
deterministic = TRUE,
fixedprop = 0
)
Arguments
data |
a data frame or matrix with the data. |
weights |
a vector of positive sampling weights. |
outind |
a logical vector with component |
reach |
reach of the threshold function (usually set to the maximum
distance to a nearest neighbour, see internal function |
transmission.function |
form of the transmission function of distance d:
|
power |
sets |
distance.type |
distance type in function |
duration |
the duration of the detection epidemic. |
maxl |
maximum number of steps without infection. |
kdon |
the number of donors that should be infected before imputation. |
monitor |
if |
threshold |
Infect all remaining points with infection probability above
the threshold |
deterministic |
if |
fixedprop |
if |
Details
EAimp uses the distances calculated in EAdet (actually the
counterprobabilities, which are stored in a global data set) and starts an
epidemic at each observation to be imputed until donors for the missing values
are infected. Then a donor is selected randomly.
Value
EAimp returns a list with two components: parameters and
imputed.data.
parameters contains the following elements:
sample.sizeNumber of observations
number.of.variablesNumber of variables
n.complete.recordsNumber of records without missing values
n.usable.recordsNumber of records with less than half of values missing (unusable observations are discarded)
durationDuration of epidemic
reachTransmission distance (
d0)thresholdInput parameter
deterministicInput parameter
computation.timeElapsed computation time
imputed.data contains the imputed data.
Author(s)
Beat Hulliger
References
Béguin, C. and Hulliger, B. (2004) Multivariate outlier detection in incomplete survey data: the epidemic algorithm and transformed rank correlations, JRSS-A, 167, Part 2, pp. 275-294.
See Also
EAdet for outlier detection with the Epidemic Algorithm.
Examples
data(bushfirem, bushfire.weights)
det.res <- EAdet(bushfirem, bushfire.weights)
imp.res <- EAimp(bushfirem, bushfire.weights, outind = det.res$outind, kdon = 3)
print(imp.res$output)