suda2 {sdcMicro} | R Documentation |
Suda2: Detecting Special Uniques
Description
SUDA risk measure for data from (stratified) simple random sampling.
Usage
suda2(obj, ...)
Arguments
obj |
a |
... |
see arguments below
|
Details
Suda 2 is a recursive algorithm for finding Minimal Sample Uniques. The algorithm generates all possible variable subsets of defined categorical key variables and scans them for unique patterns in the subsets of variables. The lower the amount of variables needed to receive uniqueness, the higher the risk of the corresponding observation.
Value
A modified sdcMicroObj object or the following list
-
ContributionPercent
: The contribution of each key variable to the SUDA score, calculated for each row. -
score
: The suda score 'disscore: The dis suda score -
attribute_contributions:
adata.frame
showing how much of the total risk is contributed by each variable. This information is stored in the following two variables:-
variable
: containing the name of the variable -
contribution
: contains how much risk a variable contributes to the total risk.
-
-
attribute_level_contributions
: returns risks of each attribute-level as adata.frame
with the following three columns:-
variable
: the variable name -
attribute
: holding relevant level-codes -
contribution
: contains the risk of this level within the variable.
-
Note
Since version >5.0.2, the computation of suda-scores has changed and is now by default as described in the original paper by Elliot et al.
Author(s)
Alexander Kowarik and Bernhard Meindl (based on the C++ code from the Organisation For Economic Co-Operation And Development.
For the C++ code: This work is being supported by the International Household Survey Network and funded by a DGF Grant provided by the World Bank to the PARIS21 Secretariat at the Organisation for Economic Co-operation and Development (OECD). This work builds on previous work which is elsewhere acknowledged.
References
C. J. Skinner; M. J. Elliot (20xx) A Measure of Disclosure Risk for Microdata. Journal of the Royal Statistical Society: Series B (Statistical Methodology), Vol. 64 (4), pp 855–867.
M. J. Elliot, A. Manning, K. Mayes, J. Gurd and M. Bane (20xx) SUDA: A Program for Detecting Special Uniques, Using DIS to Modify the Classification of Special Uniques
Anna M. Manning, David J. Haglin, John A. Keane (2008) A recursive search algorithm for statistical disclosure assessment. Data Min Knowl Disc 16:165 – 196
Templ, M. Statistical Disclosure Control for Microdata: Methods and Applications in R. Springer International Publishing, 287 pages, 2017. ISBN 978-3-319-50272-4. doi:10.1007/978-3-319-50272-4