dmean {synthesizer}R Documentation

Summarization of location and spread between synthetic and real data

Description

For each numerical variable in the two datasets, compute the relative difference between the mean (standard deviation) of the real data and the mean (standard deviation) of the synthetic data. The summary is the average of these relative differences over all numerical variables.

Usage

dmean(synth, real, tol = 1e-08, ...)

dsd(synth, real, tol = 1e-08, ...)

Arguments

synth

[data.frame] Synthetic data

real

[data.frame] Real data

tol

[numeric] Nonnegative tolerance. If the absolute mean (standard deviation) of a variable is smaller than tol, it is considered zero. In that case the absolute difference instead of the absolute relative difference is computed.

...

Arguments passed to mean. e.g. use trim=c(0.01,0.99) for mean estimation that is less sensitive to outliers.

Note

Real and synthetic data are expected to have the same column names, orders, and data types.

See Also

Other measures: dcor(), pmse(), qa()

Other measures: dcor(), pmse(), qa()

Examples


dmean(cars, cars) # 0
dmean(synthesize(cars), cars)

dsd(cars, cars) # 0
dsd(synthesize(cars), cars)



[Package synthesizer version 0.2.0 Index]