logst {relevance} | R Documentation |
Started Logarithmic Transformation
Description
Transforms the data by a log10 transformation, modifying small and zero observations such that the transformation yields finite values.
Usage
logst(data, calib=data, threshold=NULL, mult = 1)
Arguments
data |
a vector or matrix of data, which is to be transformed |
calib |
a vector or matrix of data used to calibrate the
transformation(s),
i.e., to determine the constant |
threshold |
constant c that determines the transformation, possibly a vector with a value for each variable. |
mult |
a tuning constant affecting the transformation of small values, see Details |
Details
Small values are determined by the threshold c. If not given by the
argument threshold
, then it is determined by the quartiles
q_1
and q_3
of the non-zero data as those
smaller than c=q_1 / (q_3/q_1)^{mult}
.
The rationale is that for lognormal data, this constant identifies
2 percent of the data as small.
Beyond this limit, the transformation continues linear with the
derivative of the log curve at this point. See code for the formula.
The function chooses log10 rather than natural logs because they can be backtransformed relatively easily in the mind.
Value
the transformed data. The value c needed for the transformation is
returned as attr(.,"threshold")
.
Note
The names of the function alludes to Tudey's idea of "started logs".
Author(s)
Werner A. Stahel, ETH Zurich
Examples
dd <- c(seq(0,1,0.1),5*10^rnorm(100,0,0.2))
dd <- sort(dd)
r.dl <- logst(dd)
plot(dd, r.dl, type="l")
abline(v=attr(r.dl,"threshold"),lty=2)