oa.design {DoE.base} | R Documentation |
Function for accessing orthogonal arrays
Description
Function for accessing orthogonal arrays, allowing limited optimal allocation of columns
Usage
oa.design(ID=NULL, nruns=NULL, nfactors=NULL, nlevels=NULL,
factor.names = if (!is.null(nfactors)) {
if (nfactors <= 50) Letters[1:nfactors]
else paste("F", 1:nfactors, sep = "")}
else NULL,
columns="order",
replications=1, repeat.only=FALSE,
randomize=TRUE, seed=NULL, min.residual.df=0, levordold = FALSE)
origin(ID)
Arguments
ID |
orthogonal array to be used; must be given as the name without quotes
(e.g. |
nruns |
minimum number of runs to be used,
can be omitted if obvious from |
nfactors |
number of factors;
only needed if |
nlevels |
number(s) of levels, vector with |
factor.names |
a character vector of |
columns |
EITHER |
replications |
the number of replications of the array,
the setting of |
repeat.only |
default |
randomize |
logical indicating whether the run order is to be randomized ? |
seed |
integer seed for the random number generator |
min.residual.df |
minimum number of residual degrees of freedom; |
levordold |
logical indicating whether or not old (=pre version 0.27)
level ordering should be used; |
Details
Package DoE.base is described in Groemping (2018).
The paper also has detailed material on function oa.design
.
Function oa.design
assigns factors to the columns of orthogonal arrays that are
available within package DoE.base or are provided by the user.
The available arrays and their properties are listed in the
data frame oacat
and can be systematically searched for using function show.oas
.
The design names also indicate the number of runs and the numbers of factors
for each number of levels, e.g. L18.3.6.6.1
is an 18 run design with six factors in 3 levels
(3.6
) and one factor in 6 levels (6.1
).
oa
is the S3 class used for orthogonal arrays. Objects of class oa
should at least have
the attribute origin
, an attribute comment
should be used for additional information.
Users can define their own orthogonal arrays and hand them to oa.design
with parameter ID
.
Requirements for the arrays:
Factor levels must be coded as numbers from 1 to number of levels.
The array must be of classes
oa
andmatrix
(If your array is a matrix namedfoo
, you can simply assign it classoa
by the commandclass(foo) <- c("oa","matrix")
, see also last example.)The array should have an attribute
origin
.The array can have an attribute
comment
;
this should be used for mentioning specific properties, e.g. for theL18.2.1.3.7
that the interaction of the first two factors can be estimated.
Users are encouraged to send additional arrays to the package maintainer.
The requirements for these are the same as listed above, with attribute origin
being a MUST in this case. (See the last example for how to assign an attribute.)
For relatively small important applications, creation of a tailor-made array of class oa
can
be attempted with package DoE.MIParray, which uses mixed integer optimization for creating
a design from scratch (see Groemping and Fontana 2019 for the algorithm behind that approach).
The data frame oacat
lists the orthogonal arrays from Warren Kuhfelds
collection of “parent” and “child” arrays. The parent arrays,
plus a few additional arrays, are directly exported from the package namespace.
The child arrays from Kuhfelds collection can be constructed from these, using
the replacement instructions provided in the variable lineage
of oacat
.
The last example below indicates how
a child array can be created manually, and compares this to the automatically created array.
(A lot more than just the child arrays could be obtained from these arrays
by implementing a functionality similar to the market research macros available in SAS; presumably,
this topic will not be addressed soon, as it will involve a substantial amount of work.)
Furthermore, there are stronger arrays
(at least resolution IV) in the catalogue oacat3
. Since version 1.1,
function oa.design
uses the stronger arrays, where possible.
If no specific orthogonal array is specified and function oa.design
does not
find an orthogonal array that meets the specified requirements,
oa.design
returns a full factorial, replicated for enough residual degrees of freedom, if necessary.
If oa.design
has not found an array smaller than the full factorial, it is
absolutely possibly that a smaller array does exist nevertheless. It may be worth
while checking with oacat whether an appropriate smaller array can be found by
combining some of the parent arrays listed there (looking for a design with a few
factors in 5 runs, you may e.g. call oacat[oacat$n5>0,]$name
in order to see the
names of more promising candidate arrays for combination, or you may also want to look
up arrays with n25>0
subsequently.
With version 0.9-18 of the package,
the possibility for an automatic allocation of columns for improved design
performance was implemented. With version 0.23, this approach has been sped
up and extended to properly cover relative projection aberration according to
Groemping (2011) with and without step (b) (see below) (the previous choice
"maxGR.min34"
was modified and renamed to "minRelProjAberr"
).
Because of performance reasons, and because of a lack of a clear best default,
optimum column allocation is not switched on per default.
However, with the default column order from left to right,
the package always issues a warning to remind users that an automatic unoptimized design
can be quite far from ideal.
If optimization is activated, the first step is selection of an array,
either explicitly by the user (option ID
) or automatically (unoptimized)
according to the required combination of factors. Within that array, the following
choices for the column
option are on offer:
- "order"
the default choice; allocates factors from left to right, which is what most software does (but what is not necessarily good, see also the example section)
"min3"
recommended, if
"min34"
is not affordable; aliasing between main effects and 2-factor interactions is kept to a minimal degree, minimizing the number of generalized words of length 3 according to Xu and Wu (2001)- "min3.rela"
the same approach is taken, but with relative number of generalized words according to Groemping (2011)
- "min34"
recommended, if affordable; beware the time demand; this requests that the number of words of generalized length 4 is also minimized.
- "min34.rela"
again takes the same approach, but with relative number of generalized words according to Groemping (2011)
- "minRPFT"
minimizes the relative projection frequency table, applying the approach according to Groemping (2011) without step (b) (see next entry).
- "minRelProjAberr"
applies minimum relative projection aberration according to Groemping (2011) ((a): maximize generalized resolution, (b): minimize total relative number of shortest words, (c) rank designs according to relative projection frequency table (obtainable with P3.3 or P4.4, depending on resolution) and (d) resolve ties by looking at absolute number of length 4 words in case of resolution III).
WARNING: Usually, it is recommended to investigate the properties of a design automatically
created by function oa.design
before starting experimentation.
While all designs can estimate main effects in the absence of interactions,
the presence of interactions may render some designs useless or even dangerous.
Deliberate choice of columns different from the default may improve a design
(see example section)!
Mathematical comment on the expansion example:
There are 720 different ways to expand the unique L18.3.6.6.1
into an L18.2.1.3.7
, depending on which row of the replacement design
nest.des
is assigned to which level of the 6 level factor;
for qualitative factors,
60 of these are potentially non-isomorphic (divide 720 by the 2 * 3! ways
of permuting levels within a factor; there are
more possibly different arrays for quantitative 3 level factors, since arbitrary
relabelling of the levels is no longer isomorphic).
According to Eric Schoen (personal communication), for this particular case,
all the resulting children
are isomorphic to each other and are also isomorphic to the Taguchi L18.
To see isomorphism of two designs is not easy; in the example, nest.des
has been prepared such that it is easy to see isomorphism of the resulting
child to the Taguchi L18
: L18
is reproduced by
assigning the first row of nest.des
to level 1 etc.,
except for a swap of columns G
and H
.
Value
oa.design
returns a data frame of S3 class design
with attributes attached.
In the data frame itself, the experimental factors are all stored as R factors.
For factors with 2 levels, contr.FrF2
contrasts (-1 / +1) are used.
For factors with more than 2 numerical levels, polynomial contrasts are used
(i.e. analyses will per default use orthogonal polynomials).
For factors with more than 2 categorical levels, the default contrasts are used.
Future versions will most likely allow more user control about the type of
contrasts to be used.
The desnum
and run.order
attributes of class design
are
as usual. In the design.info
attribute, the following elements are specific for
this type of designs:
type |
is |
nlevels |
vector containing the number of levels for each factor |
generating.oa |
contains information on the generating orthogonal array, |
selected.columns |
contains information, which column of the orthogonal array underlies which factor, |
origin |
contains the respective attribute of the orthogonal array, |
comment |
contains the respective attribute of the orthogonal array, |
residual.df |
contains the requested residual degrees of freedom for a main effects model. |
Other information is generic, like documented for class design
.
Function origin
returns the origin attribute of the orthogonal array ID
,
functions comment
and "comment<-"
from package base
return and set the comment attribute.
Warning
Since version 1.1 of the package, strength 3 arrays are automatically used, if available.
This changes the behavior of function oa.design
for situations for requests with a
combination of nruns
and nlevels
for which a strength 3 array exists in oacat3
.
If the old behavior is required for reproducing a previously-created array, it is
possible to set oacat3
to NULL
by the command assignInNamespace("oacat3", NULL, pos="package:DoE.base")
;
this temporary replacement of oacat3
with NULL
remains in effect
for the current R session; detaching it (with namespace unloading) and reloading is possible but can also go wrong;
therefore, it is recommended to only use the above technique if you are prepared to restart the
R session before using the original version of oacat3
.
Since R version 3.6.0, the behavior of function sample
has changed
(correction of a biased previous behavior that should not be relevant for the randomization of designs).
For reproducing a randomized design that was produced with an earlier R version,
please follow the steps described with the argument seed
.
Note
This package is still under development. Suggestions and bug reports are welcome.
Author(s)
Ulrike Groemping
References
Groemping, U. (2011). Relative projection frequency tables for orthogonal arrays. Report 1/2011, Reports in Mathematics, Physics and Chemistry http://www1.bht-berlin.de/FB_II/reports/welcome.htm, Department II, Berliner Hochschule fuer Technik (formerly Beuth University of Applied Sciences), Berlin.
Groemping, U. (2018). R Package DoE.base for Factorial Designs. Journal of Statistical Software 85(5), 1–41.
Hedayat, A.S., Sloane, N.J.A. and Stufken, J. (1999) Orthogonal Arrays: Theory and Applications, Springer, New York.
Kuhfeld, W. (2009). Orthogonal arrays. Website courtesy of SAS Institute https://support.sas.com/techsup/technote/ts723b.pdf and references therein.
Schoen, E. (2009). All orthogonal arrays with 18 runs. Quality and Reliability Engineering International 25, 467–480.
Xu, H.-Q. and Wu, C.F.J. (2001). Generalized minimum aberration for asymmetrical fractional factorial designs. Annals of Statistics 29, 1066–1077.
See Also
See Also FrF2
, fac.design
, pb
Examples
## smallest available array for 6 factors with 3 levels each
oa.design(nfactors=6, nlevels=3)
## level combination for which only a full factorial is (currently) found
oa.design(nlevels=c(4,3,3,2))
## array requested via factor.names
oa.design(factor.names=list(one=c("a","b","c"), two=c(125,275),
three=c("old","new"), four=c(-1,1), five=c("min","medium","max")))
## array requested via character factor.names and nlevels
## (with a little German lesson for one two three four five)
oa.design(factor.names=c("eins","zwei","drei","vier","fuenf"), nlevels=c(2,2,2,3,7))
## array requested via explicit name, Taguchi L18
oa.design(ID=L18)
## array requested via explicit name, with column selection
oa.design(ID=L18.3.6.6.1,columns=c(2,3,7))
## array requested with nruns, not very reasonable
oa.design(nruns=12, nfactors=3, nlevels=2)
## array requested with min.residual.df
oa.design(nfactors=3, nlevels=2, min.residual.df=12)
## examples showing alias structures and their improvment with option columns
plan <- oa.design(nfactors=6,nlevels=3)
plan
## generalized word length pattern
length3(plan)
## length3 (first element of GWP) can be slightly improved by columns="min3"
plan <- oa.design(nfactors=6,nlevels=3,columns="min3")
summary(plan) ## the first 3-level column of the array is not used
length3(plan)
plan <- oa.design(nlevels=c(2,2,2,6))
length3(plan)
plan.opt <- oa.design(nlevels=c(2,2,2,6),columns="min3") ## substantial improvement
length3(plan.opt)
length4(plan.opt)
## visualize practical relevance of improvement:
## for optimal plan, all 3-dimensional projections are full factorials
plot(plan, select=1:3)
plot(plan, select=c(1,2,4))
plot(plan, select=c(1,3,4))
plot(plan, select=2:4)
plot(plan.opt, select=1:3)
plot(plan.opt, select=c(1,2,4))
plot(plan.opt, select=c(1,3,4))
plot(plan.opt, select=2:4)
## The last example:
## generate an orthogonal array equivalent to Taguchi's L18
## by combining L18.3.6.6.1 with a full factorial in 2 and 3 levels
show.oas(nruns=18, parents.only=FALSE)
## lineage entry leads the way:
## start from L18.3.6.6.1
## insert L6.2.1.3.1 for the 6 level factor
## prepare the parent
parent.des <- L18.3.6.6.1
colnames(parent.des) <- c(Letters[3:9])
## new columns will become A and B
## 6-level design can be created by fac.design or expand.grid or cbind
nest.des <- as.matrix(expand.grid(1:3,1:2))[c(1:3,5,6,4),c(2,1)]
## want first column to change most slowly
## want resulting design to be easily transformable into Taguchi L18
## see mathematical comments in section Details
colnames(nest.des) <- c("A","B")
## do the expansion (see mathematical comments in section Details)
## using function expansive.replace
L18.2.1.3.7.manual <- expansive.replace(parent.des, nest.des)[,c(7:8,1:6)]
L18.2.1.3.7.manual <- L18.2.1.3.7.manual[ord(L18.2.1.3.7.manual),] ## sort array
rownames(L18.2.1.3.7.manual) <- 1:18
## (ordering is not necessary, just **tidy**)
## prepare for using it with function oa.design
## note: function expansive.replace creates a matrix of class "oa"
## rearranging the columns removed that class and makes it necessary
## to add the class again for using the array in DoE.base
attr(L18.2.1.3.7.manual, "origin") <-
c(show.oas(name="L18.2.1.3.7", parents.only=FALSE,show=0)$lineage,
"unconventional order")
class(L18.2.1.3.7.manual) <- c("oa", "matrix")
comment(L18.2.1.3.7.manual) <- "Interaction of first two factors estimable"
## indicates that first two factors are full factorial from 6-level factor
origin(L18.2.1.3.7.manual)
comment(L18.2.1.3.7.manual)
L18 ## Taguchi array
L18.2.1.3.7.manual ## manually expanded array
oa.design(L18.2.1.3.7, randomize=FALSE)
## automatically expanded array
P3.3(L18.2.1.3.7.manual) ## length 3 pattern of 3 factor projections
## this also identifies the array as isomorphic to L18
## according to Schoen 2009
## the array can now be used in oa.design, like the built-in arrays
oa.design(ID=L18.2.1.3.7.manual,nfactors=7,nlevels=3)