intervalPVA {imageData} | R Documentation |
Selects a subset of variables observed within a specified time interval using Principal Variable Analysis (PVA)
Description
Principal Variable Analysis (PVA) (Cummings, 2007) selects a subset from a set of the variables such that the variables in the subset are as uncorrelated as possible, in an effort to ensure that all aspects of the variation in the data are covered. Here, all observations in a specified time interval are used for calculation the correlations on which the selection is based.
Usage
intervalPVA(responses, data, times.factor = "Days", start.time, end.time,
nvarselect = NULL, p.variance = 1, include = NULL,
plot = TRUE, ...)
Arguments
responses |
A |
data |
A |
times.factor |
A |
start.time |
A |
end.time |
A |
nvarselect |
A |
p.variance |
A |
include |
A |
plot |
A |
... |
allows passing of arguments to other functions. |
Details
The variable that is most correlated with the other variables is selected first for inclusion. The partial correlation for each of the remaining variables, given the first selected variable, is calculated and the most correlated of these variables is selects for inclusion next. Then the partial correlations are adjust for the second included variables. This process is repeated until the specified criteria have been satisfied. The possibilities are to:
the default (
nvarselect = NULL
andp.variance = 1
) select all variables in increasing order of amount of information they provide;select exactly
nvarselect
variables;select just enough variables, up to a maximum of
nvarselect
variables, to explain at leastp.variance
*100 per cent of the total variance.
Value
A data.frame
giving the results of the variable selection.
It will contain the columns Variable
, Selected
,
h.partial
, Added.Propn
and Cumulative.Propn
.
Author(s)
Chris Brien
References
Cumming, J. A. and D. A. Wood (2007) Dimension reduction via principal variables. Computational Statistics and Data Analysis, 52, 550–565.
See Also
Examples
data(exampleData)
responses <- c("Area","Area.SV","Area.TV", "Image.Biomass", "Max.Height","Centre.Mass",
"Density", "Compactness.TV", "Compactness.SV")
results <- intervalPVA(responses, longi.dat,
start.time = "31", end.time = "31",
p.variance=0.9, plot = FALSE)