| cmnorm {mnorm} | R Documentation |
Parameters of conditional multivariate normal distribution
Description
This function calculates mean (expectation) and covariance matrix of conditional multivariate normal distribution.
Usage
cmnorm(
mean,
sigma,
given_ind,
given_x,
dependent_ind = numeric(),
is_validation = TRUE,
is_names = TRUE,
control = NULL,
n_cores = 1L
)
Arguments
mean |
numeric vector representing expectation of multivariate normal vector (distribution). |
sigma |
positively defined numeric matrix representing covariance matrix of multivariate normal vector (distribution). |
given_ind |
numeric vector representing indexes of multivariate
normal vector which are conditioned at values given by
|
given_x |
numeric vector which |
dependent_ind |
numeric vector representing indexes of unconditional elements (components) of multivariate normal vector. |
is_validation |
logical value indicating whether input
arguments should be validated. Set it to |
is_names |
logical value indicating whether output
values should have row and column names. Set it to |
control |
a list of control parameters. See Details. |
n_cores |
positive integer representing the number of CPU cores
used for parallel computing. Currently it is not recommended to set
|
Details
Consider m-dimensional multivariate normal vector
X=(X_{1},...,X_{m})^{T}~\sim N(\mu,\Sigma), where E(X)=\mu and
Cov(X)=\Sigma are expectation (mean) and covariance matrix
respectively.
Let's denote vectors of indexes of conditioned and unconditioned elements of X
by I_{g} and I_{d} respectively. By x^{(g)} denote
deterministic (column) vector of given values of X_{I_{g}}. The
function calculates expected value and covariance matrix of conditioned
multivariate normal vector X_{I_{d}} | X_{I_{g}} = x^{(g)}. For example
if I_{g}=(1, 3) and x^{(g)}=(-1, 1) then I_{d}=(2, 4, 5)
so the function calculates:
\mu_{c}=E\left(\left(X_{2}, X_{4}, X_{5}\right) | X_{1} = -1, X_{3} = 1\right)
\Sigma_{c}=Cov\left(\left(X_{2}, X_{4}, X_{5}\right) |
X_{1} = -1, X_{3} = 1\right)
In general case:
\mu_{c} = E\left(X_{I_{d}} | X_{I_{g}} = x^{(g)}\right) =
\mu_{I_{d}} +
\left(x^{(g)} - \mu_{I_{g}}\right)
\left(\Sigma_{(I_{d}, I_{g})}
\Sigma_{(I_{g}, I_{g})}^{-1}\right)^{T}
\Sigma_{c} = Cov\left(X_{I_{d}} | X_{I_{g}} = x^{(g)}\right) =
\Sigma_{(I_{d}, I_{d})} -
\Sigma_{(I_{d}, I_{g})}
\Sigma_{(I_{g}, I_{g})}^{-1}
\Sigma_{(I_{g}, I_{d})}
Note that \Sigma_{(A, B)}, where A,B\in\{d, g\},
is a submatrix of \Sigma generated by intersection of I_{A}
rows and I_{B} columns of \Sigma.
Below there is a correspondence between aforementioned theoretical (mathematical) notations and function arguments:
-
mean-\mu. -
sigma-\Sigma. -
given_ind-I_{g}. -
given_x-x^{(g)}. -
dependent_ind-I_{d}
Moreover \Sigma_{(I_{g}, I_{d})} is a theoretical (mathematical)
notation for sigma[given_ind, dependent_ind]. Similarly \mu_{g}
represents mean[given_ind].
By default dependent_ind contains all indexes that are not
in given_ind. It is possible to omit and duplicate indexes of
dependent_ind. But at least single index should be provided for
given_ind without any duplicates. Also dependent_ind and
given_ind should not have the same elements. Moreover given_ind
should not be of the same length as mean so at least one component
should be unconditioned.
If given_x is a vector then (if possible) it will be treated as
a matrix with the number of columns equal to the length of mean.
Currently control has no input arguments intended for
the users. This argument is used for some internal purposes
of the package.
Value
This function returns an object of class "mnorm_cmnorm".
An object of class "mnorm_cmnorm" is a list containing the
following components:
-
mean- conditional mean. -
sigma- conditional covariance matrix. -
sigma_d- covariance matrix of unconditioned elements. -
sigma_g- covariance matrix of conditioned elements. -
sigma_dg- matrix of covariances between unconditioned and conditioned elements. -
s12s22- equals to the matrix product ofsigma_dgandsolve(sigma_g).
Note that mean corresponds to \mu_{c} while sigma
represents \Sigma_{c}. Moreover sigma_d is
\Sigma_{I_{d}, I_{d}}, sigma_g is \Sigma_{I_{g}, I_{g}}
and sigma_dg is \Sigma_{I_{d}, I_{g}}.
Since \Sigma_{c} do not depend on
X^{(g)} the output sigma does not depend on given_x.
In particular output sigma remains the same independent of whether
given_x is a matrix or vector. Oppositely if given_x is
a matrix then output mean is a matrix which rows correspond
to conditional means associated with given values provided by corresponding
rows of given_x.
The order of elements of output mean and output sigma depends
on the order of dependet_ind elements that is ascending by default.
The order of given_ind elements does not matter. But, please, check
that the order of given_ind match the order of given values i.e.
the order of given_x columns.
Examples
# Consider multivariate normal vector:
# X = (X1, X2, X3, X4, X5) ~ N(mean, sigma)
# Prepare multivariate normal vector parameters
# expected value
mean <- c(-2, -1, 0, 1, 2)
n_dim <- length(mean)
# correlation matrix
cor <- c( 1, 0.1, 0.2, 0.3, 0.4,
0.1, 1, -0.1, -0.2, -0.3,
0.2, -0.1, 1, 0.3, 0.2,
0.3, -0.2, 0.3, 1, -0.05,
0.4, -0.3, 0.2, -0.05, 1)
cor <- matrix(cor, ncol = n_dim, nrow = n_dim, byrow = TRUE)
# covariance matrix
sd_mat <- diag(c(1, 1.5, 2, 2.5, 3))
sigma <- sd_mat %*% cor %*% t(sd_mat)
# Estimate parameters of conditional distribution i.e.
# when the first and the third components of X are conditioned:
# (X2, X4, X5 | X1 = -1, X3 = 1)
given_ind <- c(1, 3)
given_x <- c(-1, 1)
par <- cmnorm(mean = mean, sigma = sigma,
given_ind = given_ind,
given_x = given_x)
# E(X2, X4, X5 | X1 = -1, X3 = 1)
par$mean
# Cov(X2, X4, X5 | X1 = -1, X3 = 1)
par$sigma
# Additionally calculate E(X2, X4, X5 | X1 = 2, X3 = 3)
given_x_mat <- rbind(given_x, c(2, 3))
par1 <- cmnorm(mean = mean, sigma = sigma,
given_ind = given_ind,
given_x = given_x_mat)
par1$mean
# Duplicates and omitted indexes are allowed for dependent_ind
# For given_ind duplicates are not allowed
# Let's calculate conditional parameters for (X5, X2, X5 | X1 = -1, X3 = 1):
dependent_ind <- c(5, 2, 5)
par2 <- cmnorm(mean = mean, sigma = sigma,
given_ind = given_ind,
given_x = given_x,
dependent_ind = dependent_ind)
# E(X5, X2, X5 | X1 = -1, X3 = 1)
par2$mean
# Cov(X5, X2, X5 | X1 = -1, X3 = 1)
par2$sigma