valstat {EMAR} | R Documentation |
Validation statistics for model assessment
Description
This function helps users to generate 16 fit indices for model assessment based on independent/validation data set. In addition to empirical models, the function valstat()
can generate fit indices for AI-based models such as artificial neural network, supervise vector machine, etc.
Usage
valstat(obs.y, pred.y, p)
Arguments
obs.y |
observed values from the independent/validation data |
pred.y |
predicted values from the model |
p |
number of parameters in the model. This is needed to compute the 'criteria-based indices' and adjusted coefficient of determination. Users could enter any value for AI-based models with an unknown number of parameters (p) and assess their models using the indices that are invariant of p. See the section on note. |
Value
n: number of observation in the validation data, SSR: residual sum of squares, TRE: total relative error, Bias: mean bias, MRB: mean relative bias, MAB: mean absolute bias, MAPE: mean absolute percentage error, MSE: mean squared error, RMSE: root mean squared error, Percent.RMSE: percentage root mean squared error, R2: coefficient of determination, R2adj: adjusted coefficient of determination, APC: Amemiya's prediction criterion, logL: Log-likelihood, AIC: Akaike information criterion, AICc: corrected Akaike information criterion, BIC: Bayesian information criterion, HQC: Hannan-Quin information criterion.
Note
The lower the better for the SSR, TRE, Bias, MRB, MAB, MAPE, MSE, RMSE, Percent.RMSE, APC, logL, AIC, AICc, BIC and HQC indices. The higher the better for R2 and R2adj indices. Users can choose which indices to use to evaluate their models from the output. Given the difficulty of determining the number of parameters (p) in AI-based models, users might consider using error-based indices, and coefficients of determination (R2).
Author(s)
Ogana F.N. and Corral-Rivas S.
See Also
fitstat()
, which gives the fit indices of the model based on the fitting data
Examples
library(EMAR)
# fitting data
Age <- 1:50
Yield <- exp(6.5 - 39.5/Age)
dat <- data.frame(Age, Yield)
# fit the model to the fitting data
Eq01 <- lm(Yield ~ Age, data=dat)
# independent/validation data
test_data <- data.frame(Age=1:50, Yield=2.5*Age^1.4)
# predict with the model i.e. Eq01, using the independent/validation data
test_data$pred.Yield <- predict(Eq01, test_data)
# Evaluate the quality of the prediction using the 'valstat' function.
# You need the observed and predicted values. Specify the number of parameters in the model.
valstat(test_data$Yield, test_data$pred.Yield, 2)