gw.glm.mc.test {lctools} | R Documentation |
Significance test for the spatial variation of the Generalised Geographically Weighted Regression local parameter estimates
Description
This function provides one approach for testing the significance of the spatial variation of the local parameter estimates resulted in by fitting a Generalised Geographically Weighted Regression (GGWR) model. The approach consists of a Monte Carlo simulation according to which: a) the data are spatially reallocated in a random way; b) GGWR models fit for the original and simulated spatial data sets; c) the variance of each variable for the original and simulated sets is then calculated; d) a pseudo p-value for each variable V
is calculated as p = (1+C)/(1+M)
where C
is the number of cases in which the simulated data sets generated variances of the local parameter estimates of the variable V
that were as extreme as the observed local parameter estimates variance of the variable in question and M
is the number of permutations. If p <= 0.05
it can be argued that the spatial variation of the local parameters estimates for a variable V
is statistically significant. For this approach, a minimum of 19 simulations is required.
Usage
gw.glm.mc.test(Nsim = 19, formula, family, dframe, bw, kernel, coords)
Arguments
Nsim |
a positive integer that defines the number of the simulation's iterations |
formula |
the local model to be fitted using the same syntax used in the glm function in R. This is a sting that is passed to the sub-models' |
family |
a description of the error distribution and link function to be used in the local model as in the |
dframe |
a numeric data frame of at least two suitable variables (one dependent and one independent) |
bw |
a positive number that may be an integer in the case of an "adaptive kernel" or a real in the case of a "fixed kernel". In the first case the integer denotes the number of nearest neighbours, whereas in the latter case the real number refers to the bandwidth (in meters if the coordinates provided are Cartesian). This argument can be also the result of a bandwidth selection algorithm such as those available in the function |
kernel |
the kernel to be used in the regression. Options are "adaptive" or "fixed". The weighting scheme used here is defined by the bi-square function |
coords |
a numeric matrix or data frame of two columns giving the X,Y coordinates of the observations |
Details
For 0.05 level of significance in social sciences, a minimum number of 19 simulations (Nsim >= 19) is required. We recommend at least 99 and at best 999 iterations.
Value
Returns a list of the simulated values, the observed the pseudo p-value of significance
var.lpest.obs |
a vector with the variances of the observed local parameter estimates for each variable in the model. |
var.SIM |
a matrix with the variance of the simulated local parameter estimates for each variable in the model |
var.SIM.c |
a matrix with the number of cases in which the simulated data set generated variances of the local parameter estimates of a variable |
pseudo.p |
a vector of pseudo p-values for all the parameters in the model (constant and variables). |
Warning
Large datasets may take way too long to perform this test.
Note
This function will be developed along with gw.glm.
Author(s)
Stamatis Kalogirou <stamatis.science@gmail.com>
References
Kalogirou, S. (2016) Destination Choice of Athenians: an application of geographically weighted versions of standard and zero inflated Poisson spatial interaction models, Geographical Analysis, 48(2),pp. 191-230. DOI: 10.1111/gean.12092 https://onlinelibrary.wiley.com/doi/abs/10.1111/gean.12092