tfr.predict.subnat {bayesTFR} | R Documentation |
Generates posterior trajectories of the total fertility rate for subregions of given countries, using the Scale-AR(1) method.
tfr.predict.subnat(countries, my.tfr.file,
sim.dir = file.path(getwd(), "bayesTFR.output"),
end.year = 2100, start.year = NULL, output.dir = NULL,
nr.traj = NULL, seed = NULL, min.tfr = 0.5,
ar.pars = c(mu = 1, rho = 0.92464, sigma = 0.04522),
save.as.ascii = 0, verbose = TRUE)
countries |
Vector of numerical country codes or country names. |
my.tfr.file |
Tab-separated ASCII file containing the subnational TFR data. See Details for more information on its format. |
sim.dir |
Simulation directory with the national projections generated using |
end.year |
End year of the projections. |
start.year |
Start year of the projections. By default, projections start at the same time point as the national projections. |
output.dir |
Directory into which the resulting prediction objects and the trajectories are stored. See below for details. |
nr.traj |
Number of trajectories to be generated. If |
seed |
Seed of the random number generator. If NULL no seed is set. It can be used to generate reproducible projections. |
min.tfr |
Lower bound on TFR. |
ar.pars |
List containing the parameter estimates of the AR(1) process. It must have elements called |
save.as.ascii |
Either a number determining how many trajectories should be converted into an ASCII file, or “all” in which case all trajectories are converted. By default no conversion is performed. |
verbose |
Logical switching log messages on and off. |
The function implements the methodology described in Sevcikova et al (2017). Given a set of national bayesTFR projections, it applies the Scale-AR(1) model to each national trajectory and each subregion of given countries which yield subnational TFR projections.
The file on subnational data passed in my.tfr.file
has to have a column “country_code” with numerical values corresponding to countries given in the argument countries
, and column “reg_code” giving the numerical identifier of each subregion. Column “name” should be used for subregion name, and column “country” for country name. An optional column “include_code” can be used to eliminate entries from processing. Entries with values of 1 or 2 will be included, all others will be ignored. Column “last.observed” can be used to define which time period contains the last observed data point (given as integer, e.g. year in the middle of the time period). Remaining columns define the time periods, e.g. “2000-2005”, “2005-2010”. The package contains an example of such dataset, see Example below.
Argument output.dir
gives a location on disk where results of the function should be stored. If it is NULL
(default), results are stored in the same directory as the national projections. In both cases a subdirectory called “subnat” is created in which each country has its own subfolder with the country code in its name. Each such subfolder contains the same type of outputs as in the national case generated using tfr.predict
, most importantly a directory “predictions” with trajectories for each region.
A list of objects of class bayesTFR.prediction
. The name of each element includes its country code. Not all elements of the class bayesTFR.prediction
are available. For example, no mcmc.set
is attached to these objects. Thus, not all functions that work with bayesTFR.prediction
can be applied to these results.
Even though the resulting object contains subnational results, the names of its elements are the same as in the national case. This allows to apply the same functions on both objects (subnational and national). However, it means that sometimes the meaning of the elements or function arguments does not match the subnational context. For example, various functions expect the argument country
. When a subnational object is passed to such a function, country
means a subregion.
Hana Sevcikova
Hana Sevcikova, Adrian E. Raftery, Patrick Gerland (2017). Probabilistic Projection of Subnational Total Fertility Rates. arXiv:1701.01787, https://arxiv.org/abs/1701.01787.
get.regtfr.prediction
, tfr.predict
# View the example data
my.subtfr.file <- file.path(find.package("bayesTFR"), 'extdata', 'subnational_tfr_template.txt')
subtfr <- read.delim(my.subtfr.file, check.names=FALSE)
head(subtfr)
# Directory with national projections (contains 30 trajectories for each country)
nat.dir <- file.path(find.package("bayesTFR"), "ex-data", "bayesTFR.output")
# Subnational projections for Australia and Canada
subnat.dir <- tempfile()
preds <- tfr.predict.subnat(c(36, 124), my.tfr.file=my.subtfr.file,
sim.dir=nat.dir, output.dir=subnat.dir, start.year=2013)
names(preds)
get.countries.table(preds[["36"]])
summary(preds[["36"]], "Queensland")
tfr.trajectories.plot(preds[["36"]], "Queensland")
# plot subnational and national TFR in one plot
nat.pred <- get.tfr.prediction(nat.dir)
tfr.trajectories.plot(preds[["36"]], 186, pi=80, half.child.variant=FALSE)
tfr.trajectories.plot(nat.pred, "Australia", half.child.variant=FALSE,
add=TRUE, col=rep("darkgreen", 5), nr.traj=0, show.legend=FALSE)
legend("topright", c("regional TFR", "national TFR"), col=c("red", "darkgreen"),
lty=1, bty='n')
# Retrieve trajectories
trajs.Alberta <- get.tfr.trajectories(preds[["124"]], "Alberta")
summary(t(trajs.Alberta))
# cleanup
unlink(subnat.dir)
# See more examples in ?get.regtfr.prediction