read_abs {readabs} | R Documentation |
Download, extract, and tidy ABS time series spreadsheets
Description
read_abs()
downloads ABS time series spreadsheets,
then extracts the data from those spreadsheets,
then tidies the data. The result is a single
data frame (tibble) containing tidied data.
Usage
read_abs(
cat_no = NULL,
tables = "all",
series_id = NULL,
path = Sys.getenv("R_READABS_PATH", unset = tempdir()),
metadata = TRUE,
show_progress_bars = TRUE,
retain_files = TRUE,
check_local = TRUE,
release_date = "latest"
)
read_abs_series(series_id, ...)
Arguments
cat_no |
ABS catalogue number, as a string, including the extension. For example, "6202.0". |
tables |
numeric. Time series tables in |
series_id |
(optional) character. Supply an ABS unique time series
identifier (such as "A2325807L") to get only that series.
This is an alternative to specifying |
path |
Local directory in which downloaded ABS time series
spreadsheets should be stored. By default, |
metadata |
logical. If |
show_progress_bars |
TRUE by default. If set to FALSE, progress bars will not be shown when ABS spreadsheets are downloading. |
retain_files |
when TRUE (the default), the spreadsheets downloaded
from the ABS website will be saved in the directory specified with |
check_local |
If |
release_date |
Either |
... |
Arguments to |
Details
read_abs_series()
is a wrapper around read_abs()
, with series_id
as
the first argument.
read_abs()
downloads spreadsheet(s) from the ABS containing time
series data. These files need to be saved somewhere on your disk.
This local directory can be controlled using the path
argument to
read_abs()
. If the path
argument is not set, read_abs()
will store
the files in a directory set in the "R_READABS_PATH" environment variable.
If this variable isn't set, files will be saved in a temporary directory.
To check the value of the "R_READABS_PATH" variable, run
Sys.getenv("R_READABS_PATH")
. You can set the value of this variable
for a single session using Sys.setenv(R_READABS_PATH = <path>)
.
If you would like to change this variable for all future R sessions, edit
your .Renviron
file and add R_READABS_PATH = <path>
line.
The easiest way to edit this file is using usethis::edit_r_environ()
.
Certain corporate networks restrict your ability to download files in an R
session. On some of these networks, the "wininet"
method must be used when
downloading files. Users can now specify the method that will be used to
download files by setting the "R_READABS_DL_METHOD"
environment variable.
For example, the following code sets the environment variable for your
current session: sSys.setenv("R_READABS_DL_METHOD" = "wininet")
You can add "R_READABS_DL_METHOD"
to your .Rprofile to have this persist across sessions.
The release_date
argument allows you to download table(s) other than the
latest release. This is useful for examining revisions to time series, or
for obtaining the version of series that were available on a given date.
Note that you cannot supply more than one date to release_date
. Note also
that any dates prior to mid-2019 (the exact date varies by series) will fail.
Value
A data frame (tibble) containing the tidied data from the ABS time series table(s).
Examples
# Download and tidy all time series spreadsheets
# from the Wage Price Index (6345.0)
## Not run:
wpi <- read_abs("6345.0")
## End(Not run)
# Download table 1 from the Wage Price Index
## Not run:
wpi_t1 <- read_abs("6345.0", tables = "1")
## End(Not run)
# Or table 1 as in the Sep 2019 release of the WPI:
## Not run:
wpi_t1_sep2019 <- read_abs("6345.0", tables = "1", release_date = "2019-09-01")
## End(Not run)
# Or tables 1 and 2a from the WPI
## Not run:
wpi_t1_t2a <- read_abs("6345.0", tables = c("1", "2a"))
## End(Not run)
# Get two specific time series, based on their time series IDs
## Not run:
cpi <- read_abs(series_id = c("A2325806K", "A2325807L"))
## End(Not run)
# Get series IDs using the `read_abs_series()` wrapper function
## Not run:
cpi <- read_abs_series(c("A2325806K", "A2325807L"))
## End(Not run)