nth {dplyr} | R Documentation |
Extract the first, last, or nth value from a vector
Description
These are useful helpers for extracting a single value from a vector. They are guaranteed to return a meaningful value, even when the input is shorter than expected. You can also provide an optional secondary vector that defines the ordering.
Usage
nth(x, n, order_by = NULL, default = NULL, na_rm = FALSE)
first(x, order_by = NULL, default = NULL, na_rm = FALSE)
last(x, order_by = NULL, default = NULL, na_rm = FALSE)
Arguments
x |
A vector |
n |
For |
order_by |
An optional vector the same size as |
default |
A default value to use if the position does not exist in If If supplied, this must be a single value, which will be cast to the type of
When |
na_rm |
Should missing values in |
Details
For most vector types, first(x)
, last(x)
, and nth(x, n)
work like
x[[1]]
, x[[length(x)]
, and x[[n]]
, respectively. The primary exception
is data frames, where they instead retrieve rows, i.e. x[1, ]
, x[nrow(x), ]
, and x[n, ]
. This is consistent with the tidyverse/vctrs principle which
treats data frames as a vector of rows, rather than a vector of columns.
Value
If x
is a list, a single element from that list. Otherwise, a vector the
same type as x
with size 1.
Examples
x <- 1:10
y <- 10:1
first(x)
last(y)
nth(x, 1)
nth(x, 5)
nth(x, -2)
# `first()` and `last()` are often useful in `summarise()`
df <- tibble(x = x, y = y)
df %>%
summarise(
across(x:y, first, .names = "{col}_first"),
y_last = last(y)
)
# Selecting a position that is out of bounds returns a default value
nth(x, 11)
nth(x, 0)
# This out of bounds behavior also applies to empty vectors
first(integer())
# You can customize the default value with `default`
nth(x, 11, default = -1L)
first(integer(), default = 0L)
# `order_by` provides optional ordering
last(x)
last(x, order_by = y)
# `na_rm` removes missing values before extracting the value
z <- c(NA, NA, 1, 3, NA, 5, NA)
first(z)
first(z, na_rm = TRUE)
last(z, na_rm = TRUE)
nth(z, 3, na_rm = TRUE)
# For data frames, these select entire rows
df <- tibble(a = 1:5, b = 6:10)
first(df)
nth(df, 4)