getNeighborhood {NAIR} | R Documentation |
Identify Cells or Clones in a Neighborhood Around a Target Sequence
Description
Given Adaptive Immune Receptor Repertoire Sequencing (AIRR-Seq) data and a target receptor sequence that is present within the data, identifies a "neighborhood" comprised of cells/clones with receptor sequences sufficiently similar to the target sequence.
Usage
getNeighborhood(
data,
seq_col,
target_seq,
dist_type = "hamming",
max_dist = 1
)
Arguments
data |
A data frame containing the AIRR-Seq data. |
seq_col |
Specifies the column of |
target_seq |
A character string containing the target receptor sequence. Must be a receptor sequence possessed by one of the clones/cells in the AIRR-Seq data. |
dist_type |
Specifies the function used to quantify the similarity between receptor
sequences. The similarity between two sequences determines their pairwise
distance, with greater similarity corresponding to shorter distance. Valid
options are |
max_dist |
Determines whether each cell/clone belongs to the neighborhood based on its
receptor sequence's distance from the target sequence. The distance is based
on the |
Value
A data frame containing the rows of data
corresponding to the
cells/clones in the neighborhood.
If no cell/clone in the AIRR-Seq data possesses the target sequence as its
receptor sequence, then a value of NULL
is returned.
Author(s)
Brian Neal (Brian.Neal@ucsf.edu)
References
Hai Yang, Jason Cham, Brian Neal, Zenghua Fan, Tao He and Li Zhang. (2023). NAIR: Network Analysis of Immune Repertoire. Frontiers in Immunology, vol. 14. doi: 10.3389/fimmu.2023.1181825
Examples
set.seed(42)
toy_data <- simulateToyData(sample_size = 500)
# Get neighborhood around first clone sequence
nbd <-
getNeighborhood(
toy_data,
seq_col = "CloneSeq",
target_seq = "GGGGGGGAATTGG"
)
head(nbd)