readFastqDb {alakazam} | R Documentation |
Load sequencing quality scores from a FASTQ file
Description
readFastqDb
adds the sequencing quality scores to a data.frame
from a FASTQ file. Matching is done by 'sequence_id'.
Usage
readFastqDb(
data,
fastq_file,
quality_offset = -33,
header = c("presto", "asis"),
sequence_id = "sequence_id",
sequence = "sequence",
sequence_alignment = "sequence_alignment",
v_cigar = "v_cigar",
d_cigar = "d_cigar",
j_cigar = "j_cigar",
np1_length = "np1_length",
np2_length = "np2_length",
v_sequence_end = "v_sequence_end",
d_sequence_end = "d_sequence_end",
style = c("num", "ascii", "both"),
quality_sequence = FALSE
)
Arguments
data |
|
fastq_file |
path to the fastq file |
quality_offset |
offset value to be used by ape::read.fastq. It is the value to be added to the quality scores (the default -33 applies to the Sanger format and should work for most recent FASTQ files). |
header |
FASTQ file header format; one of |
sequence_id |
column in |
sequence |
column in |
sequence_alignment |
column in |
v_cigar |
column in |
d_cigar |
column in |
j_cigar |
column in |
np1_length |
column in |
np2_length |
column in |
v_sequence_end |
column in |
d_sequence_end |
column in |
style |
how the sequencing quality should be returned;
one of |
quality_sequence |
specify |
Value
Modified data
with additional fields:
-
quality_alignment
: A character vector with ASCII Phred scores forsequence_alignment
. -
quality_alignment_num
: A character vector, with comma separated numerical quality values for each position insequence_alignment
. -
quality
: A character vector with ASCII Phred scores forsequence
. -
quality_num
: A character vector, with comma separated numerical quality values for each position insequence
.
See Also
maskPositionsByQuality and getPositionQuality
Examples
db <- airr::read_rearrangement(system.file("extdata", "example_quality.tsv", package="alakazam"))
fastq_file <- system.file("extdata", "example_quality.fastq", package="alakazam")
db <- readFastqDb(db, fastq_file, quality_offset=-33)