| bed_cluster {valr} | R Documentation |
Cluster neighboring intervals.
Description
The output .id column can be used in downstream grouping operations. Default
max_dist = 0 means that both overlapping and book-ended intervals will be
clustered.
Usage
bed_cluster(x, max_dist = 0)
Arguments
x |
|
max_dist |
maximum distance between clustered intervals. |
Details
input tbls are grouped by chrom by default, and additional
groups can be added using dplyr::group_by(). For example,
grouping by strand will constrain analyses to the same strand. To
compare opposing strands across two tbls, strands on the y tbl can
first be inverted using flip_strands().
Value
ivl_df with .id column specifying sets of clustered intervals.
See Also
https://bedtools.readthedocs.io/en/latest/content/tools/cluster.html
Other single set operations:
bed_complement(),
bed_flank(),
bed_genomecov(),
bed_merge(),
bed_partition(),
bed_shift(),
bed_slop()
Examples
x <- tibble::tribble(
~chrom, ~start, ~end,
"chr1", 100, 200,
"chr1", 180, 250,
"chr1", 250, 500,
"chr1", 501, 1000,
"chr2", 1, 100,
"chr2", 150, 200
)
bed_cluster(x)
# glyph illustrating clustering of overlapping and book-ended intervals
x <- tibble::tribble(
~chrom, ~start, ~end,
"chr1", 1, 10,
"chr1", 5, 20,
"chr1", 30, 40,
"chr1", 40, 50,
"chr1", 80, 90
)
bed_glyph(bed_cluster(x), label = ".id")