scoring {sigminer} | R Documentation |
Score Copy Number Profile
Description
Returns quantification of copy number profile and events including tandem duplication and Chromothripisis etc. Only copy number data from autosome is used here. Some of the quantification methods are rough, you use at your risk. You should do some extra work to check the result scores.
Usage
scoring(object, TD_size_cutoff = c(1000, 1e+05, 2e+06), TD_cn_cutoff = Inf)
Arguments
object |
a object of CopyNumber. |
TD_size_cutoff |
a length-3 numeric vector used to specify the start, midpoint, end segment size for determining tandem duplication size range, midpoint is used to split TD into short TD and long TD. Default is 1Kb to 100Kb for short TD, 100Kb to 2Mb for long TD. |
TD_cn_cutoff |
a number defining the maximum copy number of TD,
default is |
Value
a data.table
with following scores:
cnaBurden: CNA burden representing the altered genomic fraction as previously reported.
cnaLoad: CNA load representing the quantity of copy number alteration.
MACN: mean altered copy number (MACN) reflecting the property of altered copy number segments, calculated as
MACN = \frac{\sum_{i} CN_i}{N_{cnv}}
where
CN_i
is the copy number of altered segmenti
,N_{cnv}
is the number of CNV.weightedMACN: same as MACN but weighted with segment length.
MACN_{weighted} = \frac{\sum_{i} (CN_i \times L_{i})}{ \sum_{i} L_{i} }
where
L_{i}
is the length of altered copy number segmenti
.Ploidy: ploidy, the formula is same as
weightedMACN
but using all copy number segments instead of altered copy number segments.TDP_pnas: tandem duplication phenotype score from
https://www.pnas.org/doi/10.1073/pnas.1520010113
, the thresholdk
in reference is omitted.TDP = - \frac{\sum_{chr} |TD_{obs}-TD_{exp}|}{TD_{total}}
where
TD_{total}
is the number of TD,TD_{obs}
andTD_exp
are observed number of TD and expected number of TD for each chromosome.TDP: tandem duplication score used defined by our group work, TD represents segment with copy number greater than 2.
TD = \frac{TD_{total}}{\sum_{chr} |TD_{obs}-TD_{exp}|+1}
sTDP: TDP score for short TD.
lTDP: TDP score for long TD.
TDP_size : TDP region size (Mb).
sTDP_size: sTDP region size (Mb).
lTDP_size: lTDP region size(Mb).
Chromoth_state: chromothripsis state score, according to reference doi:10.1016/j.cell.2013.02.023, chromothripsis frequently leads to massive loss of segments on the affected chromosome with segmental losses being interspersed with regions displaying normal (disomic) copy-number (e.g., copy-number states oscillating between copy-number = 1 and copy-number = 2), form tens to hundreds of locally clustered DNA rearrangements. Most of methods use both SV and CNV to infer chromothripsis, here we roughly quantify it with
\sum_{chr}{N_{OsCN}^2}
where
N_{OsCN}
is the number of oscillating copy number pattern "2-1-2" for each chromosome.
Examples
# Load copy number object
load(system.file("extdata", "toy_copynumber.RData",
package = "sigminer", mustWork = TRUE
))
d <- scoring(cn)
d
d2 <- scoring(cn, TD_cn_cutoff = 4L)
d2