demo_dataset {LncFinder} | R Documentation |
A demo of dataset
Description
This dataset contains the features of 20 lncRNA sequences and 20 protein-coding sequences.
Usage
data(demo_dataset)
Format
A data frame with 40 rows and 20 variables:
- Label
the class of the sequences
- ORF.Max.Len
the length of the longest ORF
- ORF.Max.Cov
the coverage of the longest ORF
- Seq.lnc.Dist
Log-Distance.lncRNA
- Seq.pct.Dist
Log-Distance.protein-coding transcripts
- Seq.Dist.Ratio
Distance-Ratio.sequence
- Signal.Peak
Signal as 1/3 position
- SNR
Signal to noise ratio
- Signal.Min
the minimum value of the top 10% power spectrum
- Signal.Q1
the quantile Q1 of the top 10% power spectrum
- Signal.Q2
the quantile Q2 of the top 10% power spectrum
- Signal.Max
the maximum value of the top 10% power spectrum
- Dot_lnc.dist
Log-Distance.acguD.lncRNA
- Dot_pct.dist
Log-Distance.acguD.protein-coding transcripts
- Dot_Dist.Ratio
Distance-Ratio.acguD
- SS.lnc.dist
Log-Distance.acgu-ACGU.lncRNA
- SS.pct.dist
Log-Distance.acgu-ACGU.protein-coding transcripts
- SS.Dist.Ratio
Distance-Ratio.acgu-ACGU
- MFE
Minimum free energy
- UP.PCT
Percentage of Unpair-Pair
Source
Sequences are selected from GENCODE.