classification {BASiNET}R Documentation

Performs the classification methodology using complex network theory

Description

Given two distinct data sets, one of mRNA and one of lncRNA. The classification of the data is done from the structure of the networks formed by the sequences. After this is done classifying with the J48 classifier and randomForest. Can be also created in the current directory a file of type arff called' result 'with all values so that it can be used later. There is also the graphic parameter that when TRUE generates graphs based on the results of each measure. Using the J48 classifier it is possible to generate a tree based on the dataset and then save this tree so that it can be used to predict other RNA sequences

Usage

classification(mRNA, lncRNA, word = 3, step = 1, sncRNA, graphic,
  classifier = c("J48", "RF"), load, save)

Arguments

mRNA

Directory where the file .FASTA lies with the mRNA sequences

lncRNA

Directory where the file .FASTA lies with the lncRNA sequences

word

Integer that defines the size of the word to parse. By default the word parameter is set to 3

step

Integer that determines the distance that will be traversed in the sequences for creating a new connection. By default the step parameter is set to 1

sncRNA

Directory where the file .FASTA lies with the sncRNA sequences (OPTIONAL)

graphic

Parameter of the logical type, TRUE or FALSE for graphics generation. As default graphic gets FALSE

classifier

Character Parameter. By default the classifier is J48, but the user can choose to use randomForest by configuring as classifier = "RF". The prediction with a model passed by the param load only works with the classifier J48.

load

When defined this parameter will be loaded the file which is the model previously saved in the current directory with the name entered in this parameter. No file is loaded by default

save

when set, this parameter saves a .arff file with the results of the features in the current directory and also saves the tree created by the J48 classifier so that it can be used to predict RNA sequences. This parameter sets the file name. No file is created by default

Value

Results with cross-validation or the prediction result

Author(s)

Eric Augusto Ito

Examples


 # Classification - cross validation
 library(BASiNET)
 arqSeqMRNA <- system.file("extdata", "sequences2.fasta", package = "BASiNET")
 arqSeqLNCRNA <- system.file("extdata", "sequences.fasta", package = "BASiNET")
 classification(mRNA=arqSeqMRNA,lncRNA=arqSeqLNCRNA)
 classification(mRNA=arqSeqMRNA,lncRNA=arqSeqLNCRNA, save="example") #Save Tree to Predict Sequences
 # Prediction
 mRNApredict <- system.file("extdata", "sequences2-predict.fasta", package = "BASiNET")
 lncRNApredict <- system.file("extdata", "sequences-predict.fasta", package = "BASiNET")
 modelPredict <- system.file("extdata", "modelPredict.dat", package = "BASiNET")
 classification(mRNApredict,lncRNApredict,load=modelPredict)


[Package BASiNET version 0.0.5 Index]