ecoli {MoTBFs} | R Documentation |
Data set Ecoli: Protein Localization Sites
Description
This data set contains information of Escherichia coli. It is a bacterium of the genus Escherichia that is commonly found in the lower intestine of warm-blooded organism.
Format
A data frame with 336 rows, 8 variables and the class.
Details
- Sequence Name
Accession number for the SWISS-PROT database.
- mcg
McGeoch's method for signal sequence recognition.
- gvh
Von Heijne's method for signal sequence recognition.
- lip
Von Heijne's Signal Peptidase II consensus sequence score. Binary attribute.
- chg
Presence of charge on N-terminus of predicted lipoproteins. Binary attribute.
- aac
Score of discriminant analysis of the amino acid content of outer membrane and periplasmic proteins.
- alm1
Score of the ALOM membrane spanning region prediction program.
- alm2
Score of ALOM program after excluding putative cleavable signal regions from the sequence.
- Class
Class variable. 8 possibles states.
Source
http://archive.ics.uci.edu/ml/datasets/Ecoli