amino.acid.sequence.1972 {cluster.datasets}R Documentation

Hartigan (1975) Amino Acid Sequence for Vertibrates

Description

The table defines the position of amino acids for Cytochrome-c. This is Table 13.4 in Chapter 13 of Hartigan (1975) on page 240.

Usage

data(amino.accid.sequence.1972)

Format

A data frame with 17 observations on the following 37 variables.

species

a character vector for the species names

p.1

a factor for position 1 with levels I V

p.2

a factor for position 2 with levels A E

p.3

a factor for position 3 with levels I T V

p.4

a factor for position 4 with levels I T V

p.5

a factor for position 5 with levels M Q

p.6

a factor for position 6 with levels A S

p.7

a factor for position 7 with levels C V

p.8

a factor for position 8 with levels K N

p.9

a factor for position 9 with levels T V

p.10

a factor for position 10 with levels H N S W Y

p.11

a factor for position 11 with levels F I

p.12

a factor for position 12 with levels A E P Q V

p.13

a factor for position 13 with levels F Y

p.14

a factor for position 14 with levels S T

p.15

a factor for position 15 with levels A D E

p.16

a factor for position 16 with levels N S

p.17

a factor for position 17 with levels I T V

p.18

a factor for position 18 with levels G K N Q

p.19

a factor for position 19 with levels E N Q

p.20

a factor for position 20 with levels D E

p.21

a factor for position 21 with levels M R

p.22

a factor for position 22 with levels E I

p.23

a factor for position 23 with levels I V

p.24

a factor for position 24 with levels T V

p.25

a factor for position 25 with levels I L

p.26

a factor for position 26 with levels K S

p.27

a factor for position 27 with levels K

p.28

a factor for position 28 with levels A D E G K S T

p.29

a factor for position 29 with levels A E Q T V

p.30

a factor for position 30 with levels D N

p.31

a factor for position 31 with levels I V

p.32

a factor for position 32 with levels D E K Q S

p.33

a factor for position 33 with levels A K T

p.34

a factor for position 34 with levels A C T

p.35

a factor for position 35 with levels A K N S

p.36

a factor for position 36 with levels - A E K S

Details

The factor levels across the 36 positions common. Hartigan uses the reduced mutation algorithm with this data set.

Source

Dickerson, R. E. (1972). The structure and history of an ancient problem, Scientific American, 222(4), 58-72.

SPAETH2 Cluster Analysis Datasets http://people.sc.fsu.edu/~jburkardt/datasets/spaeth2/spaeth2.html

References

Hartigan, J. A. (1975). Clustering Algorithms, John Wiley, New York.

Examples

data(amino.acid.sequence.1972)

[Package cluster.datasets version 1.0-1 Index]