PCs_1000G {PCAmatchR}R Documentation

First 20 principal components of 2504 individuals from the 1000 Genome Project

Description

A sample dataset containing information about population, gender, and the first 20 principal components calculated from 2504 individuals in the Phase 3 data release of the 1000 Genomes Project. The principal component analysis was conducted using PLINK.

Usage

PCs_1000G

Format

A data frame with 2504 rows and 24 variables:

sample

sample ID number

pop

three letter designation of 1000 Genomes reference population

super_pop

three letter designation of 1000 Genomes reference super population

gender

gender of individual

PC1

principal component 1

PC2

principal component 2

PC3

principal component 3

PC4

principal component 4

PC5

principal component 5

PC6

principal component 6

PC7

principal component 7

PC8

principal component 8

PC9

principal component 9

PC10

principal component 10

PC11

principal component 11

PC12

principal component 12

PC13

principal component 13

PC14

principal component 14

PC15

principal component 15

PC16

principal component 16

PC17

principal component 17

PC18

principal component 18

PC19

principal component 19

PC20

principal component 20

Source

https://www.internationalgenome.org

Examples

head(PCs_1000G)
genome_PC <- PCs_1000G
# Create PCs
           PC <- as.data.frame(genome_PC[,c(1,5:24)])
           head(PC)


[Package PCAmatchR version 0.3.3 Index]