R: First 20 principal components of 2504 individuals from the...

PCs_1000G {PCAmatchR}

R Documentation

First 20 principal components of 2504 individuals from the 1000 Genome Project

Description

A sample dataset containing information about population, gender, and the first 20 principal components calculated from 2504 individuals in the Phase 3 data release of the 1000 Genomes Project. The principal component analysis was conducted using PLINK.

Usage

PCs_1000G

Format

A data frame with 2504 rows and 24 variables:

sample: sample ID number
pop: three letter designation of 1000 Genomes reference population
super_pop: three letter designation of 1000 Genomes reference super population
gender: gender of individual
PC1: principal component 1
PC2: principal component 2
PC3: principal component 3
PC4: principal component 4
PC5: principal component 5
PC6: principal component 6
PC7: principal component 7
PC8: principal component 8
PC9: principal component 9
PC10: principal component 10
PC11: principal component 11
PC12: principal component 12
PC13: principal component 13
PC14: principal component 14
PC15: principal component 15
PC16: principal component 16
PC17: principal component 17
PC18: principal component 18
PC19: principal component 19
PC20: principal component 20

Source

https://www.internationalgenome.org

Examples

head(PCs_1000G)
genome_PC <- PCs_1000G
# Create PCs
           PC <- as.data.frame(genome_PC[,c(1,5:24)])
           head(PC)

[Package PCAmatchR version 0.3.3 Index]