PCs_1000G {PCAmatchR} | R Documentation |
First 20 principal components of 2504 individuals from the 1000 Genome Project
Description
A sample dataset containing information about population, gender, and the first 20 principal components calculated from 2504 individuals in the Phase 3 data release of the 1000 Genomes Project. The principal component analysis was conducted using PLINK.
Usage
PCs_1000G
Format
A data frame with 2504 rows and 24 variables:
- sample
sample ID number
- pop
three letter designation of 1000 Genomes reference population
- super_pop
three letter designation of 1000 Genomes reference super population
- gender
gender of individual
- PC1
principal component 1
- PC2
principal component 2
- PC3
principal component 3
- PC4
principal component 4
- PC5
principal component 5
- PC6
principal component 6
- PC7
principal component 7
- PC8
principal component 8
- PC9
principal component 9
- PC10
principal component 10
- PC11
principal component 11
- PC12
principal component 12
- PC13
principal component 13
- PC14
principal component 14
- PC15
principal component 15
- PC16
principal component 16
- PC17
principal component 17
- PC18
principal component 18
- PC19
principal component 19
- PC20
principal component 20
Source
https://www.internationalgenome.org
Examples
head(PCs_1000G)
genome_PC <- PCs_1000G
# Create PCs
PC <- as.data.frame(genome_PC[,c(1,5:24)])
head(PC)