getAccessions {geneHummus} | R Documentation |
Get the acessions ids and the organism for each protein identifier
Description
The getAccessions
function parses the protein page for each identifier
and extracts the accession id (usually referred as XP accession in the RefSeq
database) and the organism given by the scientific name.
The accessions_by_spp
and accessions_from_spp
functions are
convenient filters for further cleaning of getAccessions
by giving
the total number of XP accessions per species or extracting the XP
accessions for a given species, respectively.
Usage
getAccessions(protein_ids)
Arguments
protein_ids |
A string vector containing protein identifiers. |
Value
A data.frame
of protein ids including columns:
accession
organism
Author(s)
Jose V. Die
See Also
accessions_by_spp
to summarize the total number of
accession proteins per species.
accessions_from_spp
to filter the accession ids for
a given species
Examples
prot_ids <- c("593705262", "1379669790", "357520645", "1150156484")
getAccessions(prot_ids)
[Package geneHummus version 1.0.11 Index]