dataset_cui2vec_embeddings {clinspacy}R Documentation

Cui2vec concept embeddings

Description

This dataset contains Unified Medical Langauge System (UMLS) concept embeddings from Andrew Beam's cui2vec R package. There are 500 embeddings included for each concept.

Usage

dataset_cui2vec_embeddings()

Format

A data frame with 109053 rows and 501 variables:

cui

A Unified Medical Language System (UMLS) Concept Unique Identifier (CUI)

emb_001

Concept embedding vector #1

emb_002

Concept embedding vector #2

...

and so on...

emb_500

Concept embedding vector #500

Details

This dataset is not viewable until it has been downloaded, which will occur the very first time you run clinspacy_init() after installing this package.

Citation

Beam, A.L., Kompa, B., Schmaltz, A., Fried, I., Griffin, W, Palmer, N.P., Shi, X., Cai, T., and Kohane, I.S.,, 2019. Clinical Concept Embeddings Learned from Massive Sources of Multimodal Medical Data. arXiv preprint arXiv:1804.01486.

License

This data is made available under a CC BY 4.0 license. The only change made to the original dataset is the renaming of columns.

Value

Returns the cui2vec UMLS embeddings as a data frame.

Source

https://figshare.com/s/00d69861786cd0156d81


[Package clinspacy version 1.0.2 Index]