mergeLDA {tosca} | R Documentation |
Preparation of Different LDAs For Clustering
Description
Merges different lda-results to one matrix, including only the words which appears in all lda-results.
Usage
mergeLDA(x)
Arguments
x |
A list of lda results. |
Details
The function is useful for merging lda-results prior to a cluster analysis with clusterTopics
.
Value
A matrix including all topics from all lda-results. The number of rows is the number of topics, the number of columns is the number of words which appear in all results.
Examples
texts <- list(A="Give a Man a Fish, and You Feed Him for a Day.
Teach a Man To Fish, and You Feed Him for a Lifetime",
B="So Long, and Thanks for All the Fish",
C="A very able manipulative mathematician, Fisher enjoys a real mastery
in evaluating complicated multiple integrals.")
corpus <- textmeta(meta=data.frame(id=c("A", "B", "C", "D"),
title=c("Fishing", "Don't panic!", "Sir Ronald", "Berlin"),
date=c("1885-01-02", "1979-03-04", "1951-05-06", "1967-06-02"),
additionalVariable=1:4, stringsAsFactors=FALSE), text=texts)
corpus <- cleanTexts(corpus)
wordlist <- makeWordlist(corpus$text)
ldaPrep <- LDAprep(text=corpus$text, vocab=wordlist$words)
LDA1 <- LDAgen(documents=ldaPrep, K = 3L, vocab=wordlist$words, num.words=3)
LDA2 <- LDAgen(documents=ldaPrep, K = 3L, vocab=wordlist$words, num.words=3)
mergeLDA(list(LDA1=LDA1, LDA2=LDA2))
[Package tosca version 0.3-2 Index]