WordCountAna {SensoMineR} | R Documentation |
Word-Count based methods Analysis (WordCountAna)
Description
Sensory methods as labelled sorting task, check-all-that-apply (CATA), ultra-flash profiling (UFP) and open-ended questions can be used to collect free-text descriptions of products through word-count based methods. A data frame with rows-products and columns-panellists is considered for the analysis. WordCountAna performs a multiple factor analysis for contingency tables keeping all the information in the comparison of the products. The identification of the consensual words which have the same meaning for most of the panellists eases the interpretation of the word-count based methods and solves the problems arising from the large diversity of vocabulary as the different meanings possibly associated to a same word. A test, based on resampling techniques, allows for assessing the significance of the consensus.
Usage
WordCountAna (base, sep.word = NULL, ncp = Inf, nb.panel = 3, nb.simul = 500,
proba = 0.05, graph = TRUE, axes = c(1,2))
Arguments
base |
a data frame with n rows (products) and p columns (panellists). Each cell corresponds to a free-text description used to describe a product by a panellist |
sep.word |
a string with all the characters which correspond to separator of words (by default, NULL and is considered equal to "; (),?./:'!$=+;<>[]@-") |
ncp |
number of dimensions kept in the results and to compute the within-inertia |
nb.panel |
minimum number of panellists who used the same word in order to define consensual words (by default 3) |
nb.simul |
number of bootstrap simulations (by default 500) |
proba |
significance threshold considered to define consensual words (by default 0.05) |
graph |
boolean, if TRUE a graph is displayed |
axes |
a length 2 vector specifying the components to plot |
Value
mfact |
a list of matrices containing all the results for multiple factor analysis for contingency tables |
dist.words |
a matrix containing the results for distinct words (number of times that used and number of panellists that pronounced) |
centroids |
a matrix containing the coordinates of the centroids of distinct-words |
cons |
a matrix containing the results of bootstrap resampling for distinct-words pronounced by at least "nb.panel" panellists (number of times that used, number of panellists that pronounced and the significance of the consensus) |
cons.words |
a vector of consensual words assessed by bootstrap resampling |
Returns the products factor map, panellists factor map, distinct-words factor map and consensual words factor map.
Author(s)
Belchin Kostov badriyan@clinic.ub.es, Francois Husson francois.husson@institut-agro.fr, Monica Becue-Bertaut
References
Becue-Bertaut, M. and Pages, J. (2008). Multiple factor analysis and clustering of a mixture of quantitative, categorical and frequency data. Computational Statistice and Data Analysis, 52, 3255-3268. Kostov, B., Becue-Bertaut, M. and Husson, F. (2012). Multiple Factor Analysis for Contingency Tables in FactoMineR Package. The R journal Kostov, B., Becue-Bertaut, M., Husson, F., Pages, J., Cadoret, M., Torrens, J. and Urpi, P. (2012). A tool for detecting words with consensual meaning in verbalization tasks. 11th Sensometrics Conference, July 10-13, 2012, Rennes, France.
See Also
textual
, MFA
, plot.WordCountAna
Examples
data(perfume)
res<-WordCountAna(base=perfume,sep.word=";")