| PLMIX-package {PLMIX} | R Documentation |
Bayesian Analysis of Finite Mixtures of Plackett-Luce Models for Partial Rankings/Orderings
Description
The PLMIX package for R provides functions to fit and analyze finite mixtures of Plackett-Luce models for partial top rankings/orderings within the Bayesian framework. It provides MAP point estimates via EM algorithm and posterior MCMC simulations via Gibbs Sampling. It also fits MLE as a special case of the noninformative Bayesian analysis with vague priors.
In addition to inferential techniques, the package assists other fundamental phases of a model-based analysis for partial rankings/orderings, by including functions for data manipulation, simulation, descriptive summary, model selection and goodness-of-fit evaluation.
Specific S3 classes and methods are also supplied to enhance the usability and foster exchange with other packages. Finally, to address the issue of computationally demanding procedures typical in ranking data analysis, PLMIX takes advantage of a hybrid code linking the R environment with the C++ programming language.
Details
The Plackett-Luce model is one of the most popular and frequently applied parametric distributions to analyze partial top rankings/orderings of a finite set of items. The present package allows to account for unobserved sample heterogeneity of partially ranked data with a model-based analysis relying on Bayesian finite mixtures of Plackett-Luce models. The package provides a suite of functions that covers the fundamental phases of a model-based analysis:
Ranking data manipulation
binary_group_indBinary group membership matrix from the mixture component labels.
freq_to_unitFrom the frequency distribution to the dataset of individual orderings/rankings.
make_completeRandom completion of partial orderings/rankings data.
make_partialCensoring of complete orderings/rankings data.
rank_ord_switchFrom rankings to orderings and vice-versa.
unit_to_freqFrom the dataset of individual orderings/rankings to the frequency distribution.
Ranking data simulation
rPLMIXRandom sample from a finite mixture of Plackett-Luce models.
Ranking data description
paired_comparisonsPaired comparison frequencies.
rank_summariesSummary statistics of partial ranking/ordering data.
Model estimation
gibbsPLMIXBayesian analysis with MCMC posterior simulation via Gibbs sampling.
label_switchPLMIXLabel switching adjustment of the Gibbs sampling simulations.
likPLMIXLikelihood evaluation for a mixture of Plackett-Luce models.
loglikPLMIXLog-likelihood evaluation for a mixture of Plackett-Luce models.
mapPLMIXMAP estimation via EM algorithm.
mapPLMIX_multistartMAP estimation via EM algorithm with multiple starting values.
Class coercion and membership
as.top_orderingCoercion into top-ordering datasets.
gsPLMIX_to_mcmcFrom the Gibbs sampling simulation to an MCMC class object.
is.top_orderingTest for the consistency of input data with a top-ordering dataset.
S3 class methods
plot.gsPLMIXPlot of the Gibbs sampling simulations.
plot.mpPLMIXPlot of the MAP estimates.
print.gsPLMIXPrint of the Gibbs sampling simulations.
print.mpPLMIXPrint of the MAP estimation algorithm.
summary.gsPLMIXSummary of the Gibbs sampling procedure.
summary.mpPLMIXSummary of the MAP estimation.
Model selection
bicPLMIXBIC value for the MLE of a mixture of Plackett-Luce models.
selectPLMIXBayesian model selection criteria.
Model assessment
ppcheckPLMIXPosterior predictive diagnostics.
ppcheckPLMIX_condPosterior predictive diagnostics conditionally on the number of ranked items.
Datasets
d_apaAmerican Psychological Association Data (partial orderings).
d_carconfCar Configurator Data (partial orderings).
d_dublinwestDublin West Data (partial orderings).
d_gamingGaming Platforms Data (complete orderings).
d_germanGerman Sample Data (complete orderings).
d_nascarNASCAR Data (partial orderings).
d_occupOccupation Data (complete orderings).
d_riceRice Voting Data (partial orderings).
Data have to be supplied as an object of class matrix, where missing positions/items are denoted with zero entries and Rank = 1 indicates the most-liked alternative. For a more efficient implementation of the methods, partial sequences with a single missing entry should be preliminarily filled in, as they correspond to complete rankings/orderings. In the present setting, ties are not allowed. Some quantities frequently recalled in the manual are the following:
NSample size.
KNumber of possible items.
GNumber of mixture components.
LSize of the final posterior MCMC sample (after burn-in phase).
Author(s)
Cristina Mollica and Luca Tardella
Maintainer: Cristina Mollica <cristina.mollica@uniroma1.it>
References
Mollica, C. and Tardella, L. (2017). Bayesian Plackett-Luce mixture models for partially ranked data. Psychometrika, 82(2), pages 442–458, ISSN: 0033-3123, http://dx.doi.org/10.1007/s11336-016-9530-0.
Mollica, C. and Tardella, L. (2014). Epitope profiling via mixture modeling for ranked data. Statistics in Medicine, 33(21), pages 3738–3758, ISSN: 0277-6715, http://onlinelibrary.wiley.com/doi/10.1002/sim.6224/full.