mondrian {Mondrian} | R Documentation |
Computes and displays the relative occurrence and co-occurence of events
Description
This function allows representing in a single graph the relative occurrence and co-occurrence of events measured in a sample. As examples, the package was applied to describe the occurrence and co-occurrence of different species of bacterial or viral symbionts infecting arthropods at the individual level. The graphics allows determining the prevalence of each symbiont and the patterns of multiple infections (i.e. how different symbionts share or not the same individual hosts). The graphical output recalls Mondrian’s paintings.
Usage
mondrian(data, labels = colnames(data), xlab = "", ylab = "", main = "",
col = NULL , pop = NULL, indiv = FALSE, ...)
Arguments
data |
a data frame or a matrix containing the infection profile of each individual for each symbiont. Rows corresponds to individuals and columns to the symbionts. |
labels |
a vector containing the symbionts name. By default, labels are the columns name of |
xlab |
a string of characters indicating the x-axis label. |
ylab |
a string of characters indicating the y-axis label. |
main |
a string of characters indicating the main title of the displayed output. This string is followed by the number of individuals to constitute the actual main title. |
col |
a vector containing as many colors as symbionts. See |
pop |
a number index indicating the number of the column containing the population's factor in |
indiv |
a logical to draw horizontal lines separating each individual profile. |
... |
Details
By default, rectangles' colors come from the Set1
palette defined by the brewer.pal
function of the RColorBrewer
package. This palette
automatically proposes distinct colors (more than 2 and less than 10 values).
Value
The mondrian
function yields two outputs:
- a plot representing the occurence and co-occurence of symbionts within individuals, recalling famous Mondrian paintings.
Individuals are represented by horizontal lines, and each column represents one symbiont.
Horizontal lines (representing individuals) are reordered to optimize the reading.
Uninfected individuals are always represented in white.
For each column, the coloured area corresponds to the total prevalence of this symbiont.
When coloured areas are present in different columns, this indicates co-infection.
If individuals belong to different populations (pop
is not null), the graphical output is a layout with one display for each sub-population
and one display where all individuals are considered belonging to a unique population.
- the distinct profiles of multiple infection and their prevalence (considering all symbionts in data
) in the whole population and in each sub-population (when pop
is not null).
See Also
Examples
data(endosymbiont_1pop)
out1 <- mondrian(endosymbiont_1pop)
out1
## 110 011 010
## 0.6 0.2 0.2
data(endosymbiont_3pop)
out2 <- mondrian(endosymbiont_3pop, pop = 1)
out2
## $AlxT
## 01100
## 1
## $MalD
## 10111 10011 01101 00100
## 0.1 0.1 0.1 0.7
## $MslCh
## 10111 10011 10010 10001 00110 00010
## 0.1 0.1 0.4 0.1 0.1 0.2
## $pop
## 10111 10011 10010 10001 01101 01100
## 0.06666667 0.06666667 0.13333333 0.03333333 0.03333333 0.33333333
## 00110 00100 00010
## 0.03333333 0.23333333 0.06666667
names(out2)
## [1] "AlxT" "MalD" "MslCh" "pop"