The R package chickn implements the Chromatogram Hierarchical Compressive K-means with Nystrom approximation clustering approach. It is designed to cluster a large collection of high-resolution mass spectrometry signals (chromatographic profiles) relying on a compressed version of the data (a.k.a. data sketch). Data compression is achieved following the guidelines for Nystrom approximation provided by (Wang et al. 2019) and the sketching operator from (Keriven et al. 2018). The Filebacked Big Matrix (FBM) class from the bigstatsr package is used to store and to manupulate matrices, which cannot be load in memory.


Olga Permiakova, Romain Guibert, Thomas Burger


