MATISSE: a method for improved single cell segmentation in imaging mass cytometry

Baars, Matthijs J. D.; Sinha, Neeraj; Amini, Mojtaba; Pieterman-Bos, Annelies; van Dam, Stephanie; Ganpat, Maroussia M. P.; Laclé, Miangela M.; Oldenburg, Bas; Vercoulen, Yvonne

doi:10.1186/s12915-021-01043-y

Methodology article
Open access
Published: 11 May 2021

MATISSE: a method for improved single cell segmentation in imaging mass cytometry

Matthijs J. D. Baars¹,
Neeraj Sinha¹^na1,
Mojtaba Amini¹^na1,
Annelies Pieterman-Bos¹,
Stephanie van Dam^1,2,
Maroussia M. P. Ganpat¹,
Miangela M. Laclé³,
Bas Oldenburg⁴ &
…
Yvonne Vercoulen ORCID: orcid.org/0000-0002-5060-2603¹

BMC Biology volume 19, Article number: 99 (2021) Cite this article

7595 Accesses
14 Citations
12 Altmetric
Metrics details

A Publisher Correction to this article was published on 18 June 2021

This article has been updated

Abstract

Background

Visualizing and quantifying cellular heterogeneity is of central importance to study tissue complexity, development, and physiology and has a vital role in understanding pathologies. Mass spectrometry-based methods including imaging mass cytometry (IMC) have in recent years emerged as powerful approaches for assessing cellular heterogeneity in tissues. IMC is an innovative multiplex imaging method that combines imaging using up to 40 metal conjugated antibodies and provides distributions of protein markers in tissues with a resolution of 1 μm² area. However, resolving the output signals of individual cells within the tissue sample, i.e., single cell segmentation, remains challenging. To address this problem, we developed MATISSE (iMaging mAss cyTometry mIcroscopy Single cell SegmEntation), a method that combines high-resolution fluorescence microscopy with the multiplex capability of IMC into a single workflow to achieve improved segmentation over the current state-of-the-art.

Results

MATISSE results in improved quality and quantity of segmented cells when compared to IMC-only segmentation in sections of heterogeneous tissues. Additionally, MATISSE enables more complete and accurate identification of epithelial cells, fibroblasts, and infiltrating immune cells in densely packed cellular areas in tissue sections. MATISSE has been designed based on commonly used open-access tools and regular fluorescence microscopy, allowing easy implementation by labs using multiplex IMC into their analysis methods.

Conclusion

MATISSE allows segmentation of densely packed cellular areas and provides a qualitative and quantitative improvement when compared to IMC-based segmentation. We expect that implementing MATISSE into tissue section analysis pipelines will yield improved cell segmentation and enable more accurate analysis of the tissue microenvironment in epithelial tissue pathologies, such as autoimmunity and cancer.

Background

Multiplex imaging technologies have revolutionized our ability to study cellular heterogeneity in tissues. These methods allow visualization of spatial organization of the tissue and quantification of different cell types. Moreover, these methods can yield precise views of cell-to-cell differences, quantify differences in signaling status, map signaling network topologies, and lead to important mechanistic insights. Visualizing and quantifying cellular heterogeneity in tissue samples is of increasing importance in many areas of biology, most notably in cancer research and oncology, where we now appreciate that the heterogeneous tumor microenvironment has profound implications for study, diagnosis, prognosis, and treatment of cancer [1].

The most commonly used multiplex imaging technologies fall into two main categories: microscopy based multiplex immune-histochemistry (IHC) methods [2, 3] and mass spectrometry-based methods [4], including imaging mass cytometry (IMC) [5]. IMC uses metal conjugated antibodies to label specific protein markers in a given tissue section followed by laser ablation, which allows for analysis of 1 μm² tissue area at a time. Cytometry time-of-flight (CyTOF) mass spectrometry is then used to analyze metal isotope distribution as a readout for protein markers. Unlike fluorescence-based microscopy imaging methods where only 4 or 5 markers can be labeled and visualized simultaneously, mass spectrometry-based imaging methods can provide simultaneous labeling and readout of approximately 40 markers. However, one of the main challenges for mass spectrometry-based imaging methods, such as IMC, is the difficulty of single cell segmentation, i.e., distinguishing signals coming from individual cells. Currently available pipelines, such as an ilastik [6] and CellProfiler [7] extension that allows segmentation of IMC data [8], use membrane, cytosolic, and nuclear markers for single cell segmentation. These strategies are limited by the intrinsic IMC 1 μM pixel size resolution, making it difficult to discriminate cells in densely packed areas, such as epithelial layers, or immune cell infiltrates. This can often result in erroneous interpretation of nuclear and membrane signals by either merging of multiple cells into one event, or fragmenting single cells into multiple events, causing inaccuracies in these analyses.

While IMC resolution is limited by a fixed 1 μM pixel size, fluorescent microscopy allows acquisition at variable resolutions well below 1 μM pixel size. Therefore, to provide a solution for the single cell segmentation problem in IMC, we have developed MATISSE (iMaging mAss cyTometry mIcroscopy Single cell SegmEntation, Fig. 1a), a method that combines the use of IMC and fluorescent microscopy imaging into a single workflow. More specifically, we designed MATISSE to use a multiplex IMC antibody panel containing membrane, cytoplasm, and nuclear markers, as well as fluorescent nuclear DAPI and DNA intercalator labeling of the same tissue region for improved segmentation. The MATISSE tissue analysis pipeline begins with staining tissue sections with metal isotope conjugated antibodies, DNA intercalator, and DAPI, followed by fluorescence microscopy and IMC. The data obtained from two imaging techniques is then aligned using nuclear staining, and pixel probability maps for membranes and nuclei are calculated based on IMC and DAPI data, respectively. These probabilities are combined into a single segmentation map that is representative of the cells in the tissue section. We have developed MATISSE based on existing technologies and used open-access tools to create scripts for automated alignment of the IMC and IF datasets, as our goal was to produce a method that can be readily implemented by other research laboratories (see Additional file 1: MATISSE_MANUAL).

Results

To test and validate the performance of the MATISSE pipeline, we benchmarked it against the current standard in the field, the IMC-only segmentation pipeline (IMC) [8]. We used colorectal biopsy sections from patients with different stages of inflammatory bowel disease (IBD) as our tissue of choice, given the established heterogeneity of IBD clinical phenotypes and challenges this poses when diagnosing and treating patients [11]. Recent studies have revealed a rewiring of both intracellular signaling and cellular interactions between intestinal epithelium, stroma, and immune cells in IBD patients [12], and significant colonic epithelial cell diversity [13], further highlighting the need for more accurate strategies for quantitative analysis at a single cell level, which would be ultimately applicable to multiple different types of tissues.

Here, we prepared the tissue sections according to the procedure described in the “Methods” section (see also Additional file 2: Table 1, Additional file 3: Fig. S1A), and we performed pixel classification using the machine learning tool Ilastik [6]. Experienced users were tasked to generate training data for nuclear and membrane markers. Specifically for nuclear fluorescent DAPI signal Fiji [14] was used to generate annotations for training. Primary (nuclei) and secondary (cells) objects were identified with CellProfiler [7] based on the probability maps generated by Ilastik (for a detailed overview of the full procedure, see the “Methods” section, and additional file MATISSE_MANUAL). We observed that incorporating fluorescent microscopy images based on DAPI nuclear staining into MATISSE workflow resulted in superior visual and signal intensity-based separation of nuclei in dense areas (Fig. 1b, Additional file 3: Fig S1B). Next, we assessed segmentation maps generated by both segmentation methods. The predicted cell outlines differed between IMC and MATISSE methods in colon (Fig. 1c, d), small intestine, and more different tissues such as skin, liver, and non-small cell lung cancer (Additional file 3: Fig. S1C). Quantitative analysis emphasized improved cell segmentation by MATISSE, which identified significantly higher numbers of cells in all regions of interest (ROIs) (Fig. 2a, IMC mean 2086 ± 611 S.D., MATISSE mean 2783 ± 622 S.D.), and an improved recall score at different rates of overlap between predicted outlines and annotated ground (intersection over union) (Fig. 2b–d, Additional file 3: Fig. S2B) [15]. Of note, DNA IMC signal intensity in single cells between images was similar (Additional file 3: Fig. S2C). Moreover, MATISSE displayed less cell fragmentation, as shown by a comparative analysis of cell density (Fig. 1e), decreased fraction of fragmentation events (Fig. 2e, Additional file 3: Fig. S2A), and improved edge intersection score (Fig. 2f), indicating that the increased cell number was not caused by erroneous fragmentation of cells. Together, this comparative analysis showed that MATISSE resulted in both a superior quality of segmentation, and identification of a larger number of cells in the tissue.

Given the differences in numbers and segmentation quality of identified cells, we next set out to examine which cell types or tissue regions were differently segmented and thus most impacted by an improved segmentation pipeline. Clustering analysis was performed on all single cell events of all included ROIs combined to assess identified cell types, resulting in 26 clusters represented in a t-SNE plot (Fig. 3a, see Additional file 4: Table 2). Comparison of the number of cells identified in each cluster showed that specific clusters were affected by the method of segmentation in multiple ROIs (Fig. 3b, Additional File 3: Fig S3A), confirming that improved segmentation leads to differences in quality and quantity for downstream analysis of the data. Multiple clusters displayed differences in cell numbers, including clusters with low membrane signal in IMC, and clusters displaying clear positive signal in multiple channels, indicating that signal intensity of a specific population did not bias the observed differences in segmentation (Fig. 3c). The 6 clusters with largest increase of cell numbers in MATISSE versus IMC included fibroblasts (clusters 1 and 7), epithelial cells (clusters 2, 11, 23), myeloid cells and intra-epithelial lymphocytes (clusters 11 and 23), and negative cells expressing no significant levels of stained markers (cluster 5). Next, we visualized all single cells at their spatial location in the tissue, color-coded by cluster number (Fig. 3d, Additional file 3: Fig S3B). Focusing on the 6 clusters displaying the largest increase in cell numbers using MATISSE showed localization throughout the tissue, as expected, with clusters 2 and 23 locating in the epithelial layer, clusters 1, 7, and 11 in the basal membrane just below the epithelium, and clusters 1, 5, and 11 in the lamina propria (Fig. 3e, Additional file 3: Fig S3C). The 6 clusters that showed lower cell numbers in MATISSE versus IMC, and 7 clusters with equal numbers of cells in both segmentation methods, were analyzed in a similar fashion (Additional file 3: Fig S3C). This highlighted that cells identified with both methods can occur at similar spatial locations but appear more often fragmented (Additional file 3: Fig. S3D) or differently clustered in several examples using IMC-based segmentation compared to MATISSE.

Discussion

We show that the segmentation maps between IMC-based and MATISSE methods displayed major differences. The differences were most pronounced for specific cell populations, such as epithelial cells, fibroblasts, and specific immune cells where using higher resolution data in MATISSE facilitated improved annotation of separate nuclei and, consequently, superior training and segmentation. Rendering improved segmentation maps with MATISSE led to clear quantitative and qualitative changes in downstream analysis. Moreover, MATISSE demonstrated improved segmentation in the small intestine, which is similar to colon, but also in tissues such as skin and liver and in samples from non-small cell lung cancer. Of note, the panel and training in this study were designed for analysis of colon tissue and still showed reasonable performance. Future studies examining whether targeted training would even further improve performance in a range of tissues are warranted. For optimal accessibility, MATISSE has been developed based on existing technologies and open-access tools and can therefore be readily applied to different tissues and IMC antibody panels.

Conclusions

Taken together, MATISSE allowed segmentation of cellular areas such as the colonic mucosa, and keratinocyte layer of the skin, and showed a qualitative and quantitative difference in the outcome of analysis compared to IMC-based segmentation. Going forward, we expect that implementing MATISSE into tissue section analysis pipelines of colorectal samples, and beyond, will yield improved cell segmentation and enable more accurate analysis of the tissue microenvironment in tissue pathologies, such as autoimmunity and cancer.

Methods

Patients

Historical formalin-fixed paraffin embedded (FFPE) tissue blocks of colonic biopsies from patients with inflammatory bowel disease (IBD) were collected and included. Informed consent was obtained from all patients. Ten tissue sections were included from 3 patients with IBD, of 3 separate timepoints of biopsy per patient. The study was approved by the medical ethical board of the UMC Utrecht (METC protocol #11-050/E, and biobank protocols #18-676). Furthermore, TMA sections of rest-material, including intestine, liver, and non-small cell lung cancer, were included according to the no-objection-agreement, approved by the UMC Utrecht biobank committee (protocol #18-222).

Antibodies and reagents

For a comprehensive list of antibodies, compounds, and kits, see Supplementary Table 1.

Sample preparation

Tissue sections of 4 μM thickness were cut and placed on a glass slide (brand, specifics).

Slides were baked for an hour at 60 °C. Samples were deparaffinized in xylene twice for 10 minutes, followed by rehydration in a graded series of ethanol (100% 10 min, 95% 5 min, 80% 5 min, 70% 5 min), washed in Milli Q water 3 min, and finally PBST (TBS containing 0.1% Tween) for 10 min. Antigen retrieval was performed for 30 min at 96 °C in Tris-EDTA (10 mM Tris, 1 mM EDTA) pH 9, followed by a cool-down period of 10 min at room temperature. Samples were incubated in TBST for 10 min. Tissue sections were encircled using a PAP pen. Blocking was performed with TBST containing 3% BSA and FC-block 1:100 for 1 h at room temperature. Metal-conjugated antibodies were diluted according to dilutions stated in supplementary Table 1 in TBST containing 0.5% BSA. Staining was performed in a humidified chamber overnight at 4 °C. Samples were then washed in TBST twice for 5 min and TBS for 10 min, followed by incubation with 300 times diluted DNA intercalator Ir193 (Fluidigm) and 1000 times diluted DAPI in PBS for 1 h at room temperature. Then, slides were then washed twice with ddH₂O for 5 min. Samples were mounted in 90% glycerol and covered with a coverslip for microscopy.

Fluorescent microscopy imaging

Slides were imaged on a Zeiss CellObserver using a × 20 dry objective (0.75 NA, 420150-9900). A Colibri 7 was used as light source, in combination with a Zeiss 90 HE filter set. The system was equipped with a Hamamatsu Orca Flash4.0 V2+ camera (C11440-22CU). Images were acquired in a tiled Z-stack format with 10% overlap between tiles and 9 Z-slices using ZEN software (2.3). ZEN was again used to export imaging data to individual 16-bit tiff tiles. Z-stacks were converted to single in-focus images using the Extended Depth of Field plugin in Fiji at highest quality settings [14, 17]. Tile images were stitched using the MIST algorithm in Fiji [18].

Mass cytometry imaging

After microscopy, samples were unmounted by dipping and washing in ddH₂O. Samples were stained with toluidine blue for 5 min at RT, washed for 3 min with ddH₂O, and dried. Mass cytometry imaging was performed on a Hyperion (Fluidigm) laser ablation module, coupled to a Helios (Fluidigm) mass cytometer. Tuning was performed according to manufacturer instructions. Laser ablation frequency was set to 200 Hz. Data files were converted to 32-bit tiff files using the imctools library (https://github.com/BodenmillerGroup/imctools).

Histone H3 imaging results typically show nuclear localization with a different staining pattern compared to DNA intercalator (Ir193, and DAPI), since they bind to different targets, and Histone H3 is an antibody staining. Therefore, we used both intercalators for registration (see below).

Registration

Fluorescent microscopy images of DAPI were registered to mass cytometry images of the DNA-intercalator signal, while retaining the resolution of the fluorescent images. Registration was performed using key points generated using the MOPS algorithm [19]. Images were transformed using the landmark correspondence command in Fiji (https://imagej.net/Landmark_Correspondences). The method for transformation used was moving least squares and transformation class similarity.

Profile plot

The plot profile function in Fiji [14] was used to determine signal intensity along lines. Images of DAPI and DNA-intercalator were co-registered, but analysis was performed at original resolution. Intensity values were normalized per line and marker.

Probability map generation mass cytometry images

Before training on imaging mass cytometry data, signal intensity was manually scaled and converted to a 16-bit range. Annotations were generated by experienced users using Ilastik [6] (1.3.3). The following channels were selected for machine learning: Pan-Keratin, E-Cadherin, αSMA, Histon H3, 193Ir, LMNB1, Ki-67, CD3, CD4, CD8a, CD14, CD16, CD20, CD45, CD45RO, and CD68.

Training data was generated for 5 classes, namely non-epithelial cellular membranes, non-epithelial nuclei, epithelial cytoplasm/membrane, epithelial nuclei, and background. Training and probability map generation was performed in Ilastik using only IMC data as input. In Ilastik all features with sigma between 0.3 and 1.6 were selected. Probability maps were saved to individual 32-bit tiff files for each class.

Probability map generation fluorescent microscopy images

Training data was generated on a random subset of 100 tile regions of fluorescent imaging data. Annotations were made by 4 experienced users for classes nuclei, edges of nuclei, and background, using Fiji [14] based only on DAPI signal. No contrast adjustments were made. Feature images and morphological filters of the raw DAPI signal were made using FeatureJ [20] and MorphoLibJ [21] respectively and added as channels to the imaging data before training in Ilastik. Features: Laplacian (σ 0.7, 1, 1.6, 2, 3), Hessian-smallest (σ 3, 5), Hessian-largest (σ 1, 2), Structure-largest (σ 1, 2), Gaussian (σ 0.7, 1.6, 2, 3.5). Morphology filters: Opening (σ 1, 2, 3, 5), Internal Gradient (σ 1, 3, 5), White Top Hat (σ 8, 10, 15, 20), Edges (σ 1, 2). In Ilastik, only the Gaussian smoothing feature with sigma 1.0 was selected, since all feature images are included in the input data. Probability maps were generated for stitched tile-scan images. Probability maps were saved to individual 32-bit tiff files for each class. Ilastik was used in headless mode on a high-performance computer cluster using 8 cores and 100 GB of memory.

Single cell segmentation

Segmentation was performed using Cellprofiler [7] (v3.1.9). The pipeline can be found in the online methods. For both segmentation approaches, cells were identified firstly by identification of individual nuclei and secondly by expansion of these nuclei to the full extent of the cells. The segmentation map was stored in a 16-bit tiff format.

For IMC-based segmentation, only the probability maps generated using the IMC-data were used. For MATISSE segmentation, identification of nuclei was based on the probability maps generated using the fluorescent images of DAPI, at high resolution. The identified nuclei were next downscaled to the resolution of the IMC data and expanded to the full extent of the cells using the membrane probability generated using only the IMC data.

Segmentation score

Manual annotations for ground truth

Trained experts were asked to manually annotate individual nuclei in a subset of 30 images of nuclear staining (100 × 100 μm). These images were a composite of both fluorescent and IMC data, of DAPI and DNA-intercalator respectively, after co-registration, at high resolution. In total 2642 nuclei were annotated. The annotations were converted to a binary mask and downscaled to the resolution of IMC data.

Segmentation score calculation

For calculation of a recall score, first the intersection over union (IOU) was calculated. The IOU is determined for all manually annotated nuclei that overlap with any nuclear outline generated by either segmentation pipeline separately. IOU is calculated as the surface area of the intersection area between nuclear outlines, divided by the surface area of the union of both nuclear outlines. For each manual event only the interaction with highest IOU was taken for recall calculation, in case of multiple identified overlaps [15]. Recall is calculated as the number of true positive events, divided by the sum of number of true positive events and number of false negative events. Specifically, this recall is calculated at different required IOU thresholds, ranging from 0.5 to 1.0 with increments of 0.05.

The proportion of split events is calculated as the fraction of ground truth nuclear annotations that overlap with multiple nuclear events from either segmentation pipeline, over the total number of events that have any overlap. Overlap being defined as at least 20% of surface area of the ground truth object.

A probability score was calculated for intersection of cell boundaries, derived from segmentation maps from automated pipelines, with manually annotated nuclei [22]. This was performed for both IMC-based and MATISSE segmentation methods.

For calculation of fragmentation per phenocluster, we calculated the proportion of events generated by either segmentation pipeline that was identified as part of a fragmentation event, from the total number of events that overlap for at least 20% with any ground truth annotation. Fragmentation events being at least two events overlapping with a single ground truth annotation for at least 20% of the ground truth surface area per interaction.

Single cell data generation

Single cell data was generated in R (v4.0) [23] by extracting pixel intensities from unscaled 32-bit images for all channels for all cells represented in the segmentation maps for both IMC-only and MATISSE methods.

Spatial analysis

Segmentation maps were converted to polygons in R using packages sf [9] and stars [24]. Distances between neighboring cells were calculated using the RANN package [10] based on centroids of the cells determined with the sf package. A radius of 10 μm was used to count the number of direct neighbors for each cell and used as a measure for density.

Clustering

Single cell clusters were generated with the Rphenograph package [16], based on the mean expression per cell of the markers αSMA, CD14, CD16, CD20, CD3, CD4, CD45, CD45RO, CD68, CD8a, E-Cadherin, FOXP3, IL-17α, Pan-Keratin, and TCRγδ. This clustering was performed using pooled single cell data generated by both segmentation methods. The number of nearest neighbors was kept at the default value of 30 for clustering. tSNE was performed using the Rtsne package [25,26,27], again using pooled data, with settings initial dimensions 50, perplexity 30, and theta 0.5. Single cell data was log1p transformed for clustering and tSNE.

Phenocluster spatial representation

Polygons representing cell outlines were plotted and given a random color fill corresponding to their assigned phenocluster number. Clusters with a large difference in number of cells across segmentation methods were selected by taking the top 6 with the highest or lowest ratio. Clusters assumed to be equally represented in both segmentation methods were selected based on a ratio between 0.9 and 1.1. Random regions were generated by selecting a region of 100 by 100 μm from 4 randomly selected ROIs.

Availability of data and materials

All data generated or analyzed during this study are included in this published article and its supplementary information files and publicly available repositories:

Datasets: Image and other processed data are publicly available on Zenodo, doi: 10.5281/zenodo.4727873 (https://zenodo.org/record/4727873).

Scripts: https://github.com/VercoulenLab/MATISSE-Pipeline)

Change history

18 June 2021
A Correction to this paper has been published: https://doi.org/10.1186/s12915-021-01065-6

References

Binnewies M, Roberts EW, Kersten K, Chan V, Fearon DF, Merad M, et al. Understanding the tumor immune microenvironment (TIME) for effective therapy. Nat Med. 2018;24(5):541–50. https://doi.org/10.1038/s41591-018-0014-x.
Article CAS PubMed PubMed Central Google Scholar
Tsujikawa T, Kumar S, Borkar RN, Azimi V, Thibault G, Chang YH, et al. Quantitative multiplex immunohistochemistry reveals myeloid-inflamed tumor-immune complexity associated with poor prognosis. Cell Rep. 2017;19(1):203–17. https://doi.org/10.1016/j.celrep.2017.03.037.
Article CAS PubMed PubMed Central Google Scholar
Amaria RN, Reddy SM, Tawbi HA, Davies MA, Ross MI, Glitza IC, et al. Neoadjuvant immune checkpoint blockade in high-risk resectable melanoma. Nat Med. 2018;24(11):1649–54. https://doi.org/10.1038/s41591-018-0197-1.
Article CAS PubMed PubMed Central Google Scholar
Angelo M, Bendall SC, Finck R, Hale MB, Hitzman C, Borowsky AD, et al. Multiplexed ion beam imaging of human breast tumors. Nat Med. 2014;20(4):436–42. https://doi.org/10.1038/nm.3488.
Article CAS PubMed PubMed Central Google Scholar
Giesen C, Wang HA, Schapiro D, Zivanovic N, Jacobs A, Hattendorf B, et al. Highly multiplexed imaging of tumor tissues with subcellular resolution by mass cytometry. Nat Methods. 2014;11(4):417–22. https://doi.org/10.1038/nmeth.2869.
Article CAS PubMed Google Scholar
Sommer CS, Köthe U, Hamprecht FA. Ilastik: interactive learning and segmentation toolkit. In: Eighth IEEE international symposium on biomedical imaging (ISBI) proceedings; 2011. p. 230–3.
Google Scholar
Carpenter AE, Jones TR, Lamprecht MR, Clarke C, Kang IH, Friman O, et al. CellProfiler: image analysis software for identifying and quantifying cell phenotypes. Genome Biol. 2006;7(10):R100. https://doi.org/10.1186/gb-2006-7-10-r100.
Article CAS PubMed PubMed Central Google Scholar
Schapiro D, Jackson HW, Raghuraman S, Fischer JR, Zanotelli VRT, Schulz D, et al. Bodenmiller B: histoCAT: analysis of cell phenotypes and interactions in multiplex image cytometry data. Nat Methods. 2017;14(9):873–6. https://doi.org/10.1038/nmeth.4391.
Article CAS PubMed PubMed Central Google Scholar
Pebesma E. Simple features for R: standardized support for spatial vector data. R J. 2018;2(1):439–46.
Article Google Scholar
Arya S, Mount D, Kemp SE, Jefferis G. RANN: fast nearest neighbour search (wraps ANN library) using L2 metric; 2019.
Google Scholar
Graham DB, Xavier RJ. Pathway paradigms revealed from the genetics of inflammatory bowel disease. Nature. 2020;578(7796):527–39. https://doi.org/10.1038/s41586-020-2025-2.
Article CAS PubMed PubMed Central Google Scholar
Smillie CS, Biton M, Ordovas-Montanes J, Sullivan KM, Burgin G, Graham DB, et al. Intra- and inter-cellular rewiring of the human colon during ulcerative colitis. Cell. 2019;178(3):714–30e722. https://doi.org/10.1016/j.cell.2019.06.029.
Article CAS PubMed PubMed Central Google Scholar
Parikh K, Antanaviciute A, Fawkner-Corbett D, Jagielowicz M, Aulicino A, Lagerholm C, et al. Colonic epithelial cell diversity in health and inflammatory bowel disease. Nature. 2019;567(7746):49–55. https://doi.org/10.1038/s41586-019-0992-y.
Article CAS PubMed Google Scholar
Schindelin J, Arganda-Carreras I, Frise E, Kaynig V, Longair M, Pietzsch T, et al. Fiji: an open-source platform for biological-image analysis. Nat Methods. 2012;9(7):676–82. https://doi.org/10.1038/nmeth.2019.
Article CAS PubMed Google Scholar
Caicedo JC, Roth J, Goodman A, Becker T, Karhohs KW, Broisin M, et al. Evaluation of deep learning strategies for nucleus segmentation in fluorescence images. Cytometry A. 2019;95(9):952–65. https://doi.org/10.1002/cyto.a.23863.
Article PubMed PubMed Central Google Scholar
Chen H. Rphenograph: R implementation of the phenograph algorithm0.99.1; 2015.
Google Scholar
Forster B, Van De Ville D, Berent J, Sage D, Unser M. Complex wavelets for extended depth-of-field: a new method for the fusion of multichannel microscopy images. Microsc Res Tech. 2004;65(1-2):33–42. https://doi.org/10.1002/jemt.20092.
Article PubMed Google Scholar
Chalfoun J, Majurski M, Blattner T, Bhadriraju K, Keyrouz W, Bajcsy P, et al. MIST: accurate and scalable microscopy image stitching tool with stage modeling and error minimization. Sci Rep. 2017;7(1):4988–8. https://doi.org/10.1038/s41598-017-04567-y.
Brown M, Szeliski R, Winder S. Multi-image matching using multi-scale oriented patches, vol. vol. 511; 2005. p. 510–7.
Google Scholar
Meijering E. Feature J; 2002.
Google Scholar
Legland D, Arganda-Carreras I, Andrey P. MorphoLibJ: integrated library and plugins for mathematical morphology with ImageJ. Bioinformatics. 2016;32(22):3532–4. https://doi.org/10.1093/bioinformatics/btw413.
Article CAS PubMed Google Scholar
Schuffler PJ, Schapiro D, Giesen C, Wang HA, Bodenmiller B, Buhmann JM. Automatic single cell segmentation on highly multiplexed tissue images. Cytometry A. 2015;87(10):936–42. https://doi.org/10.1002/cyto.a.22702.
Article CAS PubMed Google Scholar
R Core Team. R: a language and environment for statistical computing. 4.0 ed; 2019.
Google Scholar
Pebesma E. Stars: spatiotemporal arrays, raster and vector data cubes. 0.4-1 ed; 2020.
Google Scholar
Krijthe JH. Rtsne: T-distributed stochastic neighbor embedding using Barnes-hut implementation; 2015.
Google Scholar
van der Maaten LJP. Accelerating t-SNE using tree-based algorithms. J Mach Learn Res. 2014;15:3221–45.
Google Scholar
van der Maaten LJP, Hinton GE. Visualizing high-dimensional data using t-SNE. J Mach Learn Res. 2008;9:2579–605.
Google Scholar

Download references

Acknowledgements

The authors would like to thank Iris Langerak (CMM, UMCU), Danielle Krijgsman, and Domenico Castigliego (Pathology, UMCU) for experimental assistance, Eelco Brand (Gastroenterology, UMCU) for help selecting patient samples, Utrecht Bioinformatics Core (UBEC) for advice, and division LAB ICT for infrastructural support.

Funding

This work has been supported by a grant from cancergenomicscenter.nl NWO Gravitation 024.001.028 to Y.V. Y.V. received a Parental Leave Grant from Life Science Editors for editing support from Milka Kostic.

Author information

Neeraj Sinha and Mojtaba Amini contributed equally to this work.

Authors and Affiliations

Molecular Cancer Research, Center for Molecular Medicine, University Medical Center Utrecht, Utrecht University, 3584, CX, Utrecht, The Netherlands
Matthijs J. D. Baars, Neeraj Sinha, Mojtaba Amini, Annelies Pieterman-Bos, Stephanie van Dam, Maroussia M. P. Ganpat & Yvonne Vercoulen
Oncode Institute, Utrecht, The Netherlands
Stephanie van Dam
Department of Pathology, University Medical Center Utrecht, Utrecht University, 3584, CX, Utrecht, The Netherlands
Miangela M. Laclé
Department of Gastroenterology and Hepatology, University Medical Center Utrecht, Utrecht University, 3584, CX, Utrecht, The Netherlands
Bas Oldenburg

Authors

Matthijs J. D. Baars
View author publications
You can also search for this author in PubMed Google Scholar
Neeraj Sinha
View author publications
You can also search for this author in PubMed Google Scholar
Mojtaba Amini
View author publications
You can also search for this author in PubMed Google Scholar
Annelies Pieterman-Bos
View author publications
You can also search for this author in PubMed Google Scholar
Stephanie van Dam
View author publications
You can also search for this author in PubMed Google Scholar
Maroussia M. P. Ganpat
View author publications
You can also search for this author in PubMed Google Scholar
Miangela M. Laclé
View author publications
You can also search for this author in PubMed Google Scholar
Bas Oldenburg
View author publications
You can also search for this author in PubMed Google Scholar
Yvonne Vercoulen
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

MJD.B. and Y.V. conceived and designed the study, interpreted the data, and drafted the manuscript. MJD.B., N.S., M.A., A.P-B., S.v.D, and MMP.G. contributed to the data acquisition and analysis, M.L. and B.O. contributed to the experimental design and provided tissue samples, and all authors revised the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Yvonne Vercoulen.

Ethics declarations

Ethics approval and consent to participate

The study was approved by the medical ethical board of the UMC Utrecht (METC protocol #11-050/E, and biobank protocol #18-676). Informed consent was obtained from all patients for participation.

Consent for publication

Not applicable.

Competing interests

The authors have no competing interests to declare

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

The original version of this article was revised: a typesetting mistake in the processing of Figure 1 was corrected.

Supplementary Information

Additional file 1.

MATISSE_MANUAL: an interactive PDF outlining all steps of computational methods, to allow smooth implementation of MATISSE methods. All code required is publicly accessible in Github: https://github.com/VercoulenLab/MATISSE-Pipeline.

Additional file 2: Supplementary Table 1.

Antibody panel resources

Additional file 3: Figure S1.

a) Representative examples of IMC images, b) nuclear staining profiles DAPI versus Ir193, and c) predicted cell outlines of different tissues. Figure S2. a) Representative example of overlap between manual annotations and predictions, b) Recall scores calculated for different tissues, and c) Ir193 signal intensity across all analyzed images. Figure S3. Comparison of IMC and MATISSE performance per phenocluster. a) Cell numbers identified per phenocluster across all analyzed images. b) representative examples of cell outlines, density and phenoclusters, c) representative examples of cells colored by specific phenoclusters, d) fragmentation events per phenocluster.

Additional file 4: Supplementary Table 2.

Phenocluster names.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Baars, M.J.D., Sinha, N., Amini, M. et al. MATISSE: a method for improved single cell segmentation in imaging mass cytometry. BMC Biol 19, 99 (2021). https://doi.org/10.1186/s12915-021-01043-y

Download citation

Received: 24 July 2020
Accepted: 30 April 2021
Published: 11 May 2021
DOI: https://doi.org/10.1186/s12915-021-01043-y

MATISSE: a method for improved single cell segmentation in imaging mass cytometry

Abstract

Background

Results

Conclusion

Background

Results

Discussion

Conclusions

Methods

Patients

Antibodies and reagents

Sample preparation

Fluorescent microscopy imaging

Mass cytometry imaging

Registration

Profile plot

Probability map generation mass cytometry images

Probability map generation fluorescent microscopy images

Single cell segmentation

Segmentation score

Manual annotations for ground truth

Segmentation score calculation

Single cell data generation

Spatial analysis

Clustering

Phenocluster spatial representation

Availability of data and materials

Change history

18 June 2021

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Supplementary Information

Additional file 1.

Additional file 2: Supplementary Table 1.

Additional file 3: Figure S1.

Additional file 4: Supplementary Table 2.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Biology

Contact us