Combined Detection and Segmentation of Overlapping Erythrocytes in Microscopy Images Using Morphological Image Processing

Portuondo-Mallet, Lariza M.; Chinea-Valdés, Lyanett; Orozco-Morales, Rubén; Lorenzo-Ginori, Juan V.; Portuondo-Mallet, Lariza M.; Chinea-Valdés, Lyanett; Orozco-Morales, Rubén; Lorenzo-Ginori, Juan V.

doi:10.13053/cys-26-4-3893

Services on Demand

Journal

Article

Indicators

Cited by SciELO
Access statistics

Computación y Sistemas

On-line version ISSN 2007-9737Print version ISSN 1405-5546

Comp. y Sist. vol.26 n.4 Ciudad de México Oct./Dec. 2022 Epub Mar 17, 2023

https://doi.org/10.13053/cys-26-4-3893

Articles

Combined Detection and Segmentation of Overlapping Erythrocytes in Microscopy Images Using Morphological Image Processing

Lariza M. Portuondo-Mallet¹²^*

Lyanett Chinea-Valdés¹

Rubén Orozco-Morales³

Juan V. Lorenzo-Ginori¹

¹Universidad Central “Marta Abreu” de Las Villas, Centro de Investigaciones de la Informática, Cuba. juanl@uclv.edu.cu.

²Universidad de Oriente, Centro de Estudios de Neurociencias, Procesamiento de Imágenes y Señales (CENPIS), Cuba.

³Universidad Central “Marta Abreu” de Las Villas, Centro de Estudio de Métodos Computacionales y Numéricos en la Ingeniería (CEMNI), Cuba. rorozco@uclv.edu.cu.

Abstract:

Segmentation of clusters of erythrocytes into their constituent single cells is a procedure needed in various biomedical applications related to microscopy images. This task is part of the general problem of splitting clumps of objects in images which continues being an open research topic in the Image Processing field. This work presents a unified morphological method to detect and segment clusters of erythrocytes in microscopy images, and proposes two main contributions. The first one is to formulate and evaluate a method to detect clusters as connected components in binary images, obtained from a previous coarse segmentation, which is not capable of further dividing a cluster into its constituent cells. Secondly, to propose the best alternative to split the clusters into their constituent individual cells after evaluating three algorithms based in the combination of the transforms: H-maxima, weighted external distance and marker-controlled watershed. Evaluation of the proposed cluster detection methods was made in terms of standard measures of effectiveness. Segmentation accuracy was evaluated comparing the segmented objects obtained to a manually segmented ground truth, by means of the Jaccard index. Then the Friedman test allowed validating the advantages of the proposed method in comparison to the other alternatives studied here.

Keywords: Image segmentation; clusters splitting; watersheds; distance transform

1 Introduction

1.1 General Background

Segmentation of clusters of overlapping or touching objects in binary images into their single components has been addressed in a variety of practical situations and continues being an open research topic in Image Processing.

Examples can be found for the case of two-dimensional gel electrophoresis overlapping spots [³⁶], segmentation of rocks in images with application to mining industry [⁴] and rock particles in general for their recognition [³⁹].

Other examples are applications related to nanotechnology [⁴³] and to agriculture and food [³], general automated size analysis in multi-flash imaging [²¹] as well as numerous applications in the biomedical field, among which segmentation of overlapped or touching erythrocytes in microscopy images, to which this work is devoted, is an important example.

A classification of the segmentation methods used in a specific biomedical application is presented in [¹⁹] where various approaches like methods based in concave point detection, blob detection, clustering and morphological processing are recognized and discussed.

Other examples are splitting of clumped or overlapped cells based on template matching strategy [⁷] and a method called Recursive Water Flow (RWF) [⁸] for cell splitting in histological images. The problem of segmenting touching cells in a 3D framework is addressed in [²³].

Segmentation of histopathological images including overlapped or touching cells was addressed in [¹³] using deep learning algorithms and spatial relationships. Splitting of 3D cell clusters for the case of volumetric confocal images is presented in [¹⁵].

A combined method for overlapped cell detection and segmentation based in features obtained from the skeleton and the contour of the cells is showed in [¹⁶]. A semi-automatic approach for detection and segmentation of cell nuclei based on graph-cuts and Laplacian of Gaussian (LoG) filtering is proposed in [¹].

A method based on concave points extraction through polygonal approximation and ellipse fitting bubbles with average distance deviation criterion and two constraint conditions was addressed in [⁴⁵]. Reference [⁴⁴] employed a modified version of curvature scale space method to extract corner points and then recognize the concave points by evaluating angular changes.

These concave points and the centroid points are then used to characterize the structure of the cell clump and to construct the split line by using the corresponding splitting strategy. Other approach proposed recently to split overlapped cells based on elliptical shape models appers in [²⁹].

Various approaches to segment clusters in images from the Papanicolaou test are presented in [³¹, ³², ³³, ³⁸] and other diverse microscopy image applications using methods not based in mathematical morphology were reported in [²⁸, ²⁷, ³⁴, ⁴¹].

The method proposed in this work uses an approach to segment clusters based in morphological image processing techniques and under such view, we will comment about methods of this kind in more detail. A method to split cell clumps based in the use of different morphological scales after iterative erosion to find cell-specific markers is developed in [³⁷].

In spite of the good results they obtained, the authors point out that at the time of their publication a comprehensive benchmark using a database of cell clumps or clumped objects was not available. It is worth to notice, however, that to our knowledge such benchmark does not exist yet.

A morphological method is presented in [¹⁷] based in the use of an adaptive H-minima transform together with an external distance and marker-controlled watershed transform to segment cell clusters, with good results in terms of percentages of correctly segmented clusters.

Reference [¹⁸] followed this line of work and it was introduced there a parameterization using an ellipsoidal modeling of contours to perform a more appropriate analysis. The authors expressed their results there in terms of percentages of correctly split clumps.

Various alternatives of the use of markers considering minima imposition were studied in [⁶] where a relative equivalence was found between different approaches, to represent the markers used to control the watershed transform in order to split the clumps.

Other morphological approach using the watershed transform complemented with a corner detection algorithm appears in [²⁶]. The classical watershed and distance transforms are used in [⁴⁰], specifically to segment chromosomes showing overlapping.

An improved ultimate erosion process (UECS) together with an edge-to-marker association is proposed in [³⁰] to separate the overlapping convex objects in electron micrographs.

In this work, the authors used a noise-robust measure of convexity (or concavity) based on the sensitivity to the coarseness of digital grids as the stopping criterion for erosion. The missing contours of the occluded particles are inferred using a Gaussian mixture model on B-splines.

In reference [⁴²] the gradient-barrier watershed algorithm is proposed, in which the gradient in the overlapping region is used directly as the barrier to the water flow. A cluster segmentation method based in the use of structural features and morphological image processing is showed in [²⁰], again obtaining high accuracy in terms of performance measures (sensitivity, specificity) of the cluster detection process as well as accuracy of segmentation.

A review of the use of mathematical morphology techniques in malaria studies, which includes the segmentation of overlapped cells is presented in [²⁵]. Reference [²²] presents a method based in a watershed algorithm that iteratively identifies markers, considering a set of different h values in the H-minima transform.

This method showed good results, but it is oriented to the specific case of wide-field fluorescence microscopy images and requires calculating a fair gradient map from the original image as well as defining heuristically some parameters.

Recently, deep learning algorithms, in particular convolutional neural networks (CNN) have been also applied in medical image analysis [²⁴]. The fully convolutional neural network U-Net [³⁵] has significantly influenced the field of cell segmentation.

This network model was designed to work with few training images and to obtain accurate segmentation. In [²] deep learning was applied to predict cell nuclei and combined with thresholding and watershed transform to segment different types of cells.

Their approach was developed only for fluorescent images with stained cytoplasm. A modified version of U-Net called MultiResUnet is proposed in [¹⁴] and obtained better results than using the classical U-Net.

In reference [¹²] is proposed a method called BubCNN which employs a Faster region-based CNN (RCNN) detector module to locate bubbles and a shape regression CNN to predict bubble shape parameters.

A great future can be foreseen for deep learning based models in this kind of applications, however training deep networks tends to be computationally expensive and might require large numbers of annotated data, which is a time-consuming process. This implies that other conventional image processing techniques like those presented here can be still a valuable choice for the task addressed in this work.

1.2 Unified Framework for Detection and Segmentation of Clusters

We introduce in this work a unified method oriented to segment with high effectiveness clusters having up to medium complexity, which means roughly less than 30% overlapping which could be considered to allow a useful individual cell analysis after splitting. We mention also that erythrocytes consist usually in round-like objects of moderately variable sizes.

The algorithm used to segment the clusters operates by means of a combination of the conventional distance transform, the H-maxima transform, morphological operations and a weighted external distance transform combined with marker-controlled watershed segmentation, as will be described in detail later. This allowed using the information obtained during the clusters detection to facilitate their subsequent segmentation (split).

The method presented here showed a high effectiveness in detecting the clusters in terms of performance measures like sensitivity, specificity, accuracy, precision and F-measure, as well as a high segmentation accuracy. The latter was measured in terms of the Jaccard index obtained when comparing the computer-segmented objects to a manually segmented ground truth.

2 Materials and Methods

2.1 Images Dataset

The whole detection and segmentation process begins with a coarse segmentation, which produces a binary image in which the touching or overlapping erythrocytes remain as connected components. The binary images used in our experiments contain clusters of various sizes and were obtained through coarse segmentation of microscopy images, which correspond to mice peripheral blood smears stained with Giemsa.

Other components of the blood smears as leukocytes and platelets were eliminated from the image using image processing techniques, not described here as our interest resides in the segmentation of the remaining clusters of erythrocytes.

A Zuzi microscope model 148 was used to acquire the images, equipped with a plan-achromatic lens having 1.25 numerical aperture and a 0.5 magnification of the camera adaptor, with a 319CU digital camera of 3.2 megapixel and 8-bit RGB uncompressed output, obtaining a resolution of 2048 × 1536 pixels. The objective power used was 100× with immersion oil, obtaining a total magnification of 50×, which results roughly in around 140 pixels per cell diameter for the images employed in the experimental work.

The images were saved in.tiff (tagged image file) format. Then, the images were segmented by thresholding to obtain the set of binary image containing independent, single objects as well as clusters of various sizes and complexity.

Other steps in this process included conversion to grayscale prior to thresholding and then, morphological area-opening filters are used to remove items smaller than a red blood cell and to fill the holes left after thresholding. We stress the fact that this primary ”coarse” segmentation is not of concern to this research and its role was only to obtain images containing appropriate clusters to perform the experimental work.

The dataset created consists of 43 images containing in total 4265 binary objects, 1081 of which can be considered as clusters and 3184 as individual cells. Fig. 1 shows an original image and its corresponding binary image after coarse segmentation and Fig. 2 exhibits four examples of connected regions forming clusters.

Fig. 1 Microscopy image and coarse segmentation (a) original image (b) coarse segmentation from image (a)

Fig. 2 Connected regions forming clusters

2.2 Detection of the Clusters Contained in the Binary Images

To detect the clusters contained in the binary images that were obtained as described in the previous section, we followed a method that uses both the conventional (inner) distance transform, the external distance transform (EDT ) and a weighted version of it (WEDT ) as well as the H-maxima transform and some morphological processing operations, in a process described in detail in what follows.

This approach was used because it produces the inner markers needed afterward in the splitting process. The distance transform DT(A) described in [⁹] is defined in the following manner: for any point x in A, DT(A)(x) is the distance from x to the complement of A:

DT(A)(x)=min⁡{d(x,y),y∈AC}. (1)

To calculate DT(A)x, firstly the binary image from the coarse segmentation, which has one-valued foreground pixels, is complemented. Then DT(A)x is calculated as the distance from each zero-valued pixel to the nearest one-value pixel.

As the inner distance transform is applied here to the complement of a connected component, its result is a grayscale image exhibiting its highest intensity in a point or patch, which is in general a regional maximum, located farthest from the background. This process is depicted in Fig. 3.

Fig. 3 (a) Binary image from overlapping cells. (b) Complemented binary image. (c) Distance transform

The eventual appearance of spurious maxima will be addressed later. To define the external distance transform, consider the set B of pixels in the background (binary level 0) of the binary image under analysis.

Then for any point x∈B, EDT(B)(x) is the distance from x to the nearest pixel pertaining to a marker point (binary level 1), usually taken as a regional maximum as described in the previous paragraph:

EDT(B)(x)=min⁡{d(x,y)y∈BC}. (2)

The proposed methodology followed a sequence of steps to determine whether a connected component in the binary image (obtained from the previous coarse segmentation) corresponds to a cluster or to a single object and then, split those that are considered as clusters. These steps were:

Labelling the connected components and calculating the inner distance transform map for each one.
Obtaining the valid regional maxima of the distance transform (DT ) for each binary object present in the image).
Classify as clusters all the objects having more than one of these maxima.
Build the skeleton by influence zones (SKIZ) [⁹] which correspond to the regional maxima for each cluster, using the weighted EDT (WEDT ), which is the EDT with its values divided (weighted) by a factor obtained during the selection of the valid regional maxima described in the next section.
Segment the clusters into their constituent components by means of the marker controlled watershed transform [⁹], using the SKIZ lines as external markers and the regional maxima as inner markers.

When building the EDT map in setp 4 to obtain the SKIZ, the distances from a background pixel to each regional maximum were weighted by a coefficient, which depends on the magnitude (height) of the regional maximum, previously normalized to the interval [0, 1]. Segmentation by means of the watershed transform followed the previous steps.

We point out, however, that obtaining valid regional maxima corresponding to the clustered binary objects is not a trivial task. The clusters may have a moderately irregular contour, and therefore several spurious maxima can appear after calculating the distance transform.

These spurious maxima are usually deemed as noise and can lead to over segmentation when used as markers for segmenting using the marker-controlled watershed transform.

2.3 Determining the Valid Regional Maxima in DT(A)(x)

In this work, three methods were applied and compared in order to determine the valid regional maxima present in the binarized clusters.

— Method 1: Iterative H-maxima transform. This method apply iteratively the H-maxima transform to the distance transform map of each complemented binary objects and afterwards counting the number of remaining regional maxima.
— Method 2: Morphological filtering. This method has the purpose of transforming the set of spurious regional maxima formed around the center of a single (and perhaps part of a cluster) object into one valid, unique maximum.
In this case, an alternating open-close sequential filter [⁹] with two stages and a disk structuring element is applied to the distance transform map. Then, the algorithm extracts the regional maxima and the magnitude (height) of these resulting maxima is considered representative of that of the individual merged maxima.
— Method 3: Radon transform. This method is described in [¹¹] where the Radon transform and morphological operations are used to find the markers for the erythrocytes.

A detailed description of these methods is presented in the next section.

2.4 Detailed Description of the Methods Used to Detect Clusters

The H-minima and H-maxima transforms are powerful tools to suppress undesired minima or maxima in a grayscale image.

In this case, we applied the H-maxima transform to the distance transform map corresponding to the complemented binary image, obtained from the coarse segmentation step. The H-maxima transform HMAX is defined in [⁹] as:

HMAXh,D(f)=fΔD(f−h), (3)

where ΔD is the morphological operation of geodesic reconstruction, f is the intensity image, h is a height parameter and D is the structuring element. The HMAX transform removes any intensity dome in the image having height less than h and decreases the height of the other domes by h. Calculation of HMAX tends to eliminate successively the spurious maxima of different heights as the parameter h increases by iterative steps. Once the spurious maxima are eliminated or merged, if the parameter h continues increasing, at some moment the regional maxima pertaining two adjacent clustered objects will also merge.

This fact is used in as stop criterion in [¹⁷], where the dual H-minima transform is used in an analogous way. The algorithm in this reference goes back one step to keep isolated the regional minima pertaining to different adjacent merged objects.

However, increasing h in small steps until merging the maxima from adjacent objects implies in our case an unnecessary computing burden, because actually there is only the need to suppress the spurious maxima, which will occur after only some few steps.

In order to find a practical solution to this problem, experimental work with a large number of diverse clusters was performed, testing the results of iterations increasing the parameter h.

It was found experimentally that the number of maxima stabilizes in the desired value after at least five successive iterations in practically all cases, without further decrements in the number of maxima until the merging phenomenon previously mentioned occurs.

This determined the use as stop criterion for the iterative H-maxima transform the constancy of the number of detected regional maxima during five successive iterations. If after this convergence more than one maximum remain present in a connected component being analyzed, it is possible to say that we are in presence of a cluster, given that a single erythrocyte would show only one maximum.

Then, the maxima obtained for the different components in the image are saved. These maxima will be used later as internal markers to be used in the watershed segmentation, together with the last height value obtained from the HMAX transform, which will be also used for separating the clusters into individual objects. On the basis of the previous discussion, three methods to detect clusters were implemented and compared, whose algorithms are summarized as follows:

2.4.1 Method 1: Iterative H-maxima Transform for Detecting Clusters

Perform the coarse segmentation of the image using a standard method and label the resulting binary connected components, which can be either single objects or clusters.
For each labeled object i do:
- a) Compute the distance transform (Euclidean) on the complement of the ith connected component and normalize the obtained grayscale image Dmap to the range [0,1].
- b) Count the number of regional maxima in Dmap for each labeled connected component; let this number be N.
- c) Guess an initial parameter value h=0.01.
- d) While N>1, successive calculations of the HMAX transform incrementing h in small steps (experimentally set to 0.05) begins until the calculated number N of regional maxima repeats its value a number of times, reaching a count heuristically set to five, or N reaches the value 1.
- This was the criterion of convergence for the calculation of the number of maxima and the suppression of spurious extrema. Here in each iteration the new value of N is saved and compared with the previous one, to allow counting the number of repeats of it. Every time N changes, the counter is reset to one and counting re-starts.
- e) If N>1 after convergence, the labeled binary object is classified as a cluster and the algorithm, as will be seen, calls the method SplitClusterWEDT in order to split it. This function would receive two additional parameters, these are the final calculation of the H-maxima transform, which contains the information about the heights of its regional maxima, as well as the regional maxima map, stored respectively in Hmap and RegMax, that were obtained in step (d).

The pseudo code illustrates this algorithm for the Iterative H-maxima transform method. Fig. 4 shows a binary object corresponding to a cluster of 2 erythrocytes and the regional maxima obtained for it during its processing. Notice that in this case N=9 initially and at the end of the algorithm run N=2 as it should be.

Fig. 4 (a) Regional maxima superimposed to the distance transform map in Fig. 2, notice the presence of multiple spurious maxima. (b) Final regional maxima after the iterative search, where the spurious maxima have been merged into two single ones, as expected

2.4.2 Method 2: Morphological Filtering

This method applies a morphological approach to detect clusters and extracting markers for both the clusters and the single cells. The steps are as follows:

Perform the coarse segmentation of the original image in the same way as in Method 1.
Determine the distance transform (Euclidean) of the complement of this binary image and then normalize it. Let be Idt the resulting image.
Compute a two-stages open-close alternating sequential filtering (ASF), using a disk structuring element g with radius 1 and 2 in the first and second filtering stages respectively, in order to eliminate the spurious maxima. We call the resulting image Ioc. The general expression for this filtering process is:

ASFCO,g2(f)=((((f∘g)•g)∘2g)•2g). (4)

For which in this case f is the Idt image. Here ∘ and • mean respectively morphological opening and closing.
4. Determine the regional maxima on Ioc and call the resulting image Irm.
5. For each labeled connected component present in the binary image:
- a) Compute a logical AND operation between the binary image of the connected component and Irm. We call the resulting image Imark.
- b) Count the numbers of regional maxima on Imark with the aid of labeling the connected components contained in it.
- c) If the number calculated in (b) is greater than one the object is classified as a cluster and its division is carried out using the SplitClusterWEDT method, which receive as arguments the binary image of the cluster, the regional maxima map of the cluster (Imark) and the distance transform image after the open close filtering (Ioc). In other cases, the object pertaining this connected component is classified as a single erythrocyte.

2.4.3 Method 3: Radon Transform (RT)

This method uses the Radon transform to find the markers for the cells as described in [¹¹]. The search for markers is performed based on the ability of the RT to detect shape parameters and their behavior with circular structures.

The circular structure edge was determined previously in order to apply the direct RT and after that the sinogram projections were filtered using a matched filter having a horseshoe-shaped impulse response. This filter was used to enhance the projections of all circular structures with radius r, which is computed from the median cell area in each image.

Then, an image with peaks close to the circular structures centers is obtained by means of the inverse RT. After this, a threshold is applied which is calculated by means of histogram analysis of the reconstructed grayscale image.

Finally, a morphological closing was performed in order to identify the final markers of each cell. Once the image containing the markers is obtained, we proceed to determine which connected components within the coarse segmentation image can be considered as clusters for their subsequent division by means of the SplitClusterWEDT method.

Similarly, to the previous method, for each connected component of the binary image obtained by means of the coarse segmentation, a logical AND operation of it with the whole markers image is performed to obtain the final markers that correspond to the specific connected component that is being analyzed. The resulting markers are labeled and if their number is greater than one the corresponding object is considered as a cluster.

The SplitClusterWEDT algorithm needs three arguments, which in this case are the binary image of the cluster, the markers corresponding to this cluster and the normalized distance transform of the logical complement of the cluster binary image.

As a final comment concerning the last step in the previous descriptions, e.g. calling the method to split the clusters, we emphasize the fact that aside from the cited SplitClusterWEDT method, splitting by means of the classical marker controlled watershed transform as well as using the EDT were also tested and compared, as described in the following section.

2.5 Segmentation of Clusters Into Their Constituent Objects

The algorithm devoted to segment the connected components identified as clusters into their constituent parts takes three inputs. The first one C is the binary image of the cluster. The second parameter RegMax is the binary image of the valid regional maxima identified during cluster C detection.

The third one Hmap depends upon the clusters detection method employed. The output of this algorithm is the binary image Cseg of the cluster, divided into its constituent components.

The algorithm begins by labeling and counting the connected components of the regional maxima contained in RegMax and setting their values in the variables LRM and Num respectively. Then follows a loop having as many iterations as regional maxima are present in C.

This loop starts initializing a binary matrix S to zero and then setting to one the elements of S whose positions match with the elements labeled i in the LRM matrix. The described loop can be implemented instead through vector operations for the sake of computational efficiency.

Then the algorithm computes an element-wise multiplication (Hadamard product) between matrices S and Hmap to obtain a new matrix called HeightRM, whose values correspond, for Method 1, to those of the h-maxima transform in the region occupied by each regional maximum and are zero in the rest of the matrix. In this case, for methods 2 and 3 the values of the distance transform are used instead of the h-maxima transform values.

Then for each regional maximum its height value called divfact, is used to weight (divide) the EDT value associated to this maximum.

A three-dimensional array called dtarray is built in which its ith level is a matrix that contains the weighted EDT (WEDT ), which is the EDT with its values divided (weighted) by divfact.

The reasoning behind this procedure is that the WEDT value calculated in some specific point tends to be lower for a larger height of the maximum and viceversa.

This fact determines that the SKIZ lines tend to separate from higher maxima and come closer to lower maxima, and this leads to a better location of the SKIZ lines (equal distance) to segment clustered objects having different size.

The remaining i values give rise to matrices corresponding to each regional maximum, each one of them with its respective weight. The algorithm saves in dtarray the WEDT for each regional maximum in a cluster.

Then it computes the global WEDT map taking in each coordinate point of the image plane, the minimum value of the WEDT, calculated for all i saved in dtarray and saving it in im4ws matrix.

Then the marker controlled watershed transform is applied to this matrix to obtain the SKIZ lines which will be used to segment the binary cluster C.

Fig. 5 shows a block diagram illustrating the described algorithm, the pseudo code for it is shown above. Two alternative methods were compared with the proposed algorithm in other to explore their accuracy.

Fig. 5 Block diagram of the algorithm to segment the clusters using the Weighted External Distance Transform

These methods were the marker controlled watershed transform using the inner distance transform (CW ) and the marker controlled watershed transform using the external distance transform (EDT ).

The combination of the three methods implemented for the detection of clusters (Iterative H-maxima transform, Morphological filtering and Radon transform) with the three methods to split them into their constituent objects form nine combined methods.

Fig. 6 shows the result of the segmentation using the Iterative H-maxima method to detect markers and the three ways to split the cluster: the inner distance transform, the external distance transform and the proposed weighted external distance transform.

Fig. 6 Watershed lines in the segmentation result after detecting the clusters by means of iterative H-maxima algorithm. (a) Ground truth. (b) Using inner distance transform. (c) Using external distance transform. (d) Using the weighted external distance transform

In this figure, we can notice the difference in terms of the watershed lines. In (a) the ground truth lines (b) broken lines can be observed, in (c) the line is somewhat displaced from the right position and in (d) the splitting line appears in a right place.

2.6 Evaluating the Effectiveness of Clusters Detection

A comparison between the three methods to detect clusters allowed determining the most appropriate alternative.

This comparison considered the detection of clusters in terms of true positives (TP) or clusters classified as such, false positives (FP) single objects classified as clusters, true negatives (TN) single objects correctly classified, and false negatives (FN) as clusters classified as single objects. From these data, the indexes of effectiveness: sensitivity, specificity, accuracy, F-measure and precision were calculated.

These measures are defined as follows:

Sensitivity=TPTP+FN, (5)

Specificity=TNTN+FP, (6)

Accuracy=TP+TNTP+TN+FP+FN, (7)

F-measure=2TP2TP+FP+FN, (8)

Precision=TPTP+FP. (9)

2.7 Evaluating the Segmentation Accuracy

The segmentation accuracy was tested using a ground truth composed by 500 binary clusters obtained from a first coarse segmentation, from which a careful, manually segmented version was built by digitally drawing an appropriate straight line between the vertices of the concavities that appear just at the points where the overlapping region of the roundish erythrocytes begin, as shown in Fig. 6a.

These clusters comprised two to eight single touching or overlapping objects with low to moderately different shapes, sizes and spatial orientations, up to 1220 single objects. The metric used to evaluate the accuracy of the segmentation was the Jaccard similarity index [¹⁰], which measures the coincidence between the segmentation result and the ground truth and is defined as:

J(A,B)=|A∩B||A∪B|, 0≤J≤1, (10)

where A and B are the binary sets to be compared and |∗| means the cardinality of sets. A result J=1 means perfect coincidence between the binary images while J=0 indicates total lack of coincidence. In our case, A would be the manually segmented object and B the object obtained from the automated segmentation method.

The analysis and interpretation of the results when evaluating the Jaccard coefficients was performed applying statistical tests.

We compared the nine methods using the Friedman’s non-parametric rank test with a Bergmann and Hommel’s correction for the post-hoc analysis. These tests were computed using the public R scmamp package [⁵].

3 Results and Discussion

Fig. 7 shows two segmentation results using the HmaxWEDT method. Here the Iterative H-maxima transform is used to detect clusters and extract the inner markers and the weighted external distance transform (WEDT) to split the clusters into their constituent objects.

Fig. 7 (a) Binary image with two clusters (b) Segmentation result using the HmaxWEDT method

We stress the fact that this combination of methods obtained the best results. The effectiveness in the detection of clusters was measured in terms of sensitivity and specificity. We analyzed 43 images containing 4265 binary objects. Table 1 shows the indexes of effectiveness in the detection of clusters, for the three methods analyzed: Iterative H-maxima transform, Morphological filtering and Radon transform.

Table 1 Indexes of effectiveness in the detection of clusters

Indexes	Iterative Hmax	Morph Filtering	Radon Transform
TP	1057	1068	945
TN	3182	3181	3187
FP	2	3	0
FN	24	13	133
Sensitivity	97.78%	98.8%	87%
Speciﬁcity	99.94%	99.91%	100%
F-measure	98.79%	99.26%	93.43%
Accuracy	99.39%	99.62%	96.88%
Precision	99.81%	98.72%	100%

The numbers in the tables were rounded to two decimal places. Table 2 shows the descriptive statistics of the Jaccard coefficients calculated for the nine methods analyzed, for which 1220 objects were used.

Table 2 Descriptive statistics of the Jaccard coefﬁcient for the nine methods

Method	Mean	Median	St.Dev.	Max	Min
HmaxCW	0.946	0.948	0.01	0.965	0.853
HmaxEDT	0.985	0.991	0.021	1	0.749
*HmaxWEDT*	0.993	0.996	0.01	1	0.892
MorphCW	0.94	0.948	0.057	0.965	0.332
MorphEDT	0.977	0.991	0.055	1	0.157
MorphWEDT	0.987	0.994	0.042	1	0.393
RadonCW	0.946	0.948	0.017	0.965	0.561
RadonEDT	0.98	0.99	0.04	1	0.258
RadonWEDT	0.987	0.994	0.034	1	0.408

Here Hmax, Morph and Radon stand for the Iterative H-maxima transform, Morphological filtering and Radon transform respectively, and CW, EDT and WEDT for the classical watershed transform, external distance transform and weighted external distance transform.

This table shows that the method HmaxWEDT exhibited better results than the others, in terms of mean, median and standard deviation. Similar results were obtained with the other methods when using the WEDT.

The Friedman test found statistically significant differences in results among the compared algorithms with a p-value of 2.2e-16 (test statistic = 6234.5). Then, the Bergmann and Hommel post-hoc procedure was carried out in order to find which combination of methods showed a statistically significant difference.

As a further description in order to have a better understanding of the possible similarities and differences among the tested algorithms, we plotted and show in Fig. 8 a critical difference plot with the corrected p-value and α=0.05.

Fig. 8 Cross-comparison for the nine algorithms tested using the Friedman test and the Bergmann and Hommel post hoc correction. Groups of methods that are not significantly different appear connected by a horizontal line

In this plot, each algorithm is placed on an axis according to its average ranking. Then, those algorithms that do not show significant differences are grouped together using a horizontal line. The rankings in the plot assume that larger values have a poorer rank.

In our case, the plot shows that, in general, the HmaxWEDT combination method ranked significantly better than the other combined algorithms, showing as well statistically significant differences in comparison with the others.

Another representation of the results of this test is shown in Fig. 9, where in this graph each node represents an algorithm and shows its name and the computed Friedman’s test statistic.

Fig. 9 Friedman test with the Bergmann and Hommel post hoc correction for the nine algorithms tested. Groups of methods that are not significantly different appear as connected nodes

A node with a filled background in green indicates the best ranked algorithm after this comparison. Lines between nodes indicate that the differences between connected algorithms are not found to be significant for α = 0.05, according to the Bergmann-Hommel post-hoc procedure.

There are no significant differences between the algorithms HmaxCW, MorphCW and RadonCW for which their mean ranks are very similar.

These three algorithms in spite of the way they use to detect the markers for the objects -using the methods Iterative H-maxima transform, morphological approach and Radon transform respectively- have in common the way used to split the clusters, e.g. using the classical watershed transform.

The same occurs with pairs MorphEDT and RadonEDT which use the external distance transform; and MorphWEDT and RadonWEDT which use the proposed weighted external distance transform.

3.1 Comparative Study

In order to make a comparative assessment of our proposed method, experimental results were compared to other state-of-art methods cited in the present article.

In spite that these works do not use the same database or even the same type of cells, the global results may provide an idea about how the figures obtained in the experiments reported in this article compare with those obtained in other works in this field.

In reference [¹⁸] the results are expressed in terms of percentages of correctly segmented clusters obtaining a 96.43% accuracy on cervical and breast cancer images.

The accuracy results obtained in our works are higher compared with this reference in spite that the image are from different types of cells. Reference [²⁰] showed their results in terms of performance measures of overlapped cells detection as well as accuracy of splitting.

They achieved 97.4% accuracy in the overlapped cells detection on the test set. In our work, we obtain better results in the cluster detection process achieving 99.39% and 99.69% accuracy with the Iterative H-maxima transform and Morphological filtering methods respectively.

Reference [⁴⁴] obtained high results in terms of sensitivity, precision and F-measure where true positive (TP) is the number of correctly split objects. Three datasets were used to evaluate the performance of the method and average values of sensitivity = 98.29%, precision = 99.02% and F-measure = 98.65% were obtained.

In our work, the TP is the number of objects classified correctly as clusters, and in this sense, we obtained 99.81% precision and 98.79% F-measure by the Iterative H-maxima transform method as well as 99.72% precision and 99.26% F-measure by the Morphological filtering method which are slightly better.

3.2 Runtime Analysis

This study was carried out using MATLAB (2016a version) on a computer with an Intel Core i3-2310M processor clocked at 2.10 GHz and with 4 GB of RAM and 64 bits Windows 10 Pro operating system. To reduce the computational load, the binary image obtained from the coarse segmentation was resized to resolutions of 1024 × 768 pixels.

Table 3 shows for one resized binary image the total of connected components (CC) and the indexes of TP, TN, FP and FN detected by the three methods. For this image, the Iterative H-maxima transform and Morphological filtering obtained the same results.

Table 3 Detection of cell clusters for one image Table 4. Mean runtime for each method (in seconds)

Method	CC	TP	TN	FP	FN
Iterative H-maxima	117	27	89	0	1
Morph ﬁltering	117	27	89	0	1
Radon transform	117	23	89	23	5

These two methods exhibited the best results obtaining the TP and consequently the best results in terms of sensitivity, F-measure and accuracy showed earlier in Table 1.

Table 4 shows a comparison of the mean running times for the three clusters detection algorithms combined with the three methods used to split the clusters in their constituent parts.

Table 4 Mean runtime for each method (in seconds)

Method	Radon	Morph Filtering	Iterative H-maxima
Classical Watershed transform (CW)	8.66	1.82	46.41
External distance transform (EDT)	8.59	2.11	46.86
Weighted external distance transform (WEDT)	23.11	18.87	62.73

The morphological filtering method combined with the three cluster-splitting methods showed the best performance in terms of speed, which is a very important factor when analyzing large numbers of images.

The Iterative H-maxima method was the most time consuming. This result is a consequence of the need to perform a number of iterations calculating the H-maxima transform, which has a relatively high computational cost, in order to obtain the appropriate h values.

The same occurs with the WEDT method used to split the clusters. In this case, each detected cluster is to be analyzed to compute the weighted distance transform, which has a higher computational cost.

4 Conclusion

This research explored various alternatives to detect and split connected components in binary images, which appear in segmentation processes of microscopy images having touching or overlapping erythrocytes. The scope of this approach was constrained to blood smear images containing erythrocytes having moderate differences in size as well as a moderate degree of overlapping.

Three methods to detect connected components associated to clusters, named Iterative H-maxima transform, Morphological filtering and Radon transform were used, as well as three methods to split these connected components in their constituent parts, named in this case external distance transform (EDT ), the classical watershed transform (CW ) and the weighted distance transform (WEDT ) which result in nine possible combinations.

References

1. Al-Kofahi, Y., Lassoued, W., Lee, W., Roysam, B. (2010). Improved automatic detection and segmentation of cell nuclei in histopathology images. IEEE Transactions on Biomedical Engineering, Vol. 57, No. 4, pp. 841–852. DOI: 10.1109/TBME.2009.2035102. [ Links ]

2. Al-Kofahi, Y., Zaltsman, A., Graves, R., Marshall, W., Mirabela, R. (2018). A deep learning-based algorithm for 2-D cell segmentation in microscopy images. BMC Bioinformatics, Vol. 19, No. 365. DOI: 10.1186/s12859-018-2375-z. [ Links ]

3. Brosnan, T., Sun, D. W. (2002). Inspection and grading of agricultural and food products by computer vision systems—a review. Computers and Electronics in Agriculture, Vol. 36, No. 2-3, pp. 193–213. DOI: 10.1016/S0168-1699(02)00101-1. [ Links ]

4. Cabello, E., Sánchez, M., Delgado, J. (2002). A New Approach to Identify Big Rocks with Applications to the Mining Industry. Real-Time Imaging, Vol. 8, No. 1, pp. 1–9. DOI: 10.1006/rtim.2000.0255. [ Links ]

5. Calvo, B., Santafé Rodrigo, G. (2016). scmamp: Statistical comparison of multiple algorithms in multiple problems. The R Journal, Vol. 8, No. 1. [ Links ]

6. Chinea-Valdés, L., Lorenzo-Ginori, J. (2011). Evaluation of distance transform based alternatives for image segmentation of overlapping objects. Scientific Conference on Computer Science and Informatics. [ Links ]

7. Díaz, G. (2008). Automatic clump splitting for cell quantification in microscopical images. In Progress in Pattern Recognition, Image Analysis and Applications Lecture Notes in Computer Science, Vol. 4756. pp. 763–772. [ Links ]

8. Dorfer, M., Mattes, J. (2016). Recursive water flow: A shape decomposition approach for cell clump splitting. IEEE 13th International Symposium on Biomedical Imaging (ISBI), pp. 811–815. DOI: 10.1109/ISBI.2016.7493390. [ Links ]

9. Dougherty, E. R., Lotufo, R. A. (2003). Hands-on Morphological Image Processing. SPIE. DOI: 10.1117/3.501104. [ Links ]

10. Ge, F., Wang, S., Liu, T. (2007). New benchmark for image segmentation evaluation. Journal of Electronic Imaging, Vol. 16, No. 3. DOI: 10.1117/1.2762250. [ Links ]

11. González-Betancourt, A., Rodríguez Ribalta, P., Meneses Marcel, A., Sifontes Rodríguez, S., Lorenzo Ginori, J. V., Orozco Morales, R. (2016). Automated marker identification using the Radon transform for watershed segmentation. IET Image Processing, Vol. 11, No. 3, pp. 183–189. [ Links ]

12. Haas, T., Schubert, C., Eickhoff, M., Pfeifer, H. (2020). BubCNN: Bubble detection using faster RCNN and shape regression network. Chemical Engineering Science, Vol. 216. DOI: 10.1016/j.ces.2019.115467. [ Links ]

13. Hatipoglu, N., Bilgin, G. (2017). Cell segmentation in histopathological images with deep learning algorithms by utilizing spatial relationships. Medical; Biological Engineering; Computing, Vol. 55, No. 10, pp. 1829–1848. DOI: 10.1007/s11517-017-1630-1. [ Links ]

14. Ibtehaz, N., Rahman, M. S. (2020). MultiResUNet : Rethinking the u-net architecture for multimodal biomedical image segmentation. Neural Networks, Vol. 121, pp. 74–87. DOI: 10.1016/j.neunet.2019.08.025. [ Links ]

15. Indhumathi, C., Cai, Y. Y., Guan, Y. Q., Opas, M. (2011). An automatic segmentation algorithm for 3D cell cluster splitting using volumetric confocal images. Journal of Microscopy, Vol. 243, No. 1, pp. 60–76. DOI: 10.1111/j.1365-2818.2010.0382.x. [ Links ]

16. Jie, D., Jing-feng, L., Jing-yu, Y. (2010). Combined technologies in analysis of overlapping cells. 2010 International Conference on E-Business and E-Government, IEEE, pp. 1608–1612. DOI: 10.1109/icee.2010.407. [ Links ]

17. Jierong, C., Rajapakse, J. C. (2009). Segmentation of Clustered Nuclei With Shape Markers and Marking Function. Biomedical Engineering, IEEE Transactions on, Vol. 56, No. 3, pp. 741–748. [ Links ]

18. Jung, C., Kim, C. (2010). Segmenting clustered nuclei using H-minima transform-based marker extraction and contour parameterization. Biomedical Engineering, IEEE Transactions on, Vol. 57, No. 10, pp. 2600–2604. [ Links ]

19. Jung, C., Kim, C. (2014). Impact of the accuracy of automatic segmentation of cell nuclei clusters on classification of thyroid follicular lesions. Cytometry Part A, Vol. 85, No. 8, pp. 709–718. DOI: 10.1002/cyto.a.22467. [ Links ]

20. Khodadadi, V., Fatemizadeh, E., Setarehdan, S. K. (2015). Overlapped cells separation algorithm based on morphological system using distance minimums in microscopic images. 2015 22nd Iranian Conference on Biomedical Engineering (ICBME), IEEE, pp. 263–268. [ Links ]

21. Koh, T., Miles, N., Morgan, S., Hayes-Gill, B. (2007). Image segmentation of overlapping particles in automatic size analysis using multi-flash imaging. IEEE Workshop on Applications of Computer Vision (WAC V '07), IEEE. DOI: 10.1109/wacv.2007.37. [ Links ]

22. Koyuncu, C. F., Akhan, E., Ersahin, T., Cetin-Atalay, R., Gunduz-Demir, C. (2016). Iterative h-minima-based marker-controlled watershed for cell nucleus segmentation. Cytometry Part A, Vol. 89, No. 4, pp. 338–349. DOI: 10.1002/cyto.a.22824. [ Links ]

23. Li, G., Liu, T., Nie, J., Guo, L., Chen, J., Zhu, J., Xia, W., mara, A., Holley, S., Wong, S. (2008). Segmentation of touching cell nuclei using gradient flow tracking. Wiley, Vol. 231, No. 1, pp. 47–58. DOI: 10.1111/j.1365-2818.2008.02016.x. [ Links ]

24. Litjens, G., Kooi, T., Bejnordi, B. E., Setio Adiyoso, A. A., Ciompi, F., Ghafoorian, M., van der Laak, J. A., van Ginneken, B., Sánchez, C. I. (2017). A survey on deep learning in medical image analysis. Medical Image Analysis, Vol. 42, pp. 60–88. DOI: 10.1016/j.media.2017.07.005. [ Links ]

25. Loddo, A., Ruberto, C. D., Kocher, M. (2018). Recent advances of malaria parasites detection systems based on mathematical morphology. Sensors, Vol. 18, No. 2, pp. 513. DOI: 10.3390/s18020513. [ Links ]

26. Nasr-Isfahani, S., Mirsafian, A., Masoudi-Nejad, A. (2008). A New Approach for Touching Cells Segmentation. Vol. 1, pp. 816–820. [ Links ]

27. Neves, J. C., Castro, H., Tomás, A., Coimbra, M., Proença, H. (2014). Detection and separation of overlapping cells based on contour concavity for Leishmania images. Cytometry Part A, Vol. 85, No. 6, pp. 491–500. [ Links ]

28. Nguyen, N. T., Duong, A. D., Vu, H. Q. (2011). Cell splitting with high degree of overlapping in peripheral blood smear. International Journal of Computer Theory and Engineering, pp. 473–478. DOI: 10.7763/ijcte.2011.v3.352. [ Links ]

29. Panagiotakis, C., Argyros, A. (2020). Region-based fitting of overlapping ellipses and its application to cells segmentation. Image and Vision Computing, Vol. 93. DOI: 10.1016/j.imavis.2019.09.001. [ Links ]

30. Park, C., Huang, J. Z., Ji, J. X., Ding, Y. (2013). Segmentation, inference and classification of partially overlapping nanoparticles. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 35, No. 3. DOI: 10.1109/tpami.2012.163. [ Links ]

31. Phoulady, H. A., Goldgof, D. B., Hall, L. O., Mouton, P. R. (2016). A new approach to detect and segment overlapping cells in multi-layer cervical cell volume images. IEEE 13th International Symposium on Biomedical Imaging (ISBI), pp. 201–204. DOI: 10.1109/isbi.2016.7493244. [ Links ]

32. Plissiti, M. E., Nikou, C. (2012). Overlapping cell nuclei segmentation using a spatially adaptive active physical model. Image Processing, IEEE Transactions on, Vol. 21, No. 11, pp. 4568–4580. DOI: 10.1109/tip.2012.2206041. [ Links ]

33. Plissiti, M. E., Vrigkas, M., Nikou, C. (2015). Segmentation of cell clusters in Pap smear images using intensity variation between superpixels. International Conference on Systems, Signals and Image Processing (IWSSIP), IEEE, pp. 184–187. [ Links ]

34. Romero Rondón, M. F., Sanabria Rosas, L. M., Bautista Rozo, L. X., Mendoza Castellanos, A. (2016). Algoritmo para la detección de glóbulos rojos superpuestos en imágenes microscópicas de extendidos de sangre periférica. DYNA, Vol. 83, No. 198, pp. 187–194. DOI: 10.15446/dyna.v83n198.47177. [ Links ]

35. Ronneberger, O., Fischer, P., Brox, T. (2015). U-Net: Convolutional networks for biomedical image segmentation. Lecture Notes in Computer Science, Springer International Publishing, pp. 234–241. DOI: 10.1007/978-3-319-24574-4_28. [ Links ]

36. Savelonas, M., Maroulis, D., Mylona, E. (2009). Segmentation of two-dimensional gel electrophoresis images containing overlapping spots. 9th International Conference on Information Technology and Applications in Biomedicine, pp. 1–4. [ Links ]

37. Schmitt, O., Hasse, M. (2009). Morphological multiscale decomposition of connected regions with emphasis on cell clusters. Computer Vision and Image Understanding, Vol. 113, No. 2, pp. 188–201. DOI: 10.1016/j.cviu.2008.08.011. [ Links ]

38. Song, Y., Tan, E. L., Jiang, X., Cheng, J. Z., Ni, D., Chen, S., Lei, B., Wang, T. (2017). Accurate cervical cell segmentation from overlapping clumps in pap smear images. IEEE Transactions on Medical Imaging, Vol. 36, No. 1, pp. 288–300. DOI: 10.1109/tmi.2016.2606380. [ Links ]

39. Wang, W. (2008). Rock particle image segmentation and systems. Pattern Recognition Techniques, Technology and Applications. I-Tech, Vienna, Austria, pp. 197–226. DOI: 10.5772/6242. [ Links ]

40. Wenzhong, Y., Xiaohui, F. (2010). A watershed based segmentation method for overlapping chromosome images. 2010 Second International Workshop on Education Technology and Computer Science, IEEE. DOI: 10.1109/etcs.2010.107. [ Links ]

41. Xu, W., Sang, N. (2015). Urine sediment overlapped cells segmentation based on hough transform and geometrical feature. International Symposium on Bioelectronics and Bioinformatics (ISBB), IEEE, pp. 211–214. [ Links ]

42. Yang, H., Ahuja, N. (2014). Automatic segmentation of granular objects in images: Combining local density clustering and gradient-barrier watershed. Pattern Recognition, Vol. 47, No. 6, pp. 2266–2279. DOI: 10.1016/j.patcog.2013.11.004. [ Links ]

43. Zafari, S., Eerola, T., Sampo, J., Kälviäinen, H., Haario, H. (2015). Segmentation of partially overlapping nanoparticles using concave points. Advances in Visual Computing, Springer International Publishing, pp. 187–197. DOI: 10.1007/978-3-319-27857-5_17. [ Links ]

44. Zhang, Q., Wang, J., Liu, Z., Zhang, D. (2020). A structure-aware splitting framework for separating cell clumps in biomedical images. Signal Processing, Vol. 168. DOI: 10.1016/j.sigpro.2019.107331. [ Links ]

45. Zhang, W. H., Jiang, X., Liu, Y. M. (2012). A method for recognizing overlapping elliptical bubbles in bubble image. Pattern Recognition Letters, Vol. 33, No. 12, pp. 1543–1548. DOI: 10.1016/j.patrec.2012.03.027. [ Links ]

Received: January 14, 2021; Accepted: March 16, 2022

^* Corresponding author: Lariza M. Portuondo-Mallet, e-mail: lportuondo@uo.edu.cu

This is an open-access article distributed under the terms of the Creative Commons Attribution License