Skip to main content

A fast forward 3D connection algorithm for mitochondria and synapse segmentations from serial EM images

Abstract

Background

It is becoming increasingly clear that the quantification of mitochondria and synapses is of great significance to understand the function of biological nervous systems. Electron microscopy (EM), with the necessary resolution in three directions, is the only available imaging method to look closely into these issues. Therefore, estimating the number of mitochondria and synapses from the serial EM images is coming into prominence. Since previous studies have achieved preferable 2D segmentation performance, it holds great promise to obtain the 3D connection relationship from the 2D segmentation results.

Results

In this paper, we improve upon Matlab’s function bwconncomp and propose a fast forward 3D connection algorithm for mitochondria and synapse segmentations from serial EM images. To benchmark the performance of the proposed method, two EM datasets with the annotated ground truth are produced for mitochondria and synapses, respectively. Experimental results show that the proposed method can achieve the preferable connection performance that closely matches the ground truth. Moreover, it greatly reduces the computational burden and alleviates the memory requirements compared with the function bwconncomp.

Conclusions

The proposed method can be deemed as an effective strategy to obtain the 3D connection relationship from serial mitochondria and synapse segmentations. It is helpful to accurately and quickly quantify the statistics of the numbers, volumes, surface areas, and lengths, which will greatly facilitate the data analysis of neurobiology research.

Peer Review reports

Background

Recently, due to the rapid development of neuroscience, considerable attention has been paid for the statistics of the numbers, volumes, surface areas, and lengths of sub-cellular structures in the brain, allowing neuroscientists to compare these objects in healthy animals and those with degenerative brain diseases [1]. It acts as an important role in studying the sub-cellular structures and their implied behaviors of nervous systems [2]. Among these structures, mitochondria and synapses are of particular interest to neuroscience [3]. Indeed, mitochondria, known as the powerhouse of the cell, have been proven to carry out all types of important cellular functions by producing the overwhelming majority of cellular adenosine triphosphate (ATP) [4]. Meanwhile, they also take substantial responsibility for the regulation of cellular life and death, as well as disease states. For example, mitochondrial dysfunction has been directly linked to the aging process, which is the largest single risk factor for Alzheimer Disease [5, 6]. In addition, synapses, known as the information transmitters, permit a neuron to pass an electrical or chemical signal to another neuron in the mammalian nervous system. Mounting evidence also indicates that synaptic plasticity has a close link with learning and memory. To be specific, sensory experience, motor learning and aging are found to induce alterations in presynaptic axon boutons and postsynaptic dendritic spines [7, 8]. Consequently, the quantification of mitochondria and synapses is of great significance to the prevention and treatment of brain diseases.

The above researches provide the motivation for looking closely into these issues in a nervous system, and image analysis techniques as an important approach are widely adopted. For image acquisition, electron microscopy (EM), with sufficiently high resolution on the nanoscale, can provide not only the details of intra-cellular structures but also the synapses and gap junctions. In particular, focused ion beam scanning electron microscopy (FIB-SEM) [9] can provide an isotropic resolution up to 5 nm in three directions, and automated tape-collecting ultramicrotome scanning electron microscopy (ATUM-SEM) [10] can offer an anisotropic voxel (4 nm× 4 nm× 30 nm) with a low resolution in the z direction. However, the FIB-SEM technique is limited to a small volume while the ATUM-SEM technique does not suffer from the limitation and can be applied to a large volume for the statistics and analysis [11]. To extract the invaluable structural information (mitochondria and synapses) from the serial EM images, substantial effort has been recently put into developing specialized algorithms for accurate segmentation of mitochondria and synapses. For some representative results, Lucchi et al. [12] clustered groups of similar voxels into regularly spaced supervoxels and incorporated mitochondrial shape features to an automated graph partitioning scheme for segmentation. On this base, they [13] introduced context-based features and modelled mitochondrial membranes for improvement. Staffler et al. [14] presented a method for automated detection of synapses, which focused on classifying borders between neuronal processes as synaptic or non-synaptic. Neila et al. [3] proposed an automated approach for both mitochondria and synapses that involved anisotropy-aware regularization via conditional random field inference and surface smoothing techniques to improve the segmentation and visualization. Moreover, due to the powerful representation capability of deep neural network, Xiao et al. [15] put forward a fusion fully convolutional network for mitochondrial segmentation and a fully connected conditional random field to optimize the segmentation results. Santurkar et al. [16] took a compositional approach to segment synapses by training lighter networks to model the simpler marginal distributions of membranes, clefts and vesicles.

Although above approaches have demonstrated preferable segmentation performance, it seems that the connection mode from serial binary segmentation results has not received enough attention in the anisotropic case. A recent research, Neila et al. [3] adopted an approximate, simple solution under the assumption that each connected component of the segmentation is one structure, and they also pointed out that estimating the number of structures from the segmentation results is still an open problem with plenty of ongoing research. Consequently, our aim in this paper is to develop a fast and effective 3D connection algorithm based on these segmentation results. The main contributions can be roughly grouped in two different directions.

  • Methods: Since the shapes of mitochondria and synapses have great differences in EM images, we propose a new similarity indicator that takes the shape into consideration. It could accurately measure the probability that the segmentation results belong to the same 3D mitochondrion or 3D synapse. In addition, we propose a forward connection mode that can effectively handle the problems of split and merge. This connection mode can be generalized to serial detection results, but not limited to serial segmentation results.

  • Data: We benchmark the performance of the proposed method against a previously unreleased corpus of manually annotated data. The corpus consists of two EM datasets acquired by the ATUM-SEM technique and the ground truth of mitochondria and synapses are manually annotated by 2-3 independent labelers, respectively. Both the EM datasets and manual annotations are released to the community providing a valuable tool for benchmarking.

The subsequent sections of this study present the detailed information about the datasets, the proposed connection algorithm, the experimental results, the meaningful discussions and conclusions.

Materials

In this paper, the biological specimens were selected from mouse cortex and the ATUM-SEM technique was adopted for image acquisition. The datasets were collected from a water bath using a custom designed tape-collection conveyor belt in the Institute of Neuroscience, Chinese Academy of Sciences, where several slices with thicknesses of more or less 50 nm were cut automatically. Next, these sections were imaged through SEM (Zeiss Supra55) in the Institute of Automation, Chinese Academy of Sciences, where the pixel size was set at 2 nm and the dwell time was set at 2 μs. Since the datasets acquired by the ATUM-SEM technique were unregistered, the image registration method applied in [17] was adopted in this paper. After registration, two ATUM-SEM datasets are used to construct the corresponding databases for mitochondria and synapses, respectively.

Mitochondria dataset

The mitochondria dataset consists of 31 slices with a resolution of 2×2×50nm3/voxel, and each slice has a size of 8416×8624 [18]. The ground truth were prepared via the hand segmentation outlining the mitochondrial membrane by two labelers with cross validation. A total of 473 mitochondria including the incomplete ones were annotated with the plugin TrakEm2 in software ImageJ [19]. Figure 1a-b present the acquired images and annotated mitochondria in the adjacent slices, where different mitochondria are represented by different colors. To accelerate the neurobiology research, we share the mitochondria dataset and the annotated ground truth publicly available in the websiteFootnote 1.

Fig. 1
figure 1

a and b: adjacent ATUM-SEM images with the mitochondria annotated by different colors; c and d: adjacent ATUM-SEM images with the annotated synapses

Synapse dataset

Since the synapses are more sparsely distributed than the mitochondria in the biological tissue, we need a larger volume for statistics and analysis. Here, the synapse dataset consists of 178 slices with the resolution 2×2×50nm3/voxel and size 8576×7616 [15]. The ground truth were prepared via the hand segmentation outlining the synaptic junctions by three labelers with cross validation. A total of 1230 synapses were annotated by different colors. Figure 1c-d present the acquired images and annotated synapses in the adjacent slices. Also, the synapse dataset and the annotated ground truth are provided publicly available in the websiteFootnote 2.

It is worthwhile to emphasize that creating such two databases requires a considerable amount of human effort, and it is a considerably time-consuming process which also justifies that previous endeavors on computerized segmentation are of great significance for the neurobiology research.

Methods

As mentioned above, the EM images with high resolution will inevitably produce large data even at small neural circuit. From a practical point of view, it is time consuming and memory consuming to directly measure the similarity of segmentations in the adjacent slices. Therefore, we propose a fast coarse-to-fine connection algorithm instead. The main motivation is given by the following axiom.

Axiom 1: Let s1 and s2 be the segmentations, and d1 and d2 be the regions satisfying s1d1 and s2d2. Then if \(d_{1}\bigcap d_{2}=\emptyset \), \(s_{1}\bigcap s_{2}=\emptyset \).

This axiom indicates that we can use the bounding boxes containing the segmentations for screening to reduce the computation cost. The proposed connection algorithm is divided into four steps: coarse screening, validation, fine connection and skip connection. The detailed procedures are summarized in the following subsections.

Coarse screening

On basis of the 2D segmentation results, we first obtain the corresponding bounding boxes by the Matlab’s function regionprops. Assume that there are n slices and each slice has ki segmentations i=1,2,,n, and the pth segmentation in the ith slice is denoted by matrix \(s^{i}_{p}\) and its bounding box is denoted by coordinate vector \(X^{i}_{p}\), the coarse similarity \(c_{pq}^{i}\) of segmentations \(s^{i}_{p}\) and \(s^{i+1}_{q}\) is measured by the Intersection-over-Union (IoU) of bounding boxes \(X^{i}_{p}\) and \(X^{i + 1}_{q}\):

$$ c_{pq}^{i}:=\text{IoU}\left(X^{i}_{p},X^{i+1}_{q}\right)=\frac{A\left({X^{i}_{p}}\cap X^{i+1}_{q}\right)}{A\left({X^{i}_{p}}\cup X^{i + 1}_{q}\right)}. $$
(1)

Here, \(A\left ({X^{i}_{p}}\cap X^{i+1}_{q}\right)\) and \(A\left ({X^{i}_{p}}\cup X^{i+1}_{q}\right)\) denote the areas of the intersection and union of \(X^{i}_{p}\) and \(X^{i + 1}_{q}\), respectively. Since the intersection is empty with high probability when \(X^{i}_{p}\) and \(X^{i+1}_{q}\) belong to the different 3D structures, it is clear that \(c_{pq}^{i},~p=1,\cdots,k_{i},~q=1,\cdots,k_{i+1}\) are almost zeros. Therefore, we can use a sparse matrix Ci to denote the coarse connection relation between the ith slice and the i+1th slice as follows:

$$ C^{i} = \left({\begin{array}{cccc} {c_{11}^{i}}&{c_{12}^{i}}& \cdots &{c_{1k_{i+1}}^{i}}\\ {c_{21}^{i}}&{c_{22}^{i}}& \cdots &{c_{2k_{i+1}}^{i}}\\ \vdots & \vdots & \ddots & \vdots \\ {c_{k_{i}1}^{i}}&{c_{k_{i}2}^{i}}& \cdots &{c_{k_{i}k_{i+1}}^{i}} \end{array}}\right),~ i=1,2,\cdots,n-1. $$
(2)

According to axiom 1, the position information provided by the bounding box is only a necessary condition, it may produce superfluous connections. As illustrated in Fig. 2c, the connection matrix cannot accurately reflect the connection relationship between the segmentations in Fig. 2a-b. Note that a larger similarity \(c_{pq}^{i}\) means a higher probability that the segmentations \(s^{i}_{p}\) and \(s^{i+1}_{q}\) belong to the same 3D structure. Two thresholds 0≤TlTh≤1 are adopted to judge whether the \(s^{i}_{p}\) and \(s^{i+1}_{q}\) are connected. Three cases are listed as below:

  1. (1)

    If \(c_{pq}^{i}\in [T_{h},1]\), there exists a connection between the \(s^{i}_{p}\) and \(s^{i+1}_{q}\);

    Fig. 2
    figure 2

    A schematic diagram of coarse-to-fine similarity measure. a and b present the original images in the adjacent slices, where the segmentation results are outlined by red curves and the corresponding bounding boxes are depicted by green boxes. c and d present the coarse connection matrix and the updated matrix

  2. (2)

    If \(c_{pq}^{i}\in [0, T_{l})\), there exists no connection between the \(s^{i}_{p}\) and \(s^{i+1}_{q}\). Set \(c_{pq}^{i}=0\);

  3. (3)

    If \(c_{pq}^{i}\in [T_{l}, T_{h})\), there may be a connection between the \(s^{i}_{p}\) and \(s^{i+1}_{q}\), which needs to be validated as marked in red in Fig. 2c (Tl=0.01 and Th=0.4 are used).

Validation

In this subsection, we utilize the segmentation information to validate the similarities in case (3). According to the coordinate vectors \(X^{i}_{p}\) and \(X^{i + 1}_{q}\), we can find a minimum image domain I that contains the corresponding segmentations, namely, two binary images denoted by \(I^{i}_{p}\) and \(I^{i + 1}_{q}\), respectively. On this base, we update the similarity \(c_{pq}^{i}\) of the segmentations \(s^{i}_{p}\) and \(s^{i+1}_{q}\) by considering the invariable position term \(P\left (s^{i}_{p},s^{i + 1}_{q}\right)\) and the variational shape term \(S\left (s^{i}_{p},s^{i + 1}_{q}\right)\):

$$ c_{pq}^{i} = \frac{P^{2}\left(s^{i}_{p},s^{i + 1}_{q}\right)+\lambda\cdot S^{2}\left(s^{i}_{p},s^{i + 1}_{q}\right)} {1+\lambda}. $$
(3)

Here, λ≥0 is a regularization parameter for balance. \(P\left (s^{i}_{p},s^{i + 1}_{q}\right)\) characterizes the invariance of segmentations \(s^{i}_{p}\) and \(s^{i+1}_{q}\), and is defined by the IoU of the corresponding binary images \(I^{i}_{p}\) and \(I^{i + 1}_{q}\):

$$ P\left(s^{i}_{p},s^{i + 1}_{q}\right)=\text{IoU}\left(I^{i}_{p},I^{i+1}_{q}\right). $$
(4)

In contrast, \(S\left (s^{i}_{p},s^{i + 1}_{q}\right)\) characterizes the variability of segmentations \(s^{i}_{p}\) and \(s^{i+1}_{q}\). Assume that the essential transformation \(h: s^{i}_{p}\longrightarrow s^{i + 1}_{q}\) has the form h=hαhβ, hα(x)=αx (scaling), and hβ(x)=x+β (translation) [20]. Then given a set of H transformations \(\mathcal {H}=\{h_{1}, h_{2}, \cdot \cdot \cdot, h_{H}\}\), \(S\left (s^{i}_{p},s^{i + 1}_{q}\right)\) is defined by:

$$\begin{array}{*{20}l} S\left(s^{i}_{p},s^{i + 1}_{q}\right)=\max\left\{\left.P\left(h_{k}\left(s^{i}_{p}\right),s^{i + 1}_{q}\right)\right|k=1,2,\cdots,H\right\}. \end{array} $$
(5)

After updating all the similarities in case (3), we can obtain new connection matrices Ci,i=1,2,,n−1. For example, the similarities marked in red in Fig. 2c are validated and the updated matrix is shown in Fig. 2d. It is clear that the updated matrix effectively eliminates the false connections. Then, another threshold Ts[0,Th) is adopted to determine the fine connection matrices:

$$ B^{i}=C^{i}>T_{s}, i=1,2,\cdots,n-1. $$
(6)

Fine connection

For each binary connection matrices Bi,i=1,2,,n−1, the sum of the pth row \(R_{p}^{i}~(p=1,2,\cdots, k_{i})\) implies that \(s_{p}^{i}\) connects with \(R_{p}^{i}\) segmentations in the i+1th slice, and the sum of the qth column \(N_{q}^{i+1}~\left (q=1,2,\cdots, k_{i+1}\right)\) implies that \(N_{q}^{i+1}\) segmentations in the ith slice connects with \(s_{q}^{i+1}\). Based on this fact, we propose a forward connection mode instead of the iterative bidirectional connection mode in [21]. The details are divided into three steps.

Firstly, we assign several categories to each segmentation according to the \(R_{p}^{i}\) and \(N_{q}^{i+1}\) as illustrated in Fig. 3a. These categories include One-to-one (O), Start (S), End (E), Split1 (S1), Merge1 (M1), Split2 (S2) and Merge2 (M2). For clarity of presentation, we provide a simplified matrix of Bi in Table 1. The five general connection cases are listed as follows: (1) since \(R_{1}^{i}=N_{1}^{i+1}=1\), we assign O to the first segmentation in the ith slice; (2) since \(R_{2}^{i}=0\), we assign E to the second segmentation in the ith slice; (3) since \(N_{4}^{i+1}=0\), we assign S to the fourth segmentation in the i+1th slice; (4) since \(R_{3}^{i}=2\) and \(N_{2}^{i+1}=N_{3}^{i+1}=1\), we assign S1 to the third segmentation in the ith slice and S2 to the second and the third segmentations in the i+1th slice; (5) since \(R_{4}^{i}=R_{5}^{i}=1\) and \(N_{5}^{i+1}=2\), we assign M1 to the fourth and the fifth segmentations in the ith slice and M2 to the fifth segmentation in the i+1th slice. Moreover, S is assigned to each segmentation in the first slice and E is assigned to each segmentation in the final slice. It should be noted that each segmentation may have two or more categories as shown in Fig. 3a.

Fig. 3
figure 3

A schematic diagram of the proposed connection mode. a assigning several categories to each segmentation; b assigning an initial label to each segmentation; c reassigning the same label to the segmentations in case of split and merge

Table 1 A simplified matrix of Bi with five general connection cases

Secondly, we assign an initial label to each segmentation according to the categories as illustrated in Fig. 3b. Specifically, we first denote the segmentation sets with category O, with category S, with category E, with category S1, with category M1, with category S2 and with category M2 in the ith slice as Oi, Si, Ei, \(S_{1}^{i}\), \(M_{1}^{i}\), \(S_{2}^{i}\) and \(M_{2}^{i}\), respectively. Then we begin with each segmentation \(s\in S^{i}\bigcup S_{2}^{i}\bigcup M_{2}^{i}, i=1,2,\cdots n,\) and assign a unique label j to it. If sOi, find the connected segmentation s1 in the i+1 slice by Bi and assign the same label j to s1. Set s=s1,i=i+1, and repeat above steps until sOi.

Thirdly, we reassign the same label to the segmentations in case of split and merge as illustrated in Fig. 3c. Specifically, for each segmentation \(s_{1} \in M_{1}^{i} \bigcup S_{1}^{i}, i=1,2,\cdots n-1\), we first obtain the label j1 of s1 and the label j2 of the connected segmentation \(s_{2}\in M_{2}^{i+1} \bigcup S_{2}^{i+1}\) in the i+1 slice by Bi. Then, we reassign the label min{j1,j2} to these segmentations with labels j1 and j2.

Remark 1

By this connection mode, we assign different labels to these segmentations. When λ=0, \(\phantom {\dot {i}\!}T_{s}=T_{d_{2}}=0\), the coarse-to-fine connection method has the same performance as the Matlab’s function bwconncomp, which judges whether the segmentation results are connected by the specified connectivity for the connected components.

Skip connection

The matrices Bi,i=1,2,,n−1, only characterize the connection relationship in the adjacent slices. However, it is usually hard to prevent wrinkle and damage from sample preparation and imaging in practice. Additionally, a minority of objects are difficult to be identified because they sometimes do not exhibit their typical characteristics on a certain slice. Therefore, the connection relationship in the skipped slices should also be considered. Based on the above considerations, we first calculate the coarse connection matrices as:

$$ C^{i2} = \left({\begin{array}{cccc} {c_{11}^{i2}}&{c_{12}^{i2}}& \cdots &{c_{1k_{i+2}}^{i2}}\\ {c_{21}^{i2}}&{c_{22}^{i2}}& \cdots &{c_{2k_{i+2}}^{i2}}\\ \vdots & \vdots & \ddots & \vdots \\ {c_{k_{i}1}^{i2}}&{c_{k_{i}2}^{i2}}& \cdots &{c_{k_{i}k_{i+2}}^{i2}} \end{array}} \right), i=1,2,\cdots,n-2, $$
(7)

where the \(c_{pq}^{i2}, p=1,2,\cdots,k_{i},~q=1,2,\cdots,k_{i+2}\) is the coarse similarity of \(s^{i}_{p}\) and \(s^{i+2}_{q}\). Then, we focus on each pair of segmentations \(s^{i}_{p}\in E^{i}\) and \(s^{i+2}_{q}\in S^{i+2}\). If \(c_{pq}^{i2}>0\), update the \(c_{pq}^{i2}\) by formula (3) and judge the connectivity by threshold Ts. If \(c_{pq}^{i2}>T_{s}\), reassign a new label min{j1,j2} to the segmentations with labels j1 and j2, where j1 and j2 are the labels of \(s^{i}_{p}\) and \(s^{i+2}_{q}\), respectively.

The proposed method is sketched in the following Algorithm 1. By using the proposed algorithm, we divide the whole segmentations into several disjoint sets, which satisfies that the segmentations in the same set belong to the same 3D object while these in the different sets belong to different 3D objects.

Results

In this section, we conduct several experiments to evaluate the performance of the proposed algorithm on the above mentioned datasets. The connection capacity is measured by two fundamental performance indicators, split error and merge error [22]. Here, the split error means that a true 3D object is regarded as several 3D objects, which often occurs when large objects sometimes split into two or more connected components. In contrast, the merge error means that several true 3D objects are regarded as a whole 3D object and it will occur when a group of structures close to each other often merge in a single connected component [3].

Performance comparison

The parameters in the proposed algorithm have a huge impact on the final connection performance. For example, a larger Tl tends to produce split errors and a smaller Th tends to produce merge errors. To guarantee a better connection performance, a relatively small threshold Tl=0.01 and a relatively large threshold Th=0.4 are chosen for coarse screening although it will take more time for verification. The choice of parameter λ depends on the size of the segmentation results. In the mitochondria experiments, since the diameter of mitochondria is commonly between 0.75 and 3 μm [23], the segmentation results that belong to the same 3D mitochondrion usually have large overlap in the adjacent slices. Then λ is set from 0 to 1 with step size 0.1 to satisfy that the position term in (3) is the dominant contribution for the similarity measure. In the synapse experiments, the mean cleft width of wild-type synapses is 22±0.5 nm between the pre- and postsynaptic neurons [24, 25]. Due to the offsets and differences in the adjacent slices, the segmentation results that belong to the same 3D synapse usually have small or even no overlaps. Then λ is set from 0 to 10 with step size 1 to satisfy that the shape term in (3) is the dominant contribution. Meanwhile, we adjust the threshold Ts from 0 to 0.1 with step size 0.01. The number of split errors, the number of merge errors and the number of total errors of the proposed method at varying thresholds λ and Ts are illustrated in Fig. 4a-b, c-d and e-f, respectively, where the results of the function bwconncomp are also presented as baseline for comparison. Several useful conclusions can be drawn from Fig. 4.

Fig. 4
figure 4

a and b the number of split errors of the function bwconncomp and the proposed method at varying thresholds λ and Ts; c and d the number of merge errors; e and f the number of total errors

Firstly, the step ‘skip connection’ in the proposed method is capable of reducing the number of split errors. As mentioned in Remark 1, the function bwconncomp is a special case of the proposed method. Specifically, adding the step ‘skip connection’ to the function bwconncomp has the same performance as the proposed method at λ=0 and Ts=0. As shown in Fig. 4a-b, the step ‘skip connection’ can greatly reduce the split errors compared with the function bwconncomp. That is because the function bwconncomp only finds the connected components in the adjacent slices and splits a 3D object into two connected components when the information is missing. In contrast, our approach avoids this shortcoming by the step ‘skip connection’.

Secondly, the suitable parametersλ andTs play an important role in the final connection performance. The regularization parameter λ is used for controlling the significance of the shape term in (3). Usually, a large λ is prone to produce merge errors while a small λ usually leads to split errors. The suitable choice of λ plays an important role in the balance. As shown in Fig. 4a-b, the choice λ>0 significantly reduces the number of split errors compared with λ=0 (function bwconncomp), especially in the synapse dataset. It demonstrates the superiority of the proposed similarity indicator. In addition, as another threshold Ts for determining the fine connection, a large Ts tends to produce split errors while a small Ts tends to produce merge errors. Similarly, compared with Ts=0 (function bwconncomp) in Fig. 4c-d, the choice Ts>0 greatly reduces the number of merge errors. Taking these factors into consideration, the suggested parameters are λ(0.4,0.6),Ts(0.02,0.03) for the mitochondria dataset and λ(1,3),Ts(0.03,0.05) for the synapse dataset, respectively as shown in Fig. 4e-f.

Thirdly, the proposed method achieves the near-human performance in obtaining the 3D connection relationship. Given the suggested parameters, there are only three split errors on the mitochondria dataset, and the optimal case without split error and merge error is achieved on the synapse dataset. Note that the proposed method achieves the optimal performance for a wide range of thresholds in Fig. 4e-f. The robustness is demonstrated.

To have a visual presentation, we provide the specific connection results of the proposed method on the mitochondria dataset and the synapse dataset in Fig. 5, where these segmentations that belong to the same 3D object are described by the same colors. It is clear that the proposed connection algorithm can effectively handle the problems of split and merge (Fig. 5).

Fig. 5
figure 5

a and b the specific connection results of the proposed method

Running time comparison

As mentioned in the previous subsection, the choices of Tl and Th are not only related to the connection performance but also the time consumption. In this subsection, we first preset the optimal parameters Ts=0.03, λ=0.5 for the mitochondria experiments and Ts=0.03, λ=2 for the synapse experiments. Then, the threshold Tl from 0 to 0.1 with step size 0.01 and threshold Th from 0.2 to 0.4 with step size 0.02 are adjusted to estimate the time consumption. Figure 6a-b present the time consumption of the function bwconncomp and the proposed method at varying thresholds for comparison. Meanwhile, the corresponding connection performance is also provided for referencing in Fig. 6c-d. Several useful conclusions can be drawn from Fig. 6.

Fig. 6
figure 6

a and b the time consumption of the function bwconncomp and the proposed method at varying thresholds; c and d the corresponding connection performance

Firstly, the suitable parametersTh andTl reduce the time consumption as well as keep the preferable performance. From Fig. 6, it is obvious that a larger Th and a smaller Tl will take more time for computation on the both datasets. Note that the number of errors does not decrease when Th≥0.34,Tl≤0.01 for the mitochondria dataset, and Th≥0.2,Tl≤0.07 for the synapse dataset, the suggested parameter interval are Th(0.34,0.4),Tl(0,0.01) and Th(0.2,0.3),Tl(0,0.07), respectively. For a detailed representation, the time consumption of the function bwconncomp and the proposed method at several suggested parameters are provided in Table 2.

Table 2 The time consumption of the function bwconncomp and the proposed method at several suggested parameters

Secondly, the proposed method is more computationally efficient than the functionbwconncomp. From Table 2, we can see that the proposed method only needs an average of 300s to obtain the connection results while the function bwconncomp needs 398s on the mitochondria dataset and 1815s on the synapse dataset. It is because the size of synapse dataset is 6 times larger than that of mitochondria dataset and the function bwconncomp obtains the connected components by handling with each pixel. However, the proposed method uses the thresholds Tl and Th for coarse screening and only computes the region of interest instead of the whole image, which reduces the computation cost.

Memory comparison

In this subsection, we present the memory requirement of the function bwconncomp and the proposed method in Table 3. It can be seen that the memory requirement of the proposed method is less than 1/6 of the function bwconncomp on the mitochondria dataset and approximately 1/10 of the function bwconncomp on the synapse dataset, respectively. It is mainly because the input of function bwconncomp must be the total serial segmentation results, i.e., the memory requirement is closely related to the data size. In contrast, the proposed method only needs to read two images repeatedly for calculating the connection matrices. The memory requirement is determined by these sparse connection matrices, which is independent of the data size. It indicates that the proposed method does not suffer from the common problem “Out of Memory” caused by large dataset.

Table 3 The memory requirement of the function bwconncomp and the proposed method

Information sharing

All codes are written in Matlab Version R2016b (Math Works, Inc.). The mitochondria experiments are performed on a personal computer with an i7-4790 MQ 3.60 GHz Intel processor, 32 GB RAM and Windows operating system. Since the error “Out of Memory” happens when the function bwconncomp is tested on the synapse dataset, the synapse experiments are performed on a public server with an i7-4820 MQ 2.00 GHz Intel processor, 512 GB RAM and Windows operating system. The codes are available onlineFootnote 3.

Discussion

This present research is primarily motivated by the need to accurately obtain the statistics of mitochondria and synapses from serial EM images, which helps the neuroscientists to quickly quantify these objects in healthy and diseased animals. Since previous researches have achieved preferable performance on 2D segmentations [3, 15, 16, 18, 26], this paper has proposed a fast and effective method to obtain the 3D connection relation. To validate the effectiveness of the proposed method, we produce two ATUM-SEM datasets with the annotated ground truth for mitochondria and synapses, respectively. Experimental results show the superiority of the proposed method on the connection performance, running time and memory requirement over the function bwconncomp.

Because the proposed method only depends on the shapes and distribution of the objects, it indicates that the proposed method can also achieve a robust 3D connection performance for other subcellular structures, such as the endoplasmic reticulum, Golgi apparatus, and microtubules. Because all these structures are sparsely distributed in the EM images, relatively smaller parameters Tl=0.01,Th=0.2 can be adopted for reducing the time consumption. Meanwhile, the shapes of endoplasmic reticulum and microtubules are narrow as the synapses, and the shapes of Golgi apparatus are elliptical as the mitochondria. The suggested parameters can be λ=2,Ts=0.03 for the endoplasmic reticulum and microtubules, and λ=0.5,Ts=0.02 for the Golgi apparatus.

Despite the promising results of the proposed method, several problems still deserve further research. The most important concern is that the connection performance usually relies heavily on the segmentation results. However, previous segmentation algorithms [3, 15, 16, 18, 26] almost focus on the lower pixel-wise prediction error (pixel error). Unfortunately, pixel error considers only whether or not a given pixel is correctly classified as the object, without concern for the ultimate effect on the connection performance. For example, expanding, shrinking or translating the object between two slices would not cause splits or mergers, but incur a large pixel error. Further, while a gap of even a single pixel in the object between two slices would cause a merge error, it might only incur a very small pixel error [27]. Thus, further research on segmentation algorithms should take other indicators such as the rand error [28], warping error [29] into consideration. In addition, a good connection method is expected to be more robust to different segmentation results. The generalization performance of the proposed method should be further validated on the results obtained by state-of-the-art segmentation algorithms. Future research will focus on using the 3D connection relation for optimizing the local misleading segmentation. As another concern, the effectiveness of the proposed method may owe to the characteristic that both the mitochondria and synapses are sparsely distributed in EM images. More future investigations along the present line will validate the generalization performance of the proposed method on the dense neuron segmentations. In addition, since it will yield more split errors and merge errors when the number of segments is large, some normalized benchmarks like “rand index” will be added for the split-merge error analysis.

Conclusion

To obtain the 3D connection relationship from serial mitochondria and synapse segmentations, this paper proposes a fast forward 3D connection algorithm, which can be deemed as a generalization of existing Matlab’s function bwconncomp. The proposed method can achieve the connection performance that matches the ground truth closely. Meanwhile, it can significantly reduce the computational burden and alleviate the memory requirements compared with the function bwconncomp. It means that our approach can help neuroscientists to accurately and quickly obtain the meaningful statistics of mitochondria and synapses, which will greatly facilitate the data analysis of neurobiology research. To our knowledge, such method is the first work in this topic.

Notes

  1. http://95.163.198.142/MiRA/mitochondria31/

  2. http://95.163.198.142/MiRA/synapse178/

  3. https://github.com/WeifuLi/A-Fast-Forward-Connection-Algorithm

Abbreviations

ATP:

Adenosine triphosphate

ATUM-SEM:

Automated tape-collecting ultramicrotome scanning electron microscopy

EM:

Electron microscopy

FIB-SEM:

Focused ion beam scanning electron microscopy

IoU:

Intersection-over-Union

References

  1. Yang Y, Liu D, Huang W, Deng J, Sun Y, Zuo Y, Poo MM. Selective synaptic remodeling of amygdalocortical connections associated with fear memory. Nat Neurosci. 2016; 19(10):1348–55.

    Article  CAS  Google Scholar 

  2. Liu T, Jones C, Seyedhosseini M, Tasdizen T. A modular hierarchical approach to 3d electron microscopy image segmentation. J Neurosci Methods. 2014; 226(8):88–102.

    Article  Google Scholar 

  3. Márquez NP, Baumela L, González-Soriano J, Rodríguez JR, Defelipe J, Merchán-Pérez A. A fast method for the segmentation of synaptic junctions and mitochondria in serial electron microscopic images of the brain. Neuroinformatics. 2016; 14(2):1–16.

    Google Scholar 

  4. Mcbride HM, Neuspiel M, Wasiak S. Mitochondria: More than just a powerhouse. Curr Biol. 2006; 16(14):551–60.

    Article  Google Scholar 

  5. Roychaudhuri R, Yang M, Hoshi MM, Teplow DB. Amyloid β-protein assembly and alzheimer disease. J Biol Chem. 2009; 284(8):4749–53.

    Article  CAS  Google Scholar 

  6. Floudas CS, Um N, Kamboh MI, Barmada MM, Visweswaran S. Identifying genetic interactions associated with late-onset alzheimer’s disease. BioData Min. 2014; 7(1):35.

    Article  Google Scholar 

  7. Hofer SB, Mrsicflogel TD, Bonhoeffer T, Hubener M. Experience leaves a lasting structural trace in cortical circuits. Nature. 2009; 457(7227):313–7.

    Article  CAS  Google Scholar 

  8. Xie Q, Chen X, Deng H, Liu D, Sun Y, Zhou X, Yang Y, Han H. An automated pipeline for bouton, spine, and synapse detection of in vivo two-photon images. Biodata Min. 2017; 10(1):40.

    Article  Google Scholar 

  9. Knott G, Marchman H, Wall D, Lich B. Serial section scanning electron microscopy of adult brain tissue using focused ion beam milling. J Neurosci. 2008; 28(12):2959–64.

    Article  CAS  Google Scholar 

  10. Hayworth KJ, Kasthuri N, Schalek R. Automating the collection of ultrathin serial sections for large volume tem reconstructions. Microsc Microanal. 2006; 12:86–7.

    Article  Google Scholar 

  11. Briggman KL, Bock DD. Volume electron microscopy for neuronal circuit reconstruction. Curr Opin Neurobiol. 2012; 22(1):154–61.

    Article  CAS  Google Scholar 

  12. Lucchi A, Smith K, Achanta R, Knott G, Fua P. Supervoxel-based segmentation of mitochondria in em image stacks with learned shape features. IEEE Trans Med Imaging. 2012; 31(2):474–86.

    Article  Google Scholar 

  13. Lucchi A, Becker C, Neila PM, Fua P. Exploiting enclosing membranes and contextual cues for mitochondria segmentation. In: Medical Image Computing and Computer Assisted Intervention. Cambridge: Springer: 2014. p. 65–72.

    Google Scholar 

  14. Staffler B, Berning M, Boergens KM, Gour A, Smagt PVD, Helmstaedter M. Synem, automated synapse detection for connectomics. Elife. 2017; 6:26414.

    Article  Google Scholar 

  15. Xiao C, Li W, Chen X, Han H, Q X. An effective fully deep convolutional neural network for mitochondria segmentation based on atum-sem. In: SPIE Medical Imaging. Orlando: SPIE: 2018. p. 105740.

    Google Scholar 

  16. Santurkar S, Budden D, Matveev A, Berlin H, Saribekyan H, Meirovitch Y, Shavit N. Toward streaming synapse detection with compositional convnets. arXiv preprint arXiv:1702.07386. 2017.

  17. Li X, Ji G, Chen X, Ding W, Sun L, Xu W, Han H, Sun F. Large scale three-dimensional reconstruction of an entire caenorhabditis elegans larva using autocuts-sem. J Struct Biol. 2017; 20(2):87–96.

    Article  Google Scholar 

  18. Li W, Deng H, Rao Q, Xie Q, Chen X, Han H. An automated pipeline for mitochondrial segmentation on atum-sem stacks. J Bioinforma Comput Biol. 2017; 15(3):1750015.

    Article  CAS  Google Scholar 

  19. Schmid B, Schindelin J, Cardona A, Longair M, Heisenberg M. A high-level 3d visualization api for java and imagej. BMC Bioinforma. 2010; 11:1–7.

    Article  Google Scholar 

  20. Li H, Wei Y, Li L, Chen CLP. Hierarchical feature extraction with local neural response for image recognition. IEEE Trans Cybern. 2013; 43(2):412–24.

    Article  Google Scholar 

  21. Hu J, Xiao C, Shen L, Xie Q, Chen X, Han H. Automatical detecting and connecting the mitochondria from the serial em images. In: IEEE International Conference on Mechatronics and Automation. Takamatsu: IEEE: 2017. p. 1632–7.

    Google Scholar 

  22. Berning M, Boergens KM, Helmstaedter M. Segem: Efficient image analysis for high-resolution connectomics. Neuron. 2015; 87(6):1193–206.

    Article  CAS  Google Scholar 

  23. Wiemerslage L, Lee D. Quantification of mitochondrial morphology in neurites of dopaminergic neurons using multiple parameters. J Neurosci Methods. 2016; 262:56–65.

    Article  CAS  Google Scholar 

  24. Kandel ER, Schwartz JH, Jessell TM. Principles of neural science, 4th edn. McGraw-Hill. xli. 2000; 50(6):823–39.

    Google Scholar 

  25. Arce KPD, Schrod N, Metzbower SWR, Allgeyer E, Kong GKW, Tang A, Krupp AJ, Stein V, Liu X, Bewersdorf J. Topographic mapping of the synaptic cleft into adhesive nanodomains. Neuron. 2015; 88(6):1165–72.

    Article  Google Scholar 

  26. Xiao C, Li W, Deng H, Chen X, Yang Y, Xie Q, Han H. Effective automated pipeline for 3d reconstruction of synapses based on deep learning. BMC Bioinforma. 2018; 19(1):263–80.

    Article  Google Scholar 

  27. Argandacarreras I, Turaga SC, Berger DR, Ciresan D, Giusti A, Gambardella LM, Schmidhuber J, Laptev D, Dwivedi S, Buhmann JM. Crowdsourcing the creation of image segmentation algorithms for connectomics. Front Neuroanat. 2015; 9:142.

    Google Scholar 

  28. Unnikrishnan R, Pantofaru C, Hebert M. Toward objective evaluation of image segmentation algorithms. IEEE Trans Pattern Anal Mach Intell. 2007; 29(6):929–44.

    Article  Google Scholar 

  29. Jain V, Bollmann B, Richardson M, Berger DR. Boundary learning by optimization with topological constraints. In: Computer Vision and Pattern Recognition. San Francisco: IEEE: 2010. p. 2488–95.

    Google Scholar 

Download references

Acknowledgements

The authors would like to thank Dr. Yu Kong, Yang Yang and Danqian Liu (Institute of Neuroscience, CAS) for their assistance with preparation and sectioning of the sample. Also, we would like to thank Mr. Lixin Wei and his colleagues (Institute of Automation, CAS) for the Zeiss Supra55 SEM and technical support.

Funding

The financial support of National Natural Science Foundation of China (NO. 61673381, NO. 11771130, NO. 61871177, NO. 41501392, NO. 61701497), Science and Technology Development Fund of Macau (044/2015/A2), Scientific Instrument Developing Project of Chinese Academy of Sciences (NO. YZ201671), Bureau of International Cooperation, Chinese Academy of Sciences (NO. 153D31KYSB20170059), and Special Program of Beijing Municipal Science & Technology Commission (NO. Z161100000216146) is appreciated.

Availability of data and materials

The data and source code in this paper are available online.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Qiwei Xie or Hua Han.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional information

Author’s contributions

Conceived and designed the experiments: WL, JL, Performed the experiments: WL, CX, HD, Analyzed the data: WL, QX, HH, Contributed materials: QX, HH.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, W., Liu, J., Xiao, C. et al. A fast forward 3D connection algorithm for mitochondria and synapse segmentations from serial EM images. BioData Mining 11, 24 (2018). https://doi.org/10.1186/s13040-018-0183-7

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s13040-018-0183-7

Keywords