Ais: streamlining segmentation of cryo-electron tomography datasets

Mart G.F. Last; Leoni Abendstein; Lenard M. Voortman; Thomas H. Sharp

doi:10.7554/eLife.98552.1

eLife assessment

This work describes a new software platform for machine-learning-based segmentation of and particle-picking in cryo-electron tomograms. The program and its corresponding online database of trained models will allow experimentalists to conveniently test different models and share their results with others. The paper provides solid evidence that the software will be valuable to the community.

https://doi.org/10.7554/eLife.98552.1.sa3

Significance of findings

valuable: Findings that have theoretical or practical implications for a subfield

landmark
fundamental
important
valuable
useful

Strength of evidence

solid: Methods, data and analyses broadly support the claims with only minor weaknesses

exceptional
compelling
convincing
solid
incomplete
inadequate

During the peer-review process the editor and reviewers write an eLife assessment that summarises the significance of the findings reported in the article (on a scale ranging from landmark to useful) and the strength of the evidence (on a scale ranging from exceptional to inadequate). Learn more about eLife assessments

Abstract

Segmentation is a critical data processing step in many applications of cryo-electron tomography. Downstream analyses, such as subtomogram averaging, are often based on segmentation results, and are thus critically dependent on the availability of open-source software for accurate as well as high-throughput tomogram segmentation. There is a need for more user-friendly, flexible and comprehensive segmentation software that offers an insightful overview of all steps involved in preparing automated segmentations. Here, we present Ais: a dedicated tomogram segmentation package that is geared towards both high performance and accessibility, available at github.com/bionanopatterning/Ais. In this report, we demonstrate two common processing steps that can be greatly accelerated with Ais: particle picking for subtomogram averaging, and generating many-feature models of cellular architecture based on in situ tomography data. Featuring comprehensive annotation, segmentation, and rendering functionality, as well as an open repository for trained models at aiscryoet.org, we hope that Ais will help accelerate research and dissemination of data involving cryoET.

Introduction

Segmentation is a key step in processing cryo-electron tomography (cryoET) datasets that entails identifying specific structures of interest within the volumetric data and marking them as distinct features. This process forms the basis of many subsequent analysis steps, including particle picking for subtomogram averaging and the generation of 3D models that help visualize the ultrastructure of the sample.

Although it is a common task, the currently available software packages for tomogram segmentation often leave room for improvement in either scope, accessibility, or open availability of the source code. Some popular programs, such as Amira™ (Thermo Fisher Scientific), are not free to use, while other more general purpose EM data processing suites offer limited functionality in terms of visualization and user interaction. Specifically, we found that a deficit existed of software that is easy to use, competitively performing, freely available, and dedicated to segmentation of cryoET datasets for downstream processing.

In this report we present Ais, an open-source tool that is designed to enable any cryoET user – whether experienced with software and segmentation or a novice – to quickly and accurately segment their cryoET data in a largely automated fashion. Ais was designed to have an intuitive and straightforward user interface in order to make it accessible to a broad audience, while a library of various neural network architectures, a system of configurable interactions between different models, a streamlined workflow that facilitates rapid fine-tuning of neural network predictions, and built-in particle picking and volume rendering functionalities enable users to quickly prepare highly reusable models for specific segmentation tasks, as well as publication-quality figures.

To demonstrate the use of Ais, we outline its use in two such tasks: first, to automate the particle picking step of a subtomogram averaging workflow, and secondly for the generation of rich three-dimensional models of cellular architecture, with ten distinct cellular components, based on cryoET datasets acquired on cellular samples.

Results and discussion

The first step in image segmentation using convolutional neural networks (CNNs) is to manually annotate a subset of the data for use as a training dataset. Ais facilitates this step by providing a simple interface for browsing data, drawing overlays, and selecting boxes to use as training data (Fig. 1a, also Fig. S1-S5). Multiple features, such as membranes, microtubules, ribosomes, and phosphate crystals, can be segmented and edited at the same time across multiple datasets (even hundreds). These annotations are then extracted and used as ground truth labels upon which to condition neural networks, with which one can automatically annotate the same or any other dataset (Fig. 1b). Segmentation in Ais is performed on-the-fly and can achieve interactive framerates, depending on the size of the datasets and models that are used. With a little experience, users can generate a training dataset and then train, apply, and asses the quality of a model within a few minutes (Fig. 1c), including on desktop or laptop Windows and Linux systems with fairly low-end GPUs (e.g., we often use an NVIDIA T1000).

An overview of the user interface and functionalities.
a) The interface for annotation of datasets. In this example, a tomographic slice has been annotated with various features – a detailed explanation follows in Fig. 5. b) After annotation, multiple neural networks (or ‘models’) are set up and trained on the aforemention ed annotations. The models can then be used to segment the various distinct features. In this example, double membrane vesicles (DMVs), single membrane vesicles, ribosomes, intermediate filaments, mitochondrial granules, and molecular pores in the DMVs are segmented. c) After training or downloading the required models and exporting segmented volumes, the resulting segmentation are immediately available within the software for 3d rendering and inspection. d) The Ais model repository at aiscryoet.org facilitates sharing and reuse of trained models. After validation, submitted models can be freely downloaded by anyone. e) Additional information, such as the pixel size and the filtering applied to the training data, is displayed alongside all models in the repository, in order to help a user identify whether a model is suited to segment their datasets.

Many cryoET datasets look alike, especially for cellular samples. A model prepared by one user to segment, for example, ribosomes in a dataset with a pixel size of 10 Å, might also be adequate for another user’s ribosome segmentation at 12 Å per pixel. To facilitate this sort of reuse and sharing of models, we launched an open model repository at aiscryoet.org where users can freely upload and download successfully trained models (Fig. 1d). Models that pass screening become public, are labelled with relevant metadata (Fig. 1e), and can be downloaded in a format that allows for direct use in Ais. Thus, users can skip the annotation and training steps of the segmentation workflow. To kickstart the repository, all 27 models that are presented in this article have been uploaded to it.

Our software is not the first to address the challenge of segmenting cryoET datasets; established suites such as EMAN2¹, MIB², SuRVOS³, or QuPath⁴ also provide some or most of the functionality that is available in Ais. Each comes equipped with one or various choices of classifier architectures to use, and many more designs for neural networks for semantic image segmentation can be found in the literature. Therefore, as well as creating a package geared specifically towards ease of use and fast results, we also wanted to include functionality that enables a user to quickly compare different models in order to facilitate determining which models are best suited for a particular segmentation task. The software thus includes a library of a number of well-performing models, including adaptations of single-model convolutional neural network architectures such as InceptionNet⁵, ResNet⁶, various UNets⁷, VGGNet⁸, and the default model available in EMAN2¹, as well as the more complex generative-adversarial network Pix2pix⁹. This library can also be extended by copying any Python file that adheres to a minimal template into the corresponding directory of the project.

A library of neural network architectures supports varied applications

To illustrate how useful it can be to rapidly test various models before selecting one that is well suited for the segmentation of any particular feature, we used six different models for the segmentation of three distinct features within the same tomogram, and analyzed the results (Table 1). We used a cryoET dataset that we have previously acquired¹⁰, which contained liposomes with membrane bound Immunoglobulin G3 (IgG3) antibodies that form an elevated Fragment crystallizable (Fc) platform, prepared on a lacey carbon substrate. The features of interest for segmentation were the membranes, antibody platforms, and carbon support film (Fig. 2a).

Comparison of some of the default models available in Ais.
^a The computational cost is only roughly proportional to the number of model parameters, which is reported in the software. The specifics of the model architecture affect the processing speed more significantly. ^b Time required to process one 511×720 pixel sized tomographic slice. ^c The loss, calculated as the binary cross-entropy (bce) of predicted and original annotation, is a (rough) metric of how well a model performs (see Methods). From left to right, the columns list the losses of the membrane, carbon, and antibody platform models, respectively. ^d Unlike the other models, Pix2 pix is not trained to minimize the bce loss but uses a different loss function instead. The bce loss values shown here were computed after training and may not be entirely comparable.

A comparison of different neural networks for tomogram segmentation.
a) A representative example of the manual segmentation used to prepare training datasets. Membranes are annotated in cyan, carbon film in yellow, and antibody platforms in magenta. For the antibody training set, we used annotations prepared in multiple slices of the same tomogram, but for the carbon and membrane training set the slice shown here comprised all the training data. b) A tomographic slice from a different tomogram that contains the same features of interest, also showing membrane bound antibodies with elevated Fc platforms that are adjacent to carbon (magenta arrowheads). c) Results of segmentation of membranes (top; cyan), carbon (middle; yellow), and antibody platforms (bottom; magenta), with the six different neural networks.

After training, we applied the models to a different tomogram than that used to generate the training data (Fig. 2b), so that there was no overlap between the training and testing datasets. Next, we compared the training times, relative loss values, and quality of the segmentations.

Based on the model losses (the loss is a metric of how well the model predictions match the ground truth labels), VGGNet was best suited for the segmentation of membranes, while UNet performed best on the carbon support film and antibody platforms. However, the loss values do not capture model performance in the same way as human judgement (Fig. 2c). For the antibody platform models, the model that would be expected to be one of the worst based on the loss values, Pix2pix, actually generates segmentations that are well-suited for the downstream processing tasks. Pix2pix appears to predict both fewer false negatives and fewer false positives than the lowest-loss model, UNet, which occasionally labels sections of membrane and support film as antibody platform. Moreover, since Pix2pix is a relatively large model, it might also be improved further by increasing the number of training epochs.

When taking the training and processing speeds in to account as well as the segmentation results, there is no overall best model. We therefore included multiple well-performing model architectures in the final library, in order to allow users to select from these models to find one that works well for their specific datasets. Although it is not necessary to screen different models and users may simply opt to use the default model architecture (VGGNet), these results thus show that it can be useful to test different models in order to identify one that is best.

Fine-tuning segmentation results with model interactions

Although the above results go some way towards distinguishing the three different structures, they also show demonstrate a common limitation encountered in automated tomogram segmentation: some features are assigned a high segmentation value by multiple of the networks, leading to ambiguity in the results. For example, the InceptionNet and ResNet antibody platform models falsely label edges of the carbon film.

To further improve the segmentation results, we decided to implement a system of proximity-based ‘model interactions’ of two types, colocalization and avoidance (Fig. 3a), using which the output of one model can be adjusted based on the output of other models. In a colocalization interaction, the predictions of one model (the child) are suppressed wherever the prediction value of another model (the parent) is below some threshold. In an avoidance interaction, suppression occurs wherever the parent model’s prediction value is above a threshold.

Model interactions can significantly increase segmentation accuracy.
a) An overview of the settings available in the ‘Models’ menu in Ais. Three models: 1) ‘membrane’ (red), 2) ‘carbon’ (white), and 3) ‘antibody platforms’ (green) are active, with each showing a different section of the model settings: the training menu (1), prediction parameters (2), and the interactions menu (3). b) A section of a tomographic slice is segmented by two models, carbon (white; *parent* model) and membrane (red; *child* model), with the membrane model showing a clear false positive prediction on an edge of the carbon film (panel ‘without interactions’). By configuring an avoidance interaction between the membrane model that is conditional upon the carbon model’ s prediction, this false positive is avoided (panel ‘with interactions’). c) By setting up multiple model interactions, inaccurate predictions by the ‘antibody platforms’ model are suppressed. In this example, the membrane model avoids carbon while the antibody model is set to colocalize with the membrane model. d) 3D renders (see Methods) of the same dataset as used in Fig. 2 processed three ways: without any interactions (left), using model competition only (middle), or by using model competition as well as multiple model interactions (right).

These interactions are implemented as follows: first, a binary mask is generated by thresholding the parent model’s predictions. Next, the mask is then dilated using a circular kernel with a radius R, a parameter that we call the interaction radius. Finally, the child model’s prediction values are multiplied with this mask.

Besides these specific interactions between two models, the software also enables pitching multiple models against one another in what we call ‘model competition’. Models can be set to ‘emit’ and/or ‘absorb’ competition from other models. On a pixel-by-pixel basis, all models that absorb competition are suppressed whenever their prediction value for that pixel is lower than that of any of the emitting models.

With the help of these model interactions it is possible to suppress common erroneous segmentation results. For example, an interaction like ‘absorbing membrane model avoids emitting carbon model with R = 10 nm’ is effective at suppressing the prediction of edges of the carbon film as being membranes (Fig. 3b). Another straightforward example of the utility of membrane interactions is the segmentation of membrane-bound particles. By defining the following two interactions: ‘antibody platform model avoids membrane model with R = 10 nm’ followed by ‘antibody platform model colocalizes with membrane model with R = 30 nm’, the Fc-platforms formed by IgG3 at a distance of ∼22 nm from the membrane are retained, while false positive labelling of features such as the membrane or carbon is suppressed (Fig. 3c).

By conditionally combining and editing the prediction results of multiple neural networks, model interactions can thus be helpful in fine-tuning segmentations to be better suited for downstream applications. To illustrate this, we generated a comparison of segmentation results using i) no interactions, ii) model competition only, and iii) model competition as well as model interactions (Fig. 3d), which demonstrates the degree to which false positives can be reduced by the use of model interactions (although at times at the expense of increasing the rate of false negatives).

Automating particle picking for subtomogram averaging

Protein structure determination by cryoET requires the careful selection of many subtomograms (i.e., sub-volumes of a tomogram that all contain the same structure of interest), and aligning and averaging these to generate a 3D reprojection of the structure of interest with a significantly increased signal to noise ratio. The process of selecting these sub-volumes is called ‘particle picking’, and can be done either manually or in an automated fashion. Much time can be saved by automating particle picking based on segmentations. In many cases, though, segmentation results are not readily usable for particle picking, as they can often introduce numerous false positives. This is particularly the case with complex, feature-rich datasets such as those obtained within cells, where the structures of interest can visually appear highly similar to other structures that are also found in the data, or when the structures of interest are located close to other features and are therefore hard to isolate. An example of this latter case is the challenge of picking membrane-bound particles.

Recently, we have used cryoET and subtomogram averaging to determine the structures of membrane bound IgG3 platforms and of IgG3 interacting with the human complement system component 1 (C1) on the surface of lipid vesicles¹⁰. The reconstructions of the antibody platforms alone and of the antibody-C1 complex were prepared using 1193 and 2561 manually selected subtomograms, extracted from 55 and 101 tomograms, respectively. Manual picking of structures of interest, although very precise, is very time consuming and in this particular case took approximately 20 hours of work.

To demonstrate the utility of our software for particle picking, we re-analyzed these same datasets, this time using Ais to automate the picking of the two structures: antibody platforms and antibody-C1 complexes. For the antibody platforms, we used the same models and model interactions as described above, while we trained an additional neural network to identify C1 complexes for the segmentation of the antibody-C1 complexes. To prepare a training dataset for this latter model, we opened all 101 tomograms in Ais, and browsed the data to select and annotate slices where one or multiple antibody-C1 complexes were clearly visible (Fig. 4a). The training dataset thus consisted of samples taken from multiple different tomograms; the annotation and data selection in this case took around 1 hour of work.

Automated particle picking for sub-tomogram averaging of antibody complexes.
a) Manually prepared annotations used to train a neural network to recognize antibody platforms (top) or antibody-C1 complexes (bottom). b) Segmentation results as visualized within the software. Membranes (red) and carbon support film (white) were used to condition the antibody (green) and antibody-C1 complex (yellow) predictions using model interactions. c) 3 D representations of the segmented volumes rendered in Ais. d) Tomographic slices showing particles picked automatically based on the segmented volume shown in panel c. e) Subtomogram averaging result of the 2499 automatically picked antibody platforms. f) Subtomogram averaging result obtained with the 602 automatically picked antibody-C1 complexes. The quadrants in panels e and f show orthogonal slices of the reconstructed density maps and a 3D isosurface model (the latter rendered in ChimeraX).

The antibody platform and antibody-C1 complex models were then applied to the respective datasets, in combination with the membrane and carbon models and the model interactions described above (Fig. 4b). We then launched a relatively large batch segmentation process (3 models × 156 tomograms) and left it to complete overnight.

Once complete, we used Ais to inspect the segmented volumes and original datasets in 3D, and adjusted the threshold value and so that, in as far as possible, only the particles of interest remained visible and the number of false positive particles was minimized (Fig. 4c). After selecting these values, we then launched a batch particle picking process to determine lists of particle coordinates based on the segmented volumes. Next, we used EMAN2¹ (spt_boxer.py) to extract volumes using these coordinates as an input (Fig. 4d), which resulted in 2499 volumes for the antibody platform reconstruction and 602 for the antibody-C1 complex (n.b. these numbers can be highly dependent on the threshold value). These volumes, or subtomograms, were used as the input for subtomogram averaging using EMAN2¹ and Dynamo¹¹ (see Methods), without further curation – i.e., we did not manually discard any of the extracted volumes.

After applying the same approach to subtomogram averaging as used previously¹⁰, the resulting averages were indeed highly similar to the original reconstructions (Figs. 4e,f, Fig. S6). These results demonstrate that Ais can be succesfully used to automate particle picking, and thus to significantly reduce the amount of time spent on what is often a laborious processing step.

Many-feature segmentations of complex in situ datasets

Aside from particle picking, segmentation is also often used to visualize and study the complex internal structure of a sample, as for example encountered when applying tomography to whole cells. Here too, the accuracy of a segmentation can be a critical factor in the success of downstream analyses such as performing measurements on the basis of 3D models generated via segmentation.

A challenging aspect of segmentation of cellular samples is that these datasets typically contain many features that are biologically distinct, but visually and computationally difficult to distinguish. For example, one challenge that is often encountered is that of distinguishing between various linearly shaped components: lipid membranes, actin filaments, microtubules, and intermediate filaments, which all appear as linear features with a relatively high density. To show the utility of Ais for the accurate segmentation of complex cellular tomograms, we next demonstrate a number of examples of such feature-rich segmentations.

The first example is a segmentation of seven distinct features observed in the base of Chlamydomonas reinhardtii cilia (Fig. 5a), using the data by van den Hoek et al.¹² that was deposited in the Electron Microscopy Public Image Archive (EMPIAR)¹³ with accession number 11078. The features are: membranes, ribosomes, microtubule doublets, axial microtubules, non-microtubular filaments, interflagellar transport trains (IFTs), and glycocalyx. This dataset was particularly intricate (the supplementary information to the original publication lists more than 20 features that can be identified across the dataset) and some rare features, such as the IFTs, required careful annotation across all tomograms before we could compile a sufficiently large training dataset. The final segmentation correctly annotates most of the selected characteristics present in the sample: the ribosome exclusion zone that surrounds the ciliary base¹² is clearly recognizable, and the structures of the glycocalyx, membranes, and microtubule doublets within the cilia are well defined. Some fractions of the meshwork of stellate fiber and Y-link proteins are also detected within the cilium.

Segmentations of complex *in situ* tomograms.
a) A segmentation of seven distinct features observed in the base of *C. reinhardtii* cilia¹² (EMPIAR-11078, tomogram 12): membranes (gray), ribosomes (magenta), microtubule doublets (green) and axial microtubules (green), non-microtubular fi laments within the cilium (blue), interflagellar transport trains (yellow), and glycocalyx (orange). Inset: a perpendicular view of the axis of the cilium. The arrows in the adjacent panel indicate these structures in a tomographic slice. b) A segmentation of six features observed in and around mitochondria in a mouse neuron with Huntington disease phenotype ¹⁴ (EMD-29207): membranes (gray), mitochondrial granules (yellow), membranes of the mitochondrial cristae (red), microtubules (green), actin (turquoise), and ribosomes (magenta). c) Left: a segmentation of ten different cellular components found in tomograms of coronavirus infected mammalian cells ¹⁶ : double membrane vesicles (DMVs, light red), single membranes (gray), viral nucleocapsid proteins (red), viral pores in the DMVs (blue), nucleic acids in the DMVs (pink), microtubules (green), actin (cyan), intermediate fi laments (orange), ribosomes (magenta), and mitochondrial granules (yellow). Right: a representative slice, with examples of each of the features (except the mitochondrial granules) indicated by arrows of the corresponding colour.

In the second example, we show a segmentation of six features found in and around a mitochondrion in a mouse neuron (Fig. 5b), using the data by Wu et al.¹⁴ as available in the EMDataBank (EMDB)¹⁵ with accession number 29207. In the original publication, the authors developed a segmentation method to detect and perform measurements on the granules found within mitochondria. Using Ais, we were able to prepare models to segment these granules, as well as microtubules, actin filaments, ribosomes, and to distinguish between the highly similar vesicular membranes on the one hand, and the membranes of the mitochondrial cristae on the other.

Lastly, we used Ais to generate a 3D model of ten distinct cellular features observed in coronavirus infected mammalian cells (Fig. 5c, Fig. S7), using data by Wolff et al.¹⁶: single membranes, double membrane vesicles (DMVs), actin filaments, intermediate filaments, microtubules, mitochondrial granules, ribosomes, coronaviral nucleocapsid proteins, coronaviral pores in the DMVs, and the nucleic acids within the DMV replication organelles. The software was able to accurately distinguish between single membranes and double membranes, as well as to discriminate between the various filaments of the cytoskeleton. Moreover, we could identify the molecular pores within the DMV, and upon manually thresholding the segmented volumes found no immediately apparent false positive predictions of these pores (Fig. S8). We could also segment the nucleocapsid proteins, thus distinguishing viral particles from other, similarly sized, single membrane vesicles, as well as detect the nucleic acids found within the DMVs. Although the manual annotation took some hours in this case, the processing of the full volume took approximately 3 minutes per feature once the models were trained, and models can of course be applied to other volumes without requiring additional training.

To conclude, our aim with the development of Ais was to simplify and improve the accuracy of automated tomogram segmentation in order to make this processing step more accessible to many cryoET users. Here, we have attempted to create an intuitive and organized user interface that streamlines the whole workflow from annotation, to model preparation, to volume processing and particle picking and inspecting the results. Additionally, the model repository at aiscryoet.org is designed to aid users in achieving results even faster, by removing the need to generate custom models for common segmentation tasks. To help users become familiar with the software, documentation and tutorials are available at ais-cryoet.readthedocs.org and video tutorials can be accessed via youtube.com/@scNodes. By demonstrating the use of Ais in automating segmentation and particle picking for subtomogram averaging, and making the software available as an open-source project, we thus hope to help accelerate research and dissemination of data involving cryoET.

Methods

Model comparisons

The comparison presented in Table 1 was prepared with the use of the same training datasets for all models, consisting of 53/52/58 positive images showing membranes/carbon/antibody platforms and corresponding annotations alongside 159/51/172 negative images that did not contain the feature of interest, but rather the other two features, reconstruction artefacts, isolated protein, or background noise. Images were 64 × 64 pixels in size and the dataset was resampled, in random orientations, such that every positive image was copied 10 times and that the ratio of negatives to positives was 1.3:1. Models were trained for 50 epochs with 32 images per batch, with the exception of the ResNet and Pix2pix antibody platform models, which were trained for 30 epochs to avoid a divergence that occurred during training after a larger number of epochs, due to the large number of model parameters and relatively low number of unique input images.

The reported loss is that calculated on the training dataset itself, i.e., no validation split was applied. During regular use of the software a validation split is also not applied in order to make full use of an input set of ground truth annotations. The software also reports the overall loss during training, rather than the validation loss.

Data visualization

Images shown in the figures were either captured within the software (Fig. 1, Fig. 3bc ‘original image’, Fig. 4abc), output from the software that was colourized in Inkscape (Fig. 2c, Fig. 3bc), or output from the software that was rendered using ChimeraX¹⁷ (Fig. 3d, Fig. 4ef, Fig. 5). For the panels in Fig. 3d, segmented volumes were rendered as isosurfaces at a manually chosen suitable isosurface level and with the use of the ‘hide dust’ function (the same settings were used for each panel, different settings used for each model).

Hardware

The software does not require a GPU, but works optimally when a CUDA capable GPU is available. For the measurements shown in Table 1 we used an NVIDIA Quadro P2200 GPU on a PC with an Intel i9-10900K CPU. We’ve also extensively used the software on a less powerful system equipped with an NVIDIA T1000 and an Intel i3-10100 CPU, as well as on various systems with intermediate specifications, and found that the software reaches interactive segmentation rates in most cases. For batch processing of many volumes, a more powerful GPU is useful.

Tomogram reconstruction and subtomogram averaging

Data collection and subtomogram averaging (Figs. 3 and 4) was performed as described in a previously published article¹⁰. Briefly, tilt series were collected on a Talos Arctica 200 kV system equipped with a Gatan K3 detector with energy filter at a pixel size of 1.74 Å per pixel using a dose-symmetric tilt scheme with range ±57° and tilt increments of 3° with a total dose of 60 e/Å². Tomograms were reconstructed using IMOD¹⁸. Particle picking was done in Ais (and is explained in more detail in the online documentation). Subtomogram averaging was done using a combination of EMAN¹ and Dynamo¹¹. For a detailed description of the subtomogram averaging procedure, see Abendstein et al. (2023)¹⁰.

Open-source software

This project depends critically on a number of open-source software components, including: Python, Tensorflow¹⁹, numpy²⁰, scipy²¹, scikit-image²², mrcfile²³, and imgui²⁴.

Software availability

A standalone version of the software is available as ‘Ais-cryoET’ on the Python package index and at github.com/bionanopatterning/Ais. We have also integrated the functionality into scNodes²⁵, our dedicated processing suite for correlated light and electron microscopy. In the combined package, the segmentation editor contains additional features for visualization of fluorescence data and the scNodes correlation editor can be used to prepare correlated datasets for segmentation. Documentation for both versions of Ais can be found at ais-cryoet.readthedocs.org. Video tutorials are available via youtube.com/@scNodes.

Supporting information

Supplementary Information

Acknowledgements

We thank A. Koster and M. Barcena for helpful discussions and providing us with access to the coronaviral replication organelle datasets. We are also grateful to van den Hoek et al.¹²and Wu et al.¹⁴, for uploading the data that we used for Fig. 5 onto EMPIAR and EMDB, as well as to the authors of various other datasets uploaded to these databases that are not discussed in this manuscript but that were very helpful for testing the software. This research was supported by the following grants to THS: European Research Council H202 Grant 759517; European Union’s Horizon Europe Program IMAGINE grant 101094250, and the Netherlands Organization for Scientific Research Grant VI.Vidi.193.014.

Author Contributions

MGFL and LA collected and processed the data. MGFL wrote the software with input from all other authors. THS and LMV supervised the project.

Competing Interests Statement

The authors declare no competing interests.

References

1
1. Galaz-Montoya J. G.
2. Flanagan J.
3. Schmid M. F.
4. Ludtke S. J.
2015Single particle tomography in EMAN2J Struct Biol 190:279–290https://doi.org/10.1016/j.jsb.2015.04.016 Google Scholar
2
1. Belevich I.
2. Joensuu M.
3. Kumar D.
4. Vihinen H.
5. Jokitalo E.
2016Microscopy image browser: a platform for segmentation and analysis of multidimensional datasetsPLoS biology 14:e1002340https://doi.org/10.1371/journal.pbio.1002340 Google Scholar
3
1. Luengo I.
2. et al.
2017SuRVoS: Super-Region Volume Segmentation workbenchJ Struct Biol 198:43–53https://doi.org/10.1016/j.jsb.2017.02.007 Google Scholar
4
1. Bankhead P.
2. et al.
2017QuPath: Open source software for digital pathology image analysisSci Rep 7:16878https://doi.org/10.1038/s41598-017-17204-5 Google Scholar
5
1. Szegedy C.
2. et al.
2014Going deeper with convolutionsarXiv https://doi.org/10.48550/arXiv.1409.4842 Google Scholar
6
1. Szegedy C.
2. Ioffe S.
3. Vanhoucke V.
4. Alemi A.
2016Inception-v4, Inception-ResNet and the Impact of Residual Connections on LearningarXiv https://doi.org/10.48550/arXiv.1602.07261 Google Scholar
7
1. Ronneberger O.
2. Fisher P.
3. Brox T.
2015U-Net: Convolutional Networks for Biomedical Image SegmentationarXiv https://doi.org/10.48550/arXiv.1505.04597 Google Scholar
8
1. Simonyan K.
2. Zisserman A.
2015Very Deep Convolutional Networks for Large-scale Image RegognitionarXiv https://doi.org/10.48550/arXiv.1409.1556 Google Scholar
9
1. Isola P.
2. Zhu J.-Y.
3. Zhou T.
4. Efros A. A.
2017Image-to-Image Translation with Conditional Adversarial NetworksIn: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) pp. 5967–5976https://doi.org/10.1109/CVPR.2017.632 Google Scholar
10
1. Abendstein L.
2. et al.
2023Complement is activated by elevated IgG3 hexameric platforms and deposits C4b onto distinct antibody domainsNat Commun 14:4027https://doi.org/10.1038/s41467-023-39788-5 Google Scholar
11
1. Castano-Diez D.
2. Kudryashev M.
3. Arheit M.
4. Stahlberg H.
2012Dynamo: a flexible, user-friendly development tool for subtomogram averaging of cryo-EM data in high-performance computing environmentsJ Struct Biol 178:139–151https://doi.org/10.1016/j.jsb.2011.12.017 Google Scholar
12
1. van den Hoek H.
2. et al.
2022In situ architecture of the ciliary base reveals thestepwise assembly of intraflagellar transport trainsScience 377:543–548https://doi.org/10.1126/science.abm6704 Google Scholar
13
1. Iudin A.
2. et al.
2023EMPIAR: the electron microscopy public image archiveNucleic Acids Research 51:D1503–D1511Google Scholar
14
1. Wu G. H.
2. et al.
2023CryoET reveals organelle phenotypes in huntington disease patient iPSC-derived and mouse primary neuronsNat Commun 14:692https://doi.org/10.1038/s41467-023-36096-w Google Scholar
15
1. Lawson C. L.
2. et al.
2015EMDataBank unified data resource for 3DEMNucleic Acids Research 44:D396–D403https://doi.org/10.1093/nar/gkv1126 Google Scholar
16
1. Wolff G.
2. et al.
2020A molecular pore spans the double membrane of the coronavirus replication organelleScience 369:1395–1398https://doi.org/10.1126/science.abd3629 Google Scholar
17
1. Goddard T. D.
2. et al.
2018UCSF ChimeraX: Meeting modern challenges in visualization and analysisProtein Sci 27:14–25https://doi.org/10.1002/pro.3235 Google Scholar
18
1. Mastronarde D. N.
2. Held S. R.
2017Automated tilt series alignment and tomographic reconstruction in IMODJ Struct Biol 197:102–113https://doi.org/10.1016/j.jsb.2016.07.011 Google Scholar
19
1. Abadi M.
2. et al.
2015Large-Scale Machine Learning on Heterogeneous Distributed SystemsarXiv https://doi.org/10.48550/arXiv.1603.04467 Google Scholar
20
1. Harris C. R.
2. et al.
2020Array programming with NumPyNature 585:357–362https://doi.org/10.1038/s41586-020-2649-2 Google Scholar
21
1. Virtanen P.
2. et al.
2020SciPy 1.0: fundamental algorithms for scientific computing in PythonNat Methods 17:261–272https://doi.org/10.1038/s41592-019-0686-2 Google Scholar
22
1. van der Walt S.
2. et al.
2014scikit-image: image processing in PythonPeerJ 2:e453https://doi.org/10.7717/peerj.453 Google Scholar
23
1. Burnley T.
2. Palmer C. M.
3. Winn M.
2017Recent developments in the CCP-EM software suiteActa Crystallographica Section D 73:469–477https://doi.org/10.1107/S2059798317007859 Google Scholar
24
1. Cornut O. Dear Imgui
2023GitHub repositoryhttps://github.com/ocornut/imgui
25
1. Last M. G. F.
2. Voortman L. M.
3. Sharp T. H.
2023scNodes: a correlation and processing toolkit for super-resolution fluorescence and electron microscopyNat Methods 20:1445–1446https://doi.org/10.1038/s41592-023-01991-z Google Scholar

Article and author information

Author information

Mart G.F. Last
Department of Cell and Chemical Biology, Leiden University Medical Center, Leiden, 2333ZC, The Netherlands
ORCID iD: 0000-0002-3739-8863
- To whom correspondence should be addressed: t.sharp@bristol.ac.uk, mgflast@gmail.com
Leoni Abendstein
Department of Cell and Chemical Biology, Leiden University Medical Center, Leiden, 2333ZC, The Netherlands
ORCID iD: 0000-0001-7634-5353
Lenard M. Voortman
Department of Cell and Chemical Biology, Leiden University Medical Center, Leiden, 2333ZC, The Netherlands
ORCID iD: 0000-0001-9794-067X
Thomas H. Sharp
Department of Cell and Chemical Biology, Leiden University Medical Center, Leiden, 2333ZC, The Netherlands, School of Biochemistry, University of Bristol, Bristol, BS8 1TD, United Kingdom
ORCID iD: 0000-0002-1990-2333
- To whom correspondence should be addressed: t.sharp@bristol.ac.uk, mgflast@gmail.com

Version history

Preprint posted: April 4, 2024
Sent for peer review: April 20, 2024
Reviewed Preprint version 1: June 21, 2024
Reviewed Preprint version 2: November 8, 2024
Version of Record published: December 20, 2024

Cite all versions

You can cite all versions using the DOI https://doi.org/10.7554/eLife.98552. This DOI represents all versions, and will always resolve to the latest one.

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Not revised: This Reviewed Preprint includes the authors’ original preprint (without revision), an eLife assessment, public reviews, and a provisional response from the authors.

Reviewing Editor
Sjors Scheres
MRC Laboratory of Molecular Biology, Cambridge, United Kingdom
Senior Editor
Felix Campelo
Universitat Pompeu Fabra, Barcelona, Spain

Reviewer #1 (Public Review):

This paper describes "Ais", a new software tool for machine-learning-based segmentation and particle picking of electron tomograms. The software can visualise tomograms as slices and allows manual annotation for the training of a provided set of various types of neural networks. New networks can be added, provided they adhere to a Python file with an (undescribed) format. Once networks have been trained on manually annotated tomograms, they can be used to segment new tomograms within the same software. The authors also set up an online repository to which users can upload their models, so they might be re-used by others with similar needs. By logically combining the results from different types of segmentations, they further improve the detection of distinct features. The authors demonstrate the usefulness of their software on various data sets. Thus, the software appears to be a valuable tool for the cryo-ET community that will lower the boundaries of using a variety of machine-learning methods to help interpret tomograms.

https://doi.org/10.7554/eLife.98552.1.sa2

Reviewer #2 (Public Review):

Summary:

Last et al. present Ais, a new deep learning-based software package for the segmentation of cryo-electron tomography data sets. The distinguishing factor of this package is its orientation to the joint use of different models, rather than the implementation of a given approach. Notably, the software is supported by an online repository of segmentation models, open to contributions from the community.

The usefulness of handling different models in one single environment is showcased with a comparative study on how different models perform on a given data set; then with an explanation of how the results of several models can be manually merged by the interactive tools inside Ais.

The manuscripts present two applications of Ais on real data sets; one is oriented to showcase its particle-picking capacities on a study previously completed by the authors; the second one refers to a complex segmentation problem on two different data sets (representing different geometries as bacterial cilia and mitochondria in a mouse neuron), both from public databases.

The software described in the paper is compactly documented on its website, additionally providing links to some YouTube videos (less than an hour in total) where the authors videocapture and comment on major workflows.

In short, the manuscript describes a valuable resource for the community of tomography practitioners.

Strengths:

A public repository of segmentation models; easiness of working with several models and comparing/merging the results.

Weaknesses:

A certain lack of concretion when describing the overall features of the software that differentiate it from others.

https://doi.org/10.7554/eLife.98552.1.sa1

Reviewer #3 (Public Review):

Summary:

In this manuscript, Last and colleagues describe Ais, an open-source software package for the semi-automated segmentation of cryo-electron tomography (cryo-ET) maps. Specifically, Ais provides a graphical user interface (GUI) for the manual segmentation and annotation of specific features of interest. These manual annotations are then used as input ground-truth data for training a convolutional neural network (CNN) model, which can then be used for automatic segmentation. Ais provides the option of several CNNs so that users can compare their performance on their structures of interest in order to determine the CNN that best suits their needs. Additionally, pre-trained models can be uploaded and shared to an online database.

Algorithms are also provided to characterize "model interactions" which allows users to define heuristic rules on how the different segmentations interact. For instance, a membrane-adjacent protein can have rules where it must colocalize a certain distance away from a membrane segmentation. Such rules can help reduce false positives; as in the case above, false negatives predicted away from membranes are eliminated.

The authors then show how Ais can be used for particle picking and subsequent subtomogram averaging and for the segmentation of cellular tomograms for visual analysis. For subtomogram averaging, they used a previously published dataset and compared the averages of their automated picking with the published manual picking. Analysis of cellular tomogram segmentation was primarily visual.

Strengths:

CNN-based segmentation of cryo-ET data is a rapidly developing area of research, as it promises substantially faster results than manual segmentation as well as the possibility for higher accuracy. However, this field is still very much in the development and the overall performance of these approaches, even across different algorithms, still leaves much to be desired. In this context, I think Ais is an interesting package, as it aims to provide both new and experienced users with streamlined approaches for manual annotation, access to a number of CNNs, and methods to refine the outputs of CNN models against each other. I think this can be quite useful for users, particularly as these methods develop.

Weaknesses:

Whilst overall I am enthusiastic about this manuscript, I still have a number of comments:

On page 5, paragraph 1, there is a discussion on human judgement of these results. I think a more detailed discussion is required here, as from looking at the figures, I don't know that I agree with the authors' statement that Pix2pix is better. I acknowledge that this is extremely subjective, which is the problem. I think that a manual segmentation should also be shown in a figure so that the reader has a better way to gauge the performance of the automated segmentation.

On page 7, the authors mention terms such as "emit" and "absorb" but never properly define them, such that I feel like I'm guessing at their meaning. Precise definitions of these terms should be provided.

For Figure 3, it's unclear if the parent models shown (particularly the carbon model) are binary or not. The figure looks to be grey values, which would imply that it's the visualization of some prediction score. If so, how is this thresholded? This can also be made clearer in the text.

Figure 3D was produced in ChimeraX using the hide dust function. I think some discussion on the nature of this "dust" is in order, e.g. how much is there and how large does it need to be to be considered dust? Given that these segmentations can be used for particle picking, this seems like it may be a major contributor to false positives.

Page 9 contains the following sentence: "After selecting these values, we then launched a batch particle picking process to determine lists of particle coordinates based on the segmented volumes." Given how important this is, I feel like this requires significant description, e.g. how are densities thresholded, how are centers determined, and what if there are overlapping segmentations?

The FSC shown in Figure S6 for the auto-picked maps is concerning. First, a horizontal line at FSC = 0 should be added. It seems that starting at a frequency of ~0.045, the FSC of the autopicked map increases above zero and stays there. Since this is not present in the FSC of the manually picked averages, this suggests the automatic approach is also finding some sort of consistent features. This needs to be discussed.

Page 11 contains the statement "the segmented volumes found no immediately apparent false positive predictions of these pores". This is quite subjective and I don't know that I agree with this assessment. Unless the authors decide to quantify this through subtomogram classification, I don't think this statement is appropriate.

In the methods, the authors note that particle picking is explained in detail in the online documentation. Given that this is a key feature of this software, such an explanation should be in the manuscript.

https://doi.org/10.7554/eLife.98552.1.sa0

Author response:

Thank you for organising the review and providing us with the reviewer's feedback. These comments are very useful, and we would like to express our gratitude to the reviewers for their efforts.

The reviewers all point out a number of related improvements, relating to: 1) describing various processing steps more clearly, in the online documentation but also in the manuscript itself (e.g. for particle picking), 2) describing more clearly what features Ais offers, how these compare to those of other programmes, and how they might be interfaced with in third-party programmes (e.g. the expected format of models), and 3) a degree of subjectivity in discussion of the results presented in the manuscript (e.g. our statement that Pix2pix performed better in some cases than did other architectures).

We will address these points, as well as the various other suggestions, in the upcoming revised manuscript and updates to Ais.

https://doi.org/10.7554/eLife.98552.1.sa4

Ais: streamlining segmentation of cryo-electron tomography datasets

Significance of findings

Strength of evidence

Abstract

Introduction

Results and discussion

An overview of the user interface and functionalities.

A library of neural network architectures supports varied applications

Comparison of some of the default models available in Ais.

A comparison of different neural networks for tomogram segmentation.

Fine-tuning segmentation results with model interactions

Model interactions can significantly increase segmentation accuracy.

Automating particle picking for subtomogram averaging

Automated particle picking for sub-tomogram averaging of antibody complexes.

Many-feature segmentations of complex in situ datasets

Segmentations of complex in situ tomograms.