Taxonomy | Set of quantitatively derived data clusters defined by a specific computational algorithm on a specific dataset(s). Taxonomies are given a unique label and can be annotated with metadata about the taxonomy, including details of the algorithms and relevant cell and cell set IDs. | Any clustering result in a cell type classification manuscript |
Dataset | Feature information (e.g., gene expression) and associated metadata from a set of cells collected as part of a single project. | Gene expression from 6000 human MOp nuclei |
Ontology | A structured controlled vocabulary for cell types. | Cell Ontology |
Marker gene(s) | A gene (gene set) which, when expressed in a cell, can be used to accurately assign that cell to a specific cell set. | GAD2; PVALB; CHODL |
Taxonomy ID* | An identifier uniquely tagging a taxonomy of the format CCN[YYYYMMDD][#]. | CCN201910120 |
Cell | A single entry in a taxonomy representing data from a single cell (or cell compartment, such as the nucleus). Cells have metadata including a unique ID. | N/A |
Cell set | Any tagged group of cells in a taxonomy. This includes cell types, groups of cell types, and potentially other informative groupings (e.g., all cells from one donor, organ, cortical layer, or transgenic line). Cell sets have several IDs and descriptors (as discussed below) and can also have other metadata. | A cell type; a group of cell types; all cells from layer two in MTG; all cells from donor X |
Provisional cell type | Quantitatively derived data cluster defined within a taxonomy. This is a specific example of a cell set that is of high importance, as most other cell sets are groupings of one or more provisional cell types. Here, the term ‘cell type’ is synonymous with ‘provisional cell type.’ . | A cell type defined in a specific study |
Dendrogram | A hierarchical organization of provisional cell types defined for a specific taxonomy. Dendrograms have a specific semantic and visualizable structure and include nodes (representing multiple provisional cell types) and leaves (representing exactly one). Not all taxonomies include a dendrogram (e.g., if the structure of cell sets is non-hierarchical). | N/A |
Community structure | Non-hierarchical relationships between cell types defined as groups of cell types in a graph. | N/A |
Cell set accession ID* | A unique ID across all tracked datasets and taxonomies. This tag labels the taxonomy and numbers each cell type. CS[taxonomy id]_[unique # within taxonomy] | CS201910120_1 |
Cell set label* | An ID unique within a single taxonomy that is used for assigning cells to cell sets defined as a combination of multiple ‘provisional cell types’. | MTG 12 MTG 01–08 |
Cell set alias* | Any cell set descriptor. It can be defined computationally from the data, or manually based on new experiments, prior knowledge, or a combination of both. Cell aliases beyond the ‘preferred’ or ‘aligned’ are defined as ‘cell set additional aliases’. | (Any ‘cell set aligned alias’); Interneuron 1; Rosehip |
Cell set preferred alias* | The primary cell set alias (e.g., what cell types might be called in a publication). This can sometimes match the aligned alias, but not always, and can be left unassigned. | Inh L1-2 PAX6 CDH12; ADARB2 (CGE); Chandelier; [blank] |
Cell set aligned alias* | Analogous to ‘gene symbol’. At most one biologically driven name for linking matching cell sets across taxonomies and with a reference taxonomy. | L2/3 IT 4; Pvalb 3; Microglia 2 |
Cell set structure* | The location in the brain (or body) from where cells in the associated set were primarily collected. | Neocortex |
Cell set ontology tag* | A tag from a standard ontology (e.g., UBERON) corresponding to the listed cell set structure. | UBERON:0001950 |
Cell set alias assignee* | Person responsible for assigning a specific cell set alias in a specific taxonomy (e.g., the person who built the taxonomy or uploaded the data, or a field expert). | (First author of manuscript) |
Cell set alias citation* | The citation or permanent data identifier corresponding to the taxonomy where the cell set was originally reported. | (Manuscript DOI); [blank] |
Reference taxonomy | A taxonomy based on one or a combination of high-confidence datasets, to be used as a baseline of comparison for datasets collected from the same organ system. | Cross-species cortical cell type classification |
Morpho-electric(ME) type | A provisional cell type defined using a combination of morphological and electrophysiological features. | ME_Exc_7 |
Governing body | A forum of subject-matter experts to guide policy and manage change of the CCN and associated ontologies and databasing efforts. | N/A |