Figures and data

Schematic phylogenetic tree of [NiFe]-CODH, with selected CODHs marked with their respective position. Tree was built using IQ-TREE, 1000 ultrafast bootstrap, containing 5508 putative CODH sequences and one outgroup (MBE6442607.1 hydroxylamine reductase [Desulfovibrio desulfuricans]) for rooting. Detailed searchable tree with bootstrap values can be found in Supplementary File 9_Tree5.

Operons of selected [NiFe]-CODH. * NtCODH formally known as MtCODH formerly known as CtCODH due to renaming of host organism (Gtari and Ventura, 2025). ** NvCODH formerly known as DvCODH due to renaming of host organism (Waite et al., 2020).

(A) Frequency of X amounts of [NiFe]-CODH from the same clade co-occurring in the same organism. (B) Probability matrix of [NiFe]-CODH from different clades co-occurring in the same organism. The equation for probability can be found in the Method section. A long table format of the matrix can be found in Supplementary File 2_TableS1. Additional raw data can also be found in Supplementary File 1_Table S1.

(A) Phylogenetic tree of putative [NiFe]-CODH unrooted with 1376 sequences. The coloured rings mark if the operon from a protein/leaf contains one or more of a certain type of protein using the same coloring as in Fig. 4C. The types of proteins are from centre to outer ring: one carbon pool, CooC, CooT, CooJ, Ferredoxin, FeS, hydrogenase, hydrogenase-maturase, NAD(P)/FAD-dependent oxidoreductase, regulation, ABC transporter, transporter. Detailed searchable tree with bootstrap values can be found in Supplementary File 5_Tree1 and Supplementary File 6_Tree2. (B) Distribution of operon size for CODH and HCP genes. (C) Proportion of [NiFe]-CODH from one clade being coded near a certain type of protein. The dotted line marks a proportion of 10%, which we set as our significance cut-off. Raw data can be found in Supplementary File 1_Table S1.