Lifestyles are displayed next to species names (blue: free-living, green: endoparasitoid, yellow: ectoparasitoid, gray: unknown). The number of EVEs and domesticated EVEs (dEVEs) found in each …
File including all aligned cluster sequences scored from A to X.
The phylogeny of cluster21304 corresponds to the clustering of a set of viral and candidate viral insertion genes sharing a homology. In red are represented the loci of viral origin, and in blue are …
The column ‘Sp names’ contains the species name, followed by the name of the scaffold in which the endogenous viral element (EVE) has been identified. The ‘Viral family’ column refers to the …
The figure compares the synteny of the IVSPER between Hyposoter didymator ichnovirus (HdIV) and the Campoplegninae of our dataset. Homologous genes with synteny between the two species are indicated …
The phylogeny includes 12 subfamilies of the Ophioniformes group within the superfamily Ichneumonoidea. Several species of these subfamilies have been examined for the presence of ichnovirus-like …
(A) Phylogeny of early diverging families of Chalcidoidea (423 UCEs and 127,979 bp were analyzed to get the tree, best-fit model = GTR + F + R10). (B) Phylogeny of the family Eulophidae to which the …
Phylogeny of 124 Hymenoptera species. Two Coleoptera species were used to root the tree. The aLRT bootstrap scores are represented along the nodes. The sources refer to the platform or laboratory in …
A represents the distribution of the number of endogenization Events and B the number of endogenous viral elements (EVEs). The percentage of Events or EVEs is shown next to the bars.
The panel (A) refers to the four known cases (Venturia canescens, Fopius arisanus, Cotesia congregata, and Microplitis demolitor) involving Nudivirus donors while the panel (B) refers to the known …
In all three panels, events inferred as corresponding to domestications are displayed in orange, while events not inferred as domestications are displayed in yellow. (A) Distribution of the number …
(A) Distribution of viral endogenization events (Event) and B of domestication events (dEVEs) across Hymenoptera lifestyles. Crosses indicate the expected proportion of events associated with the …
Violin plots represent the posterior distribution of the coefficients obtained under the different GLM models (after exponential transformation to obtain a rate relative to free-living species). The …
The ectoparasitoid lifestyle is in yellow, the endoparasitoid lifestyle is in green, and the free-living lifestyle is in blue. A binomial negative zero-inflated GLM model was used, with free-living …
The ectoparasitoid lifestyle is in yellow, the endoparasitoid lifestyle is in green, and the free-living lifestyle is in blue (the intercept). Coefficients have been transformed into exponential and …
The ectoparasitoid lifestyle is in yellow, the endoparasitoid lifestyle is in green, the free-living lifestyle is in blu.e and the eusocial lifestyle is in purple. A binomial negative zero-inflated …
Specifically, this figure shows the relationships between Naldaviricetes double-stranded DNA viruses and endogenous viral elements (EVEs) from hymenopteran species, where at least three …
File including all aligned cluster sequences included in the Naldaviricetes phylogenetic analysis.
The size of the dots corresponds to the number of candidate EVEs inside the scaffold. The color represents the genomic entity from which the EVE probably originated (brown: Nudiviridae, blue = LbFV, …
The plot show regions homologous to viral ORFs in the Platygaster orseoliae filamentous virus (PoFV) genome (A). The colored regions correspond to the predicted ORFs in the PoFV genome (gray ORFS in …
The panel A represents the Cluster_25710 which corresponds to the integrase protein. The panel B represents the Cluster_26675 which corresponds to the ac81 protein. Taxa in red correspond to …
Summary statistics table for candidate EVEs.
General information regarding the species used in this study.
Summary statistics table for control endogenous viral elements (EVEs).
Information for individual loci.
Endogenized loci (scoring from A to D) are displayed in the first sheet, whereas exogenous loci (from E to X) are displayed in the second sheet.
Table listing the names of virus species known to probably interact with insects.
The data is taken from the virushostdb database (Mihara et al., 2016) (version of 24/03/2023), which lists a wide variety of virus species associated with their presumed hosts. We have also added two important exploratory studies of RNA viruses (Shi et al., 2016; Wu et al., 2020). The viral genomic structures associated with the viruses were retrieved from the ICTV website (V2022_MSL38). Each column represents the information retrieved for each virus species from one of the three sources listed. The Hostdb_Host_lineage column corresponds to the information of the insect host observed interacting with the virus of interest. If a column with the suffix ’Shi’ or ’Haoming’ contains information for a virus species, then this means that this RNA virus species was found in their dataset in an insect.
Datasets and detailed statistics are presented in the manuscript.
Additional information from the double-stranded DNA (dsDNA) Naldaviricetes phylogenetic analysis in Figure 5, including the best partition models chosen for each partition and the number of genes used for each species of the tree.
Biosample information regarding the 34 Hymenoptera species sequenced for this study.
Table representing the overlap between transposable elements and the clusters of homologous endogenous viral elements (EVEs).
The transposable elements were inferred using the RepeatModeler RepeatPeps database.
Information on the RNAseq datasets used in this study.
Details of the tblastn analysis for Platygaster orseoliae endogenous viral elements (PoEFV) and complementary information.
File including all the cluster phylogenies.
Leafs highlighted in green represent endogenous viral elements (EVEs) (scored from A to D), while leafs highlighted in red represent free-living viruses or loci annotated as putative free-living sequences (scored from D to X). The letter at the end of the taxon label represents the endogenization score assigned to the candidate. The assigned viral family of the free-living genes appears next to the pipe. The numbers right next to the ‘Event’ refers to the assigned Event number. If EVEs were found under selection (either by RNAseq or dN/dS analysis), the end of the leaf name will be ‘Selective_pressure_YES,’ while if not, the name will be ‘Selective_pressure_NO.’ Ultra-Fast Bootstrap values can be found next to the nodes of the phylogenies. For each phylogeny, the putative consensus protein name as well as the putative viral family is given at the top of the figure.