Human cytomegalovirus interactome analysis identifies degradation hubs, domain associations and viral protein functions

Abstract
Introduction
Results
Discussion
Materials and methods
Data availability
References
Article and author information
Metrics

Abstract

Human cytomegalovirus (HCMV) extensively modulates host cells, downregulating >900 human proteins during viral replication and degrading ≥133 proteins shortly after infection. The mechanism of degradation of most host proteins remains unresolved, and the functions of many viral proteins are incompletely characterised. We performed a mass spectrometry-based interactome analysis of 169 tagged, stably-expressed canonical strain Merlin HCMV proteins, and two non-canonical HCMV proteins, in infected cells. This identified a network of >3400 virus-host and >150 virus-virus protein interactions, providing insights into functions for multiple viral genes. Domain analysis predicted binding of the viral UL25 protein to SH3 domains of NCK Adaptor Protein-1. Viral interacting proteins were identified for 31/133 degraded host targets. Finally, the uncharacterised, non-canonical ORFL147C protein was found to interact with elements of the mRNA splicing machinery, and a mutational study suggested its importance in viral replication. The interactome data will be important for future studies of herpesvirus infection.

Introduction

Human cytomegalovirus (HCMV) persistently infects the majority of the worldwide population (Mocarski et al., 2013). Following primary infection under the control of a healthy immune system, a latent infection is established that persists lifelong (Reeves et al., 2005). In immunocompromised individuals, particularly transplant recipients and AIDS patients, virus reactivated from latency to induce lytic infection is capable of affecting almost any organ system and causing serious disease (Nichols et al., 2002). HCMV infection in utero is a leading cause of deafness and intellectual disability in newborns, affecting ~1/200 pregnancies (Mocarski et al., 2013).

Small-molecule disruption of critical virus-virus or virus-host protein interactions could provide novel therapeutic strategies. Indeed, disruption of interactions between antiviral restriction factors (ARFs) and viral antagonists can facilitate endogenous inhibition of infection (Nathans et al., 2008). Systematic characterisation of all viral protein interactions thus has important implications for antiviral therapy, and is particularly important for HCMV, for which only a few drugs are available.

HCMV encodes 170 canonical protein-coding genes (Gatherer et al., 2011), and a substantial number of non-canonical open reading frames (ORFs) that potentially encode additional proteins have been identified by ribosomal footprinting and proteomics (Nightingale et al., 2018; Stern-Ginossar et al., 2012). During productive infection in vitro, HCMV gene expression is conventionally divided into immediate-early, early and late phases over a replication cycle lasting ~96 hr. Five temporal classes of viral protein expression have been defined by measuring viral protein profiles over time (Weekes et al., 2014). Latent infection with HCMV occurs in a restricted range of cell types, and may involve a somewhat more limited range of viral gene expression (Goodrum and McWeeney, 2018; Schwartz and Stern-Ginossar, 2019). However, at least some viral proteins function similarly during both productive infection and latency. For example, UL138, which plays roles in the establishment and maintenance of latent infection, downregulates Multidrug Resistance-Associated Protein 1 (MRP1) during both phases of infection (Weekes et al., 2013; Weekes et al., 2014).

The functions of many canonical HCMV proteins remain poorly understood, and it is not yet clear how many, if any, non-canonical ORFs encode functional polypeptides. We have shown previously that >900 host proteins are downregulated >3 fold over the course of HCMV infection, with 133 proteins degraded in the proteasome or lysosome during the early phase (Nightingale et al., 2018; Weekes et al., 2014). However, it is not yet known which viral factors target these proteins, and certain proteins, including MHC class I molecules and natural killer cell ligands, can be targeted by more than one viral factor (Fielding et al., 2014; Hsu et al., 2015; van der Wal et al., 2002; Wilkinson et al., 2008).

Here, an examination of each canonical and a subset of non-canonical HCMV proteins in infected cells revealed an extensive network of >3400 high confidence virus-host and >150 virus-virus interactions. This provided insights into the functions of multiple uncharacterised or partly characterised viral proteins. The data enabled identification of individual viral factors that target 31 host proteins for degradation. Novel interactions between selected viral and host protein domains were also tested experimentally. In addition, the study provided the first evidence for a functional role for a non-canonical HCMV ORF in viral infection. The extensive interactome data generated in this study predicts viral proteins important in key cellular pathways, and may lead to the development of new antiviral therapeutics.

Results

Construction of the HCMV-host interactome

To build a global picture of all HCMV virus-host and virus-virus protein interactions, 170 stable cell lines were generated from immortalised primary human fetal foreskin fibroblasts (HFFF-TERTs), each expressing a single, canonical HCMV ORF with a C-terminal V5 tag to facilitate immunoprecipitation (IP). Two non-canonical ORFs, ORFL147C and ORFS343C, were also included on the basis of either high or low expression respectively, relative to all other viral ORFs detected previously by proteomics (Figure 1 – Figure Supplement 1A, Supplementary file 1A) (Fielding et al., 2017; Weekes et al., 2014). Prior to profiling by IP-mass spectrometry (IP-MS), expression of each tagged viral ‘bait’ protein was validated by immunoblotting (IB), MS or RT-qPCR, apart from UL136 which could not be detected by any method (Figure 1 – Figure Supplement 1B, Supplementary file 1B). To examine the full range of virus-virus interactions in addition to virus-host interactions, IP was performed in cells infected with Merlin strain HCMV at multiplicity of infection (MOI) of 2 for 60 hr. Merlin contains a full length genome and expresses all HCMV genes apart from UL128 and RL13. All detectable viral proteins are expressed at 60 hr post-infection (PI) with this strain (Weekes et al., 2014) (Figure 1—figure supplement 1E). A schematic and details of the IP-MS strategy are shown in Figure 1.

Figure 1 with 2 supplements see all

Download asset Open asset

Schematic of the IP strategy.

IP samples were generated and analysed in technical duplicate, using the method originally described in Huttlin et al. (2017); Huttlin et al. (2015) and discussed in detail in the Materials and methods section. For 153 baits with zero or one transmembrane (TM) region predicted by Uniprot, an NP40-based lysis buffer was used; for 18 baits with >1 TM region, a digitonin-based buffer was used, as this has previously been demonstrated to improve identifications of interacting proteins (‘prey’) (Babu et al., 2012) (Supplementary file 1B). Each dataset was scored separately using the CompPASS algorithm (Huttlin et al., 2015; Sowa et al., 2009) to better model detergent-specific variation in IP-MS background. Data reported for each prey protein in every IP include: (a) the number of peptide spectral matches (PSMs), averaged between technical replicates; (b) an entropy score, which compares the number of PSMs between replicates to eliminate proteins that are not detected consistently; (c) a z-score, calculated in comparison to the average and standard deviation of PSMs observed across all IPs; and (d) a normalised WD (NWD) score. The NWD score addresses whether (i) the protein is detected across all IPs, and (ii) whether it is detected reproducibly among replicates. It was calculated as described in Behrends et al. (2010) using the fraction of runs in which a protein was observed, the observed number of PSMs, the average and standard deviation of PSMs observed for that protein across all IPs, and the number of replicates (1 or 2) containing the protein of interest. NWD scores were normalised so that the top 2% earned scores of ≥1.0. Stringent filters were applied to remove inconsistent and low-confidence protein identifications across all IPs and thus minimise both false protein identifications and associations (Huttlin et al., 2015). These included: (a) a minimum PSM score of 1.5 (i.e. ≥3 peptides per protein across both replicates); (b) an entropy score of ≥0.75; and (c) an NWD or z-score in the top 2%. Previous studies have estimated a 5% false discovery rate when employing a similar strategy with a top NWD score cutoff of 2% (Sowa et al., 2009). Interactions passing these criteria are named ‘high confidence interacting proteins’ (HCIPs) (Supplementary file 2B), and were used in all subsequent analyses. For added stringency, the supervised learning algorithm CompPass Plus was employed. This additionally assessed batch variations, overall spectral counts, unique peptide counts and protein detection frequency. Shannon entropy quantified a protein’s consistency of detection across technical duplicate LC-MS analyses, removing inconsistent protein identifications (Huttlin et al., 2017). CompPass plus was developed for interactomes with ≥96 baits and in the present study was only applied to the 153 baits solubilized in NP40. Interactions that passed CompPass filters, had CompPass Plus p(Interactor)>0.75 and in which the prey was identified by ≥2 unique peptides were considered as very high confidence interacting proteins (VHCIPs). These are indicated in green shading in Supplementary file 2B. To facilitate global analysis of all data, and because digitonin-solubilised interactions were not analysed using CompPass plus, HCIPs as opposed to VHCIPs were examined for the remainder of this study. The identification of an interacting protein as a VHCIP nevertheless adds additional confidence that the interaction observed is likely to be genuine.

For HCMV UL120 and UL142, no interacting proteins passed the stringent filters employed. For seven further proteins, only the bait itself passed filtering, leaving 162 viral baits with ≥1 HCIP. In total, 3572 interactions were detected across all 162 baits, with a range of 1–174 interactions per bait, reflecting a scale-free degree distribution typical of protein interaction networks. The median number of interactions per bait was 9, similar to previously observed in the Bioplex 2.0 human interactome (Huttlin et al., 2015) (Materials and methods; Supplementary file 2A, Figure 1—figure supplement 2A). Data were validated from previously reported virus-virus and virus-host interactions described in BioGRID, IntAct, Uniprot, MINT and Virus Mentha (Figure 1—figure supplement 2B, Supplementary files 2–3) (Calderone et al., 2015; Chatr-Aryamontri et al., 2013; Licata et al., 2012; Orchard et al., 2014).

Systematic analysis of viral protein function

Systematic analysis of protein interactions can improve understanding of viral protein function. To analyse the functions of all viral proteins simultaneously, DAVID software (Huang et al., 2009) was employed to determine which pathways were enriched amongst the 3416 human proteins that interacted with viral baits (Figure 2 centre, Figure 2—figure supplement 1, Supplementary file 4A-B).

Figure 2 with 2 supplements see all

Download asset Open asset

Systematic analysis of interactome data predicts novel functions for viral proteins.

DAVID software with default settings (Huang et al., 2009) was applied to determine which pathways were enriched amongst all HCIPs in the interactome, in comparison to all human proteins as background. Benjamini-Hochberg adjusted p-values are shown as blue surrounds to each pathway enriched at p<0.05. Viral baits are linked to enriched pathways where > 33% of human interacting proteins belonged to a given pathway, and examples are shown around the outside of the figure. These examples are indicated in the central part of the figure by purple shading. For example, 6/9 (67%) human HCIPs for UL43 were part of the 14-3-3 protein family. Viral baits are shown as large turquoise circles, and interacting viral proteins as smaller turquoise circles. Members of enriched pathways are shown in orange or yellow (for NuRD complex and histone deacetylation, protein membership of both pathways is indicated by half-orange, half-yellow circles). Solid lines indicate interactions identified by this interactome, and dashed lines indicated interactions derived from human Bioplex 2.0 and subsequent unpublished data (Huttlin et al., 2017 and http://bioplex.hms.harvard.edu/downloadInteractions.php). Full data are shown in Supplementary file 4. As an alternative approach to highlight cellular functions that predominantly related to individual viral proteins, Figure 2—figure supplement 1 shows pathways with p<0.05 (after Benjamini-Hochberg adjustment) and for which > 33% of the identified cellular protein members of the pathway interacted with a given viral bait.

Nucleosome remodeling (NuRD) complex components were significantly enriched among HCMV-interacting proteins. The NuRD complex plays major roles in cellular chromatin remodeling, and is known to be co-opted by HCMV UL29 and UL38 to enhance expression of immediate-early genes (Savaryn et al., 2013; Terhune et al., 2010). The interaction of UL29 and UL38 in a complex with all components of NuRD was confirmed, in addition to p53 (Savaryn et al., 2013). UL29 was also found to interact with multiple human proteins that function in histone deacetylation, which had not been observed previously (Figure 2).

UL87, UL79, UL91 and UL95 are essential for viral replication and necessary for transcriptional activation of viral genes expressed with ‘true late’ kinetics. UL92 has a similar function, and it has been suggested that these five proteins may form one or more complexes that modulate RNA polymerase II activity (Isomura et al., 2011; Omoto and Mocarski, 2013; Omoto and Mocarski, 2014). Interactome data confirmed that UL87 interacted with UL79, UL91 and UL95 but did not detect an interaction with UL92. This latter observation, and in fact the lack of identification of any viral-viral UL92 interactions, may be explained by our finding that UL92 was one of the two least abundantly expressed viral proteins during HCMV infection (Supplementary file 1A, bottom). UL87 also interacted with all 12 components of the RNA polymerase II (RPII) complex and the associated protein RPII Associated Protein 2 (RPAP2) (Figure 2). The UL87-RPII interaction was anticipated by analogy to the orthologous RPII-interacting Epstein-Barr virus protein BcRF1, but had not previously been demonstrated. Interaction of UL87, UL95 and UL79 with the UL97 protein kinase was also novel.

Collectively, these confirmatory data indicate that the HCMV interactome has the power to predict new functions for uncharacterised or partly characterised viral proteins, particularly where a bait interacts with multiple protein components of the same pathway. For example, UL72 is a temporal protein profile 3 (Tp3)-class HCMV protein derived from deoxyuridine 5'-triphosphate nucleotidohydrolase (dUTPase) in other herpesviruses, but lacks dUTPase activity (Caposio et al., 2004; McGeehan et al., 2001). UL72 interacted with all 10 components of the CCR4-NOT (carbon catabolite repressor 4-negative on TATA) complex, which is a key regulator of gene expression from production of mRNAs in the nucleus to their degradation in the cytoplasm (Yi et al., 2018). The interaction between UL72 and CNOT2/CNOT7 was confirmed by co-IP (Figure 3A–B). It remains to be determined how UL72 modulates CCR4-NOT function.

Figure 3

Download asset Open asset

Validation of interactome data by co-IP.

(A) Co-IPs validating that UL72 interacts with CCR4-NOT Transcription Complex Subunits 7 and 2 (CNOT7 and CNOT2), conducted in HEK293T cells. For all experiments in this figure, left panels show an IB of 1–2% of input sample, and right panels shown an anti-V5 co-IP. Cells were transiently transfected with two plasmids, one expressing the C-terminally V5-tagged viral protein and the other expressing the C-terminally HA-tagged cellular prey. Bait proteins were detected with anti-V5, and prey with antibodies against CNOT7 or CNOT2 protein. Controls included GFP or the viral UL34 protein. CANX – calnexin loading control. This figure is representative of n = 1 experiment (CNOT2); n = 2 experiments (CNOT7). Expected sizes: CNOT7: 33 kDa; CNOT2: 52 kDa; CANX: 72 kDa; UL72: 44 kDa; UL34: 45 kDa. (B) Co-IPs validating that UL72 interacts with CNOT7 and CNOT2, conducted in HFFF-TERT cells overexpressing C-terminally V5-tagged UL72. Proteins were detected as described in (A). This figure is representative of n = 2 experiments (CNOT2); n = 1 experiment (CNOT7). Expected sizes: CNOT7: 33 kDa; CNOT2: 52 kDa; CANX: 72 kDa; UL72: 44 kDa; UL34: 45 kDa. (C) Co-IP validating the interaction between RL1 and CUL4A, conducted in HEK293T cells as described in (A), but with detection of CUL4A using anti-HA. This figure is representative of n = 4 experiments. Expected sizes: CUL4A: 77 kDa; RL1: 35 kDa; UL34: 45 kDa; CANX: 72 kDa. (D) HCMV UL71 interacted with multiple interferon-stimulated proteins, including TRIM22. (E) Co-IP validating the interaction between UL71 and TRIM22, conducted as described in (C). This figure is representative of n = 3 experiments. Expected sizes: TRIM22: 56 kDa; UL71: 40 kDa; UL34: 45 kDa; CANX: 72 kDa.

The hitherto uncharacterised viral UL145 protein is known to recruit the Cullin 4 E3 ligase scaffold and associated adaptor proteins, and to degrade helicase-like transcription factor (HLTF) (Nightingale et al., 2018). Interactome data suggested that all human proteins interacting with UL145 and the paralogous RL1 were part of the ubiquitin conjugation pathway (Supplementary file 2, Supplementary file 4), and furthermore that RL1 interacted with Cullin 4 (CUL4, Figure 2). The interaction with CUL4A was validated by co-IP (Figure 3C). Proteins that are degraded after binding RL1/CUL4 still require identification; it is possible that their abundance after degradation may have been insufficient to enable identification in this study. Multiple other HCMV proteins additionally interacted with elements of the ubiquitin transfer or conjugation pathways, including the inhibitor of apoptosis UL36, which bound the Cullin one scaffold, E3 ligase UBR5, and F-box component FBOX3. Similarly, DNA helicase/primase component UL102 interacted with E3 ligase RNF114 and E2 conjugating enzyme UBE2L6 (Figure 2 and Supplementary file 2).

The tegument protein UL71 has an essential function in the final steps of secondary envelopment leading to infectious viral particles, but is expressed with Tp3 kinetics, suggesting the possibility of a role earlier during infection (Dietz et al., 2018; Meissner et al., 2012; Weekes et al., 2014). UL71 interacted with multiple interferon-stimulated proteins (Figure 3D), including TRIM22, which restricts replication of HIV-1, influenza A and hepatitis B and C viruses (Lian and Sun, 2017). The UL71-TRIM22 interaction was validated by co-IP, suggesting that investigation of a putative innate immune role for UL71 will be important (Figure 3E).

In addition to characterising baits that interacted with multiple members of individual cellular pathways, an alternative approach identified pathways whose members interacted predominantly with single baits (Figure 2—figure supplement 1). The US28 G-protein coupled receptor (GPCR) functions in both lytic and latent HCMV infection via constitutive signaling to activate distinct intracellular pathways (Krishna et al., 2018). Here, US28 interacted with all quantified members of thick filament/muscle myosin complexes, namely myosin heavy and light chain components, a myosin binding protein and titin. This suggests an unanticipated role for US28 in processes such as regulation of the actin cytoskeleton or cytoskeletal remodeling (Wang et al., 2018). Other viral proteins may have novel functions modulating vesicular transport. For example, the US27 GPCR interacted with multiple components of the SNARE complex, whose primary function is to mediate vesicle fusion (Han et al., 2017). Envelope glycoprotein UL132 interacted with the AP-2 adaptor complex, which functions in clathrin-mediated endocytosis (Figure 2—figure supplement 1) (Collins et al., 2002).

To gain further insights into temporal regulation of protein-protein interactions, we determined which functions were enriched amongst human HCIPs for each of the five temporal classes of HCMV bait (Weekes et al., 2014). A clear relation to functions required at different stages of the viral life-cycle was observed (Figure 2—figure supplement 2A, Supplementary file 4C). For example, Tp1 and Tp2 protein HCIPs were enriched in NuRD complex members, proteins involved in histone deacetylation and proteins with SANT domains (which function in chromatin remodelling). Tp3 HCIPs were enriched in functions required for viral genomic replication and immune evasion, whilst Tp5 HCIPs were directed at intracellular trafficking and secretion (Figure 2—figure supplement 2A). For viral-viral protein interactions, two patterns emerged – (a) interaction of viral proteins within the same temporal class, or between adjacent classes; (b) interaction of proteins from the largest class (Tp5) with members of each of the five classes (Figure 2—figure supplement 2B, Supplementary file 4D). For example, Tp1 and Tp2 class proteins UL29 and UL38 interacted, as previously reported (Supplementary file 3, Figure 2). Tp1-class tegument proteins US23 and US24 interacted. The majority of Tp5 interactions were with other Tp5 proteins, 15/37 of which were tegument-tegument, capsid-capsid or tegument-capsid protein interactions (Figure 2—figure supplement 2B). Certain interactions between proteins in different temporal classes have also been reported; for example, between the Tp5 DNA polymerase accessory protein UL44 and Tp2 DNA polymerase UL54. Clearly, other novel interactions also exist between quite distinctly expressed proteins, for example between the functionally unknown Tp2-class membrane protein UL14 and two Tp5-class proteins: membrane protein UL121 and envelope glycoprotein UL4.

Association between functional domains revealed by protein-protein interactions

Certain domains perform related functions within diverse proteins, often via interactions with complementary structures. The function and interaction(s) of these domains can be predicted by analysing interactions between their parent proteins (Finn et al., 2014; Huttlin et al., 2015). Although domains that co-occur frequently do not necessarily interact directly, these associations can nevertheless provide insights into domain biology.

By mapping Pfam domains to every bait and prey protein in the interactome, it was possible to identify domain pairs that interact with unusual frequency (Figure 4A) (Finn et al., 2014). This correctly predicted that HCMV glycoprotein UL141 interacts with TNFR cysteine-rich domains (TNFR c6), which has been demonstrated for TNFRSF10B and predicted for TNFRSF10A (Nemčovičová et al., 2013). UL141 also interacted with TNFRSF10D as reported (Smith et al., 2013) and was found to interact with TNFRSF1A, suggesting that these interactions may also occur via the TNFR c6 domain (Figure 4A, Supplementary file 5B).

Figure 4 with 1 supplement see all

Download asset Open asset

Interaction between UL25 and NCK1 identified by domain association analysis.

(A) Table depicting significant associations between domains present in HCMV baits (top) and human or viral prey (side). Pfam domains were mapped onto every bait and prey protein in the interactome (Finn et al., 2014). The numbers of interactions emanating from proteins containing each domain were tallied individually, along with the numbers of interactions linking each observed domain pair. Contingency tables were then populated to relate domain associations. For each pair, Fisher’s exact test determined the likelihood of a non-random association. p values were adjusted for multiple hypothesis testing (Benjamini and Hochberg, 1995). Coloured boxes identify domain pairs that associate at a 1% false discovery rate (FDR). Red boxes indicate domain pairs from this analysis discussed in the text. Domain associations are only shown for domains occurring in at least two viral proteins. Supplementary file 5 shows the full underlying data. (B) All HCIPs for UL25 and a subset of HCIPs for UL26 (full data are shown in Figure 4—figure supplement 1). DAVID analysis identified that members of the C-terminal to LisH (CTLH) complex and COPII vesicle coat proteins were enriched among UL26 HCIPs (Figure 2—figure supplement 1). Domain association analysis suggested that interaction of UL26 with CTLH components may occur via interaction of the viral US22 domain with either cellular CLTH or LisH domains (Supplementary file 5). Dashed lines represent human-human interactions derived either from Bioplex 2.0 as described in Figure 2 or from curated or experimental data in the STRING database. CPSF - Cleavage and polyadenylation specificity factor. (C) Schematic of NCK1 and UL25 protein structures, indicating the position of point mutations or truncation for (D). (D) Co-IP demonstrating that the UL25 proline-rich C-terminal domain associates with the first NCK1 SH3 domain, conducted as described in Figure 3. HEK293T cells were transiently transfected with the indicated plasmids, one expressing the C-terminally V5-tagged viral protein and the other expressing C-terminally HA-tagged NCK1. These proteins were detected with anti-V5 and anti-HA. Mutations or truncations of each gene are indicated in the figure and in (C). GAPDH – loading control. This figure is representative of n = 3 experiments. Expected sizes: NCK1: 43 kDa; UL25: 74 kDa; UL26: 21 kDa; GAPDH: 36 kDa.

Domain analysis predicted that certain Herpes pp85 proteins interact with host SH3 domains. Underlying interactome data suggested that the viral tegument pp85 phosphoprotein UL25 interacted with SH3 domain-containing proteins NCK1 (Non-catalytic region of protein tyrosine kinase 1) and NCK2. Additionally, UL25 interacted with two other human proteins and the viral tegument protein UL26. UL26 had more diverse targets, including NCK2 but not NCK1 (Figure 4A–B, Supplementary file 2, Supplementary file 5).

SH3 domains are known to interact with proline-rich regions (Kurochkina and Guha, 2013). UL25 has a proline-rich C-terminus, and NCK1 has three N-terminal SH3 regions. A series of mutations or truncations (Figure 4C) suggested that the UL25 C-terminus interacts with the first NCK1 SH3 domain alone, validating and extending the prediction from domain association analysis (Figure 4D).

NCK1 is a multifunctional cytoplasmic adaptor protein with known roles in signal transduction from receptor tyrosine kinases, cytoplasmic remodeling via regulation of actin polymerization, apoptosis and the DNA damage response (Buvall et al., 2013; Keyvani Chahi et al., 2016; Ngoenkam et al., 2014). Interaction of UL25 with NCK1 may thus fulfill a variety of functions. One possibility may include inhibition of immune synapse formation. HCMV UL135 is known to dispel association between F-actin filaments in target cells and the immune synapse (Stanton et al., 2014). UL25 might regulate actin polymerisation in a complementary manner in order to achieve a similar aim.

Viral proteins that degrade cellular prey

We previously described a multiplexed approach for discovering proteins that have innate immune function on the basis of their active degradation by the proteasome or lysosome during the early phase of HCMV infection. Using three orthogonal proteomic/transcriptomic screens to quantify protein degradation, 133 proteins were shown to be degraded in the proteasome or lysosome during early phase infection, which were enriched in novel antiviral restriction factors (Nightingale et al., 2018). To facilitate the mapping of viral gene functions, a final screen employed a panel of HCMV mutants, each deleted in contiguous gene blocks dispensable for virus replication in vitro. However, this screen did not confidently identify the genetic loci that targeted 121/133 degraded proteins. Furthermore, even for 12/133 confidently identified loci, characterization of which individual viral genes degraded cellular targets often proved arduous. For example, to identify UL145 as the gene within the UL133-UL150 block that targeted HLTF to the proteasome, 19 single viral gene deletion mutants required testing (Nightingale et al., 2018).

Interactome data revealed viral baits for 31/133 degraded prey (Supplementary file 6). The ubiquitin E3 ligase ITCH (Itchy E3 Ubiquitin Protein Ligase) is known to be targeted for degradation by viral UL42 (Koshizuka et al., 2016). In addition to ITCH, UL42 interacted with Neural Precursor Cell Expressed, Developmentally Down-Regulated 4 (NEDD4)- family E3 ligases NEDD4 and NEDD4-like (NEDD4L), which were degraded during early HCMV infection (Figure 5A–B) (Nightingale et al., 2018). These interactions were validated by co-IP using both C- and N-terminally V5 tagged UL42, and UL42 was shown to be sufficient for degradation of NEDD4 (Figure 5D–E, Figure 5—figure supplement 1). UL42 protein has not been detected in any of our previous proteomic studies (Fielding et al., 2017; Nightingale et al., 2018; Weekes et al., 2014), however UL42 transcript was quantified by Stern-Ginossar et al (Stern-Ginossar et al., 2012). Although expression of this transcript peaked at 72 hr of infection, it was nevertheless clearly detectable at early time points, suggesting that UL42 protein is likely to be expressed contemporaneously with degradation of NEDD4 and NEDD4L (Figure 5C). The route of degradation of each of the UL42 targets requires further characterisation. MG132 and leupeptin both inhibited degradation of each protein (Figure 5B), which may correspond to the known effects of MG132 on lysosomal cathepsins in addition to the proteasome (Wiertz et al., 1996), or effects of leupeptin on certain proteasomal proteases in addition to lysosomal proteases.

Figure 5 with 1 supplement see all

Download asset Open asset

UL42 identified as a hub of E3 destruction by a combination of interactome and degradation data.

US10 interacts with LRFN3, which is rapidly downregulated from the PM during HCMV infection. (A) High-confidence cellular interactors of UL42. 57% of UL42 interactors exhibited ubiquitin protein transferase activity (Figure 2, counting NEDD4 only once). UL42 interacted with NEDD4, NEDD4 isoform four and NEDD4L, in addition to HECT, C2 and WW Domain Containing E3 Ubiquitin Protein Ligases HECW1 and 2. NEDD4-4: isoform 4 of NEDD4. (B) ITCH, NEDD4 and NEDD4L are degraded during early HCMV infection (data from Nightingale et al., 2018). Protein degradation was measured using three orthogonal tandem mass tag (TMT)-based proteomic screens. The first measured protein abundance throughout early infection in the presence or absence of inhibitors of the proteasome or lysosome. The second compared transcript and protein abundance over time to distinguish between degraded and transcriptionally regulated proteins. The third employed an unbiased global pulse-chase to compare the rates of protein degradation during HCMV infection against mock infection (NEDD4 and NEDD4L were not quantified in this latter screen). Benjamini-Hochberg adjusted Significance A values were used to estimate p-values in the top panels; **p<0.005, ***p<0.0005. Mean and SEM are shown for transcript quantitation (n = 3) in the middle panels. A p-value for the difference between rates of degradation is shown in the bottom panel; ***p<0.0005. All calculations and statistics are described in Nightingale et al. (2018). (C) UL42 transcript is expressed contemporaneously with NEDD4 and NEDD4L degradation. Protein profiles from Figure 5B (red colour, Nightingale et al., 2018) are overlaid with a UL42 transcript profile (blue colour, Stern-Ginossar et al., 2012). UL42 transcript was not detected in our previous RNAseq analysis (Nightingale et al., 2018). (D) Validation of interaction between UL42 and NEDD4 (left panel) and NEDD4L (right panel) by co-IP, conducted as described in Figure 3. HEK293T cells were transiently transfected with the indicated plasmids, one expressing the C-terminally V5-tagged viral protein and the other expressing C-terminally HA-tagged NEDD4 or NEDD4L. These proteins were detected with anti-V5 and anti-HA. This figure is representative of n = 2 experiments (NEDD4); n = 1 experiment (NEDD4L). Expected sizes: NEDD4: 104–149 kDa; NEDD4L: 96–111 kDa; UL42: 14 kDa; UL34: 45 kDa; CANX: 72 kDa. (E) UL42 was sufficient to degrade NEDD4. HFFF-TERTs expressing UL42 or controls were lysed and immunoblotted as indicated. Anti-NEDD4 was used to detect endogenous NEDD4. This figure is representative of n = 1 experiment. Expected sizes: NEDD4: 104–149 kDa; UL42: 14 kDa; UL34: 45 kDa; CANX: 72 kDa. (F) LRFN3 was rapidly downregulated from the PM during HCMV infection, in the presence of upregulated transcript (mean and SEM are shown for transcript quantitation (n = 3); data are from Nightingale et al., 2018). (G) HCIPs of US10, including LRFN3. (H) Validation of the interaction between US10 and LRFN3 by co-IP, conducted as described in Figure 3. Prey were detected using anti-HA. This figure is representative of n = 2 experiments. Expected sizes: LRFN3: 66 kDa; US10: 21 kDa; UL34: 45 kDa; CANX: 72 kDa.

To test the sensitivity of the interactome for detecting interactions with weakly-expressed prey, cell surface adhesion molecule Leucine Rich Repeat And Fibronectin Type III Domain Containing 3 (LRFN3) was examined. This protein was previously quantified by a single peptide in samples enriched for plasma membrane (PM) proteins only (Nightingale et al., 2018; Weekes et al., 2014). LRFN3 was rapidly downregulated from the PM, accompanied by upregulation of transcript over the same period, suggesting either degradation or retention within the infected cell (Figure 5F). Only the ER-resident transmembrane glycoprotein US10 interacted with LRFN3, and this was validated by co-IP (Figure 5G–H). US10 may downregulate this cell surface molecule in a manner similar to the reported degradation of HLA-G (Park et al., 2010).

ORFL147C is a novel viral protein required for viral replication

It had hitherto been unclear whether any of the 604 HCMV ORFs identified by ribosome profiling (RP-ORFs) encoded functional polypeptides (Stern-Ginossar et al., 2012). The abundance of the two RP-ORFs examined in this interactome was in the same range as canonical HCMV proteins, with ORFL147C present at ~25 x lower copy number than the most abundant tegument protein UL83 and ~275 x higher copy number than the membrane protein US18. ORFS343C was ~3 x more abundant than US18 (Figure 1—figure supplement 1A). ORFL147C had 80 human HCIPs and ORFS343C 23 human HCIPs (Supplementary file 2).

The coding sequence of ORFL147C is oriented parallel to the 5’ end of UL56 (Figure 6A), which is a canonical gene encoding a subunit of terminase. ORFL147C is expressed with Tp4 kinetics (Figure 6B). Enrichment analysis of ORFL147C HCIPs suggested functions in RNA binding, mRNA splicing or transcription (Figure 6C–D). We validated the interaction of ORFL147C with Muscleblind Like Splicing Regulator 1 (MBNL1) and CUG Triplet Repeat RNA-Binding Protein 1 (CELF1), two proteins with roles in mRNA splicing and RNA binding (Figure 6E).

Figure 6 with 1 supplement see all

Download asset Open asset

HCMV ORFL147C interactors function in RNA binding, splicing and transcription.

(A) Diagram of the ORFL147C coding sequence and relation to neighbouring viral genes. (B) Expression kinetics of ORFL147C, taken from Weekes et al. (2014). Data was taken from experiments WCL2 and WCL3, enabling assessment of 24, 48, 72 and 96 hr time points in biological duplicate. Error bars show range. Mean expression was normalised to a maximum of 1. (C) Enrichment analysis of 80 human HCIPs interacting with ORFL147C. (i) DAVID analysis using all human proteins as background. Benjamini-Hochberg adjusted p-values are shown. (ii) Reactome database analysis (Fabregat et al., 2018) showing results with a minimum of 4 entities per enriched pathway. Full details of interacting proteins are given in Supplementary file 7A-B. (D) A subset of HCIPs for ORFL147C (full data are shown in Figure 6—figure supplement 1). Dashed lines represent human-human interactions derived from Bioplex 2.0 as described in Figure 2, in addition to known interactions that had been experimentally determined or derived from curated data as part of the STRING database. (E) Validation of interaction between ORFL147C and MBNL1 and CELF1 by co-IP, conducted as described in Figure 3. HEK293T cells were transiently transfected with the indicated plasmids, one expressing the C-terminally V5-tagged viral protein and the other expressing C-terminally HA-tagged MBNL1 or CELF1. These proteins were detected with anti-V5 and anti-HA. GAPDH – calnexin loading control. This figure is representative of n = 1 experiment. Expected sizes: MBNL1: 33–42 kDa; CELF1: 50–55 kDa; ORFL147C: 50 kDa; UL25: 74 kDa; GAPDH: 36 kDa. (F) Growth analysis of an ORFL147C-deficient recombinant. The ORFL147C and wild-type viruses were HCMV strain Merlin recombinants in which the enhanced GFP (eGFP) gene was cloned as a 3’-terminal fusion with immediate-early gene UL36, with a self-cleaving P2A peptide releasing the reporter following synthesis. Insertion of GFP does not impede UL36 function in such recombinants (Nightingale et al., 2018). Cells were infected at a MOI of 1, and supernatants harvested and titred every two days. Cells were infected in biological duplicates, and each supernatant was titred in technical duplicates. Mean values are shown, and error bars represent SD. p-values for a difference between wild-type and ORFL147C-deficient virus were estimated using a two-tailed Student’s t-test. ***p<0.001, ****p<0.0001. This figure is representative of n = 2 experiments. All data for this figure are also shown in Figure 6—source data 1. (G) ORFL147C protein is not expressed during infection with the ORFL147C-deficient recombinant (MOI = 2, 48 hr post infection). Viral protein expression was analysed using tandem mass tag-based proteomics as previously described (Nightingale et al., 2018). ORFL147C protein was measured at the same level as during mock infection in cells infected with the ORFL147C-deficient recombinant, attributable to noise. All data for this figure are also shown in Figure 6—source data 2.

Figure 6—source data 1 Growth analysis of an ORFL147C-deficient recombinant.: https://cdn.elifesciences.org/articles/49894/elife-49894-fig6-data1-v2.xlsx
Download elife-49894-fig6-data1-v2.xlsx
Figure 6—source data 2 Tandem mass tag-based proteomics analysis of ORFL147C protein expression.: https://cdn.elifesciences.org/articles/49894/elife-49894-fig6-data2-v2.xlsx
Download elife-49894-fig6-data2-v2.xlsx

To test whether ORFL147C plays an important role in viral replication, possibly via a splicing or transcriptional mechanism, an HCMV recombinant was generated in which the three most N-terminal methionine residues in ORF147C were mutated without modifying the coding sequence of UL56. The growth of ΔORFL147C virus was significantly impaired, suggesting that ORFL147C plays an important functional role during viral infection (Figure 6F–G). The large HCIP network for ORFL147C suggests that various mechanisms underlying this observation need to be examined; it is as yet unclear whether splicing or transcriptional effects are important.

Discussion

In the present study, we report the largest host-pathogen interactome to date and the first comprehensive interactome map for a DNA virus in infected cells. This has suggested functions and domain associations for multiple uncharacterised or partly characterised viral proteins, in addition to providing evidence that the non-canonical HCMV proteins ORFL147C and ORFS343C may be functional. The searchable database provided details virus-virus and virus-host interactions for 162/171 HCMV proteins, and will be of significant value in future studies of HCMV and other herpesviruses.

Different herpesviruses exhibit certain common functions (Mocarski Jr, 2007). A previous study identified 564 human HCIPs of Kaposi’s sarcoma-associated herpesvirus (KSHV) (Davis et al., 2015). Comparison of HCMV and KSHV interactomes revealed that baits from both viruses interacted with 176 identical human prey, including RNA Pol II, CCR4-NOT and CTLH components, and elements of the ubiquitin conjugation pathway. It will be important in future studies to determine which of these common functions are mediated by orthologous proteins, and which by distinct viral mechanisms. Conversely, certain HCMV prey did not interact with KSHV baits, including mRNA splicing machinery components (Figure 7). Comparisons with interactomes from additional herpesviruses when generated will help to delineate functions exhibited by all herpesvirus genera, from those more specific to individual viruses or viral subfamilies.

Figure 7

Download asset Open asset

Overlap in functions targeted by different viruses.

(A) DAVID analysis of pathway enrichment among 176 HCIPs that interacted both with HCMV baits (this study) and KSHV baits (Davis et al., 2015), in comparison to all human proteins as background. Benjamini-Hochberg adjusted p-values are shown for each pathway. Full details of interacting viral and host proteins are given in Supplementary file 7A. (B) DAVID analysis of pathway enrichment among HCIPs that only interacted with HCMV but not KSHV baits, in comparison to all human proteins as background. As the KSHV interactome was performed in HEK293T cells as opposed to HFFFs, the list of HCMV HCIPs was first filtered to include proteins that were clearly detectable in HEK293Ts, using the list of ~50,000 unfiltered bait-prey interactions from KSHV to indicate protein expression (Davis et al., 2015). Subsequently, both high confidence interacting prey of KSHV baits, and first degree interactors of these prey from the human interactome, were excluded (Huttlin et al., 2017), to leave a list of proteins that only interacted with HCMV. Benjamini-Hochberg adjusted p-values are shown for each pathway. Full details of interacting viral and host proteins are given in Supplementary file 7B.

The combination of interactome data generated in the present study with our previous screens of protein degradation during early HCMV infection (Nightingale et al., 2018) identified the viral UL42 protein as a hub of degradation for multiple ubiquitin E3 ligases, and predicted novel interactions between viral baits and 29 other degraded cellular prey. More broadly, we discovered that HCMV devotes multiple proteins to interactions with the ubiquitin conjugation pathway, with 18 viral proteins interacting with two or more E3 ligases (defined in Medvar et al., 2016) and 51 viral proteins interacting with one or more E3 ligase. Details of such interactions can potentially identify viral mechanisms of cellular protein degradation. For example, UL25 interacted with the adaptor protein WD Repeat Domain 26 (WDR26), which can recruit substrates to the Cullin-4 RING ubiquitin ligase family (Higa et al., 2006). UL25 interacted with UL26, which itself interacted with 9 out of 10 members of the CTLH complex, a homologue of the yeast glucose-induced degradation-deficient machinery. This complex has inherent E3 ligase activity, but so far substrates have not been well defined (Francis et al., 2013; Salemi et al., 2017). Finally, UL26 also interacted with other ligases and scaffolds, such as Cullin three and SMAD Specific E3 Ubiquitin Protein Ligase 2 (SMURF2). Future work is likely to identify whether UL25 or UL26 prey are degraded, and which of these cellular pathways are employed.

The present study also highlights other viral ‘hubs’ of protein degradation. For example, HCMV UL20 was previously found to be rapidly degraded, with the suggestion it may target unidentified cellular proteins to lysosomes (Jelcic et al., 2011). Here, we identify candidate cellular targets. For example, UL20 interacted with Interleukin 6 Signal Transducer (IL6ST), the neonatal Fc receptor (FCGRT), Ephrin A2 (EPHA2), and Interferon Gamma Receptor 1 (IFNGR1), all of which we have previously shown are rescued from degradation by application of the lysosomal protease inhibitor Leupeptin. Interestingly, all four proteins were also rescued by targeted deletion of members of the viral US12-US21 family of paralogous genes (Fielding et al., 2017). This suggests that there may be cooperativity between the US12-US21 proteins and UL20, possibly with UL20 acting in a final common pathway.

All systematic interactomes of this type include false discoveries and fail to detect certain genuine interactions. However, a particular advantage of considering multiple interactions simultaneously in comparison to isolated IP-MS experiments is a much lower false discovery rate (estimated ~5%), as non-specific interacting proteins can be excluded because they are commonly identified in multiple different IPs (Sowa et al., 2009). The present study also identified a subset of VHCIPs by employing two distinct filtering strategies, which will assist future investigations based on our data. It is difficult to estimate a true false negative rate, since there is no gold standard for assessing true interactions, and the published literature also suffers from false discoveries. One factor that may contribute to missed identifications is the abundance of the prey protein. The present study clearly has the ability to identify some interacting proteins present at low cellular abundance, exemplified by identification of the interaction between US10 and LRFN3. LRFN3 was below the limit of detection in two unbiased quantitative proteomic studies of >8000 proteins from whole cell lysates of HFFFs (Supplementary file 1C). However, 36% of previously described interactions that were not identified in the present study were also unquantified in whole cell lysates (Supplementary file 3), suggesting that protein abundance may play a significant role in interaction discovery. Furthermore, degradation of human prey proteins during HCMV infection may also impact the limit of detection by MS. For example, although RL1 interacted with the Cullin four scaffold and two associated proteins, no other high confidence RL1 prey were identified. It will therefore be important to repeat this interactome in the presence of lysosomal and proteasomal inhibition to identify such targets. Additionally, for future investigations of our data, validation of interactions in which the prey protein has low cellular abundance as indicated in Supplementary file 2B may be best performed by overexpression studies as opposed to attempts to co-IP the endogenous protein.

Overexpression of each bait throughout the course of infection may have led to temporal dysregulation of the expression of other viral proteins, and may have facilitated interactions that would usually commence earlier or later than 60 hr of infection. However, as 153/153 quantified viral ORFs were expressed at 60 hr (Weekes et al., 2014), the observed interactions should occur at this phase of infection even if either bait or prey protein or both were not maximally expressed. Stable overexpression of the viral bait might have enabled false positive interactions. However, certain proteins endogenously expressed by HCMV are already under the control of strong promoters (Mocarski et al., 2013). Indeed, the abundance of certain stably expressed proteins may actually have been lower than the abundance of the same proteins expressed during HCMV infection. From our IBAQ analysis of host and viral protein abundance averaged across 24, 48 and 72 hr of HCMV infection, the most abundant viral protein (UL83) was expressed ~2.4 fold more than the most abundant host protein (Galectin-1), and the least abundant viral protein ~62 fold more than the least abundant host protein (Supplementary file 1A, Supplementary file 1C), suggesting that the range of expression of viral proteins was already shifted towards the higher end of host protein expression. Prior human interactome studies have found no correlation between bait protein expression and the number of HCIPs (Sowa et al., 2009). Alternative strategies to conduct an interactome study would also suffer from potential confounding issues. For example, introduction of a tag into the viral genome prior to or after each coding sequence may facilitate expression of the bait at the same time and level as during infection with unmodified virus. However, due to the occurrence of polycistronic transcription of viral genes and overlapping viral ORFs (Stern-Ginossar et al., 2012), introduction of a tag may disrupt expression of neighbouring genes.

A large number of noncanonical ORFs were identified by ribosome profiling as potentially being translated (RP-ORFs, Stern-Ginossar et al., 2012), and 13 novel ORFs from a six-frame translation of the HCMV genome sequence were recognised as being represented in MS data (6FT-ORFs, Nightingale et al., 2018). However, these studies produced no evidence that any of these ORFs encode functional proteins. The present study identified three RP-ORFs and 2/13 6FT-ORFs as interactors of canonical HCMV proteins, and identified seven additional interacting 6FT-ORFs for the first time. There is thus a case for functional investigations of a modest number of additional ORFs, and initial prediction of these functions can be achieved by interaction analysis. For example, although the precise function of ORFL147C remains to be determined, we validated interactions with proteins involved in mRNA splicing including MBNL1 and CELF1. Other interactors with roles in RNA binding, such as Ribonucleotide PTB-binding 1 (RAVER1) modulates alternative splicing events. Spliced transcripts have long been recognized from HCMV at all times post infection (Rawlinson and Barrell, 1993), and more recently up to 100 splice junctions have been identified (Balázs et al., 2017; Gatherer et al., 2011; Stern-Ginossar et al., 2012).

Only three drugs are currently available to treat HCMV infection, and all suffer from significant side effects and the threat of the development of resistance. In the context of the increasing frequency of transplantation, innovative therapeutic strategies are required. The identification of key interactions in virus-virus or virus-host protein complexes may be important in this regard, since small molecule inhibitors may be able to disrupt these interactions or restore endogenous antiviral restriction by preventing host protein degradation (Cen et al., 2010; Nathans et al., 2008; Pery et al., 2015). To identify bait-prey pairs amenable to straightforward therapeutic interruption, it is desirable to identify factors targeted by a single viral protein, for example members of the CNOT complex by UL72. In addition to the interaction between UL72 and individual CNOT members, CNOT effector function could also be an antiviral target, for example employing inhibitors of the CNOT7 deadenylase (Maryati et al., 2014). Ideally, similar interactions involving several distinct pathways might be targeted simultaneously to inhibit viral replication in a way that is refractory to resistance. As an additional strategy, the recent identification of putative ligands for the viral GPCRs may facilitate approaches to targeting cytotoxins exclusively to infected cells (Krishna et al., 2017). These considerations illustrate the potential of the interactome data in the present study for identifying biologically important protein-protein interactions and developing antiviral therapies based on their disruption.

Materials and methods

Key resources table

Reagent type	Designation	Source or reference	Identifiers	Additional information
Strain, strain background (HCMV)	HCMV Merlin	Stanton et al., 2010	RCMV1111
Strain, strain background (HCMV)	HCMV Merlin UL36-GFP deltaORFL147C	This paper	RCMV2697	Available from Dr Michael Weekes’ lab, University of Cambridge
Strain, strain background (HCMV)	HCMV Merlin UL36-GFP	Nightingale et al., 2018	RCMV2582
Strain, strain background (Escherichia coli)	E. coli. (α-Select Silver Competent Cells)	Bioline	Cat#BIO-85026
Cell line (Homo-sapiens)	HFFF immortalised with human telomerase (HFFF-TERT)	McSharry et al., 2001
Cell line (Homo-sapiens)	Human Embryonic Kidney 293 T cells	Menzies et al., 2018	ATCC Cat#CRL-3216, RRID:CVCL_0063
Antibody	Anti-V5 Agarose Affinity Gel	Sigma-Aldrich	Cat#A7345; RRID:AB_10062721	(30 µl/mL)
Antibody	Mouse monoclonal anti-GAPDH	R and D Systems	Cat#MAB5718; RRID:AB_10892505	(1:10.000)
Antibody	Rabbit polyclonal anti-Calnexin	LifeSpan Biosciences	Cat#LS-B6881; RRID:AB_11186721	(1:10.000)
Antibody	Rabbit monoclonal anti-HA (C29F4)	Cell Signaling Technologies	Cat#3724S; RRID:AB_1549585	(1:1000)
Antibody	Mouse monoclonal anti-V5	Thermo	Cat#R960-25; RRID:AB_2556564	(1:5000)
Antibody	Rabbit polyclonal anti-CNOT2	Novus Biologicals	Cat#NBP2-56034; RRID:AB_2801658	(1:1000)
Antibody	Rabbit monoclonal anti-CNOT7	Abcam	Cat#ab195587; RRID:AB_2801659	(1:1000)
Antibody	Mouse monoclonal anti-NEDD4	R and D Systems	Cat#MAB6218; RRID:AB_10920762	(1:1000)
Antibody	IRDye 680RD goat anti-mouse IgG	LI-COR	Cat#925–68070, RRID:AB_2651128	(1:10.000)
Antibody	IRDye 800CW goat anti-rabbit IgG	LI-COR	Cat#925–32211, RRID:AB_2651127	(1:10.000)
Antibody	IRDye 680RD goat anti-rabbit IgG	LI-COR	Cat#926–68071; RRID:AB_10956166	(1:10.000)
Antibody	IRDye 800CW goat anti-mouse IgG	LI-COR	Cat#926–32210; RRID:AB_621842	(1:10.000)
Antibody	Human TruStain FcX	BioLegend	Cat#422302; RRID:AB_2818986	1:20
Recombinant DNA reagent	pHAGE-pSFFV	Nightingale et al., 2018
Recombinant DNA reagent	pDONR223	Nightingale et al., 2018
Recombinant DNA reagent	pDONR221-MBLN1	Harvard PlasmID	Cat#HsCD00079833
Recombinant DNA reagent	pDONR221-CUGBP1	Harvard PlasmID	Cat#HsCD00039403
Recombinant DNA reagent	pOTB7-CUL4A	Harvard PlasmID	Cat#HsCD00325140
Recombinant DNA reagent	pCMV-SPORT6-NEDD4L	Harvard PlasmID	Cat#HsCD00337956
Recombinant DNA reagent	pENTR223-NCK1	Harvard PlasmID	Cat#HsCD00370605
Recombinant DNA reagent	pDONR223-CNOT2	Harvard PlasmID	Cat#HsCD00080019
Recombinant DNA reagent	pHAGE-CNOT7	Harvard PlasmID	Cat#HsCD00453329
Recombinant DNA reagent	PHAGE-P-CMVt-N-HA Nedd4 wt	Addgene	Cat#24124
Recombinant DNA reagent	pDONR221-LRFN3	Harvard PlasmID	Cat#HsCD00041564
Sequence-based reagent	M13-F	GENEWIZ	PCR primers	GTAAAACGACGGCCAG
Sequence-based reagent	M13-R	GENEWIZ	PCR primers	CAGGAAACAGCTATGAC
Sequence-based reagent	pHAGE-pSFFV-Seq	This paper	PCR primers	CGCGCCAGTCCTCCGATTG
Sequence-based reagent	GAW-CMVp-F	This paper	PCR primers	GGGACAAGTTTGTACAAAAAAGCAGCTGAAGACACCGGGACCGATC
Sequence-based reagent	attB2-V5-R	This paper	PCR primers	GGGGACCACTTTGTACAAGAAAGCTGGGTTTACGTAGAATCAAGACCTAGGAGC
Peptide, recombinant protein	V5 Epitope Tag	Alpha Diagnostic International	Cat#SP-59199–5
Peptide, recombinant protein	Trypsin	Promega	Cat#V5111
Commercial assay or kit	BCA Protein Assay Kit	Thermo Fisher	Cat#23227
Commercial assay or kit	Micro BCA Protein Assay Kit	Thermo Fisher	Cat#23235
Commercial assay or kit	RNeasy Mini Kit	Qiagen	Cat#74104
Commercial assay or kit	Empore SPE Disks	Supelco	Cat#66883 U
Commercial assay or kit	GoScript Reverse Transcriptase kit	Promega	Cat#A5001
Commercial assay or kit	Power SYBR Green PCR Master Mix	Thermo Fisher	Cat#4367659
Commercial assay or kit	Gateway BP Clonase II Enzyme Mix	Invitrogen	Cat#56481
Commercial assay or kit	Gateway LR Clonase Enzyme Mix	Invitrogen	Cat#56484
Chemical compound, drug	Dexamethasone	Sigma-Aldrich	Cat#D4902
Chemical compound, drug	DL-Dithiothreitol	Sigma-Aldrich	Cat#43815–1G
Software, algorithm	‘MassPike’, a Sequest-based software pipeline for quantitative proteomics.	Professor Steven Gygi’s lab, Harvard Medical School, Boston, USA.
Software, algorithm	SEQUEST	Eng et al., 1994
Software, algorithm	DAVID software	https://david.ncifcrf.gov/	DAVID, RRID:SCR_001881
Software, algorithm	Reactome software	https://reactome.org/	Reactome, RRID:SCR_003485
Software, algorithm	Image Studio Lite	LI-COR	Ver. 5.2; Image Studio Lite, RRID:SCR_013715
Software, algorithm	Cytoscape	The Cytoscape Consortium	Ver 3.7.1; Cytoscape, RRID:SCR_003032
Software, algorithm	DNASTAR Lasergene - SeqBuilder	DNASTAR, Inc	Ver. 12; DNASTAR: Lasergene Core Suite, RRID:SCR_000291
Software, algorithm	FlowJo	FlowJo	Ver. 10; FlowJo, RRID:SCR_008520
Software, algorithm	CompPass	Sowa et al., 2009
Software, algorithm	CompPass Plus	Huttlin et al., 2015
Other	Orbitrap Fusion Mass Spectrometer	ThermoFisher Scientific	Cat#IQLAAEGAAP FADBMBCX	Instrument
Other	Orbitrap Fusion Lumos Mass Spectrometer	ThermoFisher Scientific	Cat#IQLAAEGAAP FADBMBHQ	Instrument
Other	Raw Mass Spectrometry Data Files	This paper	ProteomeXchange Consortium via the PRIDE partner repository with dataset identifier PXD014845.	Raw data

Share this article

Cite this article

Schematic of the IP strategy.

Systematic analysis of interactome data predicts novel functions for viral proteins.

Validation of interactome data by co-IP.

Interaction between UL25 and NCK1 identified by domain association analysis.

UL42 identified as a hub of E3 destruction by a combination of interactome and degradation data.

HCMV ORFL147C interactors function in RNA binding, splicing and transcription.

Figure 6—source data 1

Figure 6—source data 2

Overlap in functions targeted by different viruses.

Author details

Luis V Nobre

Contribution

Competing interests

Katie Nightingale

Contribution

Competing interests

Benjamin J Ravenhill

Contribution

Competing interests

Robin Antrobus

Contribution

Competing interests

Lior Soday

Contribution

Competing interests

Jenna Nichols

Contribution

Competing interests

James A Davies

Contribution

Competing interests

Sepehr Seirafian

Contribution

Competing interests

Eddie CY Wang

Contribution

Competing interests

Andrew J Davison

Contribution

Competing interests

Gavin WG Wilkinson

Contribution

Competing interests

Richard J Stanton

Contribution

Competing interests

Edward L Huttlin

Contribution

Competing interests

Michael P Weekes

Contribution

For correspondence

Competing interests

Citations by DOI

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Categories and tags

Research organisms