Cell membrane glycan contents are biochemical factors that constitute a kinetic barrier to viral particle uptake in a protein-nonspecific manner

  1. Yoshihisa Kaizuka  Is a corresponding author
  2. Rika Machida
  1. National Institute for Materials Science, Japan
7 figures and 4 additional files

Figures

Figure 1 with 1 supplement
List of glycoproteins in the human genome: predictions and measurement of glycosylation, and virus infection inhibition assays using sample proteins from the list.

(A) Schematic of inhibition of virus infection by membrane glycoproteins. (B) The number of membrane proteins and predicted glycosylated proteins in human genome from UniProt. (C) The number of predicted glycosylation sites per the number of amino acid sequence of ectodomain for 2515 membrane associated proteins, plotted along with the number of ectodomain amino acid sequence. Color indicates the measured rate for glycosylation per molecule (PNA/mol) per amino acid. 0T, 4T, and 14T indicate truncation mutants of MUC1 that contain 0, 4, and 14 tandem repeat sequences, respectively. (D) Flow cytogram for the binding of Alexa Fluor 647 labeled PNA to HEK293T cells expressing MUC1(42 tandem repeats) tagged with SNAP surface 488 and the linear regression of the data to the reaction model (red dashed line, see the method section for details). (E) Relations of the measured PNA/mol and the number of predicted glycosylation sites for the indicated molecules. (F) SARS-CoV2-PP infection assay in HEK293T cells expressing ACE2, TMPRSS2, and each of designated membrane protein. Dots were measured values of the integral of GFP expressions from infected viruses in those samples adjusted by the total ACE2 expressions at the time of infection and were plotted along with the mean density of membrane protein at the time of infection. Red lines indicate learned predicted infection rates mean from Bayesian hierarchical inference based on sigmoidal function, and purple area represents one sigma below and above the red lines. (G) Relations between the measured rate for glycosylation per molecule (PNA/mol) and molecular specific IC50 density in sigmoidal inhibitory function inferred from Bayesian hierarchical modeling in F (σ_IC50). (H) Relations between σ_IC50 and estimated molecular weight including glycans in the experimental system. (I) Purification and analysis of recombinant proteins, non-glycosylated (bacterial or B) and glycosylated (G) MUC1 (14TR) tagged with SNAP surface 488. Coomassie Brilliant Blue stained (left), glycan stained (middle), and fluorescent (right) for proteins in SDS-PAGE.

Figure 1—figure supplement 1
List of glycoproteins in the human genome: technical details for predictions and measurement of glycosylation, and virus infection inhibition assays using sample proteins from the list.

(A) Glycosylation site prediction by Net O Glyc/Net N Glyc and GlycoEP. (B) Comparisons of the number of predicted glycosylation sites by both software packages. (C) Flow cytograms demonstrating the binding of three different fluorescent labeled lectins (PNA, WGA, SSA) to HEK 293T cells expressing MUC1(14TR) proteins. Protein expression probed by SNAP tag fluorescence (upper left), PNA binding (upper right), WGA binding (bottom left), and SSA binding (bottom right) was shown. (D) Two-dimensional flow cytograms of PNA binding assay to HEK 293T cells expressing various membrane proteins. Protein density was shown in either SNAP tag (SNAP surface 488) or mTagBFP2 fluorescence. Two different PNA concentrations were used to have wide enough dynamic range in PNA signals to resolve glycosylation in both low- and high-glycosylated proteins. PNA signals between low- and high- PNA concentrations and between SNAP tag and mTagBFP2 were adjusted with data of sample measured in both conditions (CD43, CD44). (E) Examples of flow cytograms in virus infection inhibition assays in HEK293T cells and SARS-CoV2-PP. Expressions of ACE2-mTagBFP2 (left) and SNAP-MUC1(14TR) (middle) were measured at the time of infection, and expression of GFP (right) was measured 2 days after the infection in the same batch of cells. Dark is null HEK293T cells without ACE2 and MUC1 expression. Green lines for HEK293T-ACE2 cells without MUC1 expression. Red lines for HEK293T-ACE2-MUC1 cells. (F) Infection inhibition assays for HEK293T cells and various other viruses (lentivirus, adenovirus, and adeno-associated virus). Mean GFP expression of infected cells is plotted along with mean MUC1 (14TR) densities in cells at the time of infection. These were measured by flow cytometry, in a similar manner as in the assay for SARS-CoV2-PP. (G) Measurement of surface densities of SNAP – Surface 488 labeled SNAP – MUC1 (14TR) proteins in HEK 293T cells, transfected with different amounts of plasmid DNAs. (H) Integrated amount of GFP expression in SARS-CoV2-PP infected HEK 293T cells compared with integrated amount of ACE2-mTagBFP2 expression at the time of infection. (I) Graphical model for Bayesian hierarchical sigmoidal regression of infection inhibition assay data. (J) Results of four repeated rounds of MCMC samplings by Numpyro for inferences for each parameter. (K) Relations between amino acid sequence for ectodomain of each protein and molecular specific IC50 density in sigmoidal inhibitory function inferred from Bayesian hierarchical modeling (σ_IC50). (L) Flow cytometry measurements for liposomes containing serially diluted dye-conjugated lipids and fluorescent membrane incorporating molecules (Bodipy-FL, peryelene, and DID) with indicated mol%. Linear fitting shown was used for calibration. (M) Fluorescence emission spectrum for equimolar molecules (50 µM for green and far-red channels, and 100 µM for blue channel), excited at 405 nm, 488 nm, and 638 nm, respectively. Membrane dyes were measured as incorporated in liposomes. Purified recombinant mTagBFP2 was used.

Figure 2 with 1 supplement
Virus infection in epithelium regulated by glycan contents in each cell.

(A) Maximum projection of Z-stack images at 1 µm intervals taken with a confocal microscope. SARS-CoV2-pp-infected, air-liquid interface (ALI)-cultured Calu-3 cell monolayers were chemically fixed and imaged by binding of Alexa Fluor 647-labeled Neu5AC-specific lectin from Sambucus sieboldiana (SSA) and GFP expression from the infecting virus. (B) Density scatter plot of normalized fluorescence intensities in all pixels in (A) in both GFP and SSA channels. Color indicates the pixel density. (C) SARS-CoV2-pp infected Calu-3 ALI monolayer, imaged by immunofluorescence for each membrane protein and GFP. Maximal projection of z-stack of images is shown. (D) Pearson correlation for fluorescence intensities of lectin/antibody and GFPs in image pixels in maximal projection of z-stack images. Error bars are standard error of mean from images from three or more different samples. Lectins used were SSA, MAL (from Maackia amurensis, Neu5AC specific), WGA (from Triticum vulgaris, GlcNAc specific), PNA, and DSA (from Datura stramonium, GlcNAc specific). (E) SARS-CoV2-pp infected Calu-3 ALI monolayer, imaged by immunofluorescence for CD44 and SSA lectin binding. Maximal projection of z-stack images is shown. (F) Pearson correlation of lectin/antibody signal and SSA signal in image pixels in maximal projection of z-stack images. Error bars are standard error of mean from images from three or more different samples. (G) TOS analysis for GFP and SSA signal pixels in image of (A). (H) Correlation in top 10% in both axes in the TOS analysis. Error bars are standard error of mean from images from three or more different samples. *MUC1 has a small mean expression level and variance and is more affected by measurement noise than other molecules when calculating the Pearson correlation function (C and F). In addition, the number of cells in which expression can be detected is small, so no significant correlation was detected by TOS analysis (H). Scale bars = 40 µm (A) and 20 µm (C, E).

Figure 2—figure supplement 1
Virus infection in epithelium regulated by glycan contents in each cell.

(A-B) List of highly expressed membrane proteins in air liquid interface cultured Calu-3 cells, aligned in the order of normalized expression shown in bulk RNA-seq analysis by Wyler E et al. (GSE148729). Ratios for total glycans of each molecule are calculated by combining with predicted glycosylation rate determined in our study and the ratio of expression from the RNA seq result. The panel A is the list for proteins predicted to have predicted glycosylation rate >0.15, and the panel B is the list for other membrane proteins. (C) Maximal projection of z-stack of images. SARS-CoV2-pp infected air-liquid interface (ALI) cultured Calu-3 cell monolayer, imaged by the binding of Alexa Fluor 647-labeled lectins specific for indicated glycans or the binding of primary and Alexa Fluor 555-labeled secondary antibodies against indicated proteins, and the expression of GFP derived from infected viruses. (D) Maximal projection of z-stack of images. SARS-CoV2-pp infected air-liquid interface (ALI) cultured Calu-3 cell monolayer, imaged by the binding of Alexa647-labeled Neu5AC-specific lectin from Sambucus sieboldiana (SSA) and the binding of primary and Alexa Fluor 555-labeled secondary antibodies against indicated proteins. (E) Correlation in top 10% in both axes in TOS analysis, for colocalizations in SSA and glycoproteins. Error bars are standard error of mean from images from three or more different samples. (F-G) DID-labeled SARS-CoV2-pp was incubated on ALI Calu-3 cells for 10 min or 3 hr, and the Pearson correlation between DID and SSA was calculated at the pixel level. Correlation values were compared to those of positively and negatively correlated samples (ACE2, SSA, versus GFP, respectively). Example virus and lectin images after 10 min virus incubation are shown in G. Scale bar 20 µm.

Figure 3 with 1 supplement
Structure of interface between viruses and cell membranes and polymer brush theory.

(A) Flow cytogram of DID labeled SARS-CoV2-PP binding to HEK293T cells expressing ACE2, TMPRSS2, and/or various truncation mutants of MUC1. (B) Schematic diagram of the interface structure between the virus and the cell membrane during the process from cell-virus binding to virus uptake via stable interface formation. (C) Basics of polymer brush theory and free energy of the polymer in the course of mushroom to brush transitions. d, spacing distance between polymers, RF, Flory radius of polymers, fint, and fel, intermolecular and elastic free energies per a single polymer. (D) Two types of interfacial structures of particle binding to polymer graft surfaces in the conventional polymer brush model. (E) Two additional structure types of particle-surface interface, specialized cases for virus-cell interface. (F) A chart for free energy of the system U during the process of virus-cell interface formation.

Figure 3—figure supplement 1
Virus binding to cells and fluid phase endocytosis were not significantly affected by surface glycoproteins.

(A) Dual-color flow cytogram (DID and SNAP surface 488) of DID-labeled SARS-CoV2-pp-bound HEK293T cells expressing MUC1 with tandem repeat domains of the indicated lengths. MUC1 was labeled and quantified with SNAP surface 488. (B) Normalized mean fluorescence intensity of FITC-labeled dextran endocytosed into HEK293T cells expressing the indicated membrane proteins, measured by flow cytometry. Data are normalized to a control sample without exogenous membrane protein expression and plotted against the molecular weight of the introduced membrane protein. Three different MUC1 molecules with different molecular weights (0TR, 14TR, and 42TR) were used. Each dot represents an independent measurement.

Figure 4 with 1 supplement
In situ FLIM–FRET measurements for protein sizes and conformational predictions.

(A) Example of FLIM images for cells expressing SNAP–Surface 488 conjugated CD24 proteins and being incorporated with PlasMem Bright Red dyes in different surface densities. Schematic drawing (left) depicts a geometry of the FRET from a single donor dye conjugated to SNAP tag at the end of protein ectodomain to populations of acceptor dyes incorporated in plasma membrane. (B) FRET efficiency estimated from FLIM imaging for cells expressing each protein at different mean acceptor densities. Lines are predicted mean FRET efficiencies from Bayesian hierarchical inference (see Method details), and purple area represents one sigma below and above the lines. (C) Relations between inferred Flory radius from FLIM – FRET analyses and inferred σ_IC50 from infection inhibition assays. Measured glycosylation rate PNA/mol/AA was depicted in color. (D) Relations between inferred Flory radius from FLIM – FRET analyses and estimated molecular weight including glycans. Dot line is the result of linear regression. (E) Relations between inferred Flory radius from FLIM – FRET analyses and amino acid length. Dot lines are the fit to the Flory model for RF ~Nν. (F) Relations between the distance between coordinates of two amino acids at both ends of ectodomain in Alpha Fold2 predicted conformations and number of amino acids for all 2515 proteins in the list. The number of predicted glycosylation sites per amino acid was depicted in color. The dot line indicates the Flory model RF ~Nν, where ν=0.14. (G) Relations between the distance between coordinates of two amino acids at both ends of ectodomain in Alpha Fold2 predicted conformations for our sample molecules and measured Flory radius from FLIM–FRET assays. A dot line indicates where the measured RF is equal to the Alpha Fold 2 predicted length. (H) Schematic diagram of protein conformational dynamics and two length scales, RF and Alpha Fold 2 prediction. Yellow and red stars indicate the two ends of the ectodomain.

Figure 4—figure supplement 1
Technical details for In situ FLIM–FRET measurements for protein sizes and conformational predictions.

(A) FRET efficiency estimated from FLIM imaging for HEK 293T cells expressing SNAP-VAMP2 conjugated with SNAP – Surface 488, where cell membranes contained different mean acceptor densities of either PlasMem Bright Red or MemGlow 590. Lines are learned predicted mean FRET efficiencies from Bayesian hierarchical inference (see method), and purple area represents one sigma below and above the lines. (B) FRET efficiency estimated from FLIM imaging for cells expressing each protein at different mean acceptor densities. Lines are learned predicted mean FRET efficiencies from Bayesian hierarchical inference (see Method details), and the purple area represents one sigma below and above the lines. (C) Graphical model for Bayesian hierarchical regression of FLIM – FRET data. (D) Results of four repeated rounds of MCMC samplings by Numpyro for inferences for each parameter. (E) Protein size derived from the predicted ectodomain conformations by AlphaFold 2, plotted on Figure 1C by color map. (F) Predicted local Distance Difference test scores (pLDDT) for the predicted ectodomain conformations by AlphaFold 2 were subtracted from 100 and were plotted on Figure 1C by color map. By applying this color map that is inverse to original pLDDT score, proteins predicted not to fit well to reference models were highlighted by darker blue marks. (G) Alpha Fold 2 (ver 2.2 in Google Colaboratory) predicted conformation of MUC1 (0TR), with predicted LDDT along with amino acid and the map of predicted aligned error. (H) Relations between pLDDT subtracted from 100 in Alpha Fold 2 prediction and measured Flory radius from FLIM – FRET assays. The dot line indicates where the measured RF is equal to the Alpha Fold 2 predicted length. (I) Conformation drawings of proteins predicted by Alpha Fold 2. Yellow and red stars indicate the two ends of ectodomain.

Figure 5 with 1 supplement
Biochemical reconstitution of protein packing in membrane surface.

(A) Lipid bilayers coated on silica beads for incorporating bacterial (b-) and glycosylated (mammalian expressed, g-) proteins, schematic and fluorescent images. (B) Representative result for flow cytometry analyses of protein binding to lipid bilayer coated silica beads. Bar is the standard deviation in each measurement. Lines are regression curves to receptor binding model Bx/(KD +x), where x is protein concentration, KD is the dissociation constant, and B is the saturated density. (C) Relations of surface area coverage by bound proteins and concentrations of proteins used for membrane binding. Surface area was normalized by assuming all bound proteins were in a hemisphere of radius RF and the ratio of coverage was calculated. Protein concentrations were normalized by KD. The dot line in the plot indicates the coverage when the hemisphere of radius RF aligned in a hexagonal close packing. (D) Schematic for structures and free energies for glycosylated and non-glycosylated proteins with similar RF that are at diluted and more condensed densities on the membrane surface.

Figure 5—figure supplement 1
Dynamics is not a factor to influence the packing of glycoprotein.

(A) An example of a snapshot image of single molecular SNAP–cell TMR star conjugated g-MUC1 (14TR) (upper left) and a tracked path of its diffusive motion on a supported lipid bilayer (upper right). Statistics of diffusion coefficients of g-MUC1 and b-MUC1 calculated from tracked paths (lower). n.s.: nonspecific in a two-sided unpaired t-test with Welch’s correction. (B) Representative result for flow cytometry analyses of protein binding to lipid bilayer coated silica beads. Bar is standard deviation in each measurement. Lines are regression curves to receptor binding model Bx/(KD +x), where x is protein concentration, KD is the dissociation constant, and B is the saturated density.

Figure 6 with 1 supplement
Superresolution imaging of virus and cellular proteins for analyses of virus – cell interface.

(A) Dual color STORM images of SNAP-MUC1/SNAP-VAMP2 in cells and Spike in SARS-CoV-2-PP bound to the cells. Whole cell images (left) were reconstructed from STORM data, and coordinates determined by STORM were individually plotted in expanded images on the right. (B) Schematic of calculation of cross-correlation. Mutual distance between all combinations of two dyes was calculated from the coordinates determined by STORM imaging. (C) Examples of histogram of mutual distances between all combinations of two dyes were calculated from coordinates H(r), and the cross-correlation function C(r) that is the normalized radial average of H(r). Δr, a size of shell and the bin size for histogram, was set to be 5 nm. (D) Examples of simulated STORM images, and calculated C(r) from these images. h denotes the distance between virus and cell when they are separated, and density plot illustrates average protein densities along membranes in these images. (E) Plot of C(r)_(out/in) calculated for all simulated images, along with density_(in/out) and h. Blue dashed lines indicate upper and lower bounds for C(r)_(out/in) calculated for stable virus – cell interface, and red dot line is the fifth polynomial regression to data point and purple area represents one sigma below and above the line. (F) Traces of C(r) for individual cells expressing MUC1 and VAMP2. (G) C(r)_(out/in) calculated for all STORM images. (H) density_(in/out) for all STORM images converted from C(r)_(out/in) based on the regression in E was plotted along with RF for each protein determined by FLIM – FRET (Figure 4).

Figure 6—figure supplement 1
STORM images of all analyzed cells, expressing designated proteins.

The detected spots of SNAP-surface Alexa 647 bound to each membrane protein are shown in red, and the spots of CF568-conjugated anti-mouse IgG secondary antibody that recognizes Spike on SARS-CoV2-PP are shown in green. For cells, a pair of two-color composite images and a CF658-only image are shown. mNumbers on axes are coordinates in nanometer.

Models for three distinct cases of virus – cell interface.

(A) Energy and force involved in molecular exclusion from virus-cell interface. (B) Transient distribution of membrane proteins nearby the interface. (C) Two-dimensional distribution of viruses and glycoproteins on the surface horizontal direction of the cell membrane. (D-F) Schematic of virus-cell interface structure and corresponding free energy chart. (D) Virus-cell interface with very small membrane proteins. Due to the low energy penalty in constructing virus-cell interface entrapping these small molecules, the virus can form the interface without excluding proteins and can infect cells. (E-F). Virus-cell interface structure for larger membrane proteins. Interface formation requires protein exclusion. Due to membrane viscosity and inhomogeneous field in adhesive energy, excluded proteins become crowded at proximity of the interface. In the case for highly glycosylated proteins (F), intermolecular repulsion between excluded proteins becomes very high at high density regions. Such repulsion generates a high kinetic barrier for molecular exclusion, preventing the process from proceeding to subsequent steps for infection. In contrast, in the case of low glycosylated proteins (E), their intermolecular repulsion and energy barrier are lower, and thus these molecules are easier to be excluded from the interface, and virus infection is not strongly inhibited.

Additional files

Supplementary file 1

List of all reagents and software used in this study.

https://cdn.elifesciences.org/articles/101175/elife-101175-supp1-v1.xlsx
Supplementary file 2

List of 2515List human membrane proteins whose ectodomain amino sequences were used for glycosylation predictions by NetN/O Glyc and GlycoEP.

Length of amino acid sequences in ectodomains as well as ratio of glycosylation site per amino acid are listed for all molecules.

https://cdn.elifesciences.org/articles/101175/elife-101175-supp2-v1.pdf
Supplementary file 3

Glycosylation prediction data for MUC1 truncation mutants and mouse proteins used in this study.

https://cdn.elifesciences.org/articles/101175/elife-101175-supp3-v1.xlsx
MDAR checklist
https://cdn.elifesciences.org/articles/101175/elife-101175-mdarchecklist1-v1.docx

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Yoshihisa Kaizuka
  2. Rika Machida
(2025)
Cell membrane glycan contents are biochemical factors that constitute a kinetic barrier to viral particle uptake in a protein-nonspecific manner
eLife 14:RP101175.
https://doi.org/10.7554/eLife.101175.3