The outbreak of Coronavirus disease (COVID-19) has posed a great threat to public health, and we are amid a pandemic. The disease is caused by Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2). Like other coronaviruses, the SARS-CoV-2 genome encodes spike (S) glycoproteins, which protrude from the surface of mature virions. The S glycoprotein plays essential roles in virus attachment, fusion, and entry into the host cell. The surface location of the S glycoprotein renders it a direct target for host immune responses, making it the main target of neutralizing antibodies. In the light of its crucial roles in viral infection and adaptive immunity, the S protein is the focus of most vaccine strategies as well as therapeutic interventions. In this review, we highlight and describe the recent progress that has been made in the biosynthesis, structure, function, and antigenicity of the SARS-CoV-2 S glycoprotein, aiming to provide valuable insights into the design and development of S protein-based vaccines as well as therapeutics.
The ongoing global pandemic poses a social, economic, and public health-related challenge of unprecedented sorts. The etiological agent of COVID-19 is a new member of the Coronaviridae family that is closely related to severe acute respiratory syndrome coronavirus (SARS-CoV) and was recently referred to as SARS-CoV-2 by the Coronavirus Study Group of the International Committee on Taxonomy of Viruses. The virus has spread rapidly and sustainably around the globe, resulting in over twenty-one million cases and more than 750,000 deaths as of August 15, 2020.
Coronaviruses (CoVs) are enveloped positive-sense RNA viruses. Enveloped CoVs enter host cells and initiate infection through the fusion of viral and cellular membrane. Membrane fusion is mediated by the large type I transmembrane S glycoprotein on the viral envelope and the cognate receptor on the surface of host cells. The surface-exposed location of the S glycoprotein allows it to carry out membrane fusion and renders it a direct target for host immune responses, making it the major target of neutralizing antibodies. Because of its central roles in viral infection and eliciting protective humoral and cell-mediated immune responses in hosts during infection, the S protein is the primary target for vaccine design as well as antiviral therapeutics.
The SARS-CoV-2 spike (S) glycoprotein is a major component of the virus envelope, essential for receptor binding, fusion, virus entry, and a target of host immune defense. The SARS-CoV-2 S glycoprotein is a class I fusion protein produced as a prominent 1273 amino acid inactive precursor (S0). Unique to SARS-CoV-2 is the insertion of a polybasic RRAR furin-like cleavage motif in the S1/S2 cleavage site. Proteolytic cleavage of the S protein generates the S2 stalk that is conserved across human coronaviruses and the less conserved S1 cap. The N-terminal domain (NTD) and the receptor-binding domain (RBD) are located in the S1 subunit. The fusion peptide (FP), two heptad repeats (HR1 and HR2), central helix (CH), transmembrane (TM) domain, and cytoplasmic tail (CT) are located in the S2 subunit. Three S1/S2 protomers non-covalently associate to form the functional S-trimer. Like other fusion proteins, the SARS-CoV-2 S-trimer is metastable and undergoes significant structural rearrangement from a prefusion conformation to a thermostable postfusion conformation upon S-protein receptor binding and proteolytic cleavage, either at the plasma membrane or following endocytosis Rearrangement exposes the hydrophobic FP allowing insertion into the host cell membrane, facilitating virus/host cell membrane alignment, fusion, and virus entry.
Synthesis, Processing, and Trafficking of the SARS-CoV-2 S Glycoprotein
The SARS-CoV-2 S glycoprotein is synthesized as a 1273-amino acid polyprotein precursor on the rough endoplasmic reticulum (RER). The unprocessed precursor harbors an endoplasmic reticulum (ER) signal sequence located at the N terminus, which targets the S glycoprotein to the RER membrane and is removed by cellular signal peptidases in the lumen of the ER. A single stop-transfer, membrane-spanning sequence located at the C terminus of the S protein prevents it from being fully released into the lumen of the ER and subsequent secretion from the infected cell. Co-translationally, N-linked, high-mannose oligosaccharide side chains are added during synthesis. Shortly after synthesis, the S glycoprotein monomers trimerize, which might be thought to facilitate the transport from the ER to the Golgi complex. Once in the Golgi complex, most of the high-mannose oligosaccharide side chains are modified to more complex forms, and O-linked oligosaccharide side chains are also added.
SARS-CoV-2 S Protein Structure and Function
As mentioned above, the SARS-CoV-2 S glycoprotein plays pivotal role in viral infection and pathogenesis. Mature S glycoprotein on the viral surface is a heavily glycosylated trimer, each protomer of which is composed of 1260 amino acids (residues 14-1273). The surface subunit S1 is composed of 672 amino acids (residues 14–685) and organized into four domains: an N-terminal domain (NTD), a C-terminal domain (CTD, also known as the receptor-binding domain(RBD), and two subdomains (SD1 and SD2). The transmembrane S2 subunit is composed of 588 amino acids (residues 686-1273) and contains an N-terminal hydrophobic fusion peptide (FP), two heptad repeats (HR1 and HR2), a transmembrane domain (TM), and a cytoplasmic tail (CT), arranged as FP-HR1-HR2-TM-CT
SARS-CoV-2 S Glycoprotein-Mediated Membrane Fusion
Membrane fusion and viral entry of SARS-CoV-2 is initiated by binding of RBD in the viral S glycoprotein transiently sampling the functional conformation to ACE2 on the surface of target cells. After receptor engagement at the plasma membrane or ensuing virus endocytosis by the host cell, a second cleavage (S2′ cleavage site) is generated, which is mediated by a cellular serine protease TMPRSS2 or endosomal cysteine proteases cathepsins B and L. Protease cleavage at S2′ site frees the fusion peptide from the new S2 N-terminal region, further destabilizes the SARS-CoV-2 S glycoprotein and may initiate S2-mediated membrane fusion cascade. Following the second cleavage, the fusion peptide at the N terminus of the S2 trimer is inserted into the host membrane, forming the pre-hairpin intermediate state. Since the pre-hairpin intermediate state is extremely unstable, the S2 fusion protein is refolded quickly and irreversibly into the stable postfusion state. These large conformational rearrangements pull the viral and host cell membrane into close proximity, leading ultimately to membrane fusion.
Concluding Remarks and Prospects
SARS-CoV-2 is a highly contagious pathogen that continues to spread quickly around the globe, making COVID-19 one of the worst pandemics recorded in history. A safe and efficacious vaccine will be one of the best solutions to reduce or eliminate the COVID-19 pandemic. Unfortunately, no vaccines for any of the known human CoVs have been licensed. However, several potential SARS-CoV and MERS-CoV vaccines have advanced into human clinical trials for years, suggesting the development of effective vaccines against human CoVs has always been challenging. However, it has been shown that both SARS-CoV and SARS-CoV-2 could readily induce neutralizing antibodies following natural infection or immunization.
Moreover, a growing number of neutralizing monoclonal antibodies targeting the SARS-CoV-2 S glycoprotein with high potency have been isolated from plenty of convalescent donors and humanized mice, some of which have been shown to afford protection against SARS-CoV-2 challenge in animal models. It thus seems that vaccine candidates designed to elicit such neutralizing antibodies are feasible. It is widely accepted that the S protein of SARS-CoV-2 is the most promising immunogen for producing protective immunity. However, the S protein has likely evolved to perform its functions while evading host neutralizing antibody responses and thus should be engineered to ensure optimal immune response. The immunogen design strategies described in this review based on the wealth of the SARS-CoV-2 S glycoprotein research related to its biosynthesis, structure, function, antigenicity, and immunogenicity will likely contribute to the ultimate success of safe and efficacious vaccines against SARS-CoV-2/COVID-19.Read more