The SARS-CoV-2 furin cleavage site is yet again in the news - this time because of a quote by Nobel laureate David Baltimore.

The site is not a "smoking gun", nor does it "make a powerful challenge to the idea of a natural origin".

Quite the opposite, so a little science https://abs.twimg.com/emoji/v2/... draggable="false" alt="🧵" title="Thread" aria-label="Emoji: Thread">https://abs.twimg.com/emoji/v2/... draggable="false" alt="👇" title="Rückhand Zeigefinger nach unten" aria-label="Emoji: Rückhand Zeigefinger nach unten">
The furin cleavage site (FCS) / polybasic cleavage site is present in SARS-CoV-2 at the S1/S2 junction of the spike protein where it mediates the cutting (by the host protease furin, among others) of the spike, which is required for infection of cells.
The FCS was created by an out-of-frame insertion of "CTCCTCGGCGGG" creating the "(P)RRAR" amino acid sequence, which constitutes a suboptimal polybasic cleavage site that is important for expanding SARS-CoV-2 host range, it& #39;s transmission and pathogenesis, etc.
FCSs are abundant, including being highly prevalent in coronaviruses. While SARS-CoV-2 is the first example of a SARSr virus with an FCS, other betacoronaviruses (the genus for SARS-CoV-2) have FCSs, including MERS and HKU1. https://www.sciencedirect.com/science/article/pii/S1873506120304165?via%3Dihub">https://www.sciencedirect.com/science/a...
There is nothing mysterious about having a "first example" of a virus with an FCS. Viruses sampled to date only give us a teeny-tiny fraction of all the viruses circulating in the wild. Fragments - such as the CTCCTCGGCGGG - come and go all the time. https://www.biorxiv.org/content/10.1101/2021.02.03.429646v1">https://www.biorxiv.org/content/1...
How did SARS-CoV-2 acquire the FCS? We don& #39;t know, however, we know four main mechanisms often lead to insertions:

(1) mutation

(2) polymerase slippage

(3) template switching

(4) recombination

All of which play key roles in coronavirus (incl. SARS-CoV-2) evolution.
Template switching likely also play an important role during the ongoing evolution of SARS-CoV-2: https://www.biorxiv.org/content/10.1101/2021.04.23.441209v1.

We">https://www.biorxiv.org/content/1... need to see this in the context of the decades of evolution of the SARS-CoV-2 ancestor and related viruses in bats. It& #39;s safe to say indels come and go.
The FCS itself, (P)RRAR, is not an optimal site (for cleavage) and has never previously been used in CoV experiments to the best of my knowledge - unlike more optimal sites, which have been inserted into SARSr CoVs for basic research: https://www.sciencedirect.com/science/article/pii/S0042682206000900">https://www.sciencedirect.com/science/a...
If we zoom in on the (P)RRAR site in SARS-CoV-2 and compare it to the one found in (some) FCoV sequences, we can see there& #39;s a fair bit of homology outside the FCS too - including likely O-linked glycans being conserved.
The (P)RRAR FCS isn& #39;t optimal and while it& #39;s & #39;sufficient& #39; for SARS-CoV-2s & #39;success& #39; as a pandemic virus, it& #39;s not an ideal site as defined by the canonical R‐X‐K/R‐R FCS seen in many proteins (viral and otherwise). https://onlinelibrary.wiley.com/doi/full/10.1002/cti2.1073">https://onlinelibrary.wiley.com/doi/full/...
Importantly, however, in recent month we have started seeing the "P" mutating towards residues creating more optimal furin sites - P681H and, especially, P681R, which can be found in B.1.1.7 and B.1.617.x, suggesting the virus may evolve towards more efficient usage of the site.
https://abs.twimg.com/emoji/v2/... draggable="false" alt="🚨" title="Polizeiautos mit drehendem Licht" aria-label="Emoji: Polizeiautos mit drehendem Licht"> So Baltimore& #39;s first point - that the FCS found in SARS-CoV-2 is somehow unusual - is simply incorrect. FCSs are found in a multitude of different coronaviruses, indels come and go frequently, and the exact (P)RRAR can be found in other coronaviruses.
Now, the codons. Here, Baltimore is talking about the two codons coding for the first two arginines (R) following the P - CGG. The CGG codon is rare in viruses because it& #39;s an example of an unmethylated "CpG" site that can be bound by TLR9, leading to immune cell activation.
https://abs.twimg.com/emoji/v2/... draggable="false" alt="🚨" title="Polizeiautos mit drehendem Licht" aria-label="Emoji: Polizeiautos mit drehendem Licht"> Despite being rare, however, CGG codons *are* found in all coronaviruses, albeit at low frequency. Specifically, of all arginine codons, CGG is used at these frequencies in these viruses:

SARS: 5%
SARS2: 3%
SARSr: 2%
ccCoVs: 4%
HKU9: 7%
FCoV: 2%

Nothing unusual here.
We see CGG multiple times in different ways - here& #39;s an example comparing another "PR" stretch between SARS-CoV-2, RaTG13, and SARS-CoV in the N gene. Note how SARS-CoV-2 and RaTG13 both use CGG, while SARS-CoV-2 uses CGC for the first R, while later R& #39;s are coded by CGT or AGA.
One final point about the CGG codons in the FCS - if they were somehow "unnatural", we& #39;d see SARS-CoV-2 evolve away from "CGG" during the ongoing pandemic. We have more than a million genomes to analyze, so what do we find if we look at synonymous mutations at the "CGG_CGG" site?
https://abs.twimg.com/emoji/v2/... draggable="false" alt="🚨" title="Polizeiautos mit drehendem Licht" aria-label="Emoji: Polizeiautos mit drehendem Licht">Remarkably stable. Specifically, CGG is 99.87% conserved in the first codon and 99.84% conserved in the second.

This is *very* strong evidence that SARS-CoV-2 & #39;prefers& #39; CGG in these positions.
R is coded by six different codons, yet the simple single transition "CGA" is only observed in ~0.02% of sequences. The second most & #39;popular& #39; codon at these sites is "CGT" (a transversion) at 0.11% frequency.

In other words - there is nothing unusual about the codons either.
So Baltimore& #39;s second point is also false, invalidating his hypothesis that the "FCS [...] with its arginine codons [...] was the smoking gun for the origin of the virus".

Baltimore does not provide any evidence to support his hypothesis and the data support a natural origin.
Does this disprove a lab leak? No. However, it disproves there being a "smoking gun" in the FCS and lends further evidence to natural emergence - but it also does not *prove* that scenario.

To this day, we have yet to see any scientific evidence supporting a lab leak.
Variants of https://abs.twimg.com/emoji/v2/... draggable="false" alt="👇" title="Rückhand Zeigefinger nach unten" aria-label="Emoji: Rückhand Zeigefinger nach unten"> have come up - it& #39;s false. Specifically:

1. The events are not independent, hence the calculation is incorrect.

2. It& #39;s the same argument used by creationists about "irreducible complexity" - also false:

https://en.wikipedia.org/wiki/Irreducible_complexity

https://en.wikipedia.org/wiki/Irre... href=" https://www.americanprogress.org/issues/religion/news/2006/04/10/1934/the-flaws-in-intelligent-design/">https://www.americanprogress.org/issues/re...
Variants of https://abs.twimg.com/emoji/v2/... draggable= have come up - it& #39;s false. Specifically:1. The events are not independent, hence the calculation is incorrect.2. It& #39;s the same argument used by creationists about "irreducible complexity" - also false: https://en.wikipedia.org/wiki/Irre... href=" https://www.americanprogress.org/issues/religion/news/2006/04/10/1934/the-flaws-in-intelligent-design/">https://www.americanprogress.org/issues/re..." title="Variants of https://abs.twimg.com/emoji/v2/... draggable="false" alt="👇" title="Rückhand Zeigefinger nach unten" aria-label="Emoji: Rückhand Zeigefinger nach unten"> have come up - it& #39;s false. Specifically:1. The events are not independent, hence the calculation is incorrect.2. It& #39;s the same argument used by creationists about "irreducible complexity" - also false: https://en.wikipedia.org/wiki/Irre... href=" https://www.americanprogress.org/issues/religion/news/2006/04/10/1934/the-flaws-in-intelligent-design/">https://www.americanprogress.org/issues/re..." class="img-responsive" style="max-width:100%;"/>
As to Richard& #39;s final point - well... #introspection
You can follow @K_G_Andersen.
Tip: mention @twtextapp on a Twitter thread with the keyword “unroll” to get a link to it.

Latest Threads Unrolled: