COVID-19: genetic network analysis provides ‘snapshot’ of pandemic origins

今天看到一篇来自剑桥大学Peter Forster的文章。这篇调查报告也在Proceedings of the National Academy of Sciences(PNAS)上有所登载



In a phylogenetic network analysis of 160 complete human severe acute respiratory syndrome coronavirus 2 (SARS-Cov-2) genomes, we find three central variants distinguished by amino acid changes, which we have named A, B, and C, with A being the ancestral type according to the bat outgroup coronavirus. The A and C types are found in significant proportions outside East Asia, that is, in Europeans and Americans. In contrast, the B type is the most common type in East Asia, and its ancestral genome appears not to have spread outside East Asia without first mutating into derived B types, pointing to founder effects or immunological or environmental resistance against this type outside Asia. The network faithfully traces routes of infections for documented coronavirus disease 2019 (COVID-19) cases, indicating that phylogenetic networks can likewise be successfully used to help trace undocumented COVID-19 infection sources, which can then be quarantined to prevent recurrent spread of the disease worldwide.

意思是说:作者在对160个完整的人类严重急性呼吸系统新冠病毒2(SARS-Cov-2)基因组进行的系统进化网络分析,我们发现了三个主要的变异体,它们的氨基酸变化不同,我们将其命名为A,B和C,其中A为与蝙蝠的型冠状病毒最为接近的。 A和C类型在东亚以外地区(即欧洲人和美国人中)的比例很高。相比之下,B型是东亚最常见的类型,而B型的祖先在没有变异成B型之前并没有在东亚以外传播,这表明在亚洲以外的地区对该类型具有影响力或免疫或环境抵抗力。该网络分析忠实地跟踪了已经记录在案的冠状病毒病2019(COVID-19)病例的感染途径,这表明系统发育网络同样可以成功地用于帮助跟踪未记录的COVID-19感染源,然后可以对其进行隔离以防止疾病的再次传播全世界。

这和原先听到日本和台湾的报告:一共有五种变异,中国有三种不同;但是和最早的病毒初始地点不在中国相吻合。我没有去找日本和台湾的报告的出处,因为我在这里更想翻译Peter Forster的文章。


There are two subclusters of A which are distinguished by the synonymous mutation T29095C. In the T-allele subcluster, four Chinese individuals (from the southern coastal Chinese province of Guangdong) carry the ancestral genome, while three Japanese and two American patients differ from it by a number of mutations.


It is noteworthy that nearly half (15/33) of the types in this subcluster, however, are found outside East Asia, mainly in the United States and Australia.



For type B, all but 19 of the 93 type B genomes were sampled in Wuhan (n = 22), in other parts of eastern China (n = 31), and, sporadically, in adjacent Asian countries (n = 21). Outside of East Asia, 10 B-types were found in viral genomes from the United States and Canada, one in Mexico, four in France, two in Germany, and one each in Italy and Australia. Node B is derived from A by two mutations: the synonymous mutation T8782C and the nonsynonymous mutation C28144T changing a leucine to a serine. Cluster B is striking with regard to mutational branch lengths: While the ancestral B type is monopolized (26/26 genomes) by East Asians, every single (19/19) B-type genome outside of Asia has evolved mutations. This phenomenon does not appear to be due to the month-long time lag and concomitant mutation rate acting on the viral genome before it spread outside of China.



The 'C' variant is the major European type, found in early patients from France, Italy, Sweden and England. It is absent from the study’s Chinese mainland sample, but seen in Singapore, Hong Kong and South Korea.

