Posted on by Dr. Francis Collins
Many people who contract COVID-19 have only a mild illness, or sometimes no symptoms at all. But others develop respiratory failure that requires oxygen support or even a ventilator to help them recover . It’s clear that this happens more often in men than in women, as well as in people who are older or who have chronic health conditions. But why does respiratory failure also sometimes occur in people who are young and seemingly healthy?
A new study suggests that part of the answer to this question may be found in the genes that each one of us carries . While more research is needed to pinpoint the precise underlying genes and mechanisms responsible, a recent genome-wide association (GWAS) study, just published in the New England Journal of Medicine, finds that gene variants in two regions of the human genome are associated with severe COVID-19 and correspondingly carry a greater risk of COVID-19-related death.
The two stretches of DNA implicated as harboring risks for severe COVID-19 are known to carry some intriguing genes, including one that determines blood type and others that play various roles in the immune system. In fact, the findings suggest that people with blood type A face a 50 percent greater risk of needing oxygen support or a ventilator should they become infected with the novel coronavirus. In contrast, people with blood type O appear to have about a 50 percent reduced risk of severe COVID-19.
These new findings—the first to identify statistically significant susceptibility genes for the severity of COVID-19—come from a large research effort led by Andre Franke, a scientist at Christian-Albrecht-University, Kiel, Germany, along with Tom Karlsen, Oslo University Hospital Rikshospitalet, Norway. Their study included 1,980 people undergoing treatment for severe COVID-19 and respiratory failure at seven medical centers in Italy and Spain.
In search of gene variants that might play a role in the severe illness, the team analyzed patient genome data for more than 8.5 million so-called single-nucleotide polymorphisms, or SNPs. The vast majority of these single “letter” nucleotide substitutions found all across the genome are of no health significance, but they can help to pinpoint the locations of gene variants that turn up more often in association with particular traits or conditions—in this case, COVID-19-related respiratory failure. To find them, the researchers compared SNPs in people with severe COVID-19 to those in more than 1,200 healthy blood donors from the same population groups.
The analysis identified two places that turned up significantly more often in the individuals with severe COVID-19 than in the healthy folks. One of them is found on chromosome 3 and covers a cluster of six genes with potentially relevant functions. For instance, this portion of the genome encodes a transporter protein known to interact with angiotensin converting enzyme 2 (ACE2), the surface receptor that allows the novel coronavirus that causes COVID-19, SARS-CoV-2, to bind to and infect human cells. It also encodes a collection of chemokine receptors, which play a role in the immune response in the airways of our lungs.
The other association signal popped up on chromosome 9, right over the area of the genome that determines blood type. Whether you are classified as an A, B, AB, or O blood type, depends on how your genes instruct your blood cells to produce (or not produce) a certain set of proteins. The researchers did find evidence suggesting a relationship between blood type and COVID-19 risk. They noted that this area also includes a genetic variant associated with increased levels of interleukin-6, which plays a role in inflammation and may have implications for COVID-19 as well.
These findings, completed in two months under very difficult clinical conditions, clearly warrant further study to understand the implications more fully. Indeed, Franke, Karlsen, and many of their colleagues are part of the COVID-19 Host Genetics Initiative, an ongoing international collaborative effort to learn the genetic determinants of COVID-19 susceptibility, severity, and outcomes. Some NIH research groups are taking part in the initiative, and they recently launched a study to look for informative gene variants in 5,000 COVID-19 patients in the United States and Canada.
The hope is that these and other findings yet to come will point the way to a more thorough understanding of the biology of COVID-19. They also suggest that a genetic test and a person’s blood type might provide useful tools for identifying those who may be at greater risk of serious illness.
 Characteristics of and important lessons from the Coronavirus Disease 2019 (COVID-19) outbreak in China: Summary of a report of 72 314 cases from the Chinese Center for Disease Control and Prevention. Wu Z, McGoogan JM, et. al. 2020 Feb 24. [published online ahead of print]
 Genomewide association study of severe Covid-19 with respiratory failure. Ellinghaus D, Degenhardt F, et. a. NEJM. June 17, 2020.
Andre Franke (Christian-Albrechts-University of Kiel, Germany)
Tom Karlsen (Oslo University Hospital Rikshospitalet, Norway)
Posted on by Dr. Francis Collins
Cancer is a disease of the genome. It can be driven by many different types of DNA misspellings and rearrangements, which can cause cells to grow uncontrollably. While the first oncogenes with the potential to cause cancer were discovered more than 35 years ago, it’s been a long slog to catalog the universe of these potential DNA contributors to malignancy, let alone explore how they might inform diagnosis and treatment. So, I’m thrilled that an international team has completed the most comprehensive study to date of the entire genomes—the complete sets of DNA—of 38 different types of cancer.
Among the team’s most important discoveries is that the vast majority of tumors—about 95 percent—contained at least one identifiable spelling change in their genomes that appeared to drive the cancer . That’s significantly higher than the level of “driver mutations” found in past studies that analyzed only a tumor’s exome, the small fraction of the genome that codes for proteins. Because many cancer drugs are designed to target specific proteins affected by driver mutations, the new findings indicate it may be worthwhile, perhaps even life-saving in many cases, to sequence the entire tumor genomes of a great many more people with cancer.
The latest findings, detailed in an impressive collection of 23 papers published in Nature and its affiliated journals, come from the international Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium. Also known as the Pan-Cancer Project for short, it builds on earlier efforts to characterize the genomes of many cancer types, including NIH’s The Cancer Genome Atlas (TCGA) and the International Cancer Genome Consortium (ICGC).
In these latest studies, a team including more than 1,300 researchers from around the world analyzed the complete genomes of more than 2,600 cancer samples. Those samples included tumors of the brain, skin, esophagus, liver, and more, along with matched healthy cells taken from the same individuals.
In each of the resulting new studies, teams of researchers dug deep into various aspects of the cancer DNA findings to make a series of important inferences and discoveries. Here are a few intriguing highlights:
• The average cancer genome was found to contain not just one driver mutation, but four or five.
• About 13 percent of those driver mutations were found in so-called non-coding DNA, portions of the genome that don’t code for proteins .
• The mutations arose within about 100 different molecular processes, as indicated by their unique patterns or “mutational signatures.” [3,4].
• Some of those signatures are associated with known cancer causes, including aberrant DNA repair and exposure to known carcinogens, such as tobacco smoke or UV light. Interestingly, many others are as-yet unexplained, suggesting there’s more to learn with potentially important implications for cancer prevention and drug development.
• A comprehensive analysis of 47 million genetic changes pieced together the chronology of cancer-causing mutations. This work revealed that many driver mutations occur years, if not decades, prior to a cancer’s diagnosis, a discovery with potentially important implications for early cancer detection .
The findings represent a big step toward cataloging all the major cancer-causing mutations with important implications for the future of precision cancer care. And yet, the fact that the drivers in 5 percent of cancers continue to remain mysterious (though they do have RNA abnormalities) comes as a reminder that there’s still a lot more work to do. The challenging next steps include connecting the cancer genome data to treatments and building meaningful predictors of patient outcomes.
To help in these endeavors, the Pan-Cancer Project has made all of its data and analytic tools available to the research community. As researchers at NIH and around the world continue to detail the diverse genetic drivers of cancer and the molecular processes that contribute to them, there is hope that these findings and others will ultimately vanquish, or at least rein in, this Emperor of All Maladies.
 Pan-Cancer analysis of whole genomes. ICGC/TCGA Pan-Cancer Analysis of Whole Genomes Consortium. Nature. 2020 Feb;578(7793):82-93.
 Analyses of non-coding somatic drivers in 2,658 cancer whole genomes. Rheinbay E et al; PCAWG Consortium. Nature. 2020 Feb;578(7793):102-111.
 The repertoire of mutational signatures in human cancer. Alexandrov LB et al; PCAWG Consortium. Nature. 2020 Feb;578(7793):94-101.
 Patterns of somatic structural variation in human cancer genomes. Li Y et al; PCAWG Consortium. Nature. 2020 Feb;578(7793):112-121.
 The evolutionary history of 2,658 cancers. Gerstung M, Jolly C, Leshchiner I, Dentro SC et al; PCAWG Consortium. Nature. 2020 Feb;578(7793):122-128.
The Genetics of Cancer (National Cancer Institute/NIH)
NIH Support: National Cancer Institute, National Human Genome Research Institute
Posted on by Dr. Francis Collins
Tropical medicine has its share of wily microbes. Among the most clever is the mosquito-borne protozoan Plasmodium falciparum, which is the cause of the most common—and most lethal—form of malaria. For decades, doctors have used antimalarial drugs against P. falciparum. But just when malaria appeared to be well on its way to eradication, this parasitic protozoan mutated in ways that has enabled it to resist frontline antimalarial drugs. This resistance is a major reason that malaria, one of the world’s oldest diseases, still claims the lives of about 400,000 people each year .
This is a situation with which I have personal experience. Thirty years ago before traveling to Nigeria, I followed directions and took chloroquine to prevent malaria. But the resistance to the drug was already widespread, and I came down with malaria anyway. Fortunately, the parasite that a mosquito delivered to me was sensitive to another drug called Fansidar, which acts through another mechanism. I was pretty sick for a few days, but recovered without lasting consequences.
While new drugs are being developed to thwart P. falciparum, some researchers are busy developing tools to predict what mutations are likely to occur next in the parasite’s genome. And that’s what is so exciting about the image above. It presents the unprecedented, 3D atomic-resolution structure of a protein made by P. falciparum that’s been a major source of its resistance: the chloroquine-resistance transporter protein, or PfCRT.
In this cropped density map, you see part of the protein’s biochemical structure. The colorized area displays the long, winding chain of amino acids within the protein as helices in shades of green, blue and gold. These helices enclose a central cavity essential for the function of the protein, whose electrostatic properties are shown here as negative (red), positive (blue), and neutral (white). All this structural information was captured using cryo-electron microscopy (cryo-EM). The technique involves flash-freezing molecules in liquid nitrogen and bombarding them with electrons to capture their images with a special camera.
This groundbreaking work, published recently in Nature, comes from an NIH-supported multidisciplinary research team, led by David Fidock, Matthias Quick, and Filippo Mancia, Columbia University Irving Medical Center, New York . It marks a major feat for structural biology, because PfCRT is on the small side for standard cryo-EM and, as Mancia discovered, the protein is almost featureless.
These two strikes made Mancia and colleagues wonder at first whether they would swing and miss at their attempt to image the protein. With the help of coauthor Anthony Kossiakoff, a researcher at the University of Chicago, the team complexed PfCRT to a bulkier antibody fragment. That doubled the size of their subject, and the fragment helped to draw out PfCRT’s hidden features. One year and a lot of hard work later, they got their homerun.
PfCRT is a transport protein embedded in the surface membrane of what passes for the gut of P. falciparum. Because the gene encoding it is highly mutable, the PfCRT protein modified its structure many years ago, enabling it to pump out and render ineffective several drugs in a major class of antimalarials called 4-aminoquinolines. That includes chloroquine.
Now, with the atomic structure in hand, researchers can map the locations of existing mutations and study how they work. This information will also allow them to model which regions of the protein to be on the lookout for the next adaptive mutations. The hope is this work will help to prolong the effectiveness of today’s antimalarial drugs.
For example, the drug piperaquine, a 4-aminoquinoline agent, is now used in combination with another antimalarial. The combination has proved quite effective. But recent reports show that P. falciparum has acquired resistance to piperaquine, driven by mutations in PfCRT that are spreading rapidly across Southeast Asia .
Interestingly, the researchers say they have already pinpointed single mutations that could confer piperaquine resistance to parasites from South America. They’ve also located where new mutations are likely to occur to compromise the drug’s action in Africa, where most malarial infections and deaths occur. So, this atomic structure is already being put to good use.
Researchers also hope that this model will allow drug designers to make structural adjustments to old, less effective malarial drugs and perhaps restore them to their former potency. Perhaps this could even be done by modifying chloroquine, introduced in the 1940s as the first effective antimalarial. It was used worldwide but was largely shelved a few decades later due to resistance—as I experienced three decades ago.
Malaria remains a constant health threat for millions of people living in subtropical areas of the world. Wouldn’t it be great to restore chloroquine to the status of a frontline antimalarial? The drug is inexpensive, taken orally, and safe. Through the power of science, its return is no longer out of the question.
 World malaria report 2019. World Health Organization, December 4, 2019
 Structure and drug resistance of the Plasmodium falciparum transporter PfCRT. Kim J, Tan YZ, Wicht KJ, Erramilli SK, Dhingra SK, Okombo J, Vendome J, Hagenah LM, Giacometti SI, Warren AL, Nosol K, Roepe PD, Potter CS, Carragher B, Kossiakoff AA, Quick M, Fidock DA, Mancia F. Nature. 2019 Dec;576(7786):315-320.
 Determinants of dihydroartemisinin-piperaquine treatment failure in Plasmodium falciparum malaria in Cambodia, Thailand, and Vietnam: a prospective clinical, pharmacological, and genetic study. van der Pluijm RW, Imwong M, Chau NH, Hoa NT, et. al. Lancet Infect Dis. 2019 Sep;19(9):952-961.
Malaria (National Institute of Allergy and Infectious Diseases/NIH)
Fidock Lab (Columbia University Irving Medical Center, New York)
Video: David Fidock on antimalarial drug resistance (BioMedCentral/YouTube)
Kossiakoff Lab (University of Chicago)
Mancia Lab (Columbia University Irving Medical Center)
Matthias Quick (Columbia University Irving Medical Center)
NIH Support: National Institute of Allergy and Infectious Diseases; National Institute of General Medical Sciences; National Heart, Lung, and Blood Institute