Posted on by Dr. Francis Collins
Over the past year, it’s been so inspiring to watch tens of thousands of people across the country selflessly step forward for vaccine trials and other research studies to combat COVID-19. And they are not alone. Many generous folks are volunteering to take part in other types of NIH-funded research that will improve health all across the spectrum, including the more than 360,000 who’ve already enrolled in the pioneering All of Us Research Program.
Now in its second year, All of Us is building a research community of 1 million participant partners to help us learn more about how genetics, environment, and lifestyle interact to influence disease and affect health. So far, more than 80 percent of participants who have completed all the initial enrollment steps are Black, Latino, rural, or from other communities historically underrepresented in biomedical research.
This community will build a diverse foundation for precision medicine, in which care is tailored to the individual, not the average patient as is now often the case. What’s also paradigm shifting about All of Us is its core value of sharing information back with participants about themselves. It is all done responsibly through each participant’s personal All of Us online account and with an emphasis on protecting privacy.
All of Us participants share their health information in many ways, such as taking part in surveys, offering access to their electronic health records, and providing biosamples (blood, urine, and/or saliva). In fact, researchers recently began genotyping and sequencing the DNA in some of those biosamples, and then returning results from analyses to participants who’ve indicated they’d like to receive such information. This first phase of genotyping DNA analysis will provide insights into their genetic ancestry and four traits, including bitter taste perception and tolerance for lactose.
Results of a second sequencing phase of DNA analysis will likely be ready in the coming year. These personalized reports will give interested participants information about how their bodies are likely to react to certain medications and about whether they face an increased risk of developing certain health conditions, such as some types of cancer or heart disease. To help participants better understand the results, they can make a phone appointment with a genetic counselor who is affiliated with the program.
This week, I had the pleasure of delivering the keynote address at the All of Us Virtual Face-to-Face. This lively meeting was attended by a consortium of more than 2,000 All of Us senior staff, program leads with participating healthcare provider organizations and federally qualified health centers, All of Us-supported researchers, community partners, and the all-important participant ambassadors.
If you are interested in becoming part of the All of Us community, I welcome you—there’s plenty of time to get involved! To learn more, just go to Join All of Us.
Join All of Us (NIH)
Posted on by Dr. Francis Collins
Back in April 2003, when the international Human Genome Project successfully completed the first reference sequence of the human DNA blueprint, we were thrilled to have achieved that feat in just 13 years. Sure, the U.S. contribution to that first human reference sequence cost an estimated $400 million, but we knew (or at least we hoped) that the costs would come down quickly, and the speed would accelerate. How far we’ve come since then! A new study shows that whole genome sequencing—combined with artificial intelligence (AI)—can now be used to diagnose genetic diseases in seriously ill babies in less than 24 hours.
Take a moment to absorb this. I would submit that there is no other technology in the history of planet Earth that has experienced this degree of progress in speed and affordability. And, at the same time, DNA sequence technology has achieved spectacularly high levels of accuracy. The time-honored adage that you can only get two out of three for “faster, better, and cheaper” has been broken—all three have been dramatically enhanced by the advances of the last 16 years.
Rapid diagnosis is critical for infants born with mysterious conditions because it enables them to receive potentially life-saving interventions as soon as possible after birth. In a study in Science Translational Medicine, NIH-funded researchers describe development of a highly automated, genome-sequencing pipeline that’s capable of routinely delivering a diagnosis to anxious parents and health-care professionals dramatically earlier than typically has been possible .
While the cost of rapid DNA sequencing continues to fall, challenges remain in utilizing this valuable tool to make quick diagnostic decisions. In most clinical settings, the wait for whole-genome sequencing results still runs more than two weeks. Attempts to obtain faster results also have been labor intensive, requiring dedicated teams of experts to sift through the data, one sample at a time.
In the new study, a research team led by Stephen Kingsmore, Rady Children’s Institute for Genomic Medicine, San Diego, CA, describes a streamlined approach that accelerates every step in the process, making it possible to obtain whole-genome test results in a median time of about 20 hours and with much less manual labor. They propose that the system could deliver answers for 30 patients per week using a single genome sequencing instrument.
Here’s how it works: Instead of manually preparing blood samples, his team used special microbeads to isolate DNA much more rapidly with very little labor. The approach reduced the time for sample preparation from 10 hours to less than three. Then, using a state-of-the-art DNA sequencer, they sequence those samples to obtain good quality whole genome data in just 15.5 hours.
The next potentially time-consuming challenge is making sense of all that data. To speed up the analysis, Kingsmore’s team took advantage of a machine-learning system called MOON. The automated platform sifts through all the data using artificial intelligence to search for potentially disease-causing variants.
The researchers paired MOON with a clinical language processing system, which allowed them to extract relevant information from the child’s electronic health records within seconds. Teaming that patient-specific information with data on more than 13,000 known genetic diseases in the scientific literature, the machine-learning system could pick out a likely disease-causing mutation out of 4.5 million potential variants in an impressive 5 minutes or less!
To put the system to the test, the researchers first evaluated its ability to reach a correct diagnosis in a sample of 101 children with 105 previously diagnosed genetic diseases. In nearly every case, the automated diagnosis matched the opinions reached previously via the more lengthy and laborious manual interpretation of experts.
Next, the researchers tested the automated system in assisting diagnosis of seven seriously ill infants in the intensive care unit, and three previously diagnosed infants. They showed that their automated system could reach a diagnosis in less than 20 hours. That’s compared to the fastest manual approach, which typically took about 48 hours. The automated system also required about 90 percent less manpower.
The system nailed a rapid diagnosis for 3 of 7 infants without returning any false-positive results. Those diagnoses were made with an average time savings of more than 22 hours. In each case, the early diagnosis immediately influenced the treatment those children received. That’s key given that, for young children suffering from serious and unexplained symptoms such as seizures, metabolic abnormalities, or immunodeficiencies, time is of the essence.
Of course, artificial intelligence may never replace doctors and other healthcare providers. Kingsmore notes that 106 years after the invention of the autopilot, two pilots are still required to fly a commercial aircraft. Likewise, health care decisions based on genome interpretation also will continue to require the expertise of skilled physicians.
Still, such a rapid automated system will prove incredibly useful. For instance, this system can provide immediate provisional diagnosis, allowing the experts to focus their attention on more difficult unsolved cases or other needs. It may also prove useful in re-evaluating the evidence in the many cases in which manual interpretation by experts fails to provide an answer.
The automated system may also be useful for periodically reanalyzing data in the many cases that remain unsolved. Keeping up with such reanalysis is a particular challenge considering that researchers continue to discover hundreds of disease-associated genes and thousands of variants each and every year. The hope is that in the years ahead, the combination of whole genome sequencing, artificial intelligence, and expert care will make all the difference in the lives of many more seriously ill babies and their families.
 Diagnosis of genetic diseases in seriously ill children by rapid whole-genome sequencing and automated phenotyping and interpretation. Clark MM, Hildreth A, Batalov S, Ding Y, Chowdhury S, Watkins K, Ellsworth K, Camp B, Kint CI, Yacoubian C, Farnaes L, Bainbridge MN, Beebe C, Braun JJA, Bray M, Carroll J, Cakici JA, Caylor SA, Clarke C, Creed MP, Friedman J, Frith A, Gain R, Gaughran M, George S, Gilmer S, Gleeson J, Gore J, Grunenwald H, Hovey RL, Janes ML, Lin K, McDonagh PD, McBride K, Mulrooney P, Nahas S, Oh D, Oriol A, Puckett L, Rady Z, Reese MG, Ryu J, Salz L, Sanford E, Stewart L, Sweeney N, Tokita M, Van Der Kraan L, White S, Wigby K, Williams B, Wong T, Wright MS, Yamada C, Schols P, Reynders J, Hall K, Dimmock D, Veeraraghavan N, Defay T, Kingsmore SF. Sci Transl Med. 2019 Apr 24;11(489).
DNA Sequencing Fact Sheet (National Human Genome Research Institute/NIH)
Genomics and Medicine (NHGRI/NIH)
Genetic and Rare Disease Information Center (National Center for Advancing Translational Sciences/NIH)
Stephen Kingsmore (Rady Children’s Institute for Genomic Medicine, San Diego, CA)
NIH Support: National Institute of Child Health and Human Development; National Human Genome Research Institute; National Center for Advancing Translational Sciences
Posted on by Dr. Francis Collins
In seeking the biological answer to the question of what it means to be human, the brain’s cerebral cortex is a good place to start. This densely folded, outer layer of grey matter, which is vastly larger in Homo sapiens than in other primates, plays an essential role in human consciousness, language, and reasoning.
Now, an NIH-funded team has pinpointed a key set of genes—found only in humans—that may help explain why our species possesses such a large cerebral cortex. Experimental evidence shows these genes prolong the development of stem cells that generate neurons in the cerebral cortex, which in turn enables the human brain to produce more mature cortical neurons and, thus, build a bigger cerebral cortex than our fellow primates.
That sounds like a great advantage for humans! But there’s a downside. Researchers found the same genomic changes that facilitated the expansion of the human cortex may also render our species more susceptible to certain rare neurodevelopmental disorders.
Posted on by Dr. Francis Collins
Imagine how long it would take to analyze the 37 trillion or so cells that make up the human body if you had to do it by hand, one by one! Still, single-cell analysis is crucial to gaining a comprehensive understanding of our biology. The cell is the unit of life for all organisms, and all cells are certainly not the same. Think about it: even though each cell contains the same DNA, some make up your skin while others build your bones; some of your cells might be super healthy while others could be headed down the road to cancer or Alzheimer’s disease.
So, it’s no surprise that many NIH-funded researchers are hard at work in the rapidly emerging field known as single-cell analysis. In fact, one team recently reported impressive progress in improving the speed and efficiency of a method to analyze certain epigenetic features of individual cells . Epigenetics refers to a multitude of chemical and protein “marks” on a cell’s DNA—patterns that vary among cells and help to determine which genes are switched on or off. That plays a major role in defining cellular identity as a skin cell, liver cell, or pancreatic cancer cell.
The team’s rather simple but ingenious approach relies on attaching a unique combination of two DNA barcodes to each cell prior to analyzing epigenetic marks all across the genome, making it possible for researchers to pool hundreds of cells without losing track of each of them individually. Using this approach, the researchers could profile thousands of individual cells simultaneously for less than 50 cents per cell, a 50- to 100-fold drop in price. The new approach promises to yield important insights into the role of epigenetic factors in our health, from the way neurons in our brains function to whether or not a cancer responds to treatment.
Posted on by Dr. Francis Collins
It’s hard to believe, but it’s been almost 15 years since we successfully completed the Human Genome Project, ahead of schedule and under budget. I was proud to stand with my international colleagues in a celebration at the Library of Congress on April 14, 2003 (which happens to be my birthday), to announce that we had stitched together the very first reference sequence of the human genome at a total cost of about $400 million. As remarkable as that achievement was, it was just the beginning of our ongoing effort to understand the human genome, and to use that understanding to improve human health.
That first reference human genome was sequenced using automated machines that were the size of small phone booths. Since then, breathtaking progress has been made in developing innovative technologies that have made DNA sequencing far easier, faster, and more affordable. Now, a report in Nature Biotechnology highlights the latest advance: the sequencing and assembly of a human genome using a pocket-sized device . It was generated using several “nanopore” devices that can be purchased online with a “starter kit” for just $1,000. In fact, this new genome sequence—completed in a matter of weeks—includes some notoriously hard-to-sequence stretches of DNA, filling several key gaps in our original reference genome.