Posted on by Dr. Francis Collins
One of the great mysteries in biology is how we humans ended up with such large, complex brains. In search of clues, researchers have spent years studying the protein-coding genes activated during neurodevelopment. But some answers may also be hiding in non-coding regions of the human genome, where sequences called regulatory elements increase or decrease the activity of genes.
A fascinating example involves a type of regulatory element called a human accelerated region (HAR). Although “human” is part of this element’s name, it turns out that the genomes of all vertebrates—not just humans—contain the DNA segments now designated as HARs.
In most organisms, HARs show a relatively low rate of mutation, which means these regulatory elements have been highly conserved across species throughout evolutionary time . The big exception is Homo sapiens, in which HARs have exhibited a much higher rate of mutations.
The accelerated rate of HARs mutations observed in humans suggest that, over the course of very long periods of time, these genomic changes might have provided our species with some sort of evolutionary advantage. What might that be? Many have speculated the advantage might involve the brain because HARs are often associated with genes involved in neurodevelopment. Now, in a paper published in the journal Neuron, an NIH-supported team confirms that’s indeed the case .
In the new work, researchers found that about half of the HARs in the human genome influence the activity, or expression, of protein-coding genes in neural cells and tissues during the brain’s development . The researchers say their study—the most comprehensive to date of the 3,171 HARs in the human genome—firmly establishes that this type of regulatory element helps to drive patterns of neurodevelopmental gene activity specific to humans.
Yet to be determined is precisely how HARs affect the development of the human brain. The quest to uncover these details will no doubt shed new light on fundamental questions about the brain, its billions of neurons, and their trillions of interconnections. For example, why does human neural development span decades, longer than the life spans of most primates and other mammals? Answering such questions could also reveal new clues into a range of cognitive and behavioral disorders. In fact, early research has already made tentative links between HARs and neurodevelopmental conditions such as autism spectrum disorder and schizophrenia .
The latest work was led by Kelly Girskis, Andrew Stergachis, and Ellen DeGennaro, all of whom were in the lab of Christopher Walsh while working on the project. An NIH grantee, Walsh is director of the Allen Discovery Center for Brain Evolution at Boston Children’s Hospital and Harvard Medical School, which is supported by the Paul G. Allen Foundation Frontiers Group, and is an Investigator of the Howard Hughes Medical Institute.
Though HARs have been studied since 2006, one of the big challenges in systematically assessing them has been technological. The average length of a HAR is about 269 bases of DNA, but current technologies for assessing function can only easily analyze DNA molecules that span 150 bases or less.
Ryan Doan, who was then in the Walsh Lab, and his colleagues solved the problem by creating a new machine called CaptureMPRA. (MPRA is short for “massively parallel reporter assays.”) This technological advance cleverly barcodes HARs and, more importantly, makes it possible to analyze HARs up to about 500 bases in length.
Using CaptureMPRA technology in tandem with cell culture studies, researchers rolled up their sleeves and conducted comprehensive, full-sequence analyses of more than 3,000 HARs. In their initial studies, primarily in neural cells, they found nearly half of human HARs are active to drive gene expression in cell culture. Of those, 42 percent proved to have increased ability to enhance gene expression compared to their orthologues, or counterparts, in chimpanzees.
Next, the team integrated these data with an existing epigenetic dataset derived from developing human brain cells, as well as additional datasets generated from sorted brain cell types. They found that many HARs appeared to have the ability to increase the activity of protein-coding genes, while a smaller—but very significant—subset of the HARs appeared to be enhancing gene expression specifically in neural progenitor cells, which are responsible for making various neural cell types.
The data suggest that as the human HAR sequences mutated and diverged from other mammals, they increased their ability to enhance or sometimes suppress the activity of certain genes in neural cells. To illustrate this point, the researchers focused on two HARs that appear to interact specifically with a gene referred to as R17. This gene can have highly variable gene expression patterns not only in different human cell types, but also in cells from other vertebrates and non-vertebrates.
In the human cerebral cortex, the outermost part of the brain that’s responsible for complex behaviors, R17 is expressed only in neural progenitor cells and only at specific time points. The researchers found that R17 slows the progression of neural progenitor cells through the cell cycle. That might seem strange, given the billions of neurons that need to be made in the cortex. But it’s consistent with the biology. In the human, it takes more than 130 days for the cortex to complete development, compared to about seven days in the mouse.
Clearly, to learn more about how the human brain evolved, researchers will need to look for clues in many parts of the genome at once, including its non-coding regions. To help researchers navigate this challenging terrain, the Walsh team has created an online resource displaying their comprehensive HAR data. It will appear soon, under the name HAR Hub, on the University of California Santa Cruz Genome Browser.
 An RNA gene expressed during cortical development evolved rapidly in humans. Pollard KS, Salama SR, Lambert N, Lambot MA, Coppens S, Pedersen JS, Katzman S, King B, Onodera C, Siepel A, Kern AD, Dehay C, Igel H, Ares M Jr, Vanderhaeghen P, Haussler D. Nature. 2006 Sep 14;443(7108):167-72.
 Rewiring of human neurodevelopmental gene regulatory programs by human accelerated regions. Girskis KM, Stergachis AB, DeGennaro EM, Doan RN, Qian X, Johnson MB, Wang PP, Sejourne GM, Nagy MA, Pollina EA, Sousa AMM, Shin T, Kenny CJ, Scotellaro JL, Debo BM, Gonzalez DM, Rento LM, Yeh RC, Song JHT, Beaudin M, Fan J, Kharchenko PV, Sestan N, Greenberg ME, Walsh CA. Neuron. 2021 Aug 25:S0896-6273(21)00580-8.
 Mutations in human accelerated regions disrupt cognition and social behavior. Doan RN, Bae BI, Cubelos B, Chang C, Hossain AA, Al-Saad S, Mukaddes NM, Oner O, Al-Saffar M, Balkhy S, Gascon GG; Homozygosity Mapping Consortium for Autism, Nieto M, Walsh CA. Cell. 2016 Oct 6;167(2):341-354.
Christopher Walsh Laboratory (Boston Children’s Hospital and Harvard Medical School)
NIH Support: National Institute of Neurological Disorders and Stroke; National Institute of Mental Health; National Institute of General Medical Sciences; National Cancer Institute
Posted on by Dr. Francis Collins
In days mostly gone by, it was fashionable in some circles for people to hand out calling cards to mark their arrival at special social events. This genteel human tradition is now being adapted to the lab to allow certain benign viruses to issue their own high-tech calling cards and mark their arrival at precise locations in the genome. These special locations show where there’s activity involving transcription factors, specialized proteins that switch genes on and off and help determine cell fate.
The idea is that myriad, well-placed calling cards can track brain development over time in mice and detect changes in transcription factor activity associated with certain neuropsychiatric disorders. This colorful image, which won first place in this year’s Show Us Your BRAINs! Photo and Video contest, provides a striking display of these calling cards in action in living brain tissue.
The image comes from Allen Yen, a PhD candidate in the lab of Joseph Dougherty, collaborating with the nearby lab of Rob Mitra. Both labs are located in the Washington University School of Medicine, St. Louis.
Yen and colleagues zoomed in on this section of mouse brain tissue under a microscope to capture dozens of detailed images that they then stitched together to create this high-resolution overview. The image shows neural cells (red) and cell nuclei (blue). But focus in on the neural cells (green) concentrated in the brain’s outer cortex (top) and hippocampus (two lobes in the upper center). They’ve been labelled with calling cards that were dropped off by adeno-associated virus .
Once dropped off, a calling card doesn’t bear a pretentious name or title. Rather, the calling card, is a small mobile snippet of DNA called a transposon. It gets dropped off with the other essential component of the technology: a specialized enzyme called a transposase, which the researchers fuse to one of many specific transcription factors of interest.
Each time one of these transcription factors of interest binds DNA to help turn a gene on or off, the attached transposase “grabs” a transposon calling card and inserts it into the genome. As a result, it leaves behind a permanent record of the interaction.
What’s also nice is the calling cards are programmed to give away their general locations. That’s because they encode a fluorescent marker (in this image, it’s a green fluorescent protein). In fact, Yen and colleagues could look under a microscope and tell from all the green that their calling card technology was in place and working as intended.
The final step, though, was to find out precisely where in the genome those calling cards had been left. For this, the researchers used next-generation sequencing to produce a cumulative history and map of each and every calling card dropped off in the genome.
These comprehensive maps allow them to identify important DNA-protein binding events well after the fact. This innovative technology also enables scientists to attribute past molecular interactions with observable developmental outcomes in a way that isn’t otherwise possible.
While the Mitra and Dougherty labs continue to improve upon this technology, it’s already readily adaptable to answer many important questions about the brain and brain disorders. In fact, Yen is now applying the technology to study neurodevelopment in mouse models of neuropsychiatric disorders, specifically autism spectrum disorder (ASD) . This calling card technology also is available for any lab to deploy for studying a transcription factor of interest.
This research is supported by the Brain Research through Advancing Innovative Neurotechnologies® (BRAIN) Initiative. One of the major goals of BRAIN Initiative is to accelerate the development and application of innovative technologies to gain new understanding of the brain. This award-winning image is certainly a prime example of striving to meet this goal. I’ll look forward to what these calling cards will tell us in the future about ASD and other important neurodevelopmental conditions affecting the brain.
 A viral toolkit for recording transcription factor-DNA interactions in live mouse tissues. Cammack AJ, Moudgil A, Chen J, Vasek MJ, Shabsovich M, McCullough K, Yen A, Lagunas T, Maloney SE, He J, Chen X, Hooda M, Wilkinson MN, Miller TM, Mitra RD, Dougherty JD. Proc Natl Acad Sci U S A. 2020 May 5;117(18):10003-10014.
 A MYT1L Syndrome mouse model recapitulates patient phenotypes and reveals altered brain development due to disrupted neuronal maturation. Jiayang Chen, Mary E. Lambo, Xia Ge, Joshua T. Dearborn, Yating Liu, Katherine B. McCullough, Raylynn G. Swift, Dora R. Tabachnick, Lucy Tian, Kevin Noguchi, Joel R. Garbow, John N. Constantino. bioRxiv. May 27, 2021.
Autism Spectrum Disorder (National Institute of Mental Health/NIH)
Dougherty Lab (Washington University School of Medicine, St. Louis)
Mitra Lab (Washington University School of Medicine)
Show Us Your BRAINs! Photo and Video Contest (BRAIN Initiative/NIH)
NIH Support: National Institute of Neurological Disorders and Stroke; National Institute of Mental Health; National Center for Advancing Translational Sciences; National Human Genome Research Institute; National Institute of General Medical Sciences
Posted on by Dr. Francis Collins
Many factors influence our risk of illness from SARS-CoV-2, the coronavirus responsible for COVID-19. That includes being careful to limit our possible exposures to the virus, as well as whether we have acquired immunity from a vaccine or an earlier infection. But once a person is infected, a host of other biological factors, including age and pre-existing medical conditions, will influence one’s risk of becoming severely ill.
While earlier studies have tied COVID-19 severity to genetic variations in a person’s antiviral defenses and blood type, we still have a lot to learn about how a person’s genetic makeup influences COVID-19 susceptibility and severity. So, I was pleased to see the recent findings of an impressive global effort to map the genetic underpinnings of SARS-CoV-2 infection and COVID-19 severity, which involved analyzing the genomes of many thousands of people with COVID-19 around the globe.
This comprehensive search led to the identification of 13 regions of the human genome that appear to play a role in COVID-19 infection or severity. Though more research is needed to sort out these leads, they represent potentially high-quality clues to the pathways that this virus uses to cause illness, and help to explain why some people are more likely to become infected with SARS-CoV-2 or to develop severe disease.
The international effort, known as The COVID-19 Host Genetics Initiative, is led by Andrea Ganna, Institute for Molecular Medicine Finland, Helsinki, and colleagues in the United States and around the world. Teasing out those important genetic influences is no easy task. It requires vast amounts of data, so Ganna reached out to the scientific community via Twitter to announce a new COVID-19 gene-hunting effort and ask for help. Thousands of researchers around the world answered his call. The new study, published in the journal Nature, includes data collected through the initiative as of January 2021, and represents nearly 50,000 COVID-19 patients and another 2 million uninfected controls .
In search of common gene variants that may influence who becomes infected with SARS-CoV-2 and how sick they will become, Ganna’s international team turned to genome-wide association studies (GWAS). As part of this, the team analyzed patient genome data for millions of so-called single-nucleotide polymorphisms, or SNPs. While these single “letter” nucleotide substitutions found all across the genome are generally of no health significance, they can point the way to the locations of gene variants that turn up more often in association with particular traits or conditions—in this case, COVID-19 susceptibility or severity. To find them, the researchers compared SNPs in people with COVID-19 to those in about 2 million healthy blood donors from the same population groups. They also looked for variants that turned up significantly more often in people who became severely ill.
Their analyses uncovered a number of gene variants associated with SARS-CoV-2 infection or severe COVID-19 in 13 regions of the human genome, six of which were new. Four of the 13 affect a person’s risk for becoming infected with SARS-CoV-2. The other nine influence a person’s risk for developing severe illness following the infection.
Interestingly, some of these gene variants already were known to have associations with other types of lung or autoimmune diseases. The new findings also help to confirm previous studies suggesting that the gene that determines a person’s blood type may influence a person’s susceptibility to SARS-CoV-2 infection, along with other genes that play a role in immunity. For example, the findings show overlap with variants within a gene called TYK2, which was earlier shown to protect against autoimmune-related diseases. Some of the variants also point to the need for further work to study previously unexplored biological processes that may play potentially important roles in COVID-19.
Two of the new variants associated with disease severity were discovered only by including individuals with East Asian ancestry, highlighting the value of diversity in such analyses to gain a more comprehensive understanding of the biology. One of these newfound variants is close to a gene known as FOXP4, which is especially intriguing because this gene is known to play a role in the airways of the lung.
The researchers continue to look for more underlying clues into the biology of COVID-19. In fact, their latest unpublished analysis has increased the number of COVID-19 patients from about 50,000 to 125,000, making it possible to add another 10 gene variants to the list.
 Mapping the human genetic architecture of COVID-19. COVID-19 Host Genetics Initiative. Nature. 2021 Jul 8.
COVID-19 Research (NIH)
Posted on by Dr. Francis Collins
The spinal cord, as a key part of our body’s central nervous system, contains millions of neurons that actively convey sensory and motor (movement) information to and from the brain. Scientists have long sorted these spinal neurons into what they call “cardinal” classes, a classification system based primarily on the developmental origin of each nerve cell. Now, by taking advantage of the power of single-cell genetic analysis, they’re finding that spinal neurons are more diverse than once thought.
This image helps to visualize the story. Each dot represents the nucleus of a spinal neuron in a mouse; humans have a very similar arrangement. Most of these neurons are involved in the regulation of motor control, but they also differ in important ways. Some are involved in local connections (green), such as those that signal outward to a limb and prompt us to pull away reflexively when we touch painful stimuli, such as a hot frying pan. Others are involved in long-range connections (magenta), relaying commands across spinal segments and even upward to the brain. These enable us, for example, to swing our arms while running to help maintain balance.
It turns out that these two types of spinal neurons also have distinctive genetic signatures. That’s why researchers could label them here in different colors and tell them apart. Being able to distinguish more precisely among spinal neurons will prove useful in identifying precisely which ones are affected by a spinal cord injury or neurodegenerative disease, key information in learning to engineer new tissue to heal the damage.
This image comes from a study, published recently in the journal Science, conducted by an NIH-supported team led by Samuel Pfaff, Salk Institute for Biological Studies, La Jolla, CA. Pfaff and his colleagues, including Peter Osseward and Marito Hayashi, realized that the various classes and subtypes of neurons in our spines arose over the course of evolutionary time. They reasoned that the most-primitive original neurons would have gradually evolved subtypes with more specialized and diverse capabilities. They thought they could infer this evolutionary history by looking for conserved and then distinct, specialized gene-expression signatures in the different neural subtypes.
The researchers turned to single-cell RNA sequencing technologies to look for important similarities and differences in the genes expressed in nearly 7,000 mouse spinal neurons. They then used this vast collection of genomic data to group the neurons into closely related clusters, in much the same way that scientists might group related organisms into an evolutionary family tree based on careful study of their DNA.
The first major gene expression pattern they saw divided the spinal neurons into two types: sensory-related and motor-related. This suggested to them that one of the first steps in spinal cord evolution may have been a division of labor of spinal neurons into those two fundamentally important roles.
Further analyses divided the sensory-related neurons into excitatory neurons, which make neurons more likely to fire; and inhibitory neurons, which dampen neural firing. Then, the researchers zoomed in on motor-related neurons and found something unexpected. They discovered the cells fell into two distinct molecular groups based on whether they had long-range or short-range connections in the body. Researches were even more surprised when further study showed that those distinct connectivity signatures were shared across cardinal classes.
All of this means that, while previously scientists had to use many different genetic tags to narrow in on a particular type of neuron, they can now do it with just two: a previously known tag for cardinal class and the newly discovered genetic tag for long-range vs. short-range connections.
Not only is this newfound ability a great boon to basic neuroscientists, it also could prove useful for translational and clinical researchers trying to determine which specific neurons are affected by a spinal injury or disease. Eventually, it may even point the way to strategies for regrowing just the right set of neurons to repair serious neurologic problems. It’s a vivid reminder that fundamental discoveries, such as this one, often can lead to unexpected and important breakthroughs with potential to make a real difference in people’s lives.
 Conserved genetic signatures parcellate cardinal spinal neuron classes into local and projection subsets. Osseward PJ 2nd, Amin ND, Moore JD, Temple BA, Barriga BK, Bachmann LC, Beltran F Jr, Gullo M, Clark RC, Driscoll SP, Pfaff SL, Hayashi M. Science. 2021 Apr 23;372(6540):385-393.
What Are the Parts of the Nervous System? (Eunice Kennedy Shriver National Institute of Child Health and Human Development/NIH)
Spinal Cord Injury (National Institute of Neurological Disorders and Stroke/NIH)
Samuel Pfaff (Salk Institute, La Jolla, CA)
NIH Support: National Institute of Mental Health; National Institute of Neurological Disorders and Stroke; Eunice Kennedy Shriver National Institute of Child Health and Human Development