Four years ago, Valerie Arboleda accomplished something most young medical geneticists rarely do. She helped discover a rare congenital disease now known as KAT6A syndrome . From the original 10 cases to the more than 100 diagnosed today, KAT6A kids share a single altered gene that causes neuro-developmental delays, most prominently in learning to walk and talk, plus a spectrum of possible abnormalities involving the head, face, heart, and immune system.
Now, Arboleda wants to accomplish something even more groundbreaking. With a 2017 NIH Director’s Early Independence Award, she will develop ways to mine Big Data—the voluminous amounts of DNA sequence and other biological information now stored in public databases—to unearth new clues into the biology of rare disorders like KAT6A syndrome. If successful, Arboleda’s work could bring greater precision to the diagnosis and potentially treatment of Mendelian disorders, as well as provide greater clarity into the specific challenges that might lie ahead for an affected child.
The human genome contains more than 20,000 protein-coding genes, which carry the instructions for proteins essential to the structure and function of our cells, tissues and organs. Some of these genes are very similar to each other because, as the genomes of humans and other mammals evolve, glitches in DNA replication sometimes result in extra copies of a gene being made. Those duplicates can be passed along to subsequent generations and, on very rare occasions, usually at a much later point in time, acquire additional modifications that may enable them to serve new biological functions. By starting with a protein shape that has already been fine-tuned for one function, evolution can produce a new function more rapidly than starting from scratch.
Pretty cool! But it leads to a question that’s long perplexed evolutionary biologists: Why don’t duplicate genes vanish from the gene pool almost as soon as they appear? After all, instantly doubling the amount of protein produced in an organism is usually a recipe for disaster—just think what might happen to a human baby born with twice as much insulin or clotting factor as normal. At the very least, duplicate genes should be unnecessary and therefore vulnerable to being degraded into functionless pseudogenes as new mutations arise over time
An NIH-supported team offers a possible answer to this question in a study published in the journal Science. Based on their analysis of duplicate gene pairs in the human and mouse genomes, the researchers suggest that extra genes persist in the genome because of rapid changes in gene activity. Instead of the original gene producing 100 percent of a protein in the body, the gene duo quickly divvies up the job . For instance, the original gene might produce roughly 50 percent and its duplicate the other 50 percent. Most importantly, organisms find the right balance and the duplicate genes can easily survive to be passed along to their offspring, providing fodder for continued evolution.
Tags: DNA, DNA replication, duplicate genes, evolution, evolutionary biology, gene copies, gene duplication, gene expression, gene pool, genes, genomics, genotype, GTEx, human genome, mouse genome, pseudogene, The Genotype-Tissue Expression Project