Researchers at the University of California (UC) San Diego School of Medicine have been awarded $11.7 million to launch the Genetic & Social Determinants of Health: Center for Admixture Science and Technology (CAST) to address the issue of admixed individuals whose DNA reflect multiple ancestries. CAST will use the largest genomic datasets of individuals with diverse ancestry, in combination with socioeconomic data, to better predict health and disease in admixed individuals.
Historical and recent mixing of Europeans, Native Americans, Africans, and Asians has resulted in a relatively large number of admixed individuals in the U.S. Their genomes are a patchwork of DNA segments associated with different races and ethnicities, and may reflect ancestries outside of the individual’s self-identified race. The issue is physicians do not yet know how these DNA segments interact with each other to shape health outcomes, so these genomes are more difficult for them to interpret.
CAST is one of the latest additions to the renowned Centers of Excellence in Genomic Science (CEGS) funded by the NIH. Each center focuses on a unique aspect of genomics research with the intention of blazing new trails in our understanding of human biology and disease.
“To bring the CEGS program to our campus is a huge honor, and a national recognition of UC San Diego as a major player in genomics,” said Lucila Ohno-Machado, MD, PhD, Distinguished Professor of Medicine at UC San Diego School of Medicine, chair of the Department of Biomedical Informatics at UC San Diego Health, and founding faculty of the Halıcıoğlu Data Science Institute.
Ohno-Machado will lead the center with Kelly Frazer, PhD, professor of pediatrics and director of the Institute for Genomic Medicine at UC San Diego School of Medicine, and Melissa Gymrek, PhD, assistant professor at UC San Diego School of Medicine and Jacobs School of Engineering.
Researchers need data on many people’s genomes and health outcomes in order to find consistent relationships among them. The health of individuals from different racial and ethnic groups is also affected by social factors, so this information must be included in models of disease. To do all this, CAST will develop computational tools to combine, protect, and analyze data from two national studies: All of Us Research Program and the Million Veterans Program. These projects aim to recruit one million participants each, equipping CAST with an unprecedentedly large and diverse pool of data.
Their ultimate goal is for anyone to be able to visit their physician, have their genome sequenced, and learn not only if they are at higher risk for any particular disease, but also which prevention and treatment plans are best suited for them.
“As it stands, white people will be able to do this, but our existing knowledge may not be useful to most others,” said Gymrek. “We want to bring the genomic revolution to everyone.”
“People may not realize that a large number of people living in America are likely admixed, so we would be excluding a large portion of our community if we were not taking these mixed genomes into account,” added Ohno-Machado.
CAST will use advanced approaches to study admixed genomes. Their models will consider each individual’s unique patchwork of ancestry, rather than grouping individuals into established categories like “white” or “Asian.” And while most groups focus on changes in single nucleotide polymorphisms (SNPs), the CAST team will consider a much broader spectrum of genetic variation. This includes investigating tandem repeats and the major histocompatibility complex (MHC), which is one of the most diverse sections of the genome across races, in part because it is related to immune function, which is tailored to each population’s local environment.
CAST will also innovate the way large-scale and complex data is processed. The team will develop privacy-preserving algorithms that consult the data in the All of Us and the Million Veterans enclaves without needing to centralize the data in a single place. They will also use natural language processing to extract information on social determinants of health from patients’ clinical notes.
These innovations are expected to come from collaborations between informatics researchers at UC San Diego, the Broad Institute, University of Texas Health, Indiana University and the Veterans Administration.
“I really think we have the dream team here,” said Frazer. “We’re excited to use our complementary expertise to push the limits of genomic medicine at UC San Diego and beyond.”