Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

De novo genome assemblies from two Indigenous Americans from Arizona identify new polymorphisms in non-reference sequences

View through CrossRef
ABSTRACT There is a collective push to diversify human genetic studies by including underrepresented populations. However, analyzing DNA sequence reads involves the initial step of aligning the reads to the GRCh38/hg38 reference genome which is inadequate for non-European ancestries. To help address this issue, we created a modified hg38 reference map using de novo sequence assemblies from Indigenous Americans living in Arizona (IAZ). Using HiFi SMRT long-read sequencing technology, we generated de novo genome assemblies for one female and one male IAZ individual. Each assembly included ∼17 Mb of DNA sequence not present (non-reference sequence; NRS) in hg38, which consists mostly of repeat elements. Forty NRSs totaling 240 kb were uniquely anchored to the hg38 primary assembly generating a modified hg38-NRS reference genome. DNA sequence alignment and variant calling were then conducted with WGS sequencing data from 387 IAZ cohorts using both the hg38 and modified hg38-NRS reference maps. Variant calling with the hg38-NRS map identified ∼50,000 single nucleotide variants present in at least 5% of the WGS samples which were not detected with the hg38 reference map. We also directly assessed the NRSs positioned within genes. Seventeen NRSs anchored to regions including an identical 187 bp NRS found in both de novo assemblies. The NRS is located in HCN2 79 bp downstream of exon 3 and contains several putative transcriptional regulatory elements. Genotyping of the HCN2 -NRS revealed that the insertion is enriched in IAZ (MAF = 0.45) compared to Caucasians (MAF = 0.15) and African Americans (MAF = 0.03). This study shows that inclusion of population-specific NRSs can dramatically change the variant profile in an under-represented ethnic groups and thereby lead to the discovery of previously missed common variations. AUTHOR SUMMARY GRCh38/hg38 reference genome has been the standard reference for large-scale human genetics studies. However, it does not adequately represent sequences of non-European ancestry. In this study, using long-read sequencing technology, we constructed de novo sequence assemblies from two Indigenous Americans from Arizona. We then compared the de novo assemblies to the hg38 reference genome to identify non-reference sequences (NRSs). We integrated these NRSs into our whole-genome sequencing (WGS) variant calling pipeline to improve read alignment and variant detection. We also directly assessed the NRSs positioned within genes. Inclusion of population-specific NRSs dramatically changed the variant profile of our study group with under-represented ethnicity, revealing common variation not detected by our previous population-level WGS and genotyping studies.
Title: De novo genome assemblies from two Indigenous Americans from Arizona identify new polymorphisms in non-reference sequences
Description:
ABSTRACT There is a collective push to diversify human genetic studies by including underrepresented populations.
However, analyzing DNA sequence reads involves the initial step of aligning the reads to the GRCh38/hg38 reference genome which is inadequate for non-European ancestries.
To help address this issue, we created a modified hg38 reference map using de novo sequence assemblies from Indigenous Americans living in Arizona (IAZ).
Using HiFi SMRT long-read sequencing technology, we generated de novo genome assemblies for one female and one male IAZ individual.
Each assembly included ∼17 Mb of DNA sequence not present (non-reference sequence; NRS) in hg38, which consists mostly of repeat elements.
Forty NRSs totaling 240 kb were uniquely anchored to the hg38 primary assembly generating a modified hg38-NRS reference genome.
DNA sequence alignment and variant calling were then conducted with WGS sequencing data from 387 IAZ cohorts using both the hg38 and modified hg38-NRS reference maps.
Variant calling with the hg38-NRS map identified ∼50,000 single nucleotide variants present in at least 5% of the WGS samples which were not detected with the hg38 reference map.
We also directly assessed the NRSs positioned within genes.
Seventeen NRSs anchored to regions including an identical 187 bp NRS found in both de novo assemblies.
The NRS is located in HCN2 79 bp downstream of exon 3 and contains several putative transcriptional regulatory elements.
Genotyping of the HCN2 -NRS revealed that the insertion is enriched in IAZ (MAF = 0.
45) compared to Caucasians (MAF = 0.
15) and African Americans (MAF = 0.
03).
This study shows that inclusion of population-specific NRSs can dramatically change the variant profile in an under-represented ethnic groups and thereby lead to the discovery of previously missed common variations.
AUTHOR SUMMARY GRCh38/hg38 reference genome has been the standard reference for large-scale human genetics studies.
However, it does not adequately represent sequences of non-European ancestry.
In this study, using long-read sequencing technology, we constructed de novo sequence assemblies from two Indigenous Americans from Arizona.
We then compared the de novo assemblies to the hg38 reference genome to identify non-reference sequences (NRSs).
We integrated these NRSs into our whole-genome sequencing (WGS) variant calling pipeline to improve read alignment and variant detection.
We also directly assessed the NRSs positioned within genes.
Inclusion of population-specific NRSs dramatically changed the variant profile of our study group with under-represented ethnicity, revealing common variation not detected by our previous population-level WGS and genotyping studies.

Related Results

Burden of the Beast
Burden of the Beast
Introduction Throughout the COVID-19 pandemic, and its fluctuating waves of infections and the emergence of new variants, Indigenous populations in Australia and worldwide have re...
Tlacoqualli in Monequi "The Center Good"
Tlacoqualli in Monequi "The Center Good"
Photo by Andrew James on Unsplash INTRODUCTION Since its inception, bioethics has focused on Western conceptions of ethics and science. This has provided a strong foundation to bui...
Reclaiming the Wasteland: Samson and Delilah and the Historical Perception and Construction of Indigenous Knowledges in Australian Cinema
Reclaiming the Wasteland: Samson and Delilah and the Historical Perception and Construction of Indigenous Knowledges in Australian Cinema
It was always based on a teenage love story between the two kids. One is a sniffer and one is not. It was designed for Central Australia because we do write these kids off there. N...
The Impact of IL28B Gene Polymorphisms on Drug Responses
The Impact of IL28B Gene Polymorphisms on Drug Responses
To achieve high therapeutic efficacy in the patient, information on pharmacokinetics, pharmacodynamics, and pharmacogenetics is required. With the development of science and techno...
Indigenous Intellectual Property: A Handbook of Contemporary Research
Indigenous Intellectual Property: A Handbook of Contemporary Research
Edited CollectionRimmer, Matthew (Ed.) (2015) Indigenous Intellectual Property: A Handbook of Contemporary Research. Research Handbooks in Intellectual Property. Edward Elgar, Chel...
A fresh approach to indigenous business education
A fresh approach to indigenous business education
Purpose – The social and economic disadvantages confronted by many Indigenous Australians are well known. A close look at Indigenous employment highlights that Indi...
Indigenous Rights
Indigenous Rights
The attention given to indigenous rights has increased since the approval of the United Nations Declaration on the Rights of Indigenous Peoples (UNDRIP) in 2007. Although it is a s...
Contemporary Indigenous Social and Political Thought
Contemporary Indigenous Social and Political Thought
The contemporary continental emergence of a significant number of indigenous intellectuals who have been trained in the academic fields of social sciences (history, anthropology, s...

Back to Top