Javascript must be enabled to continue!
Silhouette scores for assessment of SNP genotype clusters
View through CrossRef
Abstract
Background
High-throughput genotyping of single nucleotide polymorphisms (SNPs) generates large amounts of data. In many SNP genotyping assays, the genotype assignment is based on scatter plots of signals corresponding to the two SNP alleles. In a robust assay the three clusters that define the genotypes are well separated and the distances between the data points within a cluster are short. "Silhouettes" is a graphical aid for interpretation and validation of data clusters that provides a measure of how well a data point was classified when it was assigned to a cluster. Thus "Silhouettes" can potentially be used as a quality measure for SNP genotyping results and for objective comparison of the performance of SNP assays at different circumstances.
Results
We created a program (ClusterA) for calculating "Silhouette scores", and applied it to assess the quality of SNP genotype clusters obtained by single nucleotide primer extension ("minisequencing") in the Tag-microarray format. A Silhouette score condenses the quality of the genotype assignment for each SNP assay into a single numeric value, which ranges from 1.0, when the genotype assignment is unequivocal, down to -1.0, when the genotype assignment has been arbitrary. In the present study we applied Silhouette scores to compare the performance of four DNA polymerases in our minisequencing system by analyzing 26 SNPs in both DNA polarities in 16 DNA samples. We found Silhouettes to provide a relevant measure for the quality of SNP assays at different reaction conditions, illustrated by the four DNA polymerases here. According to our result, the genotypes can be unequivocally assigned without manual inspection when the Silhouette score for a SNP assay is > 0.65. All four DNA polymerases performed satisfactorily in our Tag-array minisequencing system.
Conclusion
"Silhouette scores" for assessing the quality of SNP genotyping clusters is convenient for evaluating the quality of SNP genotype assignment, and provides an objective, numeric measure for comparing the performance of SNP assays. The program we created for calculating Silhouette scores is freely available, and can be used for quality assessment of the results from all genotyping systems, where the genotypes are assigned by cluster analysis using scatter plots.
Springer Science and Business Media LLC
Title: Silhouette scores for assessment of SNP genotype clusters
Description:
Abstract
Background
High-throughput genotyping of single nucleotide polymorphisms (SNPs) generates large amounts of data.
In many SNP genotyping assays, the genotype assignment is based on scatter plots of signals corresponding to the two SNP alleles.
In a robust assay the three clusters that define the genotypes are well separated and the distances between the data points within a cluster are short.
"Silhouettes" is a graphical aid for interpretation and validation of data clusters that provides a measure of how well a data point was classified when it was assigned to a cluster.
Thus "Silhouettes" can potentially be used as a quality measure for SNP genotyping results and for objective comparison of the performance of SNP assays at different circumstances.
Results
We created a program (ClusterA) for calculating "Silhouette scores", and applied it to assess the quality of SNP genotype clusters obtained by single nucleotide primer extension ("minisequencing") in the Tag-microarray format.
A Silhouette score condenses the quality of the genotype assignment for each SNP assay into a single numeric value, which ranges from 1.
0, when the genotype assignment is unequivocal, down to -1.
0, when the genotype assignment has been arbitrary.
In the present study we applied Silhouette scores to compare the performance of four DNA polymerases in our minisequencing system by analyzing 26 SNPs in both DNA polarities in 16 DNA samples.
We found Silhouettes to provide a relevant measure for the quality of SNP assays at different reaction conditions, illustrated by the four DNA polymerases here.
According to our result, the genotypes can be unequivocally assigned without manual inspection when the Silhouette score for a SNP assay is > 0.
65.
All four DNA polymerases performed satisfactorily in our Tag-array minisequencing system.
Conclusion
"Silhouette scores" for assessing the quality of SNP genotyping clusters is convenient for evaluating the quality of SNP genotype assignment, and provides an objective, numeric measure for comparing the performance of SNP assays.
The program we created for calculating Silhouette scores is freely available, and can be used for quality assessment of the results from all genotyping systems, where the genotypes are assigned by cluster analysis using scatter plots.
Related Results
The Impact of IL28B Gene Polymorphisms on Drug Responses
The Impact of IL28B Gene Polymorphisms on Drug Responses
To achieve high therapeutic efficacy in the patient, information on pharmacokinetics, pharmacodynamics, and pharmacogenetics is required. With the development of science and techno...
Expression and polymorphism of genes in gallstones
Expression and polymorphism of genes in gallstones
ABSTRACT
Through the method of clinical case control study, to explore the expression and genetic polymorphism of KLF14 gene (rs4731702 and rs972283) and SR-B1 gene...
Pergeseran Bentuk Siluet Kostum Tari Jaipongan Tahun 1980-2010
Pergeseran Bentuk Siluet Kostum Tari Jaipongan Tahun 1980-2010
ABSTRACKThis article is aimed at identifying the shift in silhouette of Jaipongan costume from its first appearance (in 1980) until thirty years later (2010). The silhouette of Jai...
Hubungan antara SNP rs3761863 terhadap kejadian reaksi reversal pada pasien MH tipe borderline di RSUP Prof. Dr. I.G.N.G. Ngoerah
Hubungan antara SNP rs3761863 terhadap kejadian reaksi reversal pada pasien MH tipe borderline di RSUP Prof. Dr. I.G.N.G. Ngoerah
Introduction: Reversal reaction (RR) is one of the morbidity burdens for Hansen's disease (MH) patients undergoing multi-drug therapy. Some risk factors for RR include age, stress,...
Characterization and Preparation of Sago Starch (SS) Based Reinforced with Silver Nanoparticle (SNP)
Characterization and Preparation of Sago Starch (SS) Based Reinforced with Silver Nanoparticle (SNP)
This paper reported on the properties of sago starch (SS) films impregnated with different concentration of sliver nanoparticles (SNP) of 100, 2000, 5000 rpm with weight ratio of 1...
Sianlihan laatuominaisuuksien genominen analyysi SNP-markkereiden avulla
Sianlihan laatuominaisuuksien genominen analyysi SNP-markkereiden avulla
Sianlihan laadulla on tärkeä merkitys teollisten lihatuotteiden eri prosessointivaiheissa ja kuluttajien käyttäessä tuorelihaa ruuan valmistukseen. Sianlihasta valmistetun ruuan tu...
Assessment of economic and environmental impacts of two typical cotton genotypes with contrasting potassium efficiency
Assessment of economic and environmental impacts of two typical cotton genotypes with contrasting potassium efficiency
AbstractIt is essential to produce optimal crop yields while reducing adverse environmental impacts of overfertilization. Therefore, nutrient‐efficient plants may play a major role...
Utility of silhouette showcards to assess adiposity in three countries across the epidemiological transition
Utility of silhouette showcards to assess adiposity in three countries across the epidemiological transition
Abstract
Background: The Pulvers’ silhouette showcards provide a non-invasive, easy-to-use, and possibly cross-culturally acceptable way of assessing an individual’s percep...

