Javascript must be enabled to continue!
Abstract P3-09-03: Breast cancer subtype distribution among HapMap classified ethnic groups
View through CrossRef
Abstract
Background: There are well established disparities, related to ethnicity, in the presentation, distribution of subtypes and prognosis of breast cancer. To understand the possible biological bases for these differences, it is important to determine in an unbiased manner the ethnicity/geographical origin of patients presenting with breast cancer. We have downloaded Affymetrix SNP 6.0 data generated by The Cancer Genome Atlas project (TCGA) and have extracted ethnically informative SNPs to infer the ethnicity/geographical origin of these patients.
Materials and Methods: The level one SNP data (Affymetrix SNP Array 6.0) for 536 normal DNAs from IBC patients as well as other relevant data were downloaded from the TCGA project data portal (https://tcga-data.nci.nih.gov/tcga/tcgaHome2.jsp). 103 samples were excluded for technical reasons (lack of subtype data or technical outliers). Bioconductor package “crlmm” was used for genotype calling. The HapMap PCA loadings for Affymetrix SNP probes were downloaded from http://www.stats.ox.ac.uk/~davison/software/shellfish/shellfish.php. Probe annotations were extracted from the Affymetrix GenomeWideSNP_6_na30.annot.csv file. “EIGINSTRAT” was used for Principle Component Analysis (PCA). Samples with unknown ethnicity were manually assigned based on their HapMap classification.
Results: Samples were clustered and ethnicity inferred for three major ethnic groups (White, African and Asian) based on 168,905 informative SNPs. These clusters were generally in agreement with the clinical assignment of White, Black or African American and Asian. 10 samples, clinically assigned as Black or Africa American, were placed between the White and African clusters based on the HapMap clustering suggesting admixture. This contrasts other large breast cancer cohorts such as Curtis, et al. 2012, where there were virtually no African genotype patients and there was evident admixture on the European – Asian axis. 113 samples with unknown ethnicity were assigned based on their HapMap classification. Finally there are 371 Whites, 25 African, 20 Asians, and 10 Admixtures with subtype information available. Subtype distribution were LumA 176 (46.68%), LumB 90 (23.87%), Her2 37 (9.81%), Basal 68 (18.04%) in White group; LumA 15 (33.33%), LumB 8 (17.78%), Her2 9 (20%), Basal 13 (28.89%) in non-Whites. The subtype distribution between White and non-White populations was significantly different (p < 0.05, chi-square test), 7 Normal-like samples were excluded due to the low count. The Basal subtype had higher frequency among Africans compared to other ethnic groups. The LumB subtype also showed higher frequency in the 10 admixture samples, half of which were of the LumB subtype.
Discussion: Inferring ethnicity/geographical origin by genotype removes bias known to occur in self reporting. We have successfully classified the TCGA breast cancer cohort into different ethnicity groups based on their ethnically informative SNP data. Although the SNP-based classification is generally in agreement with the clinical assignment, some samples are identified as admixtures between White and African. The subtype distribution varies by ethnicity/geographical origin and in the admixed population.
Citation Information: Cancer Res 2012;72(24 Suppl):Abstract nr P3-09-03.
American Association for Cancer Research (AACR)
Title: Abstract P3-09-03: Breast cancer subtype distribution among HapMap classified ethnic groups
Description:
Abstract
Background: There are well established disparities, related to ethnicity, in the presentation, distribution of subtypes and prognosis of breast cancer.
To understand the possible biological bases for these differences, it is important to determine in an unbiased manner the ethnicity/geographical origin of patients presenting with breast cancer.
We have downloaded Affymetrix SNP 6.
0 data generated by The Cancer Genome Atlas project (TCGA) and have extracted ethnically informative SNPs to infer the ethnicity/geographical origin of these patients.
Materials and Methods: The level one SNP data (Affymetrix SNP Array 6.
0) for 536 normal DNAs from IBC patients as well as other relevant data were downloaded from the TCGA project data portal (https://tcga-data.
nci.
nih.
gov/tcga/tcgaHome2.
jsp).
103 samples were excluded for technical reasons (lack of subtype data or technical outliers).
Bioconductor package “crlmm” was used for genotype calling.
The HapMap PCA loadings for Affymetrix SNP probes were downloaded from http://www.
stats.
ox.
ac.
uk/~davison/software/shellfish/shellfish.
php.
Probe annotations were extracted from the Affymetrix GenomeWideSNP_6_na30.
annot.
csv file.
“EIGINSTRAT” was used for Principle Component Analysis (PCA).
Samples with unknown ethnicity were manually assigned based on their HapMap classification.
Results: Samples were clustered and ethnicity inferred for three major ethnic groups (White, African and Asian) based on 168,905 informative SNPs.
These clusters were generally in agreement with the clinical assignment of White, Black or African American and Asian.
10 samples, clinically assigned as Black or Africa American, were placed between the White and African clusters based on the HapMap clustering suggesting admixture.
This contrasts other large breast cancer cohorts such as Curtis, et al.
2012, where there were virtually no African genotype patients and there was evident admixture on the European – Asian axis.
113 samples with unknown ethnicity were assigned based on their HapMap classification.
Finally there are 371 Whites, 25 African, 20 Asians, and 10 Admixtures with subtype information available.
Subtype distribution were LumA 176 (46.
68%), LumB 90 (23.
87%), Her2 37 (9.
81%), Basal 68 (18.
04%) in White group; LumA 15 (33.
33%), LumB 8 (17.
78%), Her2 9 (20%), Basal 13 (28.
89%) in non-Whites.
The subtype distribution between White and non-White populations was significantly different (p < 0.
05, chi-square test), 7 Normal-like samples were excluded due to the low count.
The Basal subtype had higher frequency among Africans compared to other ethnic groups.
The LumB subtype also showed higher frequency in the 10 admixture samples, half of which were of the LumB subtype.
Discussion: Inferring ethnicity/geographical origin by genotype removes bias known to occur in self reporting.
We have successfully classified the TCGA breast cancer cohort into different ethnicity groups based on their ethnically informative SNP data.
Although the SNP-based classification is generally in agreement with the clinical assignment, some samples are identified as admixtures between White and African.
The subtype distribution varies by ethnicity/geographical origin and in the admixed population.
Citation Information: Cancer Res 2012;72(24 Suppl):Abstract nr P3-09-03.
Related Results
Breast Carcinoma within Fibroadenoma: A Systematic Review
Breast Carcinoma within Fibroadenoma: A Systematic Review
Abstract
Introduction
Fibroadenoma is the most common benign breast lesion; however, it carries a potential risk of malignant transformation. This systematic review provides an ove...
Desmoid-Type Fibromatosis of The Breast: A Case Series
Desmoid-Type Fibromatosis of The Breast: A Case Series
Abstract
IntroductionDesmoid-type fibromatosis (DTF), also called aggressive fibromatosis, is a rare, benign, locally aggressive condition. Mammary DTF originates from fibroblasts ...
Abstract OI-1: OI-1 Decoding breast cancer predisposition genes
Abstract OI-1: OI-1 Decoding breast cancer predisposition genes
Abstract
Women with one or more first-degree female relatives with a history of breast cancer have a two-fold increased risk of developing breast cancer. This risk i...
Predictors of False-Negative Axillary FNA Among Breast Cancer Patients: A Cross-Sectional Study
Predictors of False-Negative Axillary FNA Among Breast Cancer Patients: A Cross-Sectional Study
Abstract
Introduction
Fine-needle aspiration (FNA) is commonly used to investigate lymphadenopathy of suspected metastatic origin. The current study aims to find the association be...
Spanish Breast Cancer Research Group (GEICAM)
Spanish Breast Cancer Research Group (GEICAM)
This section provides current contact details and a summary of recent or ongoing clinical trials being coordinated by Spanish Breast Cancer Research Group (GEICAM). Clinical trials...
PO-285 A review of effects of exercise on the quality of life in breast cancer survivors
PO-285 A review of effects of exercise on the quality of life in breast cancer survivors
Objective Breast cancer is one of the most common malignant tumors in women.The number of women diagnosed with breast cancer each year is also increasing.It is also the leading cau...
Effect of type lll collagen coating of electrospun scaffolds on breast cancer cell apoptosis
Effect of type lll collagen coating of electrospun scaffolds on breast cancer cell apoptosis
Breast cancer arises from the epithelial or the connective tissue components of the breast. Breast cancer is the most commonly diagnosed cancer in women, with about half a million ...
Early-onset breast cancer in East Africa : genetics, tumor characteristics and clinical management
Early-onset breast cancer in East Africa : genetics, tumor characteristics and clinical management
<p dir="ltr">Background: Breast cancer is a leading cause of cancer-related mortality among women worldwide, with particularly poor outcomes in many low-and middle- income co...

