Javascript must be enabled to continue!
Mammo-Bench: A Large-scale Benchmark Dataset of Mammography Images
View through CrossRef
Abstract
Breast cancer remains a significant global health concern, and machine learning algorithms and computer-aided detection systems have shown great promise in enhancing the accuracy and efficiency of mammography image analysis. However, there is a critical need for large, benchmark datasets for training deep learning models for breast cancer detection. In this work we developed Mammo-Bench, a large-scale benchmark dataset of mammography images, by collating data from six well-curated resources, viz., DDSM, INbreast, KAU-BCMD, CMMD, CDD-CESM and DMID. To ensure consistency across images from diverse sources while preserving clinically relevant features, a preprocessing pipeline that includes breast segmentation, pectoral muscle removal, and intelligent cropping is proposed. The dataset consists of 19,731 high-quality mammographic images from 6,500 patients across 6 countries and is one of the largest open-source mammography databases to the best of our knowledge. To show the efficacy of training on the large dataset, performance of ResNet101 architecture was evaluated on Mammo-Bench and the results compared by training independently on a few member datasets and an external dataset, VinDr-Mammo. An accuracy of 78.8% (with data augmentation of the minority classes) and 77.8% (without data augmentation) was achieved on the proposed benchmark dataset, compared to the other datasets for which accuracy varied from 25 – 69%. Noticeably, improved prediction of the minority classes is observed with the Mammo-Bench dataset. These results establish baseline performance and demonstrate Mammo-Bench’s utility as a comprehensive resource for developing and evaluating mammography analysis systems.
Title: Mammo-Bench: A Large-scale Benchmark Dataset of Mammography Images
Description:
Abstract
Breast cancer remains a significant global health concern, and machine learning algorithms and computer-aided detection systems have shown great promise in enhancing the accuracy and efficiency of mammography image analysis.
However, there is a critical need for large, benchmark datasets for training deep learning models for breast cancer detection.
In this work we developed Mammo-Bench, a large-scale benchmark dataset of mammography images, by collating data from six well-curated resources, viz.
, DDSM, INbreast, KAU-BCMD, CMMD, CDD-CESM and DMID.
To ensure consistency across images from diverse sources while preserving clinically relevant features, a preprocessing pipeline that includes breast segmentation, pectoral muscle removal, and intelligent cropping is proposed.
The dataset consists of 19,731 high-quality mammographic images from 6,500 patients across 6 countries and is one of the largest open-source mammography databases to the best of our knowledge.
To show the efficacy of training on the large dataset, performance of ResNet101 architecture was evaluated on Mammo-Bench and the results compared by training independently on a few member datasets and an external dataset, VinDr-Mammo.
An accuracy of 78.
8% (with data augmentation of the minority classes) and 77.
8% (without data augmentation) was achieved on the proposed benchmark dataset, compared to the other datasets for which accuracy varied from 25 – 69%.
Noticeably, improved prediction of the minority classes is observed with the Mammo-Bench dataset.
These results establish baseline performance and demonstrate Mammo-Bench’s utility as a comprehensive resource for developing and evaluating mammography analysis systems.
Related Results
Intervention to Improve Mammography Screening at a Student-Run Free Clinic
Intervention to Improve Mammography Screening at a Student-Run Free Clinic
Background: Most late-stage breast cancer occurs in those who have never been screened. Uninsured women have lower screening rates than insured women. Student-run free clinics (SRF...
Abstract P5-01-02: Glandular dose in contrast-enhanced dual-energy mammography
Abstract P5-01-02: Glandular dose in contrast-enhanced dual-energy mammography
Abstract
INTRODUCTION
Estimation of the Average Glandular Dose (AGD) absorbed by the breast during x-ray based examination is an established part of q...
Psychological predictors of intention and avoidance of attending organized mammography screening in Norway: applying the Extended Parallel Process Model
Psychological predictors of intention and avoidance of attending organized mammography screening in Norway: applying the Extended Parallel Process Model
Abstract
Background
Mammography screening is the main method for early detection of breast cancer in Norway. Few studies have focused on psychologic...
Knowledge, attitude and uptake of mammography among female health workers in two tertiary health facilities of Sokoto state, Nigeria
Knowledge, attitude and uptake of mammography among female health workers in two tertiary health facilities of Sokoto state, Nigeria
Background: Globally, breast cancer is the most common cancer in women and early detection remains the cornerstone of breast cancer control in terms of outcome and survival. Female...
Forecasting Rockfall Retention Performance of catch Bench configurations in Mining Models Calibrated to Rockfall Field Data
Forecasting Rockfall Retention Performance of catch Bench configurations in Mining Models Calibrated to Rockfall Field Data
ABSTRACT:
Rockfall hazards pose significant safety risks in open-pit mining, endangering both personnel and equipment. To mitigate these risks, catch benches are ...
Ultra-high resolution, multi-scale, context-aware approach for detection of small cancers on mammography
Ultra-high resolution, multi-scale, context-aware approach for detection of small cancers on mammography
AbstractWhile detection of malignancies on mammography has received a boost with the use of Convolutional Neural Networks (CNN), detection of cancers of very small size remains cha...
Mammo-SAE: Interpreting Breast Cancer Concept Learning with Sparse Autoencoders
Mammo-SAE: Interpreting Breast Cancer Concept Learning with Sparse Autoencoders
Abstract
Interpretability is critical in high-stakes domains such as medical imaging, where understanding model decisions is essential for clinical adoption. In this work, ...
EVOLUTION IN BREAST IMAGING
EVOLUTION IN BREAST IMAGING
Evolution in breast imaging has dramatically changed and upgraded by wide variety of technologic advances from the early days of direct-exposure film mammography to xeromammograp...

