BreCaHAD: A dataset for breast cancer histopathological annotation and diagnosis

Creative Commons License

Aksac A., Demetrick D. J., Ozyer T., Alhajj R.

BMC Research Notes, vol.12, no.1, 2019 (Scopus) identifier identifier identifier

  • Publication Type: Article / Article
  • Volume: 12 Issue: 1
  • Publication Date: 2019
  • Doi Number: 10.1186/s13104-019-4121-7
  • Journal Name: BMC Research Notes
  • Journal Indexes: Scopus
  • Keywords: Annotation, Breast cancer, Dataset, H&E staining, Histopathology, Nottingham histologic score
  • Istanbul Medipol University Affiliated: Yes


Objectives: Histopathological tissue analysis by a pathologist determines the diagnosis and prognosis of most tumors, such as breast cancer. To estimate the aggressiveness of cancer, a pathologist evaluates the microscopic appearance of a biopsied tissue sample based on morphological features which have been correlated with patient outcome. Data description: This paper introduces a dataset of 162 breast cancer histopathology images, namely the breast cancer histopathological annotation and diagnosis dataset (BreCaHAD) which allows researchers to optimize and evaluate the usefulness of their proposed methods. The dataset includes various malignant cases. The task associated with this dataset is to automatically classify histological structures in these hematoxylin and eosin (H&E) stained images into six classes, namely mitosis, apoptosis, tumor nuclei, non-tumor nuclei, tubule, and non-tubule. By providing this dataset to the biomedical imaging community, we hope to encourage researchers in computer vision, machine learning and medical fields to contribute and develop methods/tools for automatic detection and diagnosis of cancerous regions in breast cancer histology images.