Multiple mask and boundary scoring R-CNN with cGAN data augmentation for bladder tumor segmentation in WLC videos


Freitas N. R., Vieira P. M., Tinoco C., Anacleto S., Oliveira J. F., Vaz A. I. F., ...Daha Fazla

ARTIFICIAL INTELLIGENCE IN MEDICINE, cilt.147, 2024 (SCI-Expanded) identifier identifier identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 147
  • Basım Tarihi: 2024
  • Doi Numarası: 10.1016/j.artmed.2023.102723
  • Dergi Adı: ARTIFICIAL INTELLIGENCE IN MEDICINE
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Academic Search Premier, PASCAL, Aerospace Database, Applied Science & Technology Source, Biotechnology Research Abstracts, CINAHL, Communication Abstracts, Compendex, Computer & Applied Sciences, EMBASE, INSPEC, Library, Information Science & Technology Abstracts (LISTA), MEDLINE, Metadex, Civil Engineering Abstracts
  • Anahtar Kelimeler: Multi-pathology detection, Lesion localization, Texture-constrained GAN
  • İstanbul Medipol Üniversitesi Adresli: Evet

Özet

Automatic diagnosis systems capable of handling multiple pathologies are essential in clinical practice. This study focuses on enhancing precise lesion localization, classification and delineation in transurethral resection of bladder tumor (TURBT) to reduce cancer recurrence. Despite deep learning models success, medical applications face challenges like small and limited datasets and poor image characterization, including the absence lack of color/texture modeling. To address these issues, three solutions are proposed: (1) an improved texture-constrained version of the pix2pixHD cGAN for data augmentation, addressing the tradeoff of generating high-quality images with enough stochasticity using the Fréchet Inception Distance (FID) measure. (2) Introducing the Multiple Mask and Boundary Scoring R-CNN (MM&BS R-CNN), a new mask sub-net scheme where multiple masks are generated from the different levels of the mask sub-net pipeline, improving segmentation accuracy by including a new scoring module to refine object boundaries. (3) A novel accelerated training strategy based on the SGD optimizer with the second momentum. Experimental results show significant mAP improvements: the data generation scheme improves by more than 12 %; MM&BS R-CNN proposed architecture is responsible for an improvement of about 1.25 %, and the training algorithm based on the second-order momentum increases mAP by 2–3 %. The simultaneous use of all three proposals improved the state-of-the-art mAP by 17.44 %.