Swin Transformer based Siamese Network for Thermal and Optical Image Registration Termal ve Optik Görüntü Çakiştirmasi için Swin Dönüştürücü tabanli Siyam Aǧi


Elsaeidy M., Yagmur I. C., Ateş H. F., GÜNTÜRK B. K.

31st IEEE Conference on Signal Processing and Communications Applications, SIU 2023, İstanbul, Türkiye, 5 - 08 Temmuz 2023 identifier

  • Yayın Türü: Bildiri / Tam Metin Bildiri
  • Doi Numarası: 10.1109/siu59756.2023.10224035
  • Basıldığı Şehir: İstanbul
  • Basıldığı Ülke: Türkiye
  • Anahtar Kelimeler: Keypoint, Multi-modal image registration, Transformer network
  • İstanbul Medipol Üniversitesi Adresli: Evet

Özet

The process of multi-modal image registration is fundamental in remote sensing and visual navigation applications. However, existing image registration methods that are designed for single modality images do not provide satisfactory results when applied to multi-modal image registration. In this research, our objective is to achieve highly accurate alignment of both infrared and optical (visible range) images. To accomplish this goal, we explore the effectiveness of the Swin Transformer encoder and cosine loss in enhancing the keypoint-based image registration process. Simulation results show the improvement achieved in multi-modal registration by using a transformer based Siamese network.