Comparison of 3D CNN based deep learning architectures using hyperspectral images


Creative Commons License

Firat H., HANBAY D.

JOURNAL OF THE FACULTY OF ENGINEERING AND ARCHITECTURE OF GAZI UNIVERSITY, cilt.38, sa.1, ss.521-534, 2022 (SCI-Expanded) identifier identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 38 Sayı: 1
  • Basım Tarihi: 2022
  • Doi Numarası: 10.17341/gazimmfd.977688
  • Dergi Adı: JOURNAL OF THE FACULTY OF ENGINEERING AND ARCHITECTURE OF GAZI UNIVERSITY
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Academic Search Premier, Art Source, Compendex, TR DİZİN (ULAKBİM)
  • Sayfa Sayıları: ss.521-534
  • Anahtar Kelimeler: Hyperspectral image classification, deep learning, 3D convolutional neural network, principal component analysis, CLASSIFICATION
  • İnönü Üniversitesi Adresli: Evet

Özet

Hyperspectral images (HSI) are 3-dimensional (3D) image cubes with two spatial and one spectral dimensions. The development of deep learning methods has had a significant impact on HSI classification. Especially convolutional neural network (CNN) based methods are getting more attention in this field. In this study, we make use of the deep learning architectures LeNet5, AlexNet, VGG16, GoogleNet and ResNet50, which are among the successful examples of CNN for the HSI classification problem. We use a 3D CNN-based hybrid approach when using these architectures. Because, using 3D CNN, spectral-spatial features are extracted simultaneously. In this case, the classification accuracy of HSIs is increased with the spectral-spatial-based deep learning architecture. However, in the proposed model, principal component analysis (PCA) is used as a preprocessing technique for optimal band extraction from HSIs. After applying PCA, 3D cubes are obtained by neighborhood extraction and given to the input of deep learning architectures. Indian pines, Salinas, Botswana and HyRANK-Loukia datasets were used to compare the classification performances of 3D CNN-based deep learning architectures. As a result of the applications, the best classification accuracy was obtained with VGG16 architectures in Indian pines dataset, ResNet50 in Botswana dataset, VGG16 in HyRANK-Loukia dataset, LeNet5 and VGG16 architectures in Salinas dataset.