Comparison of 3D CNN based deep learning architectures using hyperspectral images

Firat, Huseyin; HANBAY, DAVUT

doi:10.17341/gazimmfd.977688

Comparison of 3D CNN based deep learning architectures using hyperspectral images

JOURNAL OF THE FACULTY OF ENGINEERING AND ARCHITECTURE OF GAZI UNIVERSITY, cilt.38, sa.1, ss.521-534, 2022 (SCI-Expanded, Scopus, TRDizin)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 38 Sayı: 1
Basım Tarihi: 2022
Doi Numarası: 10.17341/gazimmfd.977688
Dergi Adı: JOURNAL OF THE FACULTY OF ENGINEERING AND ARCHITECTURE OF GAZI UNIVERSITY
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Academic Search Premier, Art Source, Compendex, TR DİZİN (ULAKBİM)
Sayfa Sayıları: ss.521-534
Anahtar Kelimeler: Hyperspectral image classification, deep learning, 3D convolutional neural network, principal component analysis, CLASSIFICATION
Açık Arşiv Koleksiyonu: AVESİS Açık Erişim Koleksiyonu
İnönü Üniversitesi Adresli: Evet

Özet

Hyperspectral images (HSI) are 3-dimensional (3D) image cubes with two spatial and one spectral dimensions. The development of deep learning methods has had a significant impact on HSI classification. Especially convolutional neural network (CNN) based methods are getting more attention in this field. In this study, we make use of the deep learning architectures LeNet5, AlexNet, VGG16, GoogleNet and ResNet50, which are among the successful examples of CNN for the HSI classification problem. We use a 3D CNN-based hybrid approach when using these architectures. Because, using 3D CNN, spectral-spatial features are extracted simultaneously. In this case, the classification accuracy of HSIs is increased with the spectral-spatial-based deep learning architecture. However, in the proposed model, principal component analysis (PCA) is used as a preprocessing technique for optimal band extraction from HSIs. After applying PCA, 3D cubes are obtained by neighborhood extraction and given to the input of deep learning architectures. Indian pines, Salinas, Botswana and HyRANK-Loukia datasets were used to compare the classification performances of 3D CNN-based deep learning architectures. As a result of the applications, the best classification accuracy was obtained with VGG16 architectures in Indian pines dataset, ResNet50 in Botswana dataset, VGG16 in HyRANK-Loukia dataset, LeNet5 and VGG16 architectures in Salinas dataset.