Parkinson's detection based on combined CNN and LSTM using enhanced speech signals with Variational mode decomposition

Er, Mehmet; Isik, Esme; IŞIK, İBRAHİM

doi:10.1016/j.bspc.2021.103006

Parkinson's detection based on combined CNN and LSTM using enhanced speech signals with Variational mode decomposition

Er M. B., Isik E., IŞIK İ.

BIOMEDICAL SIGNAL PROCESSING AND CONTROL, cilt.70, 2021 (SCI-Expanded, Scopus)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 70
Basım Tarihi: 2021
Doi Numarası: 10.1016/j.bspc.2021.103006
Dergi Adı: BIOMEDICAL SIGNAL PROCESSING AND CONTROL
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Compendex, EMBASE, INSPEC
Anahtar Kelimeler: Parkinson's disease, Long short-term memory, Variational Mode Decomposition, Convolutional Neural Network, EARLY-DIAGNOSIS, DISEASE, CLASSIFICATION
İnönü Üniversitesi Adresli: Evet

Özet

Parkinson's disease (PD) can cause many non-motor and motor symptoms such as speech and smell. One of the difficulties that Parkinson's patients can experience is a change in speech or speaking difficulties. Therefore, the right diagnosis in the early period is important in reducing the possible effects of speech disorders caused by the disease. Speech signal of Parkinson patients shows major differences compared to normal people. In this study, a new approach based on pre-trained deep networks and Long short-term memory (LSTM) by using melspectrograms obtained from denoised speech signals with Variational Mode Decomposition (VMD) for detecting PD from speech sounds is proposed. The proposed model consists of four steps. In the first step, the noise is removed by applying VMD to the signals. In the second step, mel-spectrograms are extracted from the enhanced sound signals with VMD. In the third step, pre-trained deep networks are preferred to extract deep features from the mel-spectrograms. For this purpose, ResNet-18, ResNet-50 and ResNet-101 models are used as pre-trained deep network architecture. In the last step, the classification process is occurred by giving these features as input to the LSTM model, which is designed to define sequential information from the extracted features. Experiments are performed with the PC-GITA dataset, which consists of two classes and is widely used in the literature. The results obtained from the proposed method are compared with the latest methods in the literature, it is seen that it has a better performance in terms of classification performance.