Item request has been placed!
×
Item request cannot be made.
×
Processing Request
Differentiation of speech in Parkinson's disease and spinocerebellar degeneration using deep neural networks.
Item request has been placed!
×
Item request cannot be made.
×
Processing Request
- Author(s): Eguchi, Katsuki1,2 (AUTHOR) ; Yaguchi, Hiroaki2 (AUTHOR); Kudo, Ikue1 (AUTHOR); Kimura, Ibuki1 (AUTHOR); Nabekura, Tomoko1 (AUTHOR); Kumagai, Ryuto3 (AUTHOR); Fujita, Kenichi1 (AUTHOR); Nakashiro, Yuichi1 (AUTHOR); Iida, Yuki1 (AUTHOR); Hamada, Shinsuke1 (AUTHOR); Honma, Sanae1 (AUTHOR); Takei, Asako1 (AUTHOR); Moriwaka, Fumio1 (AUTHOR); Yabe, Ichiro2 (AUTHOR)
- Source:
Journal of Neurology. Feb2024, Vol. 271 Issue 2, p1004-1012. 9p.
- Subject Terms:
- Additional Information
- Abstract:
Introduction: Assessing dysarthria features in patients with neurodegenerative diseases helps diagnose underlying pathologies. Although deep neural network (DNN) techniques have been widely adopted in various audio processing tasks, few studies have tested whether DNNs can help differentiate neurodegenerative diseases using patients' speech data. This study evaluated whether a DNN model using a transformer architecture could differentiate patients with Parkinson's disease (PD) from patients with spinocerebellar degeneration (SCD) using speech data. Methods: Speech data were obtained from 251 and 101 patients with PD and SCD, respectively, while they read a passage. We fine-tuned a pre-trained DNN model using log-mel spectrograms generated from speech data. The DNN model was trained to predict whether the input spectrogram was generated from patients with PD or SCD. We used fivefold cross-validation to evaluate the predictive performance using the area under the receiver operating characteristic curve (AUC) and accuracy, sensitivity, and specificity. Results: Average ± standard deviation of the AUC, accuracy, sensitivity, and specificity of the trained model for the fivefold cross-validation were 0.93 ± 0.04, 0.87 ± 0.03, 0.83 ± 0.05, and 0.89 ± 0.05, respectively. Conclusion: The DNN model can differentiate speech data of patients with PD from that of patients with SCD with relatively high accuracy and AUC. The proposed method can be used as a non-invasive, easy-to-perform screening method to differentiate PD from SCD using patient speech and is expected to be applied to telemedicine. [ABSTRACT FROM AUTHOR]
- Abstract:
Copyright of Journal of Neurology is the property of Springer Nature and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
No Comments.