Convolutional neural network classifies pathological voice change in laryngeal cancer with high accuracy

Hyunbum Kim, Juhyeong Jeon, Yeon Jae Han, Younghoon Joo, Jonghwan Lee, Seungchul Lee, Sun Im

Research output: Contribution to journalArticlepeer-review

64 Scopus citations

Abstract

Voice changes may be the earliest signs in laryngeal cancer. We investigated whether automated voice signal analysis can be used to distinguish patients with laryngeal cancer from healthy subjects. We extracted features using the software package for speech analysis in phonetics (PRAAT) and calculated the Mel-frequency cepstral coefficients (MFCCs) from voice samples of a vowel sound of /a:/. The proposed method was tested with six algorithms: support vector machine (SVM), extreme gradient boosting (XGBoost), light gradient boosted machine (LGBM), artificial neural network (ANN), one-dimensional convolutional neural network (1D-CNN) and two-dimensional convolutional neural network (2D-CNN). Their performances were evaluated in terms of accuracy, sensitivity, and specificity. The result was compared with human performance. A total of four volunteers, two of whom were trained laryngologists, rated the same files. The 1D-CNN showed the highest accuracy of 85% and sensitivity and sensitivity and specificity levels of 78% and 93%. The two laryngologists achieved accuracy of 69.9% but sensitivity levels of 44%. Automated analysis of voice signals could differentiate subjects with laryngeal cancer from those of healthy subjects with higher diagnostic properties than those performed by the four volunteers.

Original languageEnglish
Article number3415
Pages (from-to)1-15
Number of pages15
JournalJournal of Clinical Medicine
Volume9
Issue number11
DOIs
StatePublished - Nov 2020

Bibliographical note

Publisher Copyright:
© 2020 by the authors. Licensee MDPI, Basel, Switzerland.

Keywords

  • Deep learning
  • Larynx cancer
  • Machine learning
  • Voice change
  • Voice pathology classification

Fingerprint

Dive into the research topics of 'Convolutional neural network classifies pathological voice change in laryngeal cancer with high accuracy'. Together they form a unique fingerprint.

Cite this