אפקה - פרויקטי גמר 2020

Diagnostic System for vocal folds Diseases

Motivation- The Problem

Voice pathologies can interfere with voice quality, speech tone, and tone of voice.

In Israel, about one million people are diagnosed with voice pathologies (12% of

the population). These pathologies reduce the quality of life and significantly

affect society and the economy. Existing technologies offer certain benefits, but

they often lead to late/incorrect diagnoses that delay the treatment required for

the patient.

Main goal

The main goal is to characterize the changes in voice and build a classification

algorithm to identify the vocal pathologies. The purpose of this system will be

classifying diseases in a non-invasive method, additionally backing up the

doctor's diagnosis.

The Method

Creating a computerized learning machine system, able to receive vocal records,

and diagnosing vocal pathologies based on those signals.

Michal Wasserman, Nofar Torem

Advisor: Yirmi Hauptman

Medical Engineering

Data Collection

Building a database based on '.wav' files of patients with various pathologies of

the vocal cords, who were treated by a communication clinician from 'Hadassah

Medical Center’.

The Algorithm

First step: Extracting from the '.wav' files the sequence of 'ah' vowel.

Second step:

Developing an algorithm for data analysis of speech signals. The algorithm

includes features extraction from the time and frequency domain analysis.

Third step:

Finding the best kind of classifier based on the features that were extracted by

subjective and objective model.

Results

Discussion & conclusions:

In this work, existing techniques of feature extraction such as LPC, ZRC, RMS,

HNR, shimmer, jitter, and entropy are implemented in order to classify

pathologies of the vocal tract.

Pathologies classification based on a dataset constructed from speech signals

recorder from different subjects is challenging since certain vocal features are

speaker-dependent, which inevitably affects the results obtained. Notwithstanding

the above, it can be seen that the classifier's results in this study were relatively

good since it has an identification percentage of over 70%. It can be seen that the

higher success [%] was obtained for the logistic regression classifier (73%).

An algorithm that is a reliable and automatic detector

for speech pathologies from speech signals. The

algorithm combines different vocal features and uses

machine learning classifier in order to get the correct

identification. Its noninvasive, fast, and early

detection of voice pathology for professionals.

Classifier type Success percentage

[%]

Sensitivity [%] Specificity [%] Accuracy [%]

Linear SVM

71%

70%

5Logistic regression

73%

72%

Cubic KNN

71%

72%

Weighted KNN

71%

72%

Data Analysis

Healthy /with

Pathology