Voice pathology can be successfully identified using computer-aided voice pathology categorization approaches. The goal of this research is to develop a tool using deep learning technique to diagnose voice pathological conditions including age and gender aspects. Here, a dataset of vocal pathology is used to train a Convolutional Neural Network (CNN) which is a pre-trained model to improve classification accuracy. The proposed work exhibits a high level of instantaneous, automatic diagnostic and therapeutic alternatives to achieve classification accuracy. Usually, recordings from people of all ages are used to develop algorithms that can recognize speech abnormalities automatically. However, this approach might not be sufficient due to the acoustic changes in the voice brought on by aging naturally. Additionally, it can be more challenging to identify pathological voice since the quality of older voices changes in ways that are comparable to those linked to vocal problems. Research developing methods that automatically account for speaker age to make it easier to identify speech anomalies is therefore of interest. The current study introduces an age/gender based automated classification utilizing both healthy and pathological voice.

1.
V.
Lyberg-Åhlander
,
R.
Rydell
,
P.
Fredlund
,
C.
Magnusson
and
S.
Wilén
, S, "
Prevalence of voice disorders in the general population, based on the Stockholm public health cohort
."
Journal of Voice
33
, no.
6
(
2019
):
900
905
.
2.
Thomas
,
Prisca
M.
,
Srikanth
Nayak
, and
Usha
Devadas
. "
Prevalence and voice characteristics in an Indian treatment-seeking population for voice disorders
."
Otorhinolaryngology Clinics
13
, no.
3
(
2021
):
110
117
.
3.
S.K.
Ravi
,
P.
Sumanth
,
T.
Saraswathi
,
MAB.
Chinoor
,
N.
Ashwini
, and
E.
Ahemed
, “
Prevalence of communication disorders among school children in Ballari, South India: A cross-sectional study
Clinical Epidemiology and Global Health
,
12
, (
2021
) pp.
100851
.
4.
R.
Mireles-Rios
,
V. M.
Rios
, and
A.
Reyes
Pushed out for missing school: The role of social disparities and school truancy in dropping out
.
Education Sciences
, (
2020
),
10
(
4
), p.
108
.
5.
R.
Islam
,
M.
Tarique
and
E.
Abdel-Raheem E
A survey on signal processing based pathological voice detection techniques
IEEE Access
, (
2020
),
8
, pp.
66749
66776
.
6.
J.A
Gómez-García
,
L.
Moro-Velázquez
,
J.
Mendes-Laureano
,
G.
Castellanos-Dominguez
and
J.I
Godino-Llorente
, “
Emulating the perceptual capabilities of a human evaluator to map the GRB scale for the assessment of voice disorders
Engineering Applications of Artificial Intelligence
, (
2019
)
82
, pp.
236
251
.
7.
Mohammed
,
Mazin
Abed
,
Karrar Hameed
Abdulkareem
,
Salama A.
Mostafa
,
Mohd Khanapi Abd
Ghani
,
Mashael S.
Maashi
,
Begonya
Garcia-Zapirain
,
Ibon
Oleagordia
,
Hosam
Alhakami
, and
Fahad Taha
Al-Dhief
. "
Voice pathology detection and classification using convolutional neural network model
."
Applied Sciences
10
, no.
11
(
2020
):
3723
.
8.
Syed
,
Sidra
Abid
,
Munaf
Rashid
,
Samreen
Hussain
, and
Hira
Zahid
. "
Comparative analysis of CNN and RNN for voice pathology detection
."
BioMed Research International
2021
(
2021
):
1
8
.
9.
Gidaye
,
Girish
,
Jagannath
Nirmal
,
Kadria
Ezzine
, and
Mondher
Frikha
. "
Wavelet sub-band features for voice disorder detection and classification
."
Multimedia Tools and Applications
79
(
2020
):
28499
28523
.
10.
Alves
,
Marco
,
Gabriel
Silva
,
Bruno C.
Bispo
,
María E.
Dajer
, and
Pedro M.
Rodrigues
. "
Voice disorders detection through multiband cepstral features of sustained vowel
."
Journal of Voice
(
2021
).
11.
AL-Dhief
,
Fahad
Taha
,
Nurul Mu’azzah Abdul
Latiff
,
Nik Noordini Nik Abd
Malik
,
Naseer
Sabri
,
Marina Mat
Baki
,
Musatafa Abbas Abbood
Albadr
,
Aymen Fadhil
Abbas
,
Yaqdhan Mahmood
Hussein
, and
Mazin Abed
Mohammed
. "Voice pathology detection using machine learning technique." In
2020 IEEE 5th international symposium on telecommunication technologies (ISTT)
, pp.
99
104
.
IEEE
,
2020
.
12.
Mallela
,
Jhansi
,
Aravind
Illa
,
B. N.
Suhas
,
Sathvik
Udupa
,
Yamini
Belur
,
Nalini
Atchayaram
,
Ravi
Yadav
,
Pradeep
Reddy
,
Dipanjan
Gope
, and
Prasanta Kumar
Ghosh
. "
Voice based classification of patients with Amyotrophic Lateral Sclerosis, Parkinson’s Disease and healthy controls with CNN-LSTM using transfer learning
." In
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
, pp.
6784
6788
.
IEEE
,
2020
.
13.
Wilkinson
,
Nicholas
, and
Thomas
Niesler
. "
A hybrid CNN-BiLSTM voice activity detector
." In
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
, pp.
6803
6807
.
IEEE
,
2021
.
14.
Islam
,
Rumana
,
Mohammed
Tarique
, and
Esam
Abdel-Raheem
. "
A survey on signal processing based pathological voice detection techniques
."
IEEE Access
8
(
2020
):
66749
66776
.
15.
Kotarba
,
Katarzyna
, and
Michal
Kotarba
. "Efficient detection of specific language impairment in children using ResNet classifier." In
2020 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)
, pp.
169
173
.
IEEE
,
2020
.
16.
Reddy
,
Mittapalle
Kiran
,
Paavo
Alku
, and
Krothapalli Sreenivasa
Rao
. "
Detection of specific language impairment in children using glottal source features
."
IEEE Access
8
(
2020
):
15273
15279
.
17.
Gumelar
,
Agustinus
Bimo
,
Eko Mulyanto
Yuniarno
,
Wiwik
Anggraeni
,
Indar
Sugiarto
,
Vincentius Raki
Mahindara
, and
Mauridhi Hery
Purnomo
. "
Enhancing detection of pathological voice disorder based on deep VGG-16 CNN
." In
2020 3rd International Conference on Biomedical Engineering (IBIOMED)
, pp.
28
33
.
IEEE
,
2020
.
18.
Hossain
,
M.
Shamim
,
Ghulam
Muhammad
, and
Atif
Alamri
. "
Smart healthcare monitoring: a voice pathology detection paradigm for smart cities
."
Multimedia Systems
25
(
2019
):
565
575
.
19.
Ankışhan
,
Haydar
, and
Sıtkı Çağdaş
İnam
. "
Voice pathology detection by using the deep network architecture
."
Applied Soft Computing
106
(
2021
):
107310
.
20.
Harar
,
Pavol
,
Zoltan
Galaz
,
Jesus B.
Alonso-Hernandez
,
Jiri
Mekyska
,
Radim
Burget
, and
Zdenek
Smekal
. "
Towards robust voice pathology detection: Investigation of supervised deep learning, gradient boosting, and anomaly detection approaches across four databases
."
Neural Computing and Applications
32
(
2020
):
15747
15757
.
This content is only available via PDF.
You do not currently have access to this content.