In environments characterized by elevated noise levels, such as airports or aircraft cabins, travelers often find themselves involuntarily speaking loudly and drawing closer to one another in an effort to enhance communication and speech intelligibility. Unfortunately, this unintentional behaviour increases the risk of respiratory particles dispersion, potentially carrying infectious agents like bacteria which makes the contagion control more challenging. The accurate characterization of the risk associated to speaking, in such a challenging noise environment with multiple overlapping speech sources, is therefore of outmost importance. Among the most advanced signal processing strategies that can be used to accurately determine who spoke when and with whom and for how long but most importantly how loudly, at one location, artificial intelligence-based speaker diarization approaches were considered and adapted for this task. The speaker diarization algorithms implementation, specifically adapted in this study to provide the relevant speaker and speech parameters without revealing what was spoken are described in this article. Moreover, the validation and preliminary studies’ results are presented. By extracting pertinent features from participants' speech, the developed algorithms facilitate the calculation of speech duration and overall sound pressure level associated to each sentence and speaker, which are crucial metrics for assessing potential viral contaminant spread. The paper explores the application of speaker diarization algorithms in noisy environments, emphasizing its relevance in multi-participant speaker scenarios within confined spaces. The insights gained from this research contribute to the development of proactive measures, enhancing our ability to effectively contain and manage communicable diseases in air travel settings.
Skip Nav Destination
Article navigation
13 May 2024
186th Meeting of the Acoustical Society of America and the Canadian Acoustical Association
13–17 May 2024
Ottawa, Ontario, Canada
Signal Processing in Acoustics: Paper 4aSP9
July 18 2024
Speech detection models for effective communicable disease risk assessment in air travel environments
Tenon Charly Kone;
Tenon Charly Kone
1
National Research Council Canada, Flight Research Laboratory
, 1200 Montreal Road, Ottawa, ON, K1A 0R6, CANADA
; tenoncharly.kone@nrc-cnrc.gc.ca
Search for other works by this author on:
Sebastian Ghinet;
Sebastian Ghinet
2
National Research Council Canada, Flight Research Laboratory
, 1200 Montreal Road, Ottawa, ON, K1A 0R6, CANADA
; Sebastian.Ghinet@nrc-cnrc.gc.ca
Search for other works by this author on:
Dana Sayed Ahmed;
Dana Sayed Ahmed
3
National Research Council Canada, Flight Research Laboratory
, 1200 Montreal Road, Ottawa, ON, K1A 0R6, CANADA
; Dana.SayedAhmed@nrc-cnrc.gc.ca
Search for other works by this author on:
Anant Grewal
Anant Grewal
4
National Research Council Canada, Flight Research Laboratory
, 1200 Montreal Road, Ottawa, ON, K1A 0R6, CANADA
; Anant.Grewal@nrc-cnrc.gc.ca
Search for other works by this author on:
Proc. Mtgs. Acoust. 54, 055002 (2024)
Article history
Received:
March 19 2024
Accepted:
June 11 2024
Connected Content
Citation
Tenon Charly Kone, Sebastian Ghinet, Dana Sayed Ahmed, Anant Grewal; Speech detection models for effective communicable disease risk assessment in air travel environments. Proc. Mtgs. Acoust. 13 May 2024; 54 (1): 055002. https://doi.org/10.1121/2.0001907
Download citation file:
94
Views
Citing articles via
Flyback sonic booms from Falcon-9 rockets: Measured data and some considerations for future models
Mark C. Anderson, Kent L. Gee, et al.
Related Content
Child-adult speech diarization in naturalistic conditions of preschool classrooms using room-independent ResNet model and automatic speech recognition-based re-segmentation
J. Acoust. Soc. Am. (February 2024)
Semi-supervised audio-driven TV-news speaker diarization using deep neural embeddings
J. Acoust. Soc. Am. (December 2020)
Prof-Life-Log: Monitoring and assessment of human speech and acoustics using daily naturalistic audio streams
J Acoust Soc Am (October 2016)
Improving speaker diarization for naturalistic child-adult conversational interactions using contextual information
J. Acoust. Soc. Am. (February 2020)
Daylong acoustic recordings of family and child speech using the HomeBank database
J Acoust Soc Am (March 2019)