An emerging use of artificial intelligence is automatic emotion recognition. Facial expression identification is an intriguing and challenging problem in computer vision. In data science, one of the most difficult problems is speech emotion recognition. The technology that has been built consists of two stages: the first involves real-time facial and speech capture and the second is categorizing of emotions. Data collection, data analysis, and data visualization are the stages of automated emotion identification. Convolution neural networks are used in the proposed multimodal system to identify emotions from speech and face expressions. Each block in the sequence is made up of convolution layers and sub sampling layers. The most difficult of all the available datasets, FER2013, was used to train the model for face emotion recognition. The accuracy that has been attained for this task is 71%. To address the issue of data deficiency in speech emotion identification, four distinct datasets—CREMA-D, RAVDESS, SAVEE, and TESS were integrated. The accuracy achieved for this challenge is 88%. The suggested approach can recognize eight emotions in total namely “angry, calm, disgust, fear, happy, neutral, sad, and surprised” for both the speech and the face, respectively. Additional effects include batch normalization, early stopping, and dropouts for better performance.
Skip Nav Destination
,
,
,
,
,
Article navigation
20 February 2024
14TH INTERNATIONAL CONFERENCE ON MATERIALS PROCESSING AND CHARACTERIZATION 2023
24–26 March 2023
Hyderabad, India
Research Article|
February 20 2024
Emotion recognition using multi-modal features and CNN classification Available to Purchase
Saba Noor Ayesha Khanum;
Saba Noor Ayesha Khanum
a
1
Muffakham Jah College of Engineering & Technology
, Hyderabad, Telangana 500034, India
aCorresponding author:[email protected]
Search for other works by this author on:
Upendra Kumar Mummadi;
Upendra Kumar Mummadi
b
1
Muffakham Jah College of Engineering & Technology
, Hyderabad, Telangana 500034, India
Search for other works by this author on:
Fahmina Taranum;
Fahmina Taranum
c
1
Muffakham Jah College of Engineering & Technology
, Hyderabad, Telangana 500034, India
Search for other works by this author on:
Syed Shabbeer Ahmad;
Syed Shabbeer Ahmad
d
1
Muffakham Jah College of Engineering & Technology
, Hyderabad, Telangana 500034, India
Search for other works by this author on:
Imtiyaz Khan;
Imtiyaz Khan
e
1
Muffakham Jah College of Engineering & Technology
, Hyderabad, Telangana 500034, India
Search for other works by this author on:
D. Shravani
D. Shravani
f
2
Stanely College of Engineering & Technology, Hyderabad
, Telangana 500001, India
Search for other works by this author on:
Saba Noor Ayesha Khanum
1,a
Upendra Kumar Mummadi
1,b
Fahmina Taranum
1,c
Syed Shabbeer Ahmad
1,d
Imtiyaz Khan
1,e
D. Shravani
2,f
1
Muffakham Jah College of Engineering & Technology
, Hyderabad, Telangana 500034, India
2
Stanely College of Engineering & Technology, Hyderabad
, Telangana 500001, India
aCorresponding author:[email protected]
AIP Conf. Proc. 3007, 030001 (2024)
Citation
Saba Noor Ayesha Khanum, Upendra Kumar Mummadi, Fahmina Taranum, Syed Shabbeer Ahmad, Imtiyaz Khan, D. Shravani; Emotion recognition using multi-modal features and CNN classification. AIP Conf. Proc. 20 February 2024; 3007 (1): 030001. https://doi.org/10.1063/5.0192751
Download citation file:
Pay-Per-View Access
$40.00
Sign In
You could not be signed in. Please check your credentials and make sure you have an active account and try again.
63
Views
Citing articles via
Effect of coupling agent type on the self-cleaning and anti-reflective behaviour of advance nanocoating for PV panels application
Taha Tareq Mohammed, Hadia Kadhim Judran, et al.
Design of a 100 MW solar power plant on wetland in Bangladesh
Apu Kowsar, Sumon Chandra Debnath, et al.
With synthetic data towards part recognition generalized beyond the training instances
Paul Koch, Marian Schlüter, et al.
Related Content
Speech emotion recognition using both deep learning and machine learning techniques
AIP Conf. Proc. (November 2022)
Designing a multimodal emotion recognition system using visual and audio modalities
AIP Conf. Proc. (November 2024)
Robustness evaluation of multi-layer perceptron based speech emotion recognition model for Hindi language
AIP Conf. Proc. (December 2023)
Emotion recognition in deep learning across diverse speech and text corpora
AIP Conf. Proc. (April 2025)
Human emotion identification based on features extracted using CNN
AIP Conf. Proc. (October 2022)