Multimodal Emotion Recognition Using Deep Learning Architectures | [IEEE International Conference on Digital Futures and Transformative Technologies (ICoDT2) May 24-26, 2022 (Book of Abstracts) • 2022]

Author(s):

1. Iram Hina: National University of Sciences and Technology Islamabad,Pakistan

2. Arslan Shaukat: National University of Sciences and Technology Islamabad,Pakistan

3. Muhammad Usman Akram: National University of Sciences and Technology Islamabad,Pakistan

Abstract:

Emotions are an essential part of immaculate communication. The purpose of this research work is to classify six basic emotions of humans namely anger, disgust, fear, happiness, sadness and surprise. In proposed method a sequential deep convolutional neural network is proposed for audio and visual modality. Audio classification is performed via fine-tuning of a pretrained AlexNet model whereas, visual classification is performed with a hybrid deep network containing CNN and LSTM. Decision level and score level fusion have been implemented for multimodalities. SVM, random forest, K-NN, and logistic regression classifiers were being used for classifying emotion for fused audio-visual data. Experiments have been performed on RML and BAUM-1s dataset with LOSO and LOSGO cross validation techniques respectively. Recognition rates were extremely positive which shows the validity of the proposed methodology.

Page(s): 1-1

DOI: DOI not available

Published: Journal: IEEE International Conference on Digital Futures and Transformative Technologies (ICoDT2) May 24-26, 2022 (Book of Abstracts), Volume: 1, Issue: 1, Year: 2022

Keywords:

Multimodal Emotion Recognition , Deep Learning Architectures

References:

References are not available for this document.

Citations

Citations are not available for this document.

Citations

Downloads

Views