Speech Emotion Recognition using Time Distributed CNN and LSTM

Beenaa Salian; Omkar Narvade; Rujuta Tambewagh; Smita Bharne

doi:10.1051/itmconf/20214003006

Open Access

Issue		ITM Web Conf. Volume 40, 2021 International Conference on Automation, Computing and Communication 2021 (ICACC-2021)


Article Number		03006
Number of page(s)		6
Section		Computing
DOI		https://doi.org/10.1051/itmconf/20214003006
Published online		09 August 2021

ITM Web of Conferences 40, 03006 (2021)

Speech Emotion Recognition using Time Distributed CNN and LSTM

Beenaa Salian^*, Omkar Narvade^**, Rujuta Tambewagh^*** and Smita Bharne^****

Ramrao Adik Institute of Technology, Navi Mumbai, India

^* e-mail: This email address is being protected from spambots. You need JavaScript enabled to view it.
^** e-mail: This email address is being protected from spambots. You need JavaScript enabled to view it.
^*** e-mail: This email address is being protected from spambots. You need JavaScript enabled to view it.
^**** e-mail: This email address is being protected from spambots. You need JavaScript enabled to view it.

Abstract

Speech has several distinguishing characteristic features which has remained a state-of-the-art tool for extracting valuable information from audio samples. Our aim is to develop a emotion recognition system using these speech features, which would be able to accurately and efficiently recognize emotions through audio analysis. In this article, we have employed a hybrid neural network comprising four blocks of time distributed convolutional layers followed by a layer of Long Short Term Memory to achieve the same.The audio samples for the speech dataset are collectively assembled from RAVDESS, TESS and SAVEE audio datasets and are further augmented by injecting noise. Mel Spectrograms are computed from audio samples and are used to train the neural network. We have been able to achieve a testing accuracy of about 89.26%.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.