A Convolutional and Recurrent Neural Network-based Approach for Speech Emotion Recognition

dc.contributor.authorDuch, Piotr
dc.contributor.authorWiatrowska, Izabela
dc.contributor.authorKapusta, Paweł
dc.date.accessioned2023-09-22T10:40:37Z
dc.date.available2023-09-22T10:40:37Z
dc.date.issued2023
dc.description.abstractSpeech emotion recognition (SER) is a crucial aspect of humancomputer interaction. In this article, we propose a deep learning approach, using CNN and RNN architectures, for SER using both convolutional and recurrent neural networks. We evaluated the approach on four audio datasets, including CREMA-D, RAVDESS, TESS, and EMOVO. Our experiments tested various feature sets and extraction settings to determine optimal features for SER. Our results demonstrate that the proposed approach achieves high accuracy rates and outperforms state-of-the-art algorithms.en_EN
dc.identifier.citationDuch P., Wiatrowska I., Kapusta P., A Convolutional and Recurrent Neural Network-based Approach for Speech Emotion Recognition. W: Progress in Polish Artificial Intelligence Research 4, Wojciechowski A. (Ed.), Lipiński P. (Ed.)., Seria: Monografie Politechniki Łódzkiej Nr. 2437, Wydawnictwo Politechniki Łódzkiej, Łódź 2023, s. 267-272, ISBN 978-83-66741-92-8, doi: 10.34658/9788366741928.42
dc.identifier.doi10.34658/9788366741928.42
dc.identifier.isbn978-83-66741-92-8
dc.identifier.urihttp://hdl.handle.net/11652/4818
dc.identifier.urihttps://doi.org/10.34658/9788366741928.42
dc.language.isoenen_EN
dc.page.numbers. 267-272
dc.publisherWydawnictwo Politechniki Łódzkiejpl_PL
dc.publisherLodz University of Technology Pressen_EN
dc.relation.ispartofWojciechowski A. (Ed.), Lipiński P. (Ed.)., Progress in Polish Artificial Intelligence Research 4, Seria: Monografie Politechniki Łódzkiej Nr. 2437, Wydawnictwo Politechniki Łódzkiej, Łódź 2023, ISBN 978-83-66741-92-8, doi: 10.34658/9788366741928.
dc.rightsDla wszystkich w zakresie dozwolonego użytkupl_PL
dc.rightsFair use conditionen_EN
dc.rights.licenseLicencja PŁpl_PL
dc.rights.licenseLUT Licenseen_EN
dc.subjectartificial intelligenceen_EN
dc.subjectspeech emotion recognitionen_EN
dc.subjectsztuczna inteligencjapl_PL
dc.subjectrozpoznawanie emocji mowypl_PL
dc.titleA Convolutional and Recurrent Neural Network-based Approach for Speech Emotion Recognitionen_EN
dc.typeRozdział - monografiapl_PL
dc.typeChapter - monographen_EN

Pliki

Oryginalne pliki
Teraz wyświetlane 1 - 1 z 1
Brak miniatury
Nazwa:
42. Convolutional_recurrent_neural_Duch_Wiatrowska_2023.pdf
Rozmiar:
335.53 KB
Format:
Adobe Portable Document Format
Opis:
Licencja
Teraz wyświetlane 1 - 1 z 1
Brak miniatury
Nazwa:
license.txt
Rozmiar:
1.71 KB
Format:
Item-specific license agreed upon to submission
Opis: