Abstract
Artificial intelligence (AI) and deep learning (ML) have used for training and processing of massive data, allowing the improvement of systems, and making them more intelligent when making decisions. Speech Emotion Recognition (SER) is an area of voice research for speech emotion recognition, evaluating the voice signal and classifying different emotions. In recent years, technological advances in deep learning have helped (SER) to detect and classify emotions effectively, as speech; signal processing methods are difficult due to the variety of emotion frequencies such as happy, angry, sad, neutral and others. In this study, we have used a deep convolutional network architecture (DSCNN) to implement the (SER) model. This uses simple networks to learn salient and discriminative features from the spectrogram of speech signals, generated through the RAVDESS dataset, 8 emotions considered for the analysis and classification of emotions, a prediction result of 61% obtained. Subsequently, an implementation of the (DSCNN) proposed in psychology to determine the diagnoses and treatments of people suffering from depression and anxiety. With the help of this deep neural network, an effective diagnosis obtained in the future and treatment time could reduce.
| Original language | English |
|---|---|
| Title of host publication | 2024 IEEE Colombian Conference on Communications and Computing, COLCOM 2024 - Proceedings |
| Editors | Diana Z. Briceno Rodriguez |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| ISBN (Electronic) | 9798331504724 |
| DOIs | |
| State | Published - 2024 |
| Event | 2024 IEEE Colombian Conference on Communications and Computing, COLCOM 2024 - Barranquilla, Colombia Duration: 21 Aug 2024 → 24 Aug 2024 |
Publication series
| Name | 2024 IEEE Colombian Conference on Communications and Computing, COLCOM 2024 - Proceedings |
|---|
Conference
| Conference | 2024 IEEE Colombian Conference on Communications and Computing, COLCOM 2024 |
|---|---|
| Country/Territory | Colombia |
| City | Barranquilla |
| Period | 21/08/24 → 24/08/24 |
Bibliographical note
Publisher Copyright:© 2024 IEEE.
Keywords
- deep learning
- psychology
- speech emotion recognition
- speech spectrograms
CACES Knowledge Areas
- 419A Medical Diagnostic and Treatment Technology
Projects
- 1 Active
-
Model for the early detection of breast cancer using medical images (MDTCM)
Plua Moran, D. H. (Col), Valverde Landivar, G. E. (Col), Quiroz Martinez, M. A. (PI), Leon Veas, J. L. (Col) & Leyva Vazquez, M. Y. (Col)
20/02/20 → …
Project: Research and Development
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver