Using Speech Signal for Emotion Recognition Using Hybrid Features with SVM Classifier
DOI:
https://doi.org/10.31185/wjcm.102Keywords:
Speech emotion recognition, Feature extraction, Statistical moments, Support vector machine (SVM)Abstract
Emotion recognition is a hot topic that has received a lot of attention and study,owing to its significance in a variety of fields, including applications needing human-computer interaction (HCI). Extracting features related to the emotional state of speech remains one of the important research challenges.This study investigated the approach of the core idea behind feature extraction is the residual signal of the prediction procedure is the difference between the original and the prediction .hence the visibility of using sets of extracting features from speech single when the statistical of local features were used to achieve high detection accuracy for seven emotions. The proposed approach is based on the fact that local features can provide efficient representations suitable for pattern recognition. Publicly available speech datasets like the Berlin dataset are tested using a support vector machine (SVM) classifier. The hybrid features were trained separately. The results indicated that some features were terrible. Some were very encouraging, reaching 99.4%. In this article, the SVM classifier test results with the same tested hybrid features that published in a previous article will be presented, also a comparison between some related works and the proposed technique in speech emotion recognition techniques.
References
M. E. Ayadi, M. S. Kamel, and F. Karray, “Survey on speech emotion recognition: Features, classification schemes, and databases,” Pattern Recognit, vol. 44, no. 3, pp. 572–587, 2011.
I. Chiriacescu, “Automatic Emotion Analysis Based on Speech,” 2010.
K. Wang Speech Emotion Recognition Using Fourier Parameters, vol. 6, pp. 69–75, 2015.
D. D. Joshi and M. B. Zalte Recognition of Emotion from Marathi Speech Using MFCC and DWT Algorithms, pp. 59–63, 2013.
R. Subhashree and G. N. Rathna, “Speech emotion recognition: Performance analysis based on fused algorithms and GMM modeling,” Indian J. Sci. Technol, vol. 9, no. 11, 2016.
A. Milton, S. Roy, and S. Selvi, “SVM Scheme for Speech Emotion Recognition using MFCC Feature,” Int. J. Comput. Appl, 2013.
M. B. Akçay and K. Oguz, “Speech emotion recognition: Emotional models, databases, features, preprocessing methods, supporting modalities, ˘ and classifiers,” Speech Communication, vol. 116, pp. 56–76, 2020.
S. Basu, J. Chakraborty, and M. Aftabuddin, “Emotion recognition from speech using a convolutional neural network with recurrent neural network architecture,” 2017 2nd International Conference on Communication and Electronics Systems (ICCES, pp. 333–336, 2017.
M. S. Likitha, S. R. R. Gupta, K. Hasitha, and A. U. Raju, “Speech-based human emotion recognition using MFCC,” 2018.
P. Shegokar and P. Sircar, “Continuous wavelet transform-based speech emotion recognition,” 2016.
A. Bhavan, P. Chauhan, and R. R. Shah, “Bagged support vector machines for emotion recognition from speech,” Knowledge-Based Syst, vol. 184, pp. 104886–104886, 2019.
Z. Han and J. Wang, “Speech emotion recognition based on Gaussian kernel nonlinear proximal support vector machine,” 2017.
F. A. Hameed and L. E. Georgeb, “Using Speech Signal for Emotion Recognition Using Hybrid Features with ANN Classifier, accepted 2022/10/31 in journal ’solid state phenomena’ to,” 2022.
F. Burkhardt, A. Paeschke, M. Rolfes, W. Sendlmeier, and B. Weiss, “A database of German emotional speech,” 2005.
R. W. Wall, “Simple methods for detecting zero crossing,” IECON’03. 29th Annual Conference of the IEEE Industrial Electronics Society, vol. 3, pp. 2477–2481, 2003.
F. Alías, J. C. Socoró, and X. Sevillano, “A review of physical and perceptual feature extraction techniques for speech, music and environmental sounds,” Appl. Sci, vol. 6, no. 5, pp. 143–143, 2016.
C. Vonesch, T. Blu, and M. Unser, “Generalized Daubechies wavelet families,” IEEE Trans. Signal Process, vol. 55, no. 9, pp. 4415 4429, 2007.
J. S. Walker, “A primer on wavelets and their scientific applications,” 2008. Chapman and hall/CRC.
D. Dacunha-Castelle and M. Duflo Probability and Statistics, vol. II, 2012.
A. B. Downey, Think Stats Probability and Statistics for Programmers. Version 1.6. 0. Massachusetts: Green Tea Press, 2011.
G. Bohm and G. Zech, Introduction to statistics and data analysis for physicists, vol. 1. Hamburg: Desy, 2010.
H. A. Hadi, L. E. George, " Eeg, User, Methods, Two, Sets, Features, On, and Wavelet J. Theor. Appl. Inf. Technol, vol. 95, no. 22, 2017.
R. W. Grubbström and O. Tang, “The moments and central moments of a compound distribution,” Eur. J. Oper. Res, vol. 170, no. 1, pp. 106–119, 2006.
C. G. V. N. Prasad, K. H. Rao, D. Pratima, and B. N. Alekhya, “Unsupervised Learning Algorithms to Identify the Dense Cluster in Large Datasets,” Int. J. Comput. Sci. Telecommun, vol. 2, no. 4, pp. 26–31, 2011.
M. H. Abdul-Hadi and J. Waleed, “Human speech and facial emotion recognition technique using SVM,” 2020 International Conference on Computer Science and Software Engineering (CSASE), pp. 191–196, 2020.
I. E. Naqa and M. J. Murphy, “What are machine learning? " in machine learning in radiation oncology, 3-11,” 2015. Springer
Downloads
Published
Issue
Section
License
Copyright (c) 2023 Fatima A.Hammed, Loay E. George
This work is licensed under a Creative Commons Attribution 4.0 International License.