Speech Emotion Recognition Based on Learning Automata in Fuzzy Petri-net

Volume 12, Issue 3, pp 173-185 http://dx.doi.org/10.22436/jmcs.012.03.01

Download PDF

Download XML

3071 Downloads
4718 Views

Authors

Sara Motamed - Science and Research University, Tehran, Iran. Saeed Setayeshi - Science and Research University, Tehran, Iran. Zeinab Farhoudi - Science and Research University, Tehran, Iran. Ali Ahmadi - Science and Research University, Tehran, Iran.

Abstract

This paper explores how fuzzy features’ number and reasoning rules can influence the rate of emotional speech recognition. The speech emotion signal is one of the most effective and neutral methods in individuals’ relationships that facilitate communication between man and machine. This paper introduces a novel method based on mind inference and recognition of speech emotion recognition. The foundation of the proposed method is the inference of rules in Fuzzy Petri-net (FPN) and the learning automata. FPN is a new method of classification which is introduced for the first time on emotion speech recognition. This method helps to analyze different rules in a dynamic environment like human’s mind. The input of FPN is computed by learning automata. Therefore learning automata has been used to adjust the membership functions for each feature vector in the dynamic environment. The proposed algorithm is divided into different parts: preprocessing; feature extraction; learning automata; fuzzification; inference engine and defuzzification. The proposed model has been compared with different models of classification. Experimental results show that the proposed algorithm outperforms other models.

Share and Cite

ISRP Style

Sara Motamed, Saeed Setayeshi, Zeinab Farhoudi, Ali Ahmadi, Speech Emotion Recognition Based on Learning Automata in Fuzzy Petri-net, Journal of Mathematics and Computer Science, 12 (2014), no. 3, 173-185

AMA Style

Motamed Sara, Setayeshi Saeed, Farhoudi Zeinab, Ahmadi Ali, Speech Emotion Recognition Based on Learning Automata in Fuzzy Petri-net. J Math Comput SCI-JM. (2014); 12(3):173-185

Chicago/Turabian Style

Motamed, Sara, Setayeshi, Saeed, Farhoudi, Zeinab, Ahmadi, Ali. "Speech Emotion Recognition Based on Learning Automata in Fuzzy Petri-net." Journal of Mathematics and Computer Science, 12, no. 3 (2014): 173-185

Keywords

Emotional Speech
Fuzzy Rules
Learning Automata
Mel frequency Cepstral coefficients (MFCC)
Fuzzy Petri-net.

MSC

68U99
68T10
68T50
68T05

References

[1] T. L. Pao, Y. T. Chen, J. H. Yeh, Comparison of Classification Methods for Detecting Emotion from Mandarin Speech, IEICE Transactions on Information and Systems , vol. E91-D(4) (2008)
- View Article
- Google Scholar

[2] T. Nose, T. Kobayashi , A Technique for Estimating Intensity of Emotional Expressions and Speaking Styles in Speech Based on Multiple-Regression HSMM, IEICE Transactions on Information and Systems , (2010)
- View Article
- Google Scholar

[3] R. Cowie, E. Douglas-Cowie, N. Tsapatsoulis, G. Votsis, S. Kollias, W. Fellenz, J. G. Taylor, Emotion recognition in human- computer interaction, IEEE Signal Process, vol. 18(1) (2001)
- View Article
- Google Scholar

[4] D. Erickson, Expressive speech: Production, perception and application to speech synthesis, Acoustical Science and Technology, vol. 26(4) (2005)
- View Article
- Google Scholar

[5] D. Neiberg, K. Elenius, K. Laskowski, Emotion recognition in spontaneous speech using GMMs, Proc. INTERSPEECH , (2006)
- Google Scholar

[6] T. L. Nwe, S. W. Foo, L. C. D. Silva, Speech emotion recognition using hidden Markov models, Speech Communication, vol. 41(4) (2003)
- View Article
- Google Scholar

[7] O. Pierre-Yves , The production and recognition of emotions in speech: features and algorithms, International Journal of Human-Computer Studies, vol. 59 (2003)
- View Article
- Google Scholar

[8] C. M. Lee, S. Narayanan , Toward Detecting Emotions in Spoken Dialogs, IEEE Transactions on Speech and Audio Processing, vol. 13(2) (2005)
- View Article
- Google Scholar

[9] I. Albrecht, M. Schroder, J. Haber, H.-P. Seidel , Mixed feelings: Expression of non-basic emotions in a muscle-based talking head , Virtual Reality, vol. 8(4) (2005)
- View Article
- Google Scholar

[10] D. Wu, T. D. Parsons, S. Narayanan, Acoustic Feature Analysis in Speech Emotion Primitives Estimation, Proc. InterSpeech , (2010)
- Google Scholar

[11] M. Schroder, R. Cowie, E. D.-cowie, M. Westerdijk, S. Gielen, Acoustic Correlates of Emotion Dimensions in View of Speech Synthesis, Proc. Eurospeech, (2001)
- Google Scholar

[12] M. Grimm, K. Kroschel, Emotion Estimation in Speech Using a 3D Emotion Space Concept, in Robust Speech Recognition and Understanding, M. Grimm and K. Kroschel , (2007)
- Google Scholar

[13] I. Kanluan, M. Grimm, K. Kroschel , Audio-Visual Emotion Recognition Using an Emotion Space Concept, Proc. EUSIPCO, (2008)
- Google Scholar

[14] C. G. Looney , Fuzzy Petri-nets for rule-based decision making , IEEE Transactions on Systems, Man and Cybernetics, Vol. 18, No. 1 (1988)
- View Article
- Google Scholar

[15] L. Shiyon, Fuzzy Control, Neuro Control And Intelligent Cybernetics, Harbin Institute of Technology , (1998)
- Google Scholar

[16] T. Seehapoch, S. Wongathanavasu , Speech Emotion Recognition Using Support Vector Machines , 5th International Conference on Knowledge and Smart Technology, (2013)
- View Article
- Google Scholar

[17] J. Price , Design an automatic speech recognition system using Maltab, University of Maryland Estern Shore Princess Anne , (2004)

[18] A. Kamarul, A. B. Ibrahim , Biomedical engineering laboratory student pack, UTM Johor, (2005)
- Google Scholar

[19] M. Bojanic, V. Crojevic, V. Deliv , Application of Neural Network in Emotional Speech Recognition, IEEE 20-22 , (2012)
- View Article
- Google Scholar

[20] M. Schroder, R. Cowie, E. D.-cowie, M. Westerdijk, S. Gielen , Acoustic Correlates of Emotion Dimensions in View of Speech Synthesis, Proc. Eurospeech, (2001)
- Google Scholar

[21] H. Beigy, M. R. Meybodi, A Mathematical Framework for Cellular Learning Automata, Advances on Complex Systems, Vol. 7, No. 3 (2004)
- View Article
- MathSciNet
- Google Scholar
- MATH

[22] V. Raghunathan, C. Schurgers, S. Park, M. B. Srivastava, Energy-Aware Wireless Microsensor Networks, IEEE Signal Processing Magazine, Vol 19 (2002)
- View Article
- Google Scholar

[23] K. S. Narendra, M. A. L. Thathachar, Learning automata: An introduction, Prentice Hall, (1989)
- Google Scholar

[24] M. A. L. Thatachar, P. S. Sastry , Varieties of Learning Automata: An Overview , IEEE Transaction on System, Man, and Cybernetics- part B: CYBERNETICS (2002)
- View Article
- Google Scholar

[25] J. Winter, Y. Xu, W. C. Lee, Energy Efficient Processing of K Nearest Neighbor Queries in Locationaware Sensor Networks, the Second International Conference on Mobile and Ubiquitous Systems: Networking and Services (Mobiquitous'05), San Diego,CA (2005)
- View Article
- Google Scholar

[26] K. S. Narendra, M. A. L. Thathachar, Learning automata a survey , IEEE Transactions on Systems, Man and Cybernetics, vol. 4, no. 4 (1974)
- View Article
- MathSciNet
- Google Scholar

[27] N. Esau, L. Kleinjohann, B. Kleinjohann, An Adaptable fuzzy Model for Emotion Recognitio, , (2003)
- Google Scholar

[28] F. Burkhardt, A. Paeschke, M. Rolfes, W. Sendlmeier, B. Weiss , A database of German emotional speech, Proceedings of the Interspeech , (2005)
- Google Scholar

[29] F. Eyden, M. Wollmer, B. Schuller, Opensmile: the munich versatile and fast opensource audio feature extractor, 10 Proceedings of the international conference on Multimedia, (2010)
- View Article
- Google Scholar

[30] M. M. Javidi, E. F. Roshan, Speech emotion recognition by using combinations of C5.0, Neural Network (NN), and Support Vector Machine (SVM) classification methods , Journal of mathematics and computer science (2013)
- Google Scholar

[31] N. Moarefi, A. Yarmohamadi , The implementation of cellular automata with non-identical rule on serial base, Journal of mathematics and computer science , (2012)
- Google Scholar