Speech Emotion Recognition Based on Learning Automata in Fuzzy Petri-net
-
3071
Downloads
-
4718
Views
Authors
Sara Motamed
- Science and Research University, Tehran, Iran.
Saeed Setayeshi
- Science and Research University, Tehran, Iran.
Zeinab Farhoudi
- Science and Research University, Tehran, Iran.
Ali Ahmadi
- Science and Research University, Tehran, Iran.
Abstract
This paper explores how fuzzy features’ number and reasoning rules can influence the rate of emotional speech recognition. The speech emotion signal is one of the most effective and neutral methods in individuals’ relationships that facilitate communication between man and machine. This paper introduces a novel method based on mind inference and recognition of speech emotion recognition. The foundation of the proposed method is the inference of rules in Fuzzy Petri-net (FPN) and the learning automata. FPN is a new method of classification which is introduced for the first time on emotion speech recognition. This method helps to analyze different rules in a dynamic environment like human’s mind. The input of FPN is computed by learning automata. Therefore learning automata has been used to adjust the membership functions for each feature vector in the dynamic environment. The proposed algorithm is divided into different parts: preprocessing; feature extraction; learning automata; fuzzification; inference engine and defuzzification. The proposed model has been compared with different models of classification. Experimental results show that the proposed algorithm outperforms other models.
Share and Cite
ISRP Style
Sara Motamed, Saeed Setayeshi, Zeinab Farhoudi, Ali Ahmadi, Speech Emotion Recognition Based on Learning Automata in Fuzzy Petri-net, Journal of Mathematics and Computer Science, 12 (2014), no. 3, 173-185
AMA Style
Motamed Sara, Setayeshi Saeed, Farhoudi Zeinab, Ahmadi Ali, Speech Emotion Recognition Based on Learning Automata in Fuzzy Petri-net. J Math Comput SCI-JM. (2014); 12(3):173-185
Chicago/Turabian Style
Motamed, Sara, Setayeshi, Saeed, Farhoudi, Zeinab, Ahmadi, Ali. "Speech Emotion Recognition Based on Learning Automata in Fuzzy Petri-net." Journal of Mathematics and Computer Science, 12, no. 3 (2014): 173-185
Keywords
- Emotional Speech
- Fuzzy Rules
- Learning Automata
- Mel frequency Cepstral coefficients (MFCC)
- Fuzzy Petri-net.
MSC
References
-
[1]
T. L. Pao, Y. T. Chen, J. H. Yeh, Comparison of Classification Methods for Detecting Emotion from Mandarin Speech, IEICE Transactions on Information and Systems , vol. E91-D(4) (2008)
-
[2]
T. Nose, T. Kobayashi , A Technique for Estimating Intensity of Emotional Expressions and Speaking Styles in Speech Based on Multiple-Regression HSMM, IEICE Transactions on Information and Systems , (2010)
-
[3]
R. Cowie, E. Douglas-Cowie, N. Tsapatsoulis, G. Votsis, S. Kollias, W. Fellenz, J. G. Taylor, Emotion recognition in human- computer interaction, IEEE Signal Process, vol. 18(1) (2001)
-
[4]
D. Erickson, Expressive speech: Production, perception and application to speech synthesis, Acoustical Science and Technology, vol. 26(4) (2005)
-
[5]
D. Neiberg, K. Elenius, K. Laskowski, Emotion recognition in spontaneous speech using GMMs, Proc. INTERSPEECH , (2006)
-
[6]
T. L. Nwe, S. W. Foo, L. C. D. Silva, Speech emotion recognition using hidden Markov models, Speech Communication, vol. 41(4) (2003)
-
[7]
O. Pierre-Yves , The production and recognition of emotions in speech: features and algorithms, International Journal of Human-Computer Studies, vol. 59 (2003)
-
[8]
C. M. Lee, S. Narayanan , Toward Detecting Emotions in Spoken Dialogs, IEEE Transactions on Speech and Audio Processing, vol. 13(2) (2005)
-
[9]
I. Albrecht, M. Schroder, J. Haber, H.-P. Seidel , Mixed feelings: Expression of non-basic emotions in a muscle-based talking head , Virtual Reality, vol. 8(4) (2005)
-
[10]
D. Wu, T. D. Parsons, S. Narayanan, Acoustic Feature Analysis in Speech Emotion Primitives Estimation, Proc. InterSpeech , (2010)
-
[11]
M. Schroder, R. Cowie, E. D.-cowie, M. Westerdijk, S. Gielen, Acoustic Correlates of Emotion Dimensions in View of Speech Synthesis, Proc. Eurospeech, (2001)
-
[12]
M. Grimm, K. Kroschel, Emotion Estimation in Speech Using a 3D Emotion Space Concept, in Robust Speech Recognition and Understanding, M. Grimm and K. Kroschel , (2007)
-
[13]
I. Kanluan, M. Grimm, K. Kroschel , Audio-Visual Emotion Recognition Using an Emotion Space Concept, Proc. EUSIPCO, (2008)
-
[14]
C. G. Looney , Fuzzy Petri-nets for rule-based decision making , IEEE Transactions on Systems, Man and Cybernetics, Vol. 18, No. 1 (1988)
-
[15]
L. Shiyon, Fuzzy Control, Neuro Control And Intelligent Cybernetics, Harbin Institute of Technology , (1998)
-
[16]
T. Seehapoch, S. Wongathanavasu , Speech Emotion Recognition Using Support Vector Machines , 5th International Conference on Knowledge and Smart Technology, (2013)
-
[17]
J. Price , Design an automatic speech recognition system using Maltab, University of Maryland Estern Shore Princess Anne , (2004)
-
[18]
A. Kamarul, A. B. Ibrahim , Biomedical engineering laboratory student pack, UTM Johor, (2005)
-
[19]
M. Bojanic, V. Crojevic, V. Deliv , Application of Neural Network in Emotional Speech Recognition, IEEE 20-22 , (2012)
-
[20]
M. Schroder, R. Cowie, E. D.-cowie, M. Westerdijk, S. Gielen , Acoustic Correlates of Emotion Dimensions in View of Speech Synthesis, Proc. Eurospeech, (2001)
-
[21]
H. Beigy, M. R. Meybodi, A Mathematical Framework for Cellular Learning Automata, Advances on Complex Systems, Vol. 7, No. 3 (2004)
-
[22]
V. Raghunathan, C. Schurgers, S. Park, M. B. Srivastava, Energy-Aware Wireless Microsensor Networks, IEEE Signal Processing Magazine, Vol 19 (2002)
-
[23]
K. S. Narendra, M. A. L. Thathachar, Learning automata: An introduction, Prentice Hall, (1989)
-
[24]
M. A. L. Thatachar, P. S. Sastry , Varieties of Learning Automata: An Overview , IEEE Transaction on System, Man, and Cybernetics- part B: CYBERNETICS (2002)
-
[25]
J. Winter, Y. Xu, W. C. Lee, Energy Efficient Processing of K Nearest Neighbor Queries in Locationaware Sensor Networks, the Second International Conference on Mobile and Ubiquitous Systems: Networking and Services (Mobiquitous'05), San Diego,CA (2005)
-
[26]
K. S. Narendra, M. A. L. Thathachar, Learning automata a survey , IEEE Transactions on Systems, Man and Cybernetics, vol. 4, no. 4 (1974)
-
[27]
N. Esau, L. Kleinjohann, B. Kleinjohann, An Adaptable fuzzy Model for Emotion Recognitio, , (2003)
-
[28]
F. Burkhardt, A. Paeschke, M. Rolfes, W. Sendlmeier, B. Weiss , A database of German emotional speech, Proceedings of the Interspeech , (2005)
-
[29]
F. Eyden, M. Wollmer, B. Schuller, Opensmile: the munich versatile and fast opensource audio feature extractor, 10 Proceedings of the international conference on Multimedia, (2010)
-
[30]
M. M. Javidi, E. F. Roshan, Speech emotion recognition by using combinations of C5.0, Neural Network (NN), and Support Vector Machine (SVM) classification methods , Journal of mathematics and computer science (2013)
-
[31]
N. Moarefi, A. Yarmohamadi , The implementation of cellular automata with non-identical rule on serial base, Journal of mathematics and computer science , (2012)