Speech Emotion Recognition System Using Gaussian Mixture Model and Improvement proposed via Boosted GMM

Pavitra Patel; A. A. Chaudhari; M. A. Pund; D. H. Deshmukh

doi:10.21013/jte.ICSESD201706

Article Tools

Indexing metadata

How to cite item

Email this article (Login required)

Email the author (Login required)

About The Authors

Pavitra Patel
PRM Institute of Technology and Research, Badnera-Amravati.
India

A. A. Chaudhari
PRM Institute of Technology and Research, Badnera-Amravati.
India

M. A. Pund
PRM Institute of Technology and Research, Badnera-Amravati.
India

D. H. Deshmukh
PRM Institute of Technology and Research, Badnera-Amravati.
India

Published by IRA Academico Research, a publisher member of the Publishers International Linking Association, Inc. (Crossref), USA.

User

Peer Reviewed Open Access

This paper is reviewed in accordance with the Peer Review Program of IRA Academico Research

Speech Emotion Recognition System Using Gaussian Mixture Model and Improvement proposed via Boosted GMM

Pavitra Patel, A. A. Chaudhari, M. A. Pund, D. H. Deshmukh

DOI: http://dx.doi.org/10.21013/jte.ICSESD201706

Abstract

Speech emotion recognition is an important issue which affects the human machine interaction. Automatic recognition of human emotion in speech aims at recognizing the underlying emotional state of a speaker from the speech signal. Gaussian mixture models (GMMs) and the minimum error rate classifier (i.e. Bayesian optimal classifier) are popular and effective tools for speech emotion recognition. Typically, GMMs are used to model the class-conditional distributions of acoustic features and their parameters are estimated by the expectation maximization (EM) algorithm based on a training data set. In this paper, we introduce a boosting algorithm for reliably and accurately estimating the class-conditional GMMs. The resulting algorithm is named the Boosted-GMM algorithm. Our speech emotion recognition experiments show that the emotion recognition rates are effectively and significantly boosted by the Boosted-GMM algorithm as compared to the EM-GMM algorithm.
During this interaction, human beings have some feelings that they want to convey to their communication partner with whom they are communicating, and then their communication partner may be the human or machine. This work dependent on the emotion recognition of the human beings from their speech signal
Emotion recognition from the speaker’s speech is very difficult because of the following reasons: Because of the existence of the different sentences, speakers, speaking styles, speaking rates accosting variability was introduced. The same utterance may show different emotions. Therefore it is very difficult to differentiate these portions of utterance. Another problem is that emotion expression is depending on the speaker and his or her culture and environment. As the culture and environment gets change the speaking style also gets change, which is another challenge in front of the speech emotion recognition system.

Keywords

Speech, Emotion, Gaussian, Boosted.

Full Text:

PDF

©IRA Academico Research & its authors
This article is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License. This article can be used for non-commercial purposes. Mentioning of the publication source is mandatory while referring this article in any future works.

The published works in Journal except the content of the website are licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

(ISSN: 2455-4480)

Username
Password
Remember me