

Author: Ślot Krzysztof Bronakowski Łukasz Cichosz Jaroslaw Kim Hyongsuk
Publisher: MDPI
E-ISSN: 1424-8220|9|12|9858-9872
ISSN: 1424-8220
Source: Sensors, Vol.9, Iss.12, 2009-12, pp. : 9858-9872
Disclaimer: Any content in publications that violate the sovereignty, the constitution or regulations of the PRC is not accepted or approved by CNPIEC.
Abstract
The following paper introduces a group of novel speech-signal descriptors that reflect phoneme-pronunciation variability and that can be considered as potentially useful features for emotion sensing. The proposed group includes a set of statistical parameters of Poincare maps, derived for formant-frequency evolution and energy evolution of voiced-speech segments. Two groups of Poincare-map characteristics were considered in the research: descriptors of sample-scatter, which reflect magnitudes of phone-uttering variations and descriptors of cross-correlations that exist among samples and that evaluate consistency of variations. It has been shown that inclusion of the proposed characteristics into the pool of commonly used speech descriptors, results in a noticeable increase—at the level of 10%—in emotion sensing performance. Standard pattern recognition methodology has been adopted for evaluation of the proposed descriptors, with the assumption that three- or four-dimensional feature spaces can provide sufficient emotion sensing. Binary decision trees have been selected for data classification, as they provide with detailed information on emotion-specific discriminative power of various speech descriptors.
Related content


The Feature Extraction Based on Texture Image Information for Emotion Sensing in Speech
Sensors, Vol. 14, Iss. 9, 2014-09 ,pp. :


Metamaterials Application in Sensing
Sensors, Vol. 12, Iss. 3, 2012-02 ,pp. :




Sensing and 3D Mapping of Soil Compaction
By Tekin Yücel Kul Basri Okursoy Rasim
Sensors, Vol. 8, Iss. 5, 2008-05 ,pp. :