Comparative Performance Analysis of Metaheuristic Feature Selection Methods for Speech Emotion Recognition

ÖZSEVEN, TURGUT; Arpacıoğlu, Mustafa

doi:10.2478/msr-2024-0010

Comparative Performance Analysis of Metaheuristic Feature Selection Methods for Speech Emotion Recognition

Yazarlar (2)

Prof. Dr. Turgut ÖZSEVEN Tokat Gaziosmanpaşa Üniversitesi, Türkiye

Mustafa Arpacıoğlu
Tokat Gaziosmanpaşa Üniversitesi, Türkiye

Makale Türü	Özgün Makale (SSCI, AHCI, SCI, SCI-Exp dergilerinde yayınlanan tam makale)
Dergi Adı	Measurement Science Review
Dergi ISSN	1335-8871 Wos Dergi Scopus Dergi
Dergi Tarandığı Indeksler	SCI-Expanded
Makale Dili	İngilizce	Basım Tarihi	04-2024
Cilt / Sayı / Sayfa	24 / 2 / 72–82	DOI	10.2478/msr-2024-0010
Makale Linki	http://dx.doi.org/10.2478/msr-2024-0010
UAK Araştırma Alanları	Makine Öğrenmesi

Özet

Emotion recognition systems from speech signals are realized with the help of acoustic or spectral features. Acoustic analysis is the extraction of digital features from speech files using digital signal processing methods. Another method is the analysis of time-frequency images of speech using image processing. The size of the features obtained by acoustic analysis is in the thousands. Therefore, classification complexity increases and causes variation in classification accuracy. In feature selection, features unrelated to emotions are extracted from the feature space and are expected to contribute to the classifier performance. Traditional feature selection methods are mostly based on statistical analysis. Another feature selection method is the use of metaheuristic algorithms to detect and remove irrelevant features from the feature set. In this study, we compare the performance of metaheuristic feature selection algorithms for speech emotion recognition. For this purpose, a comparative analysis was performed on four different datasets, eight metaheuristics and three different classifiers. The results of the analysis show that the classification accuracy increases when the feature size is reduced. For all datasets, the highest accuracy was achieved with the support vector machine. The highest accuracy for the EMO-DB, EMOVA, eNTERFACE’05 and SAVEE datasets is 88.1%, 73.8%, 73.3% and 75.7%, respectively.

Anahtar Kelimeler

acoustic analysis | feature optimization | feature selection | metaheuristic | Speech emotion recognition

Pdf İndir

BM Sürdürülebilir Kalkınma Amaçları

Atıf Sayıları
Scopus	2
Google Scholar	3

Comparative Performance Analysis of Metaheuristic Feature Selection Methods for Speech Emotion Recognition

Dergi Adı	Measurement Science Review
Yayıncı	Sciendo
Açık Erişim	Evet
ISSN	1335-8871
E-ISSN	1335-8871
CiteScore	1,9
SJR	0,240
SNIP	0,538

Comparative Performance Analysis of Metaheuristic Feature Selection Methods for Speech Emotion Recognition

Paylaş