0% Complete
English
صفحه اصلی
/
چهاردهمین کنفرانس بین المللی فناوری اطلاعات و دانش
Enhancing Supervised Learning in Speech Emotion Recognition through Unsupervised Representations
نویسندگان :
Niloufar Faridani
1
Amirali Soltani Tehrani
2
Ramin Toosi
3
1- دانشکده برق و کامپیوتر دانشگاه تهران
2- دانشکده برق و کامپیوتر دانشگاه تهران
3- دانشکده برق و کامپیوتر دانشگاه تهران
کلمات کلیدی :
Speech Emotion Recognition،Self-supervised Learning،Convolutional Neural Network
چکیده :
Speech Emotion Recognition (SER) is pivotal in enhancing human-computer interaction by enabling a deeper understanding of emotional states across various applications, contributing to more empathetic and effective communication. This study proposes an innovative approach integrating self-supervised feature extraction with supervised classification for emotion recognition from small audio segments. In the preprocessing step, to eliminate the need to craft audio features, we employed a self-supervised feature extractor based on the Wav2Vec model to capture acoustic features from audio data. Then, the output feature maps of the preprocessing step are fed to a custom-designed Convolutional Neural Network (CNN)–-based model to perform emotion classification. Utilizing the ShEMO dataset as our testing ground, the proposed method surpasses two baseline methods, i.e., support vector machine classifier and transfer learning of a pre-trained CNN. Comparing the proposed method to the state-of-the-art techniques in the SER task indicates the superiority of the proposed method. Our findings underscore the pivotal role of deep unsupervised feature learning in elevating the landscape of SER, offering enhanced emotional comprehension in the realm of human-computer interactions.
لیست مقالات
لیست مقالات بایگانی شده
Artificial Empathy in AI-Based Mental Health: A Review
Shabnam Moradi
Combinatorial Auction Based on Social Choice in the Internet of Things
Maede Esmaeili - Faria Nassiri-Mofakham - Fatemeh Hassanvand
Fast Online Character Recognition Using a Novel Local-Global Feature Extraction Method
Ayoub Parvizi - Dr Mohammad Kazemifard - Ziba Imani
Inner and Outer Bearing Fault Diagnosis of electrical Motors Using a Proposed Algorithm and Vibration Signals
Vahid Safari Dehnavi - Masoud Shafiee
Dealing with Black-hole Attacks in Inter-vehicle Networks Using the Packet Delivery Rate Algorithm
Marzieh Sedighi - Mehdi Hamidkhani - Mostafa Sadeghi
ارائۀ چارچوب هستانشناسی برای شهر هوشمند مبتنی بر سیستمهای سایبر-فیزیکی
علی اصغر قائمی - جعفر حبیبی - سید حسن میریان
Hardware Imperfection Effects in Wireless Virtual Reality System with Hybrid Beamforming
Nasim Alikhani - Abbas Mohammadi
Using Trust Statements and Ratings by GraphSAGE to Alleviate Cold Start in Recommender Systems
Seyedeh Niusha Motevallian - Dr Seyed Mohammad Hossein Hasheminejad
بکارگیری الگوریتم بهینه سازی فاخته و منطق فازی به منظور بهبود زمانبندی وظایف در محیط محاسبات مه
فاطمه دوامی - حمید جلیلوند - فاطمه نجفی
FiReT: A Neural Radiance Fields Framework for Wireless Field Reconstruction and Transmitter Placement
Negar Pouya - Armin Soleymani - Gholamreza Moradi - Farzaneh Abdollahi
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.5.2