0% Complete
English
صفحه اصلی
/
پانزدهمین کنفرانس بین المللی فناوری اطلاعات و دانش
Embedded speech encoder for low-resource languages
نویسندگان :
Alireza A.Tabatabaei
1
Pouria Sameti
2
Ali Bohlooli
3
1- University of Isfahan
2- University of Isfahan
3- University of Isfahan
کلمات کلیدی :
Embedded Systems،Embedded AI،Embedded Speech embedding
چکیده :
Although high-performance artificial intelligence (AI) models require substantial computational resources, embedded systems are constrained by limited hardware capabilities, such as memory and processing power. On the other hand, embedded systems have a broad range of applications, making the integration of AI and embedded systems a prominent topic in both hardware and AI research. Creating powerful speech embeddings for embedded systems is challenging, as such models, like Wave2Vec, are typically computationally intensive. Additionally, the scarcity of data for many low-resource languages further complicates the development of high-performance models. To address these challenges, we utilized BERT to generate speech embeddings. BERT was selected because, in addition to producing meaningful embeddings, it is trained on numerous low-resource languages and facilitates the design of efficient decoders. This study introduces a compact speech encoder tailored for low-resource languages, capable of functioning as an encoder across a diverse range of speech tasks. To achieve this, we utilized BERT to generate meaningful embeddings. However, due to the high dimensionality of BERT embeddings, which imposes significant computational demands on many embedded systems, we applied dimensionality reduction techniques. The reduced-dimensional vectors were subsequently used as labels for speech data to train a model composed of convolutional neural networks (CNNs) and fully connected layers. Finally, we demonstrated the encoder's effectiveness through an application in speech command recognition.
لیست مقالات
لیست مقالات بایگانی شده
A hybrid CNN–transformer framework for retinal disease classification
Hanie Zomorrodi - Hassan Khotanlou
ارائه یک مدل تصمیم گیری چند معیاره فازی به منظور بهبود دقت فرایند تصمیم گیری به هنگام اختلال هوانوردی
فاطمه عطا عبدالرزاق - نگار مجمع
PC-MCLD: Pose-Constrained and Multi-focal Conditioned Latent Diffusion for Person Image Synthesis
Hanieh Fazli - Reza Azmi
طبقه بندی روش های شناسایی داده های تکراری در جهت تسهیل فرایند پاکسازی داده ها
مهدی جعفری - احمد عبدالله زاده بار فروش
Distributed Deep Reinforcement Learning for Energy-Efficient and Low-Latency Load Balancing in Mobile Edge Computing
Pooria Azizi - Siavash Khorsandi
FedCloak: Backdoor-Based Covert Channels in Federated Learning
Mohammad Matin Rezaeifard - Fatemeh Zahedi - Seyed Arsalan Vasegh Rahim Parvar - Reza Ebrahimi Atani
A parallel approach to the fractional time delay model for predicting the spread of COVID-19
Mahdi Movahedian Moghaddam - Kourosh Parand
Predicting Concentration of Particulate Matter (PM2.5) in Hamedan using Machine Learning Algorithms
Anita Karim Ghassabpour - Hatam Abdoli - Muharram Mansoorizadeh - Saeid Seyedi
Particle Swarm Optimization-Based Framework for 3D Swarm Robotic Navigation Using Artificial Potential Field Dynamics
Samim Kamyab - Masoud Shirzadeh - Ghoncheh Zand
پیشبینی بستری مجدد بیماران با استفاده از استخراج مفاهیم زیستپزشکی از متون بالینی
فهیمه شاهرخ شهرکی - رسول سامانی - دکتر ناصر قدیری فهیمه شاهرخ شهرکی - رسول سامانی - ناصر قدیری -
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 43.8.0