0% Complete
English
صفحه اصلی
/
پانزدهمین کنفرانس بین المللی فناوری اطلاعات و دانش
Embedded speech encoder for low-resource languages
نویسندگان :
Alireza A.Tabatabaei
1
Pouria Sameti
2
Ali Bohlooli
3
1- University of Isfahan
2- University of Isfahan
3- University of Isfahan
کلمات کلیدی :
Embedded Systems،Embedded AI،Embedded Speech embedding
چکیده :
Although high-performance artificial intelligence (AI) models require substantial computational resources, embedded systems are constrained by limited hardware capabilities, such as memory and processing power. On the other hand, embedded systems have a broad range of applications, making the integration of AI and embedded systems a prominent topic in both hardware and AI research. Creating powerful speech embeddings for embedded systems is challenging, as such models, like Wave2Vec, are typically computationally intensive. Additionally, the scarcity of data for many low-resource languages further complicates the development of high-performance models. To address these challenges, we utilized BERT to generate speech embeddings. BERT was selected because, in addition to producing meaningful embeddings, it is trained on numerous low-resource languages and facilitates the design of efficient decoders. This study introduces a compact speech encoder tailored for low-resource languages, capable of functioning as an encoder across a diverse range of speech tasks. To achieve this, we utilized BERT to generate meaningful embeddings. However, due to the high dimensionality of BERT embeddings, which imposes significant computational demands on many embedded systems, we applied dimensionality reduction techniques. The reduced-dimensional vectors were subsequently used as labels for speech data to train a model composed of convolutional neural networks (CNNs) and fully connected layers. Finally, we demonstrated the encoder's effectiveness through an application in speech command recognition.
لیست مقالات
لیست مقالات بایگانی شده
A Blockchain Architecture for Secure, High-Speed P2P Energy Trades with Game-Theoretic Coalition Formation
Amin Aboutalebi Najafabadi - Seyed Hossein Hosseinian
Classification of Personality Traits on Facebook Using Key Phrase Extraction, Language Models and Machine Learning
Faezeh Safari - Abdolah Chalechale
A Biased Random Key Genetic Algorithm for the Dial-a-Ride Problem
ُSomayeh Sohrabi - Koorush Ziarati - Morteza Keshtkaran
A Deep Learning Framework for Phase-Aware Feature Representation to Improve Sound Source Direction and Distance Estimation
Zahra Abolfazli - Hamid Reza Abutalebi
تاثیر مدیریت دانش مشتری بر توسعه محصول جدید و نوآورانه با رویکرد مدل سازی معادلات ساختاری با استفاده از حداقل مربعات جزئی: مطالعۀ موردی شرکت کاله
دکتر آرش خسروی - سیده فاطمه حسینی - دکتر مرتضی رجب زاده آرش خسروی - سیده فاطمه حسینی - مرتضی رجب زاده -
یک سیستم پاسخ به نفوذ در شبکه های اینترنت اشیاء با استفاده از شبکه های مبتنی بر نرم افزار
احسان شاهرخی مینا - رضا محمدی - محمد نصیری
Improving Transition Cow Index Accuracy through CatBoost-Based Prediction of First Test-Day Milk Yield
Hoda Safaeipour - Sepehr Ebadi
Electrophysiological Modeling and Interactive Approaches of Electrical Circuits and Hypergraphs for Understanding Neural Circuit Dynamics
Arian Baymani - Maryam Naderi Soorki
Optimal selection of seed nodes by reducing the influence of common nodes in the influence maximization problem
Farzaneh Kazemzadeh - Ali Asghar Safaei - Mitra Mirzarezaee
Vi-Net: A Deep Violent Flow Network for Violence Detection in Video Sequences
Tahereh Zarrat Ehsan - Seyed Mehdi Mohtavipour
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 43.8.0