0% Complete
English
صفحه اصلی
/
شانزدهمین کنفرانس بین المللی فناوری اطلاعات و دانش
Integrating Wasserstein GANs for High-Speed Transformer-Based Neural Machine Translation
نویسندگان :
Parisa Nekoogol
1
Mostafa Salehi
2
1- دانشگاه تهران
2- دانشگاه تهران
کلمات کلیدی :
Neural Machine Translation،Generative Adversarial Networks،Reinforcement Learning،Transformer
چکیده :
Neural machine translation (NMT), a key achievement in natural language processing (NLP), continues to face challenges such as producing low-quality output for complex sentences and lacking natural fluency. This study aimed to improve machine translation quality by integrating Generative Adversarial Networks (GANs) with an NMT model. Initially, the baseline NMT model, derived from previous research and based on recurrent neural networks (RNNs), was reconstructed and implemented. Subsequently, this architecture was replaced with the advanced Transformer architecture, and the system was developed using a Wasserstein Generative Adversarial Network (WGAN). To overcome the crucial problem of textual data discontinuity (non-differentiability), the Self-Critical Sequence Training (SCST) method, a reinforcement learning (RL) algorithm, was employed. A core objective was to analyze the performance benefits of adversarial training when applied to a robust Transformer-based generator. The research concluded that while adversarial training enhances the model's performance in generating more fluent translations, this particular improvement is more substantial and notable for models based on recurrent neural networks compared to the Transformer architecture.
لیست مقالات
لیست مقالات بایگانی شده
طراحی واسط کاربری مبتنی بر رفتار و احساسات کاربران در سیستم های هوشمند
فاطمه صبائی - دکتر احمد عبداله زاده بارفروش
IT-based and Non-IT-based methods to separate and collect waste
Hoda Harati - Farzad Haghighi-Rad - Reza Yousefi Zenouz
شناسایی حملات فیشینگ با استفاده از الگوریتم عقاب آتشین و شبکه عصبی کانولوشن
علی کوشاری - مهدی فرتاش
Improving Training Stability in Variational Autoencoders Through the Integration of Score Matching Loss
Amirreza Mokhtari Rad - Pouya Ardehkhani - Hormehr Alborzi
User Preferences Elicitation in Bilateral Automated Negotiation Using Recursive Least Square Estimation
Farnaz Salmanian - Dr Hamid Jazayeri - Dr Javad Kazemitabar
Electrophysiological Modeling and Interactive Approaches of Electrical Circuits and Hypergraphs for Understanding Neural Circuit Dynamics
Arian Baymani - Maryam Naderi Soorki
ارائه مدل هشت مولفه ای استراتژی جامع هوش مصنوعی سازمانی
محمد کاظم صیادی - نیلوفر مرادحاصل - علیرضا یاری
مدل یادگیری ماشین برای تشخیص تقلب در کارتهای اعتباری با رویکرد بهینهسازی AUC و تنظیم خودکار ابرپارامترها
محمد مهدی متولی
Improving hypergraph attention and hypergraph convolution networks
Mustafa Mohammadi Gharasuie - Mahmood Shabankhah - Ali Kamandi
An LLM-Based Approach for Clarifying the Decisions of Vision Models in Autonomous Vehicles
Omid Mosalmani - Mohammad Javad Rashti - Seyed Enayat Alavi
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 43.8.0