0% Complete
English
صفحه اصلی
/
شانزدهمین کنفرانس بین المللی فناوری اطلاعات و دانش
Integrating Wasserstein GANs for High-Speed Transformer-Based Neural Machine Translation
نویسندگان :
Parisa Nekoogol
1
Mostafa Salehi
2
1- دانشگاه تهران
2- دانشگاه تهران
کلمات کلیدی :
Neural Machine Translation،Generative Adversarial Networks،Reinforcement Learning،Transformer
چکیده :
Neural machine translation (NMT), a key achievement in natural language processing (NLP), continues to face challenges such as producing low-quality output for complex sentences and lacking natural fluency. This study aimed to improve machine translation quality by integrating Generative Adversarial Networks (GANs) with an NMT model. Initially, the baseline NMT model, derived from previous research and based on recurrent neural networks (RNNs), was reconstructed and implemented. Subsequently, this architecture was replaced with the advanced Transformer architecture, and the system was developed using a Wasserstein Generative Adversarial Network (WGAN). To overcome the crucial problem of textual data discontinuity (non-differentiability), the Self-Critical Sequence Training (SCST) method, a reinforcement learning (RL) algorithm, was employed. A core objective was to analyze the performance benefits of adversarial training when applied to a robust Transformer-based generator. The research concluded that while adversarial training enhances the model's performance in generating more fluent translations, this particular improvement is more substantial and notable for models based on recurrent neural networks compared to the Transformer architecture.
لیست مقالات
لیست مقالات بایگانی شده
An Eco-Friendly Cosmopolitan (EFC) by Recycling Scientific/Industrial Towns (RSITs)
Engineer Reza Khalilian - Dr. Abdalhossein Rezai - Dr. Mohammadreza Talakesh
Extending Interaction Flow Modeling Language as a Profile for Form-making Systems
Ghazaleh Shahin - Dr Bahman Zamani
Impact of ICT and Digital Evolution on Capital Structure in Companies
Ali Noori
Agentic Username Suggestion and Multimodal Gender Detection in Online Platforms: Introducing the PNGT-26K Dataset
Farbod Bijary - Mohsen Ebadpour - Amirhosein Tajbakhsh
شبکههای نرمافزار محور در کلان داده: مطالعهی راهکارهای امنیتی و چالشها
احسان سلیمانی دهکردی - محمدرضا ملاخلیلی میبدی
An Improved Image Classification Based In Feature Extraction From Convolutional Neural Network: Application To Flower Classification
Faeze Sadati - Dr Behrooz Rezaie
A Fuzzy Cluster-Based Routing Algorithm to Extend Wireless Sensor Network Lifetime
Mostafa Mirzaie - Armin Mazinani - Dr Sayyed Majid Mazinani
STANet: Spatio-Temporal Attention-Enhanced WaveNet for Crime Hotspot Prediction
Rojan Roshankar - Mohammad Reza Keyvanpour
An integrated approach for estimating software cost estimation using Adaptive Neuro-Fuzzy Inference System and the Grey Wolf Optimization algorithm
Maryam Karimi - Taghi Javdani Gandomani - Mahdi Mosleh
Open-domain question classification and completion in conversational information search
Omid Mohammadi Kia - Mahmood Neshati - Mahsa Soudi Alamdari
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.5.2