0% Complete
English
صفحه اصلی
/
شانزدهمین کنفرانس بین المللی فناوری اطلاعات و دانش
Integrating Wasserstein GANs for High-Speed Transformer-Based Neural Machine Translation
نویسندگان :
Parisa Nekoogol
1
Mostafa Salehi
2
1- دانشگاه تهران
2- دانشگاه تهران
کلمات کلیدی :
Neural Machine Translation،Generative Adversarial Networks،Reinforcement Learning،Transformer
چکیده :
Neural machine translation (NMT), a key achievement in natural language processing (NLP), continues to face challenges such as producing low-quality output for complex sentences and lacking natural fluency. This study aimed to improve machine translation quality by integrating Generative Adversarial Networks (GANs) with an NMT model. Initially, the baseline NMT model, derived from previous research and based on recurrent neural networks (RNNs), was reconstructed and implemented. Subsequently, this architecture was replaced with the advanced Transformer architecture, and the system was developed using a Wasserstein Generative Adversarial Network (WGAN). To overcome the crucial problem of textual data discontinuity (non-differentiability), the Self-Critical Sequence Training (SCST) method, a reinforcement learning (RL) algorithm, was employed. A core objective was to analyze the performance benefits of adversarial training when applied to a robust Transformer-based generator. The research concluded that while adversarial training enhances the model's performance in generating more fluent translations, this particular improvement is more substantial and notable for models based on recurrent neural networks compared to the Transformer architecture.
لیست مقالات
لیست مقالات بایگانی شده
A Mathematical Optimization Approach for Preference Learning in Movie Recommender Systems with Shared Accounts
Milad Khademali - Fazlollah Aghamohammadi - Marjan Kaedi - Alireza Nasiri
Improving Transition Cow Index Accuracy through CatBoost-Based Prediction of First Test-Day Milk Yield
Hoda Safaeipour - Sepehr Ebadi
بررسی روشها، مجموعههای داده و معیارهای ارزیابی در حوزهی پرسش از متون درون تصویر
کبری فرشیدی - حسن ختنلو - محرم منصوری زاده - الهام علی قارداش
بررسی تأثیر استقرار استاندارد COBIT در افزایش بهره وری سازمانها (مطالعه موردی: شعب نمایندگیهای همراه اول، ایرانسل، رایتل)
دکتر محمد ابراهیم سمیع - ساره رحمانیان محمد ابراهیم سمیع - ساره رحمانیان -
Ensemble Model Based on an Improved Convolutional Neural Network with a Domain-agnostic Data Augmentation Technique
Faraz Fatahnaie - Armin Azhdehnia - Seyyed Amir Asghari - Mohammadreza Binesh Marvasti
Paths-oriented Test Data Generation using Genetic Algorithm
Mohammad Reza Hassanpour Charmchi - Dr Bagher Rahimpour cami
Using Trust Statements and Ratings by GraphSAGE to Alleviate Cold Start in Recommender Systems
Seyedeh Niusha Motevallian - Dr Seyed Mohammad Hossein Hasheminejad
Information Technology Risk Management Model for Remote Control Vehicles
Hamid Reza Naji - Aref Ayati
A Nano-based High-Speed QCA circuit for Information Security with Image Masking
Saeid Seyedi - Hatam Abdoli
بهینهسازی مسیر وسیله ی نقلیه ی هوایی بدون سرنشین جهت کاهش زمان جمع آوری داده از حسگرها در شبکه ی اینترنت اشیا مبتنی بر الگوریتم یادگیری تقویتی عمیق
محمد ناظمی جنابی - هادی اشعریون - مهدی پورقلی
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 43.8.0