0% Complete
فارسی
Home
/
دوازدهمین کنفرانس بین المللی فناوری اطلاعات و دانش
Conceptual Intelligent Model for Visual Question Answering using Attention Mechanism and Relational Reasoning
Authors :
ٍElham Alighardash
1
Hassan Khotanlou
2
Vahid Pour Amin
3
1- دانشگاه بوعلی سینا
2- دانشگاه بوعلی سینا
3- دانشگاه سیدجمال الدین اسدآبادی
Keywords :
visual question answering, attention mechanism, visual reasoning, zero-shot learning
Abstract :
In recent years, a great deal of interest in research of Visual Question Answering (VQA) has been propounded it as a hot topic in computer vision. Many sub-problems were raised in this regard, and reasonable efforts have been made to solve them. Considering salient elements of different modalities, discovering inter or intra correlation, proper information fusion method, using supplementary information of external knowledge bases, visual reasoning, and accepting correct answers that have not been seen before in the training set are examples of these issues. In this paper, the focus is on reinforcing the model by reasoning about complex questions, applying the attention mechanism, and leveraging knowledge graphs (KG) to improve the generated answers. Moreover, the proposed conceptual model includes a zero-shot learning method to allow unlabeled correct answers by implementing a semantic space mapping approach. The use of the fact-based VQA knowledge base for integrating the scene graph with additional information is suggested in the research. It is expected that based on the proposed approach of the framework, its implementation will lead to better accuracy and improvement in efficiency for predicting the appropriate answers.
Papers List
List of archived papers
Enhancing kNN-Based Intrusion Detection with Differential Evolution with Auto-Enhanced Population Diversity
Zohre Karimi - Zeinab Torabi
Mode Selection and Resource Allocation in D2D-Enabled MC-NOMA using Matching Theory
Alireza Gholamrezaee - Hamid Farrokhi - Javad Zeraatkar Moghaddam
Distributed Learning Automata-based Algorithm for Finding K-Clique in Complex Social Networks
Mohammad Mehdi Daliri Khomami - Alireza Rezvanian - Ali Mohammad Saghiri - Mohammad Reza Meybodi
An Attention-Enhanced Hybrid Deep Learning Framework for Detecting Denial-of-Wallet Attacks in Serverless Platforms
Mohammad Mehmandoost - HadiShahriar Shahhoseini
A perceptual loss for screen content image super-resolution
Hossein Sekhavaty-Moghadam - Marzieh Hosseinkhani - Dr Azadeh Mansouri
ارزیابی و برنامهریزی اجرای پیشنهادی هوش مصنوعی در صنعت پتروشیمی ایران
امین رضا انصاری - احد قائمی - سید مهدی کوچک کوثری
مدل یادگیری عمیق با بازنمایی چند مقیاسی زمان برای پیشبینی آبشار اطلاعاتی در شبکههای اجتماعی
مبینا پناهی - مهدی عمادی
Effective Design of Reversible 2×2 Vedic Multiplier With Low Cost
Mojtaba Noorallahzadeh - Mohammad Mosleh - Ali Shahidikia
Application of Artificial Intelligence and Remote Sensing for Oil Spill Detection
َAmir Reza Ziaee - Masomeh Azimzadeh - Parvin Ahmadi
A Framework for Systematic Stability Assessment of Post-hoc Explanations in Text Classification
Parman Mohammadalizadeh - Parham Mohammadalizadeh - Ayda Mahmoudian
more
Samin Hamayesh - Version 42.5.2