0% Complete
فارسی
Home
/
پانزدهمین کنفرانس بین المللی فناوری اطلاعات و دانش
Leveraging Retrieval-Augmented Generation for Persian University Knowledge Retrieval
Authors :
Arshia Hemmat
1
Mohammad Hassan Heydari
2
Kianoosh Vadaei
3
Afsaneh Fatemi
4
1- University of Isfahan
2- University of Isfahan
3- University of Isfahan
4- University of Isfahan
Keywords :
Large Language Models،Natural Language Processing،Retrieval Augmented Generation،Dataset Generation،QuestionAnswering System
Abstract :
This paper introduces an innovative approach using Retrieval-Augmented Generation (RAG) pipelines with Large Language Models (LLMs) to enhance information retrieval and query response systems for university-related question answering. By systematically extracting data from the university's official website, primarily in Persian, and employing advanced prompt engineering techniques, we generate accurate and contextually relevant responses to user queries. We developed a comprehensive university benchmark, UniversityQuestionBench (UQB), to rigorously evaluate our system’s performance. UQB focuses on Persian-language data, assessing accuracy and reliability through various metrics and real-world scenarios. Our experimental results demonstrate significant improvements in the precision and relevance of generated responses, enhancing user experiences, and reducing the time required to obtain relevant answers. In summary, this paper presents a novel application of RAG pipelines and LLMs for Persian-language data retrieval, supported by a meticulously prepared university benchmark, offering valuable insights into advanced AI techniques for academic data retrieval and setting the stage for future research in this domain.\footnote{Dataset is publicly available at \url{https://huggingface.co/datasets/UIAIC/UQB}}
Papers List
List of archived papers
Design and modeling of a waiter robot
Amin Mohammadnejad - Hami Tourajizadeh
UltraLearn: Next-Generation CyberSecurity Learning Platform
Saeed Raisi - Saeid Ghasemshirazi - Ghazaleh Shirvani
HTCAR: Hierarchical Text Classification based on aggregation of Representations
Ali Bavand - Mohammad Mehdi Homayounpour - Ahmad Nickabadi
Dealing with Black-hole Attacks in Inter-vehicle Networks Using the Packet Delivery Rate Algorithm
Marzieh Sedighi - Mehdi Hamidkhani - Mostafa Sadeghi
نقشه های شناختی فازی پیشرفته (FCM) رویکردی برای مدل سازی سیستم های پیچیده ی پویا
فریبا اسلامی امیرآبادی - کمال میرزایی بدرآبادی
بکارگیری الگوریتم بهینه سازی فاخته و منطق فازی به منظور بهبود زمانبندی وظایف در محیط محاسبات مه
فاطمه دوامی - حمید جلیلوند - فاطمه نجفی
Cryptanalysis of two password authenticated key exchange schemes
Mohammad Ali Poorafsahi - Hamid Mala
پیشبینی حجم ترافیک شهری با استفاده از دادههای سرویس نشان مورد مطالعاتی: خیابان کمال اصفهان
مهسا لطیفی - جمشید مالکی
Web Service Ranking based on QoS and Use Prefer
Seyed Hossein Siadat - Danial Ramezani - Fatemeh Ahani
شناسایی حملات رومینگ تلفنهمراه با استفاده از یادگیری ماشین
سعیده سیف الدین - سجاد شیرعلی شهرضا
more
Samin Hamayesh - Version 42.5.2