0% Complete
English
صفحه اصلی
/
پانزدهمین کنفرانس بین المللی فناوری اطلاعات و دانش
Knowledge Extraction from Technical Reports Based on Large Language Models: An Exploratory Study
نویسندگان :
Parsa Bakhtiari
1
Hassan Bashiri
2
Alireza Khalilipour
3
Masoud Nasiripour
4
Moharram Challenger
5
1- دانشگاه صنعتی همدان
2- دانشگاه صنعتی همدان
3- University of Antwerp
4- دانشگاه صنعتی همدان
5- University of Antwerp
کلمات کلیدی :
Knowledge Extraction،Large Language Model،Fine Tuning
چکیده :
Organizations and companies possess a vast amount of documents generated over the years. These documents contain valuable information and knowledge that can be instrumental in resolving ambiguities and challenges experts face. Information retrieval and knowledge management systems are tools for extracting documents relevant to users’ informational needs, addressing part of the knowledge extraction challenge from these document collections. With the emergence of generative artificial intelligence and large language models that exhibit strong capa- bilities in understanding textual documents, knowledge extraction solutions have shifted towards utilizing these models. Large language models possess general knowledge obtained from pre- training methods, and there are various approaches to infuse domain-specific knowledge into the general understanding of the language model. This research first examines the possible techniques for fine-tuning a large language model in a specific domain. We then train the model using fine-tuning methods on a collection of documents and technical reports from the industry. Finally, we measure the improvement in the large language model’s capability to extract domain-specific knowledge.
لیست مقالات
لیست مقالات بایگانی شده
A parallel approach to the fractional time delay model for predicting the spread of COVID-19
Mahdi Movahedian Moghaddam - Kourosh Parand
Persian deaf sign language recognition system using deep learning
Mohammad Ebrahimi
روشی برای تشخیص مرحله پیشرفت آلزایمر در تصاویرFMRI مبتنی بر شبکه های عصبی چگال
فرساد زمانی بروجنی - عباس بهره دار
GanjNet: Leveraging Network Modeling with Large Language Models for Persian Word Sense Induction
Amir Mohammad Kouyeshpour - Hadi Veisi - Saman Haratizadeh
طراحی و پیاده سازی بستر اجرای بازی جنگ سایبری
مریم نصراصفهانی - بهروز ترک لادانی - بهروز شاهقلی قهفرخی - حسین قجاوند بلتیجه - نوید شیرمحمدی - مهدی شمس - محمدامین آقاکبیری
Real-Time EEG-Based Analysis Of Stress-Inducing Stimuli
Mohsen Mahmoudi - Fattaneh Taghiyareh - Yasamin Akhavein - Elnaz Ghorbani
تحلیل و بررسی تکنیکهای محاسبات تقریبی
محمد میلاد صیاد - محمد رضا بینش مروستی - سید امیر اصغری
Effective Classifier for Predicting Churn in Payment Terminals Using RFM model and Deep Neural Network
Dr Mahila Dadfarnia - Ali Alemi Matinpour - Dr Monireh Abdoos
A Topic Based Method to Classify the Question Clarity in CQA Networks
Alireza Khabbazan - Dr Ahmad Ali Abin
A novel approach audio watermarking based on (GBT,DCT,SVD)
Mahdi Mosleh
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.3.1