0% Complete
English
صفحه اصلی
/
پانزدهمین کنفرانس بین المللی فناوری اطلاعات و دانش
Knowledge Extraction from Technical Reports Based on Large Language Models: An Exploratory Study
نویسندگان :
Parsa Bakhtiari
1
Hassan Bashiri
2
Alireza Khalilipour
3
Masoud Nasiripour
4
Moharram Challenger
5
1- دانشگاه صنعتی همدان
2- دانشگاه صنعتی همدان
3- University of Antwerp
4- دانشگاه صنعتی همدان
5- University of Antwerp
کلمات کلیدی :
Knowledge Extraction،Large Language Model،Fine Tuning
چکیده :
Organizations and companies possess a vast amount of documents generated over the years. These documents contain valuable information and knowledge that can be instrumental in resolving ambiguities and challenges experts face. Information retrieval and knowledge management systems are tools for extracting documents relevant to users’ informational needs, addressing part of the knowledge extraction challenge from these document collections. With the emergence of generative artificial intelligence and large language models that exhibit strong capa- bilities in understanding textual documents, knowledge extraction solutions have shifted towards utilizing these models. Large language models possess general knowledge obtained from pre- training methods, and there are various approaches to infuse domain-specific knowledge into the general understanding of the language model. This research first examines the possible techniques for fine-tuning a large language model in a specific domain. We then train the model using fine-tuning methods on a collection of documents and technical reports from the industry. Finally, we measure the improvement in the large language model’s capability to extract domain-specific knowledge.
لیست مقالات
لیست مقالات بایگانی شده
Knowledge Graph Based Retrieval-Augmented Generation for Multi-Hop Question Answering Enhancement
Mahdi Amiri Shavaki - Pouria Omrani - Ramin Toosi - Mohammad Ali Akhaee
Context Awareness Gate for Retrieval Augmented Generation
Mohammad Hassan Heydari - Arshia Hemmat - Erfan Naman - Afsaneh Fatemi
Writer-Independent Signature Verification with Enhanced AlexNet and Preprocessing Analysis
Mohammadreza Gholipour Shahraki - Mohammad Ghasemzadeh
A Hybrid Method to Reduce the Voltage Consumption in the Spiking Neural Networks
Shaghayegh Mehdizadeh saraj - Seyyed Amir Asghari - Mohammadreza Binesh Marvasti
Aspect-Based Sentiment Analysis of After-Sales Service Quality: A Case Study of Snowa and Competitors Using Digikala Reviews
Safiyeh Samadanian - Marjan Kaedi
پیشبینی بستری مجدد بیماران با استفاده از استخراج مفاهیم زیستپزشکی از متون بالینی
فهیمه شاهرخ شهرکی - رسول سامانی - دکتر ناصر قدیری فهیمه شاهرخ شهرکی - رسول سامانی - ناصر قدیری -
Optimal control of robotic hand for rehabilitation using fractional order systems and EEG signal processing
Mehran Safari Dehnavi - Vahid Safari Dehnavi - Masoud Shafiee
بهبود رهگیری در زنجیره تامین با استفاده از فناوری زنجیره بلوکی
سید عماد موسوی - مهرداد آشتیانی
A Survey on Utilizing Reinforcement Learning in Wireless Sensor Networks Routing Protocols
Ali Forghani Elah Abadi - Seyedeh Elham Asghari - Sepideh Sharifani - Seyyed Amir Asghari - Mohammadreza Binesh Marvasti
Classification of Personality Traits on Facebook Using Key Phrase Extraction, Language Models and Machine Learning
Faezeh Safari - Abdolah Chalechale
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.2.4