0% Complete
English
صفحه اصلی
/
پانزدهمین کنفرانس بین المللی فناوری اطلاعات و دانش
Knowledge Extraction from Technical Reports Based on Large Language Models: An Exploratory Study
نویسندگان :
Parsa Bakhtiari
1
Hassan Bashiri
2
Alireza Khalilipour
3
Masoud Nasiripour
4
Moharram Challenger
5
1- دانشگاه صنعتی همدان
2- دانشگاه صنعتی همدان
3- University of Antwerp
4- دانشگاه صنعتی همدان
5- University of Antwerp
کلمات کلیدی :
Knowledge Extraction،Large Language Model،Fine Tuning
چکیده :
Organizations and companies possess a vast amount of documents generated over the years. These documents contain valuable information and knowledge that can be instrumental in resolving ambiguities and challenges experts face. Information retrieval and knowledge management systems are tools for extracting documents relevant to users’ informational needs, addressing part of the knowledge extraction challenge from these document collections. With the emergence of generative artificial intelligence and large language models that exhibit strong capa- bilities in understanding textual documents, knowledge extraction solutions have shifted towards utilizing these models. Large language models possess general knowledge obtained from pre- training methods, and there are various approaches to infuse domain-specific knowledge into the general understanding of the language model. This research first examines the possible techniques for fine-tuning a large language model in a specific domain. We then train the model using fine-tuning methods on a collection of documents and technical reports from the industry. Finally, we measure the improvement in the large language model’s capability to extract domain-specific knowledge.
لیست مقالات
لیست مقالات بایگانی شده
A Neural-based Approach to Aid Early Parkinson's Disease Diagnosis
Dr Armin Salimi-badr - Mohammad Hashemi
دستهبندی متون خبری فارسی با یادگیری فعال
مینا طباطبائی - دکتر سعیده ممتازی
Extending Interaction Flow Modeling Language as a Profile for Form-making Systems
Ghazaleh Shahin - Dr Bahman Zamani
جمعآوری، تحلیل و خلاصه سازی نظرات کاربران فارسی زبان در شبکههای اجتماعی پیرامون بیماری فراگیر کووید-19
محمدرضا شمس - محمد یاسین فخار محمدرضا شمس - محمد یاسین فخار -
KGLM-QA: A Novel Approach for Knowledge Graph-Enhanced Large Language Models for Question Answering
Alireza Akhavan safaei - Pegah Saboori - Reza Ramezani - Mohammadali Nematbakhsh
IT-based and Non-IT-based methods to separate and collect waste
Hoda Harati - Farzad Haghighi-Rad - Reza Yousefi Zenouz
تحویل بهینه جریان پخش زنده HTTP: یک رویکرد ترکیبی سرور- شبکه
فائزه امینی تهرانی - احمدرضا منتظرالقائم
AOV-IDS: Arithmetic Optimizer with Voting classifier for Intrusion Detection System
Amir Soltany Mahboob - Mohammad Reza Ostadi Moghaddam - Shima Yousefi
شناسایی حملات رومینگ تلفنهمراه با استفاده از یادگیری ماشین
سعیده سیف الدین - سجاد شیرعلی شهرضا
A High-Speed Quantum Reversible Controlled Adder/Subtractor Circuit
Negin Mashayekhi - Mohammad Reza Reshadinezhad - Shekoofeh Moghimi
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 43.8.0