0% Complete
English
صفحه اصلی
/
پانزدهمین کنفرانس بین المللی فناوری اطلاعات و دانش
GanjNet: Leveraging Network Modeling with Large Language Models for Persian Word Sense Induction
نویسندگان :
Amir Mohammad Kouyeshpour
1
Hadi Veisi
2
Saman Haratizadeh
3
1- دانشگاه تهران ٫ دانشکده علوم و فنون نوین
2- دانشگاه تهران ٫ دانشکده علوم و فنون نوین
3- دانشگاه تهران ٫ دانشکده علوم و فنون نوین
کلمات کلیدی :
Word Sense Induction،Network Modeling،Community Detection،Large Language Models،Persian NLP،Lexical Semantics
چکیده :
Abstract—This paper introduces GanjNet, a novel approach to Word Sense Induction (WSI) in the Persian language that leverages network modeling and community detection in conjunction with large language models (LLMs). We present a method that constructs semantic graphs from lexical substitutes generated by LLMs and applies community detection algorithms to uncover and distinguish word senses in unannotated text. GanjNet addresses challenges such as limited annotated resources, high degrees of polysemy, and context-sensitive meanings in Persian. By leveraging unsupervised techniques, we enhance sense induction without relying on extensive labeled data. Our experiments demonstrate that GanjNet outperforms existing methods on a custom dataset derived from MirasText, achieving a V-measure of 47% and a paired F-score of 58%, compared to the best baseline method with a V-measure of 41% and a paired F-score of 53%. These results showcase the potential of integrating community detection and LLMs for unsupervised semantic tasks in morphologically rich languages like Persian. Moreover, GanjNet’s flexibility offers practical applicability across various domains, including automatic thesaurus and WordNet generation, as well as assisting writers in context-sensitive word choice, demonstrating its broader impact on natural language understanding.
لیست مقالات
لیست مقالات بایگانی شده
Designing an AI-assisted toolbox for fitness activity recognition based on deep CNN
Ali Bidaran - Dr Saeed Sharifian
Energy–Aware Clustering Routing Protocol to Improve the Multi-hop WSN Lifetime
Alireza Gholamrezaee - Hoda Gholamrezaee - Mahtab Hadiyan
Classification of mental states of human concentration based on EEG signal
Mehran Safari Dehnavi - Vahid Safari Dehnavi - Dr Masoud Shafiee
The risk prediction of heart disease by using neuro-fuzzy and improved GOA
Vahid Safari Dehnavi - Masoud Shafiee
An efficient hybrid approach for performance-based alternative design evaluation in systems engineering
Abbas Chaman Para - Maryam Nooraei Abadeh - Sondos Bahadori
Scattering Wavelet-Based Image Quality Assessment Metric for Medical Images
Sina Omidvar - Jamshid Shanbehzadeh
Knowledge Extraction from Technical Reports Based on Large Language Models: An Exploratory Study
Parsa Bakhtiari - Hassan Bashiri - Alireza Khalilipour - Masoud Nasiripour - Moharram Challenger
Business Process Improvement Challenges: A Systematic Literature Review
Hanieh Kashfi - Fereidoon Shams Aliee
ISPREC: Integrated Scientific Paper Recommendation using heterogeneous information network
Elaheh Jafari - Dr Bita Shams - Dr Saman Haratizadeh
رویکردی در تشخیص خودکار بوهای بد در مدل های معماری سازمانی با استفاده از تحلیل گرافی
زهرا رحیمی تمندگانی - شهره آجودانیان
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 43.8.0