0% Complete
English
صفحه اصلی
/
پانزدهمین کنفرانس بین المللی فناوری اطلاعات و دانش
GanjNet: Leveraging Network Modeling with Large Language Models for Persian Word Sense Induction
نویسندگان :
Amir Mohammad Kouyeshpour
1
Hadi Veisi
2
Saman Haratizadeh
3
1- دانشگاه تهران ٫ دانشکده علوم و فنون نوین
2- دانشگاه تهران ٫ دانشکده علوم و فنون نوین
3- دانشگاه تهران ٫ دانشکده علوم و فنون نوین
کلمات کلیدی :
Word Sense Induction،Network Modeling،Community Detection،Large Language Models،Persian NLP،Lexical Semantics
چکیده :
Abstract—This paper introduces GanjNet, a novel approach to Word Sense Induction (WSI) in the Persian language that leverages network modeling and community detection in conjunction with large language models (LLMs). We present a method that constructs semantic graphs from lexical substitutes generated by LLMs and applies community detection algorithms to uncover and distinguish word senses in unannotated text. GanjNet addresses challenges such as limited annotated resources, high degrees of polysemy, and context-sensitive meanings in Persian. By leveraging unsupervised techniques, we enhance sense induction without relying on extensive labeled data. Our experiments demonstrate that GanjNet outperforms existing methods on a custom dataset derived from MirasText, achieving a V-measure of 47% and a paired F-score of 58%, compared to the best baseline method with a V-measure of 41% and a paired F-score of 53%. These results showcase the potential of integrating community detection and LLMs for unsupervised semantic tasks in morphologically rich languages like Persian. Moreover, GanjNet’s flexibility offers practical applicability across various domains, including automatic thesaurus and WordNet generation, as well as assisting writers in context-sensitive word choice, demonstrating its broader impact on natural language understanding.
لیست مقالات
لیست مقالات بایگانی شده
Task Scheduling for Real-time Object Detection: Methods and Performance Comparison in ADAS Applications
Mahdi Seyfipoor - Sayyed Muhammad Jaffry - Siamak Mohamadi
Intelligent Transportation System (ITS) Using Internet of Things (IoT)
Engineer Reza Khalilian - Dr. Abdalhossein Rezai - Dr. Sayyed Mohammad Reza Talakesh
ارائه یک رویکرد معنایی مبتنی بر آنتولوژی به منظور شناسایی تاکتیکهای معماری
احسان شریفی - دکتر احمد عبدالله زاده بارفروش
A novel approach audio watermarking based on (GBT,DCT,SVD)
Mahdi Mosleh
طراحی و پیاده سازی بستر اجرای بازی جنگ سایبری
مریم نصراصفهانی - بهروز ترک لادانی - بهروز شاهقلی قهفرخی - حسین قجاوند بلتیجه - نوید شیرمحمدی - مهدی شمس - محمدامین آقاکبیری
SBST challenges from the perspective of the test techniques
Sepideh Kashefi Gargari - Dr Mohammad Reza Keyvanpour
An integrated approach for estimating software cost estimation using Adaptive Neuro-Fuzzy Inference System and the Grey Wolf Optimization algorithm
Maryam Karimi - Taghi Javdani Gandomani - Mahdi Mosleh
A Graph Attention-Based Autoencoder for Critical Path Anomaly Detection in Microservices
Mahdi Naderi - Hossein Momeni - Shayan Shahini
Investigating the impact of management information systems (MIS) on organizational transparency with an emphasis on work ethics
Sadegh Balouch - Omid mehdi Ebadati
خوشه بندی مقید داده ها به کمک اتوماتای یادگیر سلولی
شکوفه علی محمدی - احمدعلی آبین
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.5.2