0% Complete
English
صفحه اصلی
/
پانزدهمین کنفرانس بین المللی فناوری اطلاعات و دانش
GanjNet: Leveraging Network Modeling with Large Language Models for Persian Word Sense Induction
نویسندگان :
Amir Mohammad Kouyeshpour
1
Hadi Veisi
2
Saman Haratizadeh
3
1- دانشگاه تهران ٫ دانشکده علوم و فنون نوین
2- دانشگاه تهران ٫ دانشکده علوم و فنون نوین
3- دانشگاه تهران ٫ دانشکده علوم و فنون نوین
کلمات کلیدی :
Word Sense Induction،Network Modeling،Community Detection،Large Language Models،Persian NLP،Lexical Semantics
چکیده :
Abstract—This paper introduces GanjNet, a novel approach to Word Sense Induction (WSI) in the Persian language that leverages network modeling and community detection in conjunction with large language models (LLMs). We present a method that constructs semantic graphs from lexical substitutes generated by LLMs and applies community detection algorithms to uncover and distinguish word senses in unannotated text. GanjNet addresses challenges such as limited annotated resources, high degrees of polysemy, and context-sensitive meanings in Persian. By leveraging unsupervised techniques, we enhance sense induction without relying on extensive labeled data. Our experiments demonstrate that GanjNet outperforms existing methods on a custom dataset derived from MirasText, achieving a V-measure of 47% and a paired F-score of 58%, compared to the best baseline method with a V-measure of 41% and a paired F-score of 53%. These results showcase the potential of integrating community detection and LLMs for unsupervised semantic tasks in morphologically rich languages like Persian. Moreover, GanjNet’s flexibility offers practical applicability across various domains, including automatic thesaurus and WordNet generation, as well as assisting writers in context-sensitive word choice, demonstrating its broader impact on natural language understanding.
لیست مقالات
لیست مقالات بایگانی شده
DRL-Based Phase Optimization for O-RIS in Dual-Hop Hard Switching FSO/RIS-aided RF and UWOC Systems
Aboozar Heydaribeni - Hamzeh Beyranvand - Sahar Eslami
Short-Term Traffic Flow Prediction Based on a Recurrent Deep Neural Networks: Study in Tehran
Dr Monireh عبدوس - Taha Vajed Samei
Persian Language Understanding in Task-oriented Dialogue System for Online Shopping
Zeinab Borhanifard - Hossein Basafa - Seyedeh Zahra Razavi - Heshaam Faili
یک روش انتخاب ویژگی نیمهنظارتی جدید بر اساس منظمسازی هسین
دکتر راضیه شیخ پور راضیه شیخ پور -
Improving Long-Term Engagement of Insurance Brokerages by Providing Gamified Configurations Based on The Delphi Method
Hosein Bayati - Fattaneh Taghiyareh - Sahand Hashemi
PC-MCLD: Pose-Constrained and Multi-focal Conditioned Latent Diffusion for Person Image Synthesis
Hanieh Fazli - Reza Azmi
خوشه بندی مقید داده ها به کمک اتوماتای یادگیر سلولی
شکوفه علی محمدی - احمدعلی آبین
Multi-Modal Longitudinal Tooth Labeling with Temporal Graph–Transformer Integration
Maral Mirza mohammadi - Mahdi Tarom
تشخیص خودکار اختلال عروقی ماکولا با عنوان عروق گسترش یافته در تصاویر آنژیوگرافی حاصل از تصویربرداری OCTA
راضیه گنجی - دکتر محسن ابراهیمی مقدم - دکتر رامین نوری نیا
A Data-Driven Hybrid Algorithm for 2D Path Planning via Modeling and Metaheuristic-Based Identification
Vahid Safari Dehnavi - Masoud Shafiee
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.5.2