0% Complete
فارسی
Home
/
شانزدهمین کنفرانس بین المللی فناوری اطلاعات و دانش
Agentic Username Suggestion and Multimodal Gender Detection in Online Platforms: Introducing the PNGT-26K Dataset
Authors :
Farbod Bijary
1
Mohsen Ebadpour
2
Amirhosein Tajbakhsh
3
1- دانشگاه صنعتی امیرکبیر (پلیتکنیک تهران)
2- دانشگاه صنعتی امیرکبیر (پلیتکنیک تهران)
3- دانشگاه علم و صنعت ایران
Keywords :
agentic ai،multimodal learning،persian nlp،multilingual nlp،gender detection
Abstract :
Persian names present unique challenges for natural language processing applications, particularly in gender detection and digital identity creation, due to transliteration inconsistencies and cultural-specific naming patterns. Existing tools exhibit significant performance degradation on Persian names, while the scarcity of comprehensive datasets further compounds these limitations. To address these challenges, the present research introduces PNGT-26K, a comprehensive dataset of Persian names, their commonly associated gender, and their English transliteration, consisting of approximately 26,000 tuples. As a demonstration of how this resource can be utilized, we also introduce two frameworks, namely Open Gender Detection and Nominalist. Open Gender Detection is a production-grade, ready-to-use framework for using existing data from a user, such as profile photo and name, to give a probabilistic guess about the person's gender. Nominalist, the second framework introduced by this paper, utilizes agentic AI to help users choose a username for their social media accounts on any platform. It can be easily integrated into any website to provide a better user experience. The PNGT-26K dataset, Nominalist, and Open Gender Detection frameworks are publicly available on Github.
Papers List
List of archived papers
Improving Transition Cow Index Accuracy through CatBoost-Based Prediction of First Test-Day Milk Yield
Hoda Safaeipour - Sepehr Ebadi
Application of Artificial Intelligence and Remote Sensing for Oil Spill Detection
َAmir Reza Ziaee - Masomeh Azimzadeh - Parvin Ahmadi
From Faces to Words: An Efficient Persian Visual Lip Reading
Mana Amini - Sajjad Aemmi - Azadeh Ashouri - Reza Akhoundzadeh - Kourosh Hassanzadeh - Mohammad Reza Mohammadi
Task Scheduling for Real-time Object Detection: Methods and Performance Comparison in ADAS Applications
Mahdi Seyfipoor - Sayyed Muhammad Jaffry - Siamak Mohamadi
خوشهبندی موثر در استخراج توضیحات مفهوممحور خودکار برای شبکههای پیچشی
سعید معروف - مریم امیرمزلقانی - رضا صفابخش
بررسی تأثیر استقرار استاندارد COBIT در افزایش بهره وری سازمانها (مطالعه موردی: شعب نمایندگیهای همراه اول، ایرانسل، رایتل)
دکتر محمد ابراهیم سمیع - ساره رحمانیان محمد ابراهیم سمیع - ساره رحمانیان -
شبکههای نرمافزار محور در کلان داده: مطالعهی راهکارهای امنیتی و چالشها
احسان سلیمانی دهکردی - محمدرضا ملاخلیلی میبدی
Knowledge Extraction from Technical Reports Based on Large Language Models: An Exploratory Study
Parsa Bakhtiari - Hassan Bashiri - Alireza Khalilipour - Masoud Nasiripour - Moharram Challenger
A Demand Response Schema in Industry: Smart Scheduling Approach for Industrial Processes
Negin Shafinezhad - Hamid Abrishami - Maryam Mahmoodi
شناسایی جایگاه مالونیلاسیون در پروتئینها با بهرهگیری از استخراج ویژگی و تکنیکهای پردازش زبان طبیعی
حنانه رجبیون - محمد قاسم زاده - وحید رنجبر بافقی
more
Samin Hamayesh - Version 43.8.0