0% Complete
English
صفحه اصلی
/
شانزدهمین کنفرانس بین المللی فناوری اطلاعات و دانش
Agentic Username Suggestion and Multimodal Gender Detection in Online Platforms: Introducing the PNGT-26K Dataset
نویسندگان :
Farbod Bijary
1
Mohsen Ebadpour
2
Amirhosein Tajbakhsh
3
1- دانشگاه صنعتی امیرکبیر (پلیتکنیک تهران)
2- دانشگاه صنعتی امیرکبیر (پلیتکنیک تهران)
3- دانشگاه علم و صنعت ایران
کلمات کلیدی :
agentic ai،multimodal learning،persian nlp،multilingual nlp،gender detection
چکیده :
Persian names present unique challenges for natural language processing applications, particularly in gender detection and digital identity creation, due to transliteration inconsistencies and cultural-specific naming patterns. Existing tools exhibit significant performance degradation on Persian names, while the scarcity of comprehensive datasets further compounds these limitations. To address these challenges, the present research introduces PNGT-26K, a comprehensive dataset of Persian names, their commonly associated gender, and their English transliteration, consisting of approximately 26,000 tuples. As a demonstration of how this resource can be utilized, we also introduce two frameworks, namely Open Gender Detection and Nominalist. Open Gender Detection is a production-grade, ready-to-use framework for using existing data from a user, such as profile photo and name, to give a probabilistic guess about the person's gender. Nominalist, the second framework introduced by this paper, utilizes agentic AI to help users choose a username for their social media accounts on any platform. It can be easily integrated into any website to provide a better user experience. The PNGT-26K dataset, Nominalist, and Open Gender Detection frameworks are publicly available on Github.
لیست مقالات
لیست مقالات بایگانی شده
AI-based Message Spam Classification Framework for Secure Autonomous Vehicles Communication
Riya Upadhyay - Mili Virani - Lakshit Pathak - Rajesh Gupta - Sudeep Tanwar - Hossein Shahinzadeh
Investigating the impact of management information systems (MIS) on organizational transparency with an emphasis on work ethics
Sadegh Balouch - Omid mehdi Ebadati
Revert Propagation: Who are responsible for a contagion initialization in a Diffusion Network?
Arman Sepehr - Mohammadzaman Zamani - Hamid Beigy - Shabnam Behzad
IoT-Based Model in Smart Urban Traffic Control: Graph theory and Genetic Algorithm
Saeed Doostali - Seyed Morteza Babamir - Mohammad Shiralizadeh Dezfoli - Behzad Soleimani Neysiani
تحلیل کتابسنجی از مقالات حوزه دوقلوهای دیجیتال
فاطمه مکی زاده - سارا صراف - مصطفی شیرالی
Improving Privacy Protection in a Collaborative Blockchain-based E-Health Records System
Arman Emam-Hoseini - Samane Sobuti - دکتر سیاوش خرسندی - Alireza Hashemi-Golpayeghani
Enhancing Supervised Learning in Speech Emotion Recognition through Unsupervised Representations
Niloufar Faridani - Amirali Soltani Tehrani - Ramin Toosi
A No-Code Platform for Developing Customizable Recommender Systems for Restaurants
Moein-Aldin AliHosseini - MohammadReza Sharbaf
Sustainability analysis and improvement of model driven engineering and model transformation languages
Kevin Lano - Shekoufeh Kolahdouz Rahimi
A Novel Approach to Data mining algorithms and IoT based data mining machine learning
Danial Ramezani - Seyed Hossein Siadat
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.5.2