0% Complete
English
صفحه اصلی
/
شانزدهمین کنفرانس بین المللی فناوری اطلاعات و دانش
Prompt-Based Composed Fashion Image Retrieval via Gated Detail-Enhanced Dual Cross-Attention Difference Modeling
نویسندگان :
Kosar Keshavarz
1
Reza Azmi
2
1- دانشگاه الزهرا(س)
2- دانشگاه الزهرا(س)
کلمات کلیدی :
Composed image retrieval،Composed query،Contrastive learning،Fashion retrieval،Multimodal retrieval،Text-guided image retrieval
چکیده :
With the rapid growth of online shopping and the vast amount of fashion-related visual content on the internet, accurate methods for fashion image retrieval have become increasingly important to enhance user satisfaction. The fashion domain is inherently fine-grained, characterized by subtle details such as color, pattern, cut, and embellishments, where even small variations lead to distinct styles. To address the limitations of purely text-based or image-based queries, we adopt a text-guided retrieval approach in which a reference image and a natural-language description jointly define the user’s intent. This paper extends sentence-level prompt-based retrieval frameworks by introducing explicit image-difference modeling. The proposed Gated Detail-Enhanced Dual Cross-Attention (GDD-CA) module models the relationship between reference and target images through dual cross-attention and a gated detail-enhancement mechanism, enabling the network to capture subtle, fine-grained visual variations. Experimental results on the Fashion-IQ dataset demonstrate that integrating detail-enhanced image-difference modeling into the prompt-based structure improves retrieval performance, achieving a 1.14% gain in Recall over previous methods.
لیست مقالات
لیست مقالات بایگانی شده
Multi-Modal Longitudinal Tooth Labeling with Temporal Graph–Transformer Integration
Maral Mirza mohammadi - Mahdi Tarom
Kalman Filter–Based Anomaly Detection for User Authentication Failures in Enterprise Logs
Somayeh Soltani - Hossein Nikdel
کشف لبه در تصاویر پزشکی با استفاده از اتوماتای سلولی سلسله مراتبی
مریم علینقی زاده - علیرضا رضوانیان
سیستم پیشنهاددهنده غذای سالم با استفاده از داده کاوی عادت های تغذیه ای کاربران
محمد عباسی - مریم حسینی پزوه - محمدرضا شمس
AN EFFICIENT TASK SCHEDULING IN CLOUD COMPUTING BASED ON ACO ALGORITHM
Zahra Shafahi - Dr Alireza Yari
خوشه بندی شبکههای بیسیم ادهاک مبتنی بر محدودیتهای فازی
پروا کلیبری - کریم صمدزمینی
Two Novel Designs of Efficient Single-Bit Comparators in QCA Technology with Ultra-Low Energy Dissipation
Shobeir Fayazi - Hatam Abdoli
Enhancing Persian Speech Emotion Recognition with Contrastive Learning and Multimodal Fusion
Mobina Esmaeili - Vajiheh Sabeti
تشخیص حمله تزریق داده کاذب با روش OCD در شبکه هوشمند برق
محدثه جلیلی سنجرانی - سعید جلیلی - محمدکاظم شیخ الاسلامی
Video Steganography in HEVC Using Intra-Prediction Modes
Vahidreza Seirafian - Masoud Omomi
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 43.8.0