0% Complete
English
صفحه اصلی
/
شانزدهمین کنفرانس بین المللی فناوری اطلاعات و دانش
Prompt-Based Composed Fashion Image Retrieval via Gated Detail-Enhanced Dual Cross-Attention Difference Modeling
نویسندگان :
Kosar Keshavarz
1
Reza Azmi
2
1- دانشگاه الزهرا(س)
2- دانشگاه الزهرا(س)
کلمات کلیدی :
Composed image retrieval،Composed query،Contrastive learning،Fashion retrieval،Multimodal retrieval،Text-guided image retrieval
چکیده :
With the rapid growth of online shopping and the vast amount of fashion-related visual content on the internet, accurate methods for fashion image retrieval have become increasingly important to enhance user satisfaction. The fashion domain is inherently fine-grained, characterized by subtle details such as color, pattern, cut, and embellishments, where even small variations lead to distinct styles. To address the limitations of purely text-based or image-based queries, we adopt a text-guided retrieval approach in which a reference image and a natural-language description jointly define the user’s intent. This paper extends sentence-level prompt-based retrieval frameworks by introducing explicit image-difference modeling. The proposed Gated Detail-Enhanced Dual Cross-Attention (GDD-CA) module models the relationship between reference and target images through dual cross-attention and a gated detail-enhancement mechanism, enabling the network to capture subtle, fine-grained visual variations. Experimental results on the Fashion-IQ dataset demonstrate that integrating detail-enhanced image-difference modeling into the prompt-based structure improves retrieval performance, achieving a 1.14% gain in Recall over previous methods.
لیست مقالات
لیست مقالات بایگانی شده
A Hybrid Crow Search and Penguin Optimization Algorithm (CPMM) for Efficient Cloud Workflow Scheduling
Reza Akraminejad - Farhad Kazemipour - Mozhdeh Koreh Davoodi
طراحی واسط کاربری مبتنی بر رفتار و احساسات کاربران در سیستم های هوشمند
فاطمه صبائی - دکتر احمد عبداله زاده بارفروش
Automatic Analysis of Inconsistencies in Inter-Enterprise Business Processes: Introducing a Formal Adaptation Patterns Catalog
Somayeh Ashourian - Shohreh َAjoudanian
Advanced SMS Spam Detection using Deep Complex Models and Sine-Cosine Algorithm
Sepehr Rezaei - Mohammadreza Shams - Mohsen Alambardar Meybodi
A novel approach audio watermarking based on (GBT,DCT,SVD)
Mahdi Mosleh
طراحی و پیاده سازی بستر اجرای بازی جنگ سایبری
مریم نصراصفهانی - بهروز ترک لادانی - بهروز شاهقلی قهفرخی - حسین قجاوند بلتیجه - نوید شیرمحمدی - مهدی شمس - محمدامین آقاکبیری
Enhancing Mutation Testing through Grammar Fuzzing and Parse Tree-Driven Mutation Generation
Mohamad Khorsandi - Alireza Dastmalchi Saei - Mohammadreza Sharbaf
From Faces to Words: An Efficient Persian Visual Lip Reading
Mana Amini - Sajjad Aemmi - Azadeh Ashouri - Reza Akhoundzadeh - Kourosh Hassanzadeh - Mohammad Reza Mohammadi
مدیریت دانش هوشمند مبتنی بر بازیابی-تولید افزوده شده : معماری، ارزیابی و حاکمیت برای دستیار دانش سازمانی
محمدهادی صفری نادری
Predicting Suicide Risk in Adolescents with Random Forest for Unbalanced Data Management
Fatemeh Rabbani - Dr Behrooz Masoumi - Dr Mohammad Reza Keyvanpour
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 43.8.0