0% Complete
فارسی
Home
/
شانزدهمین کنفرانس بین المللی فناوری اطلاعات و دانش
Prompt-Based Composed Fashion Image Retrieval via Gated Detail-Enhanced Dual Cross-Attention Difference Modeling
Authors :
Kosar Keshavarz
1
Reza Azmi
2
1- دانشگاه الزهرا(س)
2- دانشگاه الزهرا(س)
Keywords :
Composed image retrieval،Composed query،Contrastive learning،Fashion retrieval،Multimodal retrieval،Text-guided image retrieval
Abstract :
With the rapid growth of online shopping and the vast amount of fashion-related visual content on the internet, accurate methods for fashion image retrieval have become increasingly important to enhance user satisfaction. The fashion domain is inherently fine-grained, characterized by subtle details such as color, pattern, cut, and embellishments, where even small variations lead to distinct styles. To address the limitations of purely text-based or image-based queries, we adopt a text-guided retrieval approach in which a reference image and a natural-language description jointly define the user’s intent. This paper extends sentence-level prompt-based retrieval frameworks by introducing explicit image-difference modeling. The proposed Gated Detail-Enhanced Dual Cross-Attention (GDD-CA) module models the relationship between reference and target images through dual cross-attention and a gated detail-enhancement mechanism, enabling the network to capture subtle, fine-grained visual variations. Experimental results on the Fashion-IQ dataset demonstrate that integrating detail-enhanced image-difference modeling into the prompt-based structure improves retrieval performance, achieving a 1.14% gain in Recall over previous methods.
Papers List
List of archived papers
Paths-oriented Test Data Generation using Genetic Algorithm
Mohammad Reza Hassanpour Charmchi - Dr Bagher Rahimpour cami
An approach to model the optimal service provisioning in vehicular cloud networks
Farhoud Jafari Kaleibar - Maghsoud Abbaspour
تخلیهی باری وظایف اینترنت اشیاء بر روی مه محاسباتی با استفاده از الگوریتم حشره آبسوار
عفت تقی زاده بیلندی - آرش دلداری - علیرضا صالحان
A Nano-based High-Speed QCA circuit for Information Security with Image Masking
Saeid Seyedi - Hatam Abdoli
تحلیل داده های شهری با رویکرد هوش تجاری
دریا چراغی
Intra Class Feature Learning and Supervised Triplet Sampling for Deep Metric Learning
Hamideh Rafiee - Ahmad Ali Abin - Seyed Soroush Majd - Viet-Vu Vu
بررسی روش یادگیری انتقالی جهت پیشبینی پیوند
علی روحانی فر - کمال میرزایی بدرآبادی
Improving hypergraph attention and hypergraph convolution networks
Mustafa Mohammadi Gharasuie - Mahmood Shabankhah - Ali Kamandi
Fast Duplicate Bug Reports Detector Training using Sampling for Dimension Reduction
Behzad Soleimani Neysiani - Saeed Doostali - Seyed Morteza Babamir - Zahra Aminoroaya
A Demand Response Schema in Industry: Smart Scheduling Approach for Industrial Processes
Negin Shafinezhad - Hamid Abrishami - Maryam Mahmoodi
more
Samin Hamayesh - Version 42.5.2