0% Complete
فارسی
Home
/
شانزدهمین کنفرانس بین المللی فناوری اطلاعات و دانش
PC-MCLD: Pose-Constrained and Multi-focal Conditioned Latent Diffusion for Person Image Synthesis
Authors :
Hanieh Fazli
1
Reza Azmi
2
1- دانشگاه الزهرا(س)
2- دانشگاه الزهرا(س)
Keywords :
pose-guided person image synthesis،latent diffusion model،texture consistency،adaptive feature fusion،fashion image generation
Abstract :
Pose-guided person image synthesis (PGPIS) aims to generate a person in a target pose while preserving identity and garment details, yet large pose variations often cause texture misalignment and loss of facial fidelity in existing diffusion models. We propose PC-MCLD, a latent diffusion framework that introduces (i) a pose-aware texture transfer constraint ensuring anatomically consistent correspondence between source and target regions, and (ii) an adaptive weighting mechanism that balances global appearance, garment texture, and facial identity cues during generation. Experiments on the DeepFashion In-Shop benchmark show clear improvements over a reproduced MCLD baseline. At 176×256, PC-MCLD reduces FID by 1.39% and LPIPS by 8.24%; at 352×512, the gains increase to 2.53% in FID and 19.48% in LPIPS. These results demonstrate that PC-MCLD enhances both perceptual quality and structural fidelity under challenging pose changes.
Papers List
List of archived papers
An Optimized GBDT-Based Model Using SMOTE for Effective Diagnosis of Coronary Heart Disease
Elahe Moradi - Mohammad Javadian
A New Method Based on Deep Learning and Time Stabilization of the Propagation Path for Fake News Detection
Fatemeh Torgheh - Dr Mohammad Reza Keyvanpour - Dr Behrooz Masoumi
دستهبندی متون خبری فارسی با یادگیری فعال
مینا طباطبائی - دکتر سعیده ممتازی
Persian deaf sign language recognition system using deep learning
Mohammad Ebrahimi
ارزیابی و برنامهریزی اجرای پیشنهادی هوش مصنوعی در صنعت پتروشیمی ایران
امین رضا انصاری - احد قائمی - سید مهدی کوچک کوثری
یک سیستم پاسخ به نفوذ در شبکه های اینترنت اشیاء با استفاده از شبکه های مبتنی بر نرم افزار
احسان شاهرخی مینا - رضا محمدی - محمد نصیری
Sentiment Analysis of the Amazon Customers Using the BiGRU Neural Network Enhanced by Attention Mechanism
Sara Sinan Salman al-Abedi - Keyvan Mohebbi
OENMOP: Loss-Aware 4×4 and 5×5 and Scalable Non‑blocking Optical Switches Designed for Odd-Even Routing Algorithm for Chip-Scale Interconnection Networks
Negin Bagheri Renani - Elham Yaghoubi - Mina Mohammadirad
بهبود کارایی بارسپاری در شبکه های سلولی با استفاده از ارتباطات مشارکتی در لایه MAC
نبیل الراشدی - رسول صادقی - وائل حسین اللامی - مهدی حمیدخانی
Vi-Net: A Deep Violent Flow Network for Violence Detection in Video Sequences
Tahereh Zarrat Ehsan - Seyed Mehdi Mohtavipour
more
Samin Hamayesh - Version 43.8.0