0% Complete
English
صفحه اصلی
/
شانزدهمین کنفرانس بین المللی فناوری اطلاعات و دانش
PC-MCLD: Pose-Constrained and Multi-focal Conditioned Latent Diffusion for Person Image Synthesis
نویسندگان :
Hanieh Fazli
1
Reza Azmi
2
1- دانشگاه الزهرا(س)
2- دانشگاه الزهرا(س)
کلمات کلیدی :
pose-guided person image synthesis،latent diffusion model،texture consistency،adaptive feature fusion،fashion image generation
چکیده :
Pose-guided person image synthesis (PGPIS) aims to generate a person in a target pose while preserving identity and garment details, yet large pose variations often cause texture misalignment and loss of facial fidelity in existing diffusion models. We propose PC-MCLD, a latent diffusion framework that introduces (i) a pose-aware texture transfer constraint ensuring anatomically consistent correspondence between source and target regions, and (ii) an adaptive weighting mechanism that balances global appearance, garment texture, and facial identity cues during generation. Experiments on the DeepFashion In-Shop benchmark show clear improvements over a reproduced MCLD baseline. At 176×256, PC-MCLD reduces FID by 1.39% and LPIPS by 8.24%; at 352×512, the gains increase to 2.53% in FID and 19.48% in LPIPS. These results demonstrate that PC-MCLD enhances both perceptual quality and structural fidelity under challenging pose changes.
لیست مقالات
لیست مقالات بایگانی شده
A Deep Learning Framework for Phase-Aware Feature Representation to Improve Sound Source Direction and Distance Estimation
Zahra Abolfazli - Hamid Reza Abutalebi
بررسی کارآمدی فناوری وب 0.2 در پشتیبانی از فرآیندهای انسان محور و دانش مبنا
سید احسان ملیحی - فاطمه مشایخی کردکلا
Integrating Wasserstein GANs for High-Speed Transformer-Based Neural Machine Translation
Parisa Nekoogol - Mostafa Salehi
Improving Training Stability in Variational Autoencoders Through the Integration of Score Matching Loss
Amirreza Mokhtari Rad - Pouya Ardehkhani - Hormehr Alborzi
Hardware Imperfection Effects in Wireless Virtual Reality System with Hybrid Beamforming
Nasim Alikhani - Abbas Mohammadi
ML-based Optical Fibre Fault Detection in Smart Surveillance and Traffic Systems
Rushil Patel - Sana Narmawala - Nikunjkumar Mahida - Rajesh Gupta - Sudeep Tanwar - Hossein Shahinzadeh
Targeted Vaccination for COVID-19 Using Mobile Communication Networks
Mohammadmohsen Jadidi - Pegah Moslemi - Saeed Jamshidiha - Iman Masroori - Abbas Mohammadi - Vahid Pourahmadi
Robustness Gap in NLP Models for Vulnerability Descriptions: Benchmarking and Data Augmentation
AmirHossein Majd - Mahdi Yousefikia - Saghar Ghasemzadeh - Amirreza Asari - Arya Khoshnavataher - Seyedeh Leili Mirtaheri
A Novel Decentralized Privacy Preserving Federated Learning Model for Healthcare Applications
Saba Ameri - Reza Ebrahimi Atani
A Demand Response Schema in Industry: Smart Scheduling Approach for Industrial Processes
Negin Shafinezhad - Hamid Abrishami - Maryam Mahmoodi
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 43.8.0