0% Complete
English
صفحه اصلی
/
سیزدهمین کنفرانس بین المللی فناوری اطلاعات و دانش
A Comparison between Slimed Network and Pruned Network for Head Pose Estimation
نویسندگان :
Amir Salimiparsa
1
Hadi Veisi
2
Mohammad-shahram Moin
3
1- دانشگاه تهران
2- دانشگاه تهران ٫ دانشکده علوم و فنون نوین
3- پژوهشگاه ارتباطات وفناوری اطلاعات
کلمات کلیدی :
Head pose estimation،MobileNet،Pruning،Quantization،Deep neural networks
چکیده :
Head pose estimation is a critical problem with a wide range of applications. There are many methods that almost solved head pose estimation problems but they are computationally expensive and not suitable for edge devices and embedded systems. In this paper, a deep learning network based on a modified MobileNetV3 architecture is proposed to reduce the computational cost with results comparable to heavy methods. The proposed method is pruned to achieve even less computational cost and results in a network that is more ideal for edge devices and smartphones. The architecture used is MobileNetV3Small which has more inverted residual blocks, making it able to inherit MobileNetV3Large performance but with less width, followed by dense layers. Pruning is enhanced by estimating layer importance and resource reallocation, in order for the informative layers to be less affected by pruning and also to improve performance. In the experiments, the proposed model performs better than many existing heavies with 3.46 MAE before the pruning and 3.61 MAE after the pruning, even though the model has six times fewer parameters than the others and its inference time is about 7ms.
لیست مقالات
لیست مقالات بایگانی شده
UltraLearn: Next-Generation CyberSecurity Learning Platform
Saeed Raisi - Saeid Ghasemshirazi - Ghazaleh Shirvani
LLM-Driven Feature Extraction for Stock Market Prediction: A case study of Tehran Stock Exchange
Siavash Hosseinpour Saffarian - Saman Haratizadeh
An OWA-Powered Dynamic Customer Churn Modeling in the banking industry Based on Customer Behavioral Vectors
Masoud Alizadeh - Mohammad Soleymannejad - Behzad Moshiri
Enhancing Supervised Learning in Speech Emotion Recognition through Unsupervised Representations
Niloufar Faridani - Amirali Soltani Tehrani - Ramin Toosi
بهبود کارایی بارسپاری در شبکه های سلولی با استفاده از ارتباطات مشارکتی در لایه MAC
نبیل الراشدی - رسول صادقی - وائل حسین اللامی - مهدی حمیدخانی
Silicon photonic microring resonators: A Novel optical router based on Negative-First routing algorithm
Negin Bagheri Renani - Elham Yaghoubi
Video Steganography in HEVC Using Intra-Prediction Modes
Vahidreza Seirafian - Masoud Omomi
GanjNet: Leveraging Network Modeling with Large Language Models for Persian Word Sense Induction
Amir Mohammad Kouyeshpour - Hadi Veisi - Saman Haratizadeh
رویکردی در تشخیص خودکار بوهای بد در مدل های معماری سازمانی با استفاده از تحلیل گرافی
زهرا رحیمی تمندگانی - شهره آجودانیان
Smart City Standardized Evaluation :Use Case of Mashhad
Dr ُSeyed Mohammadreza Mirsarraf - Dr Alireza Yari - Dr Navid Zohdi - Ali Motevalizadeh
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.3.1