0% Complete
فارسی
Home
/
شانزدهمین کنفرانس بین المللی فناوری اطلاعات و دانش
Detection of Backdoor Attacks in Neural Networks Using Input Optimization
Authors :
Parsa Hashemi Khorsand
1
Ahmad Nickabadi
2
1- Amirkabir University of Technology (Tehran Polytechnic)
2- Amirkabir University of Technology (Tehran Polytechnic)
Keywords :
backdoor attacks،adversarial robustness،backdoor detection،model contamination detection،input optimization،regularization
Abstract :
This paper presents a clean-data-free framework for detecting backdoor attacks in neural networks via input optimization. We introduce two complementary strategies. First, joint input optimization with a cleanliness detector: for each label, we optimize an input that simultaneously (i) maximizes the target-label logit on the suspected model and (ii) maintains in-domain naturalness according to an auxiliary diagnostic model; the resulting patterns are then inspected for trigger-like artifacts. Second, input optimization with the largest feasible regularization coefficient: for each label, we find the largest feasible regularization coefficient that still attains a preset confidence threshold, forming a per-class signature vector; Median Absolute Deviation (MAD) is then used to flag outlier labels as compromised. On MNIST, our framework achieves 89.5 percent detection accuracy on backdoored models with 100 percent recall in poisoned-label flagging, while requiring no access to clean training data. We further compare our methods with Neural Cleanse and the Certified Backdoor Detector (CBD).
Papers List
List of archived papers
A Comparative Evaluation of Machine Learning Models for Anomaly-Based IDS in IoT Networks
Seyed Amir Mousavi - Mostafa Sadeghi - Mohammad Sadeq Sirjani
ارائه یک مدل تصمیم گیری چند معیاره فازی به منظور بهبود دقت فرایند تصمیم گیری به هنگام اختلال هوانوردی
فاطمه عطا عبدالرزاق - نگار مجمع
Automatic identification and reconstruction of Tuberculosis in microscopic images using convolutional auto-encoder network
Ahmad Reza Nadafi - Farahnaz Mohanna
بهبود معاملات الگوریتمی سهام مبتنی بر رویکرد یادگیری تقویتی
مها العطوان - جعفر پورامینی
بررسی روشها، مجموعههای داده و معیارهای ارزیابی در حوزهی پرسش از متون درون تصویر
کبری فرشیدی - حسن ختنلو - محرم منصوری زاده - الهام علی قارداش
Automatic Analysis of Inconsistencies in Inter-Enterprise Business Processes: Introducing a Formal Adaptation Patterns Catalog
Somayeh Ashourian - Shohreh َAjoudanian
A Deep Neural Network-based Method for MmWave Time-varying Channel Estimation
Amirhossein Molazadeh - Zahra Maroufi - Mehrdad Ardebilipour
Two Novel Designs of Efficient Single-Bit Comparators in QCA Technology with Ultra-Low Energy Dissipation
Shobeir Fayazi - Hatam Abdoli
3D Mesh ONoC: Design of low Insertion Loss and Non-blocking Optical Router and Efficient Routing Algorithm
Sanaz Asadinia - Elham Yaghoubi - Mostafa Sadeghi - Mahdi Mehrabi
ارزیابی و برنامهریزی اجرای پیشنهادی هوش مصنوعی در صنعت پتروشیمی ایران
امین رضا انصاری - احد قائمی - سید مهدی کوچک کوثری
more
Samin Hamayesh - Version 43.8.0