0% Complete
فارسی
Home
/
شانزدهمین کنفرانس بین المللی فناوری اطلاعات و دانش
Detection of Backdoor Attacks in Neural Networks Using Input Optimization
Authors :
Parsa Hashemi Khorsand
1
Ahmad Nickabadi
2
1- Amirkabir University of Technology (Tehran Polytechnic)
2- Amirkabir University of Technology (Tehran Polytechnic)
Keywords :
backdoor attacks،adversarial robustness،backdoor detection،model contamination detection،input optimization،regularization
Abstract :
This paper presents a clean-data-free framework for detecting backdoor attacks in neural networks via input optimization. We introduce two complementary strategies. First, joint input optimization with a cleanliness detector: for each label, we optimize an input that simultaneously (i) maximizes the target-label logit on the suspected model and (ii) maintains in-domain naturalness according to an auxiliary diagnostic model; the resulting patterns are then inspected for trigger-like artifacts. Second, input optimization with the largest feasible regularization coefficient: for each label, we find the largest feasible regularization coefficient that still attains a preset confidence threshold, forming a per-class signature vector; Median Absolute Deviation (MAD) is then used to flag outlier labels as compromised. On MNIST, our framework achieves 89.5 percent detection accuracy on backdoored models with 100 percent recall in poisoned-label flagging, while requiring no access to clean training data. We further compare our methods with Neural Cleanse and the Certified Backdoor Detector (CBD).
Papers List
List of archived papers
An efficient hybrid approach for performance-based alternative design evaluation in systems engineering
Abbas Chaman Para - Maryam Nooraei Abadeh - Sondos Bahadori
تشخیص و جلوگیری از حمله انعکاسی/تقویتی SSDP در شبکه های نرم افزار محور مبتنی بر 4P با استفاده از الگوریتم های یادگیری ماشین
امیرحسین کرمی - رضا محمدی
Movable Antenna Design for UAV-Aided Federated Learning via Deep Reinforcement Learning
MOHSEN Ahmadzadeh - Saeid Pakravan - Ghosheh Abed Hodtani
A No-Code Platform for Developing Customizable Recommender Systems for Restaurants
Moein-Aldin AliHosseini - MohammadReza Sharbaf
استفاده از هوش مصنوعی در فضای آموزش عالی: آن روی سکه
محمدمتین لیث صفار - عسل آغاز
Improving Training Stability in Variational Autoencoders Through the Integration of Score Matching Loss
Amirreza Mokhtari Rad - Pouya Ardehkhani - Hormehr Alborzi
حفظ حریم خصوصی در انتشار نسخه های متوالی دادههای شبکه اجتماعی با امکان افزایش یال
طاهره سرزهی - دکتر مهری رجایی طاهره سرزهی - مهری رجایی -
OENMOP: Loss-Aware 4×4 and 5×5 and Scalable Non‑blocking Optical Switches Designed for Odd-Even Routing Algorithm for Chip-Scale Interconnection Networks
Negin Bagheri Renani - Elham Yaghoubi - Mina Mohammadirad
Impact of ICT and Digital Evolution on Capital Structure in Companies
Ali Noori
Recommendation Systems in Smart Agriculture: Pathway to a well-designed system
Ahmad Nameni - Amir Ghafarian Daneshmand - Omid Mahdi Ebadati E
more
Samin Hamayesh - Version 42.5.2