0% Complete
فارسی
Home
/
شانزدهمین کنفرانس بین المللی فناوری اطلاعات و دانش
A Framework for Systematic Stability Assessment of Post-hoc Explanations in Text Classification
Authors :
Parman Mohammadalizadeh
1
Parham Mohammadalizadeh
2
Ayda Mahmoudian
3
1- دانشگاه زنجان
2- پژوهشگر مستقل
3- پژوهشگر مستقل
Keywords :
Explainable AI،Explainability Evaluation،Natural Language Processing
Abstract :
Post-hoc explanation methods are widely adopted for interpreting neural text classifiers, yet lack standardized evaluation of their stability under input perturbations. We present a systematic framework for assessing explanation stability through three categories of stress tests: preprocessing variations, semantic paraphrasing, and explainer seed variations. The framework combines quantitative metrics (Jaccard similarity, Spearman correlation, attribution differences) with automated stability card generation for standardized reporting. We evaluate Integrated Gradients, LIME, and SHAP across four model-dataset combinations spanning sentiment analysis and topic classification. Results reveal nuanced stability patterns, including the decoupling of model capacity from explanation reliability and architecture-dependent vulnerability to perturbation types. Our open-source implementation supports standard transformer models and explanation libraries, establishing practical stability assessment as a reproducible evaluation standard for NLP explainability research.
Papers List
List of archived papers
Attention-Enhanced Ensemble Learning for Automated Stenosis Detection in X-ray Coronary Angiography Videos
Marzieh Sadat Hosseini - Ahmad R. Naghsh-Nilchi - Mehran Safayani - Masoumeh Sadeghi
Using Trust Statements and Ratings by GraphSAGE to Alleviate Cold Start in Recommender Systems
Seyedeh Niusha Motevallian - Dr Seyed Mohammad Hossein Hasheminejad
Persian Language Understanding in Task-oriented Dialogue System for Online Shopping
Zeinab Borhanifard - Hossein Basafa - Seyedeh Zahra Razavi - Heshaam Faili
A parallel approach to the fractional time delay model for predicting the spread of COVID-19
Mahdi Movahedian Moghaddam - Kourosh Parand
Extending Interaction Flow Modeling Language as a Profile for Form-making Systems
Ghazaleh Shahin - Dr Bahman Zamani
Handling Data Heterogeneity in Federated Medical Images Classification
Alireza Maleki - Hassan Khotanlou
مدیریت دانش هوشمند مبتنی بر بازیابی-تولید افزوده شده : معماری، ارزیابی و حاکمیت برای دستیار دانش سازمانی
محمدهادی صفری نادری
بیشینهسازی تأثیر در شبکههای اجتماعی بر اساس فعالیت کاربران
فاطمه جعفری - علیرضا رضوانیان
مروری بر الگوریتمهای انتخاب مشتری در یادگیری فدرال
عطیه منعمی بیدگلی - رضا مهدوی
Silicon photonic microring resonators: A Novel optical router based on Negative-First routing algorithm
Negin Bagheri Renani - Elham Yaghoubi
more
Samin Hamayesh - Version 42.5.2