Machine learning models have shown high accuracy for phishing URL detection, yet their robustness under distribution shifts, cross-dataset generalization, and interpretability remain underexplored in practical cybersecurity settings. The study evaluates multiple supervised learning classifiers through testing on various phishing URL datasets which exist in public domain and through their assessment of adversarial perturbation scenarios. Our research evaluates how model performance decreases when we test on new datasets which differ from the original training data while we develop quantitative robustness metrics to measure adversarial resistance. The implementation of explainable AI techniques such as SHAP enables us to interpret model decisions while we analyze how dataset and attack variations affect feature importance. Our results demonstrate that performance and interpretability of systems show considerable changes when tested across different datasets and facing various adversarial attacks which creates a need for development of robust design frameworks to protect phishing detection systems. We propose evaluation guidelines for future research while we show how explainability increases trust and operational understanding without decreasing detection performance.
Building similarity graph...
Analyzing shared references across papers
Loading...
Tarun Chowdary Yegi
Indian Institute of Engineering Science and Technology, Shibpur
Building similarity graph...
Analyzing shared references across papers
Loading...
Tarun Chowdary Yegi (Mon,) studied this question.
www.synapsesocial.com/papers/69df2cb9e4eeef8a2a6b1e8a — DOI: https://doi.org/10.5281/zenodo.19555233
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: