March 3, 2026Open Access

Re-Evaluating Android Malware Detection: Tabular Features, Vision Models, and Ensembles

Key Points

LightGBM model on extended EMBER features shows superior performance compared to all other models, including state-of-the-art methods.
The evaluation uses a large corpus of Android applications with paired static representations for accurate testing.
Two tabular models with EMBER features and two vision-based models utilizing malware images were assessed in the analysis.
Ensemble approaches yield only modest improvements in performance, incurring significant computational and engineering costs.

Abstract

Static, machine learning-based malware detection is widely used in Android security products, where even small increases in false-positive rates can impose significant burdens on analysts and cause unacceptable disruptions for end users. Both tabular features and image-based representations have been explored for Android malware detection. However, existing public benchmark datasets do not provide paired tabular and image representations for the same samples, limiting direct comparisons between tabular models and vision-based models. This work investigates whether carefully engineered, domain-specific tabular features can match or surpass the performance of state-of-the-art deep vision models under strict false-positive-rate constraints, and whether ensemble approaches justify their additional complexity. To enable this analysis, we construct a large corpus of Android applications with paired static representations and evaluate six popular machine learning models on the exact same samples: two tabular models using EMBER features, two tabular models using extended EMBER features, and two vision-based models using malware images. Our results show that a LightGBM model trained on extended EMBER features outperforms all other evaluated models, as well as a state-of-the-art approach trained on a much larger dataset. Furthermore, we develop an ensemble model combining both tabular and vision-based detectors, which yields a modest performance improvement but at the cost of substantial additional computational and engineering overhead.

Bookmark

View Full Paper

Cite This Study

Dayananda et al. (Tue,) studied this question.

synapsesocial.com/papers/69a75b93c6e9836116a23185 https://doi.org/https://doi.org/10.3390/electronics15030544

Bookmark

View Full Paper