For promoting the interpretability of Artificial Intelligence (AI) models, the methods of feature visualization, including Activation Maximization, help people understand how intractable Deep Neural Networks work by visualizing the representations of specific neurons. Meanwhile, the incredible-sounding ideas of reconstructing the input of an AI model through its output, or reconstructing the training data through simple access to the model, are indeed feasible. In fact, in order to explore the privacy leakage in AI models, model inversion techniques intend to reconstruct the private data through black-box or white-box access to AI models. Feature visualization and model inversion share a very similar framework and, in our point of view, this framework has great potential to be exploited for both beneficial and harmful intentions. In this paper, we uniformly refer to such operations of reconstructing data reversely as feature inversion. We will demonstrate feature inversion through a comprehensive analysis of model inversion and feature visualization, which are usually contradictory for the model trainer, as feature visualization boosts the interpretability of AI models while model inversion threatens privacy.
Building similarity graph...
Analyzing shared references across papers
Loading...
Muhammad Luqman Naseem
Zipeng Ye
Zhou Qi
Complex & Intelligent Systems
Harbin Institute of Technology
Building similarity graph...
Analyzing shared references across papers
Loading...
Naseem et al. (Tue,) studied this question.
www.synapsesocial.com/papers/69d894326c1944d70ce0519d — DOI: https://doi.org/10.1007/s40747-026-02294-4