The Anatomical Therapeutic Chemical (ATC) code is a drug classification system that indicates the therapeutic potential use of a compound. Predicting ATC codes for drugs using automatic approaches is key to guide clinical trials and for drug repurposing. However, such automatic assignment is challenging due to the hierarchical organization of the code in four levels, possible polypharmacological behavior, and the imbalance and scarcity of annotated data in relation to the large number of compounds and possible ATC codes a drug may have. In this work, we propose a novel multimodal generative approach for predicting ATC codes, which leverages molecular information using a sequence-to-sequence architecture. Our hypothesis explores the idea that describing the chemical structure of the input compounds using two different representations, i.e., modes, the SMILES code and its molecular descriptors, provides complementary information, hence improving the accuracy of the predictions. Furthermore, given the multilabel nature of generative sequence-based models, we also present an additional prediction method to determine when to stop generating ATC labels for each compound. We compared the performance of our proposed methods against several baselines, both for new drugs and in drug repurposing tasks. In all of these cases, the superior performance of our multimodal proposals is clearly demonstrated. The source code and different data sets used to train and evaluate the models are made publicly available.
Building similarity graph...
Analyzing shared references across papers
Loading...
Trinidad Crozes
Eugenia Ulzurrun
Juan A. Páez
Journal of Chemical Information and Modeling
Consejo Nacional de Investigaciones Científicas y Técnicas
Centro de Investigaciones Biológicas Margarita Salas
Universidad Nacional del Sur
Building similarity graph...
Analyzing shared references across papers
Loading...
Crozes et al. (Tue,) studied this question.
www.synapsesocial.com/papers/69e07dad2f7e8953b7cbe9da — DOI: https://doi.org/10.1021/acs.jcim.6c00118