Background/aim: Pyrazole derivatives are of growing interest due to their diverse pharmacological activities. However, their biological activity is often highly sensitive to subtle structural modifications. Existing quantitative structure–activity relationships (QSAR) approaches frequently fail to capture the conformational flexibility and nonlinear structure–activity relationships (SAR) of such heterocyclic scaffolds, creating a gap in the accurate prediction of their biological profiles. Therefore, there is a strong need for more robust and predictive computational frameworks. This study addresses this gap by integrating four-dimensional (4D)-QSAR descriptors with hybrid machine learning (ML) techniques to improve predictive accuracy and provide a more reliable tool for structure-based drug design. In this work, it was aimed to investigate the SAR of a series of pyrazole-based compounds using this advanced integrative computational strategy. Materials and methods: The dataset consisted of 54 pyrazole derivatives, of which 50 compounds were used for model construction and 4 compounds were reserved as a test set for validation. Although the test set was limited in size, the selected compounds were structurally representative of the training set, sharing the same core scaffold while covering different substitution patterns and biological activity values. The 4D-QSAR approach included multiple conformations of each compound and utilized matrix-based representations of geometric and electronic properties to capture dynamic molecular behavior. A pharmacophore model was generated using EMRE software based on the spatial and electronic features of used compounds. EMRE is an in-house software developed by our research group. It has been employed in several previously published 4D-QSAR studies for electron-conformational matrix of contiguity construction, pharmacophore modeling, descriptor matrix generation, and activity prediction. EMRE operates on standard geometric and electronic descriptors derived from quantum-chemical calculations, ensuring methodological transparency and reproducibility despite its proprietary implementation. Comparable performance trends obtained with EMRE-based 4D-QSAR models have been reported in previous studies, supporting the validity of the software for pharmacophore-driven QSAR analysis (Şahin et al., 2011; Sahin and Saripinar, 2020; Sahin et al., 2021). Using this framework, a total of 204 molecular descriptors were computed using Spartan 07. To reduce redundancy and prevent overfitting, descriptor selection was optimized through a genetic algorithm (GA)-based procedure (Fernandez et al., 2011), and only statistically significant descriptors with low intercorrelation were retained for model construction. Subsequently, multiple ML algorithms, including artificial neural network, decision tree, and hybrid models, were evaluated to enhance prediction accuracy. Results: Among all the tested models, the gradient boosting machine and random forest (GBM+RF) hybrid algorithm yielded
Building similarity graph...
Analyzing shared references across papers
Loading...
KADER ŞAHİN
Serhat Kiliçarslan
Serdar Durdağı
TURKISH JOURNAL OF BIOLOGY
Building similarity graph...
Analyzing shared references across papers
Loading...
ŞAHİN et al. (Mon,) studied this question.
www.synapsesocial.com/papers/69a3d811ec16d51705d2ea46 — DOI: https://doi.org/10.55730/1300-0152.2789