ABSTRACT Protein aggregation drives proteinopathies ranging from ALS to systemic amyloidosis, yet the multiscale determinants bridging sequence, structure, and kinetics remain elusive. We present SKALE, an interpretable machine learning framework that integrates sequence motifs, AlphaFold‐derived structural descriptors, and experimental kinetics to decode aggregation mechanisms. SKALE identifies latent hotspots that evade conventional tools and matches high‐performing neural baselines while preserving computational efficiency. In ALS‐linked SOD1 G86R, the model isolates a risk region at residues 72–91 where preserved β‐sheet geometry coincides with weakened hydrogen bonding to drive nucleation. Similarly, analysis of TDP‐43 S332N reveals that a locally unwound helix increases surface exposure, a prediction validated by showing that targeted deletion of model‐identified regions significantly reduces cellular aggregation. The framework generalizes to Tau P301L and PRNP variants where it uncovers distal aggregation‐prone regions to discriminate pathogenic drivers from neutral mutations. Interpretability analysis further disentangles global from mutation‐local mechanisms to reveal that β‐sheet propensity acts as a shared determinant while hydrogen bond dynamics define specific routes to nucleation. These findings establish SKALE as a scalable, disease‐agnostic engine that combines high‐fidelity prediction with biophysical resolution to decode the molecular logic of misfolding and guide therapeutic design.
Building similarity graph...
Analyzing shared references across papers
Loading...
Wei Xuan Wilson Loo
Jia Shen Sio
Keyin Yap
SHILAP Revista de lepidopterología
Aggregate
Monash University Malaysia
Chinese Institute for Brain Research
Sunway University
Building similarity graph...
Analyzing shared references across papers
Loading...
Loo et al. (Tue,) studied this question.
www.synapsesocial.com/papers/69a75b7fc6e9836116a22edc — DOI: https://doi.org/10.1002/agt2.70280