Key points are not available for this paper at this time.
The exploration of extremophiles─microorganisms that thrive in extreme environments─is crucial for advancing biotechnological applications and understanding the limits of life. However, traditional methods for identifying extremophiles are labor-intensive and low-efficiency. Here we introduce iExtreme, a machine learning model that accurately predicts extremophile characteristics employing a sophisticated Support Vector Machine (SVM) framework based on k-mer features of nucleotides and codon combinations extracted from genome sequences. Our model, trained on a curated data set of 1030 extremophilic genomes, achieves accuracies of 0.988, 0.939, and 0.938 in identifying halophiles, thermophiles, and pH-philes, respectively. Utilizing iExtreme, we discovered 520 novel extremophilic species and 5255 genomes from various databases, and a significant number of novel extremozymes via structure-based protein clustering, including d-psicose 3-epimerases (DPEase) and α-amylases. These results demonstrate the usefulness of iExtreme.
Building similarity graph...
Analyzing shared references across papers
Loading...
L D Liu
Sun Yat-sen University
Hong HUANG
Yu Zhang
Environmental Science & Technology
Hong Kong University of Science and Technology
Shantou University
Key Laboratory of Guangdong Province
Building similarity graph...
Analyzing shared references across papers
Loading...
Liu et al. (Wed,) studied this question.
synapsesocial.com/papers/6a21ca1d3f99faaa70ecaf8a — DOI: https://doi.org/10.1021/acs.est.5c17912