What does this research mean for the field?

Multi-context seeds (MCS) improve the accuracy of read mapping in sequence similarity searches without increasing runtime or memory overhead. Novelty: ClaimNovelty.NOVEL_FINDING. Consensus alignment: ConsensusAlignment.NEUTRAL.

What question did this study set out to answer?

The study aims to improve read mapping accuracy while maintaining fast search speeds through the introduction of multi-context seeds.

March 2, 2026Open Access

Multi-context seeds enable fast and high-accuracy read mapping

Key Points

The study aims to improve read mapping accuracy while maintaining fast search speeds through the introduction of multi-context seeds.
Introduced a novel indexing structure allowing storage of seeds with different lengths.
Implemented multi-context seeds in the strobealign algorithm.
Compared performance metrics of strobealign with and without multi-context seeds.
Multi-context seeds significantly improved accuracy of read mapping.
Minimal increase in runtime observed with the new implementation.
No additional memory overhead was required compared to previous versions.

Abstract

Abstract A key step in sequence similarity search is to identify shared seeds between a query and a reference sequence. A well-known tradeoff is that longer seeds offer fast searches but reduce sensitivity in variable regions. We introduce multi-context seeds (MCS), which allow the storage of seeds with different lengths in the same index structure, thus retaining the advantages of both short and long seeds. We demonstrate the applicability of MCS by implementing them in strobealign. Strobealign with MCS substantially improves accuracy compared to the previous version with little cost in runtime and no memory overhead.

Bookmark

View Full Paper

Cite This Study

Tolstoganov et al. (Sat,) studied this question.

synapsesocial.com/papers/69a52de5f1e85e5c73bf1048 https://doi.org/https://doi.org/10.1186/s13059-026-04017-x

Bookmark

View Full Paper