What question did this study set out to answer?

The research aims to improve in vivo predictions of transcription factor binding sites through modeling strategies.

May 8, 2026Open Access

Modeling strategies for in vivo transcription factor binding predictions

Puntos clave

The research aims to improve in vivo predictions of transcription factor binding sites through modeling strategies.
Analyzed various modeling techniques for predicting in vivo transcription factor binding sites.
Developed a cross-TF transfer learning scheme to enhance generalizability.
Evaluated model performance using DNA accessibility, TF RNA expression, and binding motifs.
Achieved a mean AUPR of 0.36 in a challenging cross-TF, cross-cell-type, and cross-chromosomal setting.
Demonstrated that model ensembling and DNA language model embeddings enhance prediction performance.
Identified ground truth ChIP-seq data quality as a key factor influencing model accuracy.

Resumen

Abstract Identification of in vivo transcription factor (TF) binding sites is crucial to understand gene regulation, but the lack of scalability in their experimental identification directs researchers towards computational models. These models are often specific for a given TF, which hinders their generalizability to held-out TFs. In this work, we analyse different modeling strategies to predict in vivo TF binding sites using DNA accessibility, TF RNA expression and binding motif features. We present and test a cross-TF transfer learning scheme that allows learning from the entire training set. We show that model ensembling and DNA language model embeddings increase model performance. We provide an analysis of feature importance and show that ground truth ChIP-seq data quality is an important determinant of model performance. We also test our models in an independent dataset of held-out TFs, and report a mean AUPR of 0.36 in a very challenging cross-TF, cross-cell-type and cross-chromosomal setting, providing estimates of binding for TFs without available ChIP-seq experiments.

Leer artículo completoexternamente

Me gusta

Guardar

Ver artículo completo