Characterizing the regulatory logic of transcriptional control at the DNA sequence level by ensembles of thermodynamic models

ORAL

Abstract

Characterizing how the genome encodes the regulatory logic of

transcription is a main challenge of the post-genomic era, which

can be overcome with the aid of theoretical physics tools. Gene

regulation is crucial for embryo development robustness. E.g. pattern

formation along the antero-posterior axis (AP) of Drosophila

embryos happens with striking single nuclei precision during

syncytial blastoderm stage. The pair-rule gene even-skipped (eve)

participates of this process forming seven stripes transverse to

AP observable ~3 hours after egg deposition. Under proper scaling, each stripe is

described by four well-conserved parameters determining its

location, width, intensity, and time of formation. That

simplicity contrasts with the intricate combination of

experimentally observed regulatory mechanisms encoded in specific

enhancers within eve's 16kb loci and challenges us to formulate general models about regulation of expression in metazoans. In this talk we discuss how an

ensemble of fits to data produced by application of simulated

annealing to optimize the parameters of a thermodynamics-based

sequence-level model aids understanding transcriptional

regulation. Quantitative experimental data on reporters driven by

the whole locus of the eve gene in the blastoderm of Drosophila

embryos was used for validating our approach. The fits are

clustered accordingly with their intrinsic regulatory logic. A

multiscale analysis enables visualization of quantitative

features resulting from the deconvolution of the regulatory

profile emergent from the interaction of multiple transcription

factors with the locus of eve. A few clusters of highly active

DNA binding sites within the enhancers collectively modulate

eve's transcription. Analysis of variable enhancers’

length shows the importance of DNA-bound protein-protein interactions

for transcriptional regulation. The interplay between activation

and quenching enables function conservation of enhancers in different species of Drosophila.

*This work was supported by funds from the National Institutes of Health R01 OD010936 and in part by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior—Brasil (CAPES)—Finance Code 001. D.M.G. was supported by Programa Unificado de Bolsas–USP.

Publication: Barr KA, Reinitz J. A sequence level model of an intact locus predicts the location and function of nonadditive enhancers. PLoS One
2017;12:e0180861. https://doi.org/10.1371/journal.pone.0180861;

Sabino AU, Guerreiro D de M, Kim A-R, Ramos AF, Reinitz J. Characterizing the regulatory logic of transcriptional control at the DNA sequence level by ensembles of thermodynamic models. Bioinformatics. 2025 ; 41( 10): 01-11. http://dx.doi.org/10.1093/bioinformatics/btaf534

Ramos AF. Cells are entities specialized in machine learning.

Presenters

  • Alexandre Ferreira Ramos

Authors

  • Alexandre Ferreira Ramos

  • Alan U. Sabino

    • Universidade de São Paulo
  • Drielly M. Guerreiro

    • Universidade de São Paulo
  • Ah-Ram Kim

    • School of Life Science, Handong Global University
  • John Reinitz

    • University of Chicago