BE Seminar – “Engineering cell state-specific gene regulation with AI and synthetic biology” (Sebastián Castillo Hair, University of Washington)
February 6 at 3:30 PM - 4:30 PM
Cells across tissues, developmental stages, and disease conditions adopt distinct intracellular states – epigenomic, transcriptomic, and proteomic profiles – to compartmentalize function in time and space. The ability to write DNA- and RNA-encoded programs that sense and interface with cellular states has transformative potential for biotechnology, for example in designing gene therapies with cell type- and disease-specificity to minimize off-target effects. However, limited understanding of how cellular processes respond to cellular states remains a major bottleneck. Machine learning (ML) models that learn the regulatory grammar of cell state regulation from large datasets offer a powerful tool to overcome these barriers. In this talk I will present advances in combining ML and high throughput experiments to study and engineer DNA and RNA sequences that recognize cell types, tissues, and developmental stages. First, I will focus on enhancers – DNA sequences that regulate transcription with cell type-specificity. We trained ML models on databases of genomic accessibility – an imperfect proxy of enhancer activity that is available for hundreds of cell types and tissues – and used them to design enhancers targeted to multiple cell types. Experimentally, these enhancers drove stronger and more specific gene expression than endogenous controls when tested on 10 human cell lines, including from muscle, liver, breast, and embryonal tissues, and even in vivo in mouse retinas. In the second part, I will focus on mRNA sequence engineering. We used ML predictors of translation to optimize performance of an mRNA-delivered gene editor, and developed predictors of mRNA degradation in response to microRNAs—non-coding RNAs that are differentially expressed across cell types. Using these, we designed mRNAs with programmable specificity across 10 cell lines. Finally, I will focus on gene regulation during the dramatic cell state transitions that occur during development. By training ML predictors of chromatin accessibility and mRNA regulation during zebrafish embryogenesis and applying explainable AI methods, we uncovered regulatory grammar driving differentiation and gene expression dynamics. Our work shows the potential of combining ML, high throughput experiments, and synthetic biology to decode and engineer cell state-responsive biological systems.
Sebastián Castillo Hair, PhD
Postdoctoral Researcher, University of Washington
Dr. Sebastian Castillo-Hair is a Postdoctoral Researcher in Prof. Georg Seelig’s group at the University of Washington. In his research, he uses machine learning and high throughput experiments to study gene regulation and design synthetic sequences with novel capabilities. He has applied this approach to optimize mRNA-delivered gene editing therapeutics, to engineer human enhancers with cell type-specific activity, and to uncover the sequence determinants of differentiation and gene regulation during zebrafish development. Sebastian received his PhD in Bioengineering from Rice University, where he worked in Prof. Jeffrey Tabor’s group on bacterial optogenetics and development of open-source tools for synthetic biology.
Zoom Link (if unable to attend in-person): https://upenn.zoom.us/j/99195947296?pwd=BsymSHkp1mteVj54KWnR4vhmJsafZK.1