Null networks are a type of random graph that is favored for the analysis of a wide variety of real-world networks, including gene-regulatory networks, food webs, and species co-occurrence matrices. As a hypothesis-generating tool, null networks are invaluable because they can reveal network motifs and unusual large-scale properties of networks...
Specialized or secondary metabolism is a collection of pathways and small molecules that, while beneficial to an organism, are not strictly necessary for survival. Plants use secondary metabolites to, among other things, attract pollinators, defend against biotic and abiotic stressors, and form symbioses. Natural products from plants have seen an...
Gene expression is largely regulated by the binding of transcription factors to short sequences of DNA (transcription factor binding sites) to drive or repress transcription of an adjacent gene. Identifying the key transcription factor binding sites associated with a particular biological outcome provides an avenue for tailored transcriptional regulation in...
Gene regulation is a complex mechanism that controls the spatial and temporal expression of genes in a living cell. My dissertation studies focus on two problems. First, tissue-specific gene expression prediction from DNA sequence and chromatin state, and second, the accurate discovery of small over-represented regulatory circuits in gene regulatory...
The computational identification of gene Transcription Start Sites (TSSs) can provide insights into the regulation and function of genes without performing expensive experiments, particularly in organisms with incomplete annotations. High-resolution general-purpose TSS prediction remains a challenging problem, with little recent progress on the identification and differentiation of TSSs which are...