Eric H Lyons

Associate Professor, Plant Science

Associate Professor, Agricultural-Biosystems Engineering

Advisor, CALS' Office of the Assoc Dean - Research for Cyber Initiatives in Agricultural / Life - Vet Science

Associate Professor, Genetics - GIDP

Associate Professor, BIO5 Institute

Primary Department

School of Plant Sciences

Department Affiliations

School of Plant Sciences

BIO5 Institute

Contact

(520) 626-5070

ericlyons@arizona.edu

Research Interest

Eric Lyons, PhD is an assistant professor at the University of Arizona School of Plant Sciences. Dr. Lyons is internationally known for his work in understanding the evolution, structure, and dynamics of genomes. Core to his research activities is the development of software systems for managing and analyzing genomic data and cyberinfrastructure for the life sciences.Dr. Lyons has published over 30 original research papers and 5 book chapters, many in collaboration with investigators from around the world. He is a frequent presenter at national and international meetings, and has been invited to teach workshops on the analysis of genomic data to plant, vertebrate, invertebrate, microbe, and health researchers.Prior to joining the faculty in the School of Plant Sciences, Dr. Lyons worked with the iPlant Collaborative developing cyberinfrastructure, and managing its scientific activities. In addition, he spent five years working in industry at biotech, pharmaceutical, and software companies. Dr. Lyons’ core software system for managing and analyzing genomic data is called CoGe, and is available for use at http://genomevolution.org

Publications

Bringing your tools to CyVerse Discovery Environment using Docker

Devisetty, U. K., Kennedy, K., Sarando, P., Merchant, N., & Lyons, E. (2016). Bringing your tools to CyVerse Discovery Environment using Docker. F1000Research, 5.

Leveraging CyVerse Resources for De Novo Comparative Transcriptomics of Underserved (Non-model) Organisms

Joyce, B. L., Haug-Baltzell, A. K., Hulvey, J. P., McCarthy, F., Devisetty, U. K., & Lyons, E. (2017). Leveraging CyVerse Resources for De Novo Comparative Transcriptomics of Underserved (Non-model) Organisms. Journal of visualized experiments: JoVE.

Whole Genome and Tandem Duplicate Retention Facilitated Glucosinolate Pathway Diversification in the Mustard Family

Hofberger, J. A., Lyons, E., Edger, P. P., Pires, J. C., & Schranz, M. E. (2013). Whole Genome and Tandem Duplicate Retention Facilitated Glucosinolate Pathway Diversification in the Mustard Family. Genome biology and evolution, 5(11), 2155--2173.

Using genomic sequencing for classical genetics in E. coli K12

Lyons, E., Freeling, M., Kustu, S., & Inwood, W. (2011). Using genomic sequencing for classical genetics in E. coli K12. PloS one, 6(2), e16717.

We here develop computational methods to facilitate use of 454 whole genome shotgun sequencing to identify mutations in Escherichia coli K12. We had Roche sequence eight related strains derived as spontaneous mutants in a background without a whole genome sequence. They provided difference tables based on assembling each genome to reference strain E. coli MG1655 (NC_000913). Due to the evolutionary distance to MG1655, these contained a large number of both false negatives and positives. By manual analysis of the dataset, we detected all the known mutations (24 at nine locations) and identified and genetically confirmed new mutations necessary and sufficient for the phenotypes we had selected in four strains. We then had Roche assemble contigs de novo, which we further assembled to full-length pseudomolecules based on synteny with MG1655. This hybrid method facilitated detection of insertion mutations and allowed annotation from MG1655. After removing one genome with less than the optimal 20- to 30-fold sequence coverage, we identified 544 putative polymorphisms that included all of the known and selected mutations apart from insertions. Finally, we detected seven new mutations in a total of only 41 candidates by comparing single genomes to composite data for the remaining six and using a ranking system to penalize homopolymer sequencing and misassembly errors. An additional benefit of the analysis is a table of differences between MG1655 and a physiologically robust E. coli wild-type strain NCM3722. Both projects were greatly facilitated by use of comparative genomics tools in the CoGe software package (http://genomevolution.org/).

Screening synteny blocks in pairwise genome comparisons through integer programming

Tang, H., Lyons, E., Pedersen, B., Schnable, J. C., Paterson, A. H., & Freeling, M. (2011). Screening synteny blocks in pairwise genome comparisons through integer programming. BMC Bioinformatics, 12.

PMID: 21501495;PMCID: PMC3088904;Abstract:

Background: It is difficult to accurately interpret chromosomal correspondences such as true orthology and paralogy due to significant divergence of genomes from a common ancestor. Analyses are particularly problematic among lineages that have repeatedly experienced whole genome duplication (WGD) events. To compare multiple "subgenomes" derived from genome duplications, we need to relax the traditional requirements of "one-to-one" syntenic matchings of genomic regions in order to reflect "one-to-many" or more generally "many-to-many" matchings. However this relaxation may result in the identification of synteny blocks that are derived from ancient shared WGDs that are not of interest. For many downstream analyses, we need to eliminate weak, low scoring alignments from pairwise genome comparisons. Our goal is to objectively select subset of synteny blocks whose total scores are maximized while respecting the duplication history of the genomes in comparison. We call this "quota-based" screening of synteny blocks in order to appropriately fill a quota of syntenic relationships within one genome or between two genomes having WGD events.Results: We have formulated the synteny block screening as an optimization problem known as "Binary Integer Programming" (BIP), which is solved using existing linear programming solvers. The computer program QUOTA-ALIGN performs this task by creating a clear objective function that maximizes the compatible set of synteny blocks under given constraints on overlaps and depths (corresponding to the duplication history in respective genomes). Such a procedure is useful for any pairwise synteny alignments, but is most useful in lineages affected by multiple WGDs, like plants or fish lineages. For example, there should be a 1:2 ploidy relationship between genome A and B if genome B had an independent WGD subsequent to the divergence of the two genomes. We show through simulations and real examples using plant genomes in the rosid superorder that the quota-based screening can eliminate ambiguous synteny blocks and focus on specific genomic evolutionary events, like the divergence of lineages (in cross-species comparisons) and the most recent WGD (in self comparisons).Conclusions: The QUOTA-ALIGN algorithm screens a set of synteny blocks to retain only those compatible with a user specified ploidy relationship between two genomes. These blocks, in turn, may be used for additional downstream analyses such as identifying true orthologous regions in interspecific comparisons. There are two major contributions of QUOTA-ALIGN: 1) reducing the block screening task to a BIP problem, which is novel; 2) providing an efficient software pipeline starting from all-against-all BLAST to the screened synteny blocks with dot plot visualizations. Python codes and full documentations are publicly available http://github.com/tanghaibao/quota-alignment. QUOTA-ALIGN program is also integrated as a major component in SynMap http://genomevolution.com/CoGe/SynMap.pl, offering easier access to thousands of genomes for non-programmers. © 2011 Tang et al; licensee BioMed Central Ltd.

Eric H Lyons

Eric H Lyons

Research Interest

Publications

Find Us On

Stay Connected