Ryan N Gutenkunst
Work Summary
We learn history from the genomes of humans, tumors, and other species. Our studies reveal how evolution works at the molecular level, offering fundamental insight into how humans and pathogens adapt to challenges.
We learn history from the genomes of humans, tumors, and other species. Our studies reveal how evolution works at the molecular level, offering fundamental insight into how humans and pathogens adapt to challenges.
PMID: 17922568;PMCID: PMC2000971;Abstract:
Quantitative computational models play an increasingly important role in modern biology. Such models typically involve many free parameters, and assigning their values is often a substantial obstacle to model development. Directly measuring in vivo biochemical parameters is difficult, and collectively fitting them to other experimental data often yields large parameter uncertainties. Nevertheless, in earlier work we showed in a growth-factor- signaling model that collective fitting could yield well-constrained predictions, even when it left individual parameters very poorly constrained. We also showed that the model had a "sloppy" spectrum of parameter sensitivities, with eigenvalues roughly evenly distributed over many decades. Here we use a collection of models from the literature to test whether such sloppy spectra are common in systems biology. Strikingly, we find that every model we examine has a sloppy spectrum of sensitivities. We also test several consequences of this sloppiness for building predictive models. In particular, sloppiness suggests that collective fits to even large amounts of ideal time-series data will often leave many parameters poorly constrained. Tests over our model collection are consistent with this suggestion. This difficulty with collective fits may seem to argue for direct parameter measurements, but sloppiness also implies that such measurements must be formidably precise and complete to usefully constrain many model predictions. We confirm this implication in our growth-factor-signaling model. Our results suggest that sloppy sensitivity spectra are universal in systems biology models. The prevalence of sloppiness highlights the power of collective fits and suggests that modelers should focus on predictions rather than on parameters. © 2007 Gutenkunst et al.
'Orang-utan' is derived from a Malay term meaning 'man of the forest' and aptly describes the southeast Asian great apes native to Sumatra and Borneo. The orang-utan species, Pongo abelii (Sumatran) and Pongo pygmaeus (Bornean), are the most phylogenetically distant great apes from humans, thereby providing an informative perspective on hominid evolution. Here we present a Sumatran orang-utan draft genome assembly and short read sequence data from five Sumatran and five Bornean orang-utan genomes. Our analyses reveal that, compared to other primates, the orang-utan genome has many unique features. Structural evolution of the orang-utan genome has proceeded much more slowly than other great apes, evidenced by fewer rearrangements, less segmental duplication, a lower rate of gene family turnover and surprisingly quiescent Alu repeats, which have played a major role in restructuring other primate genomes. We also describe a primate polymorphic neocentromere, found in both Pongo species, emphasizing the gradual evolution of orang-utan genome structure. Orang-utans have extremely low energy usage for a eutherian mammal, far lower than their hominid relatives. Adding their genome to the repertoire of sequenced primates illuminates new signals of positive selection in several pathways including glycolipid metabolism. From the population perspective, both Pongo species are deeply diverse; however, Sumatran individuals possess greater diversity than their Bornean counterparts, and more species-specific variation. Our estimate of Bornean/Sumatran speciation time, 400,000 years ago, is more recent than most previous studies and underscores the complexity of the orang-utan speciation process. Despite a smaller modern census population size, the Sumatran effective population size (N(e)) expanded exponentially relative to the ancestral N(e) after the split, while Bornean N(e) declined over the same period. Overall, the resources and analyses presented here offer new opportunities in evolutionary genomics, insights into hominid biology, and an extensive database of variation for conservation efforts.
PMID: 17140603;Abstract:
We use observed movement tracks of Atlantic bluefin tuna in the Gulf of Maine and mathematical modeling of this movement to identify possible resource patches. We infer bounds on the overall sizes and distribution of such patches, even though they are difficult to quantify by direct observation in situ. To do so, we segment individual fish tracks into intervals of distinct motion types based on the ratio of net displacement to length of track (Δ D / Δ L) over a time window Δ t. To find the best segmentation, we optimize the fit of a random-walk movement model to each motion type. We compare results from two distinct movement models: biased turning and biased speed, to check the model-dependence of our inferences, and find that uncertainty in choice of movement model dominates the uncertainties of our conclusions. We find that our data are best described using two motion types: "localized" (Δ D / Δ L small) and "long-ranged" (Δ D / Δ L large). The biased turning model leads to significantly better resolution of localized movement intervals than the biased speed model. We hypothesize that localized movement corresponds to exploitation of resource patches. Comparison with visual behavior observations made during tracking suggests that many inferred intervals of localized motion do indeed correspond to feeding activity. From our analysis, we estimate that, on average, bluefin tuna in the Gulf of Maine encounter a resource patch every 2 h, that those patches have an average radius of 0.7-1.2 km, and that, overall, there are at most 5-9 such patches per 100 km2 in the region studied. © 2006 Elsevier Ltd. All rights reserved.