Doctor of Philosophy, Stanford University, GENE-PHD (2018)
Bachelor of Science, Arizona State University, Biological Sci, biochemistry (2011)
Reprogramming somatic cells to induced pluripotent stem cells (iPSCs) is now routinely accomplished by overexpression of the four Yamanaka factors (OCT4, SOX2, KLF4, MYC (or OSKM))1. These iPSCs can be derived from patients' somatic cells and differentiated toward diverse fates, serving as a resource for basic and translational research. However, mechanistic insights into regulators and pathways that initiate the pluripotency network remain to be resolved. In particular, naturally occurring molecules that activate endogenous OCT4 and replace exogenous OCT4 in human iPSC reprogramming have yet to be found. Using a heterokaryon reprogramming system we identified NKX3-1 as an early and transiently expressed homeobox transcription factor. Following knockdown of NKX3-1, iPSC reprogramming is abrogated. NKX3-1 functions downstream of the IL-6-STAT3 regulatory network to activate endogenous OCT4. Importantly, NKX3-1 substitutes for exogenous OCT4 to reprogram both mouse and human fibroblasts at comparable efficiencies and generate fully pluripotent stem cells. Our findings establish an essential role for NKX3-1, a prostate-specific tumour suppressor, in iPSC reprogramming.
View details for DOI 10.1038/s41556-018-0136-x
View details for PubMedID 30013107
Lizards, which are amniote vertebrates like humans, are able to lose and regenerate a functional tail. Understanding the molecular basis of this process would advance regenerative approaches in amniotes, including humans. We have carried out the first transcriptomic analysis of tail regeneration in a lizard, the green anole Anolis carolinensis, which revealed 326 differentially expressed genes activating multiple developmental and repair mechanisms. Specifically, genes involved in wound response, hormonal regulation, musculoskeletal development, and the Wnt and MAPK/FGF pathways were differentially expressed along the regenerating tail axis. Furthermore, we identified 2 microRNA precursor families, 22 unclassified non-coding RNAs, and 3 novel protein-coding genes significantly enriched in the regenerating tail. However, high levels of progenitor/stem cell markers were not observed in any region of the regenerating tail. Furthermore, we observed multiple tissue-type specific clusters of proliferating cells along the regenerating tail, not localized to the tail tip. These findings predict a different mechanism of regeneration in the lizard than the blastema model described in the salamander and the zebrafish, which are anamniote vertebrates. Thus, lizard tail regrowth involves the activation of conserved developmental and wound response pathways, which are potential targets for regenerative medical therapies.
View details for DOI 10.1371/journal.pone.0105004
View details for Web of Science ID 000342687200046
View details for PubMedID 25140675
View details for PubMedCentralID PMC4139331
Supercentenarians (110 years or older) are the world's oldest people. Seventy four are alive worldwide, with twenty two in the United States. We performed whole-genome sequencing on 17 supercentenarians to explore the genetic basis underlying extreme human longevity. We found no significant evidence of enrichment for a single rare protein-altering variant or for a gene harboring different rare protein altering variants in supercentenarian compared to control genomes. We followed up on the gene most enriched for rare protein-altering variants in our cohort of supercentenarians, TSHZ3, by sequencing it in a second cohort of 99 long-lived individuals but did not find a significant enrichment. The genome of one supercentenarian had a pathogenic mutation in DSC2, known to predispose to arrhythmogenic right ventricular cardiomyopathy, which is recommended to be reported to this individual as an incidental finding according to a recent position statement by the American College of Medical Genetics and Genomics. Even with this pathogenic mutation, the proband lived to over 110 years. The entire list of rare protein-altering variants and DNA sequence of all 17 supercentenarian genomes is available as a resource to assist the discovery of the genetic basis of extreme longevity in future studies.
View details for DOI 10.1371/journal.pone.0112430
View details for PubMedID 25390934
The green anole lizard, Anolis carolinensis, is a key species for both laboratory and field-based studies of evolutionary genetics, development, neurobiology, physiology, behavior, and ecology. As the first non-avian reptilian genome sequenced, A. carolinesis is also a prime reptilian model for comparison with other vertebrate genomes. The public databases of Ensembl and NCBI have provided a first generation gene annotation of the anole genome that relies primarily on sequence conservation with related species. A second generation annotation based on tissue-specific transcriptomes would provide a valuable resource for molecular studies.Here we provide an annotation of the A. carolinensis genome based on de novo assembly of deep transcriptomes of 14 adult and embryonic tissues. This revised annotation describes 59,373 transcripts, compared to 16,533 and 18,939 currently for Ensembl and NCBI, and 22,962 predicted protein-coding genes. A key improvement in this revised annotation is coverage of untranslated region (UTR) sequences, with 79% and 59% of transcripts containing 5' and 3' UTRs, respectively. Gaps in genome sequence from the current A. carolinensis build (Anocar2.0) are highlighted by our identification of 16,542 unmapped transcripts, representing 6,695 orthologues, with less than 70% genomic coverage.Incorporation of tissue-specific transcriptome sequence into the A. carolinensis genome annotation has markedly improved its utility for comparative and functional studies. Increased UTR coverage allows for more accurate predicted protein sequence and regulatory analysis. This revised annotation also provides an atlas of gene expression specific to adult and embryonic tissues.
View details for DOI 10.1186/1471-2164-14-49
View details for PubMedID 23343042
View details for PubMedCentralID PMC3561122
Many perspectives on the role of evolution in human health include nonempirical assumptions concerning the adaptive evolutionary origins of human diseases. Evolutionary analyses of the increasing wealth of clinical and population genomic data have begun to challenge these presumptions. In order to systematically evaluate such claims, the time has come to build a common framework for an empirical and intellectual unification of evolution and modern medicine. We review the emerging evidence and provide a supporting conceptual framework that establishes the classical neutral theory of molecular evolution (NTME) as the basis for evaluating disease- associated genomic variations in health and medicine. For over a decade, the NTME has already explained the origins and distribution of variants implicated in diseases and has illuminated the power of evolutionary thinking in genomic medicine. We suggest that a majority of disease variants in modern populations will have neutral evolutionary origins (previously neutral), with a relatively smaller fraction exhibiting adaptive evolutionary origins (previously adaptive). This pattern is expected to hold true for common as well as rare disease variants. Ultimately, a neutral evolutionary perspective will provide medicine with an informative and actionable framework that enables objective clinical assessment beyond convenient tendencies to invoke past adaptive events in human history as a root cause of human disease.
View details for DOI 10.1101/gr.133702.111
View details for PubMedID 22665443
The axial skeleton is a defining feature of vertebrates and is patterned during somitogenesis. Cyclically expressed members of the notch and other signaling pathways, described as the 'segmentation clock', regulate the formation of somite boundaries. Comparisons among vertebrate model systems have revealed fundamental shifts in the regulation of expression among critical genes in the notch pathway. However, insights into the evolution of these expression differences have been limited by the lack of information from non-avian reptiles. We analyzed the segmentation clock of the first Lepidosaurian reptile sequenced, the green anole lizard, Anolis carolinensis, for comparison with avian and mammalian models. Using genomic sequence, RNA-Seq transcriptomic data, and in situ hybridization analysis of somite-stage embryos, we carried out comparative analyses of key genes and found that the anole segmentation clock displays features common to both amniote and anamniote vertebrates. Shared features with anamniotes, represented by Xenopus laevis and Danio rerio, include an absence of lunatic fringe (lfng) expression within the presomitic mesoderm (PSM), a hes6a gradient in the PSM not observed in the chicken or mouse, and EGF repeat structure of the divergent notch ligand, dll3. The anole and mouse share cycling expression of dll1 ligand in the PSM. To gain insight from an Archosaurian reptile, we analysed LFNG and DLL1 expressions in the American alligator. LFNG expression was absent in the alligator PSM, like the anole but unlike the chicken. In contrast, DLL1 expression does not cycle in the PSM of the alligator, similar to the chicken but unlike the anole. Thus, our analysis yields novel insights into features of the segmentation clock that are evolutionarily basal to amniotes versus those that are specific to mammals, Lepidosaurian reptiles, or Archosaurian reptiles.
View details for DOI 10.1016/j.ydbio.2011.11.021
View details for Web of Science ID 000300857800027
View details for PubMedID 22178152
As the cost of DNA sequencing drops, we are moving beyond one genome per species to one genome per individual to improve prevention, diagnosis, and treatment of disease by using personal genotypes. Computational methods are frequently applied to predict impairment of gene function by nonsynonymous mutations in individual genomes and single nucleotide polymorphisms (nSNPs) in populations. These computational tools are, however, known to fail 15%-40% of the time. We find that accurate discrimination between benign and deleterious mutations is strongly influenced by the long-term (among species) history of positions that harbor those mutations. Successful prediction of known disease-associated mutations (DAMs) is much higher for evolutionarily conserved positions and for original-mutant amino acid pairs that are rarely seen among species. Prediction accuracies for nSNPs show opposite patterns, forecasting impediments to building diagnostic tools aiming to simultaneously reduce both false-positive and false-negative errors. The relative allele frequencies of mutations diagnosed as benign and damaging are predicted by positional evolutionary rates. These allele frequencies are modulated by the relative preponderance of the mutant allele in the set of amino acids found at homologous sites in other species (evolutionarily permissible alleles [EPAs]). The nSNPs found in EPAs are biochemically less severe than those missing from EPAs across all allele frequency categories. Therefore, it is important to consider position evolutionary rates and EPAs when interpreting the consequences and population frequencies of human mutations. The impending sequencing of thousands of human and many more vertebrate genomes will lead to more accurate classifiers needed in real-world applications.
View details for DOI 10.1101/gr.091991.109
View details for Web of Science ID 000269482200006
View details for PubMedID 19546171
View details for PubMedCentralID PMC2752122