Current Role at Stanford
Associate Director, PharmGKB
Education & Certifications
PhD, Stanford University, Biophysics (2002)
SB, Massachusetts Institute of Technology, Biology (1993)
Associate Director, PharmGKB
As pharmacogenomics becomes integrated into clinical practice, curation of published studies becomes increasingly important. At the Pharmacogenomics Knowledgebase (PharmGKB; www.pharmgkb.org), pharmacogenetic associations reported in published articles are manually curated and evaluated. Standard terminologies are used, making findings uniform and unambiguous. Lack of information, clarity, or standards in the original report can make it difficult or impossible to curate. We provide 10 rules to help authors ensure that their results are accurately captured and integrated.
View details for DOI 10.1002/cpt.15
View details for PubMedID 25670512
PSB brings together top researchers from around the world to exchange research results and address open issues in all aspects of computational biology. PSB 2015 marks the twentieth anniversary of PSB. Reaching a milestone year is an accomplishment well worth celebrating. It is long enough to have seen big changes occur, but recent enough to be relevant for today. As PSB celebrates twenty years of service, we would like to take this opportunity to congratulate the PSB community for your success. We would also like the community to join us in a time of celebration and reflection on this accomplishment.
View details for PubMedID 25592562
The Clinical Pharmacogenetics Implementation Consortium (CPIC) publishes genotype-based drug guidelines to help clinicians understand how available genetic test results could be used to optimize drug therapy. CPIC has focused initially on well-known examples of pharmacogenomic associations that have been implemented in selected clinical settings, publishing nine to date. Each CPIC guideline adheres to a standardized format and includes a standard system for grading levels of evidence linking genotypes to phenotypes and assigning a level of strength to each prescribing recommendation. CPIC guidelines contain the necessary information to help clinicians translate patient-specific diplotypes for each gene into clinical phenotypes or drug dosing groups. This paper reviews the development process of the CPIC guidelines and compares this process to the Institute of Medicine's Standards for Developing Trustworthy Clinical Practice Guidelines.
View details for PubMedID 24479687
Human leukocyte antigen B (HLA-B) is a gene that encodes a cell surface protein involved in presenting antigens to the immune system. The variant allele HLA-B*15:02 is associated with an increased risk of Stevens-Johnson syndrome (SJS) and toxic epidermal necrolysis (TEN) in response to carbamazepine treatment. We summarize evidence from the published literature supporting this association and provide recommendations for the use of carbamazepine based on HLA-B genotype (also available on PharmGKB: http://www.pharmgkb.org). The purpose of this article is to provide information to allow the interpretation of clinical HLA-B*15:02 genotype tests so that the results can be used to guide the use of carbamazepine. The guideline provides recommendations for the use of carbamazepine when HLA-B*15:02 genotype results are available. Detailed guidelines regarding the selection of alternative therapies, the use of phenotypic tests, when to conduct genotype testing, and cost-effectiveness analyses are beyond the scope of this document. Clinical Pharmacogenetics Implementation Consortium (CPIC) guidelines are published and updated periodically on the PharmGKB website at (http://www.pharmgkb.org).
View details for DOI 10.1038/clpt.2013.103
View details for PubMedID 23695185
The Pharmacogenomics Knowledgebase (PharmGKB) is a resource that collects, curates, and disseminates information about the impact of human genetic variation on drug responses. It provides clinically relevant information, including dosing guidelines, annotated drug labels, and potentially actionable gene-drug associations and genotype-phenotype relationships. Curators assign levels of evidence to variant-drug associations using well-defined criteria based on careful literature review. Thus, PharmGKB is a useful source of high-quality information supporting personalized medicine-implementation projects.
View details for DOI 10.1038/clpt.2012.96
View details for Web of Science ID 000309017000009
View details for PubMedID 22992668
The need for efficient text-mining tools that support curation of the biomedical literature is ever increasing. In this article, we describe an experiment aimed at verifying whether a text-mining tool capable of extracting meaningful relationships among domain entities can be successfully integrated into the curation workflow of a major biological database. We evaluate in particular (i) the usability of the system's interface, as perceived by users, and (ii) the correlation of the ranking of interactions, as provided by the text-mining system, with the choices of the curators.
View details for DOI 10.1093/database/bas021
View details for Web of Science ID 000304924100001
View details for PubMedID 22529178
Personalized medicine is expected to benefit from combining genomic information with regular monitoring of physiological states by multiple high-throughput methods. Here, we present an integrative personal omics profile (iPOP), an analysis that combines genomic, transcriptomic, proteomic, metabolomic, and autoantibody profiles from a single individual over a 14 month period. Our iPOP analysis revealed various medical risks, including type 2 diabetes. It also uncovered extensive, dynamic changes in diverse molecular components and biological pathways across healthy and diseased conditions. Extremely high-coverage genomic and transcriptomic data, which provide the basis of our iPOP, revealed extensive heteroallelic changes during healthy and diseased states and an unexpected RNA editing mechanism. This study demonstrates that longitudinal iPOP can be used to interpret healthy and diseased states by connecting genomic information with additional dynamic omics activity.
View details for DOI 10.1016/j.cell.2012.02.009
View details for Web of Science ID 000301889500023
View details for PubMedID 22424236
The mission of the Pharmacogenomics Knowledge Base (PharmGKB; www.pharmgkb.org ) is to collect, encode and disseminate knowledge about the impact of human genetic variations on drug responses. It is an important worldwide resource of clinical pharmacogenomic biomarkers available to all. The PharmGKB website has evolved to highlight our knowledge curation and aggregation over our previous emphasis on collecting primary data. This review summarizes the methods we use to drive this expanded scope of 'Knowledge Acquisition to Clinical Applications', the new features available on our website and our future goals.
View details for DOI 10.2217/BMM.11.94
View details for Web of Science ID 000298488200009
View details for PubMedID 22103613
Warfarin is a widely used anticoagulant with a narrow therapeutic index and large interpatient variability in the dose required to achieve target anticoagulation. Common genetic variants in the cytochrome P450-2C9 (CYP2C9) and vitamin K-epoxide reductase complex (VKORC1) enzymes, in addition to known nongenetic factors, account for ~50% of warfarin dose variability. The purpose of this article is to assist in the interpretation and use of CYP2C9 and VKORC1 genotype data for estimating therapeutic warfarin dose to achieve an INR of 2-3, should genotype results be available to the clinician. The Clinical Pharmacogenetics Implementation Consortium (CPIC) of the National Institutes of Health Pharmacogenomics Research Network develops peer-reviewed gene-drug guidelines that are published and updated periodically on http://www.pharmgkb.org based on new developments in the field.(1).
View details for DOI 10.1038/clpt.2011.185
View details for Web of Science ID 000295119200035
View details for PubMedID 21900891
Whole-genome sequencing harbors unprecedented potential for characterization of individual and family genetic variation. Here, we develop a novel synthetic human reference sequence that is ethnically concordant and use it for the analysis of genomes from a nuclear family with history of familial thrombophilia. We demonstrate that the use of the major allele reference sequence results in improved genotype accuracy for disease-associated variant loci. We infer recombination sites to the lowest median resolution demonstrated to date (< 1,000 base pairs). We use family inheritance state analysis to control sequencing error and inform family-wide haplotype phasing, allowing quantification of genome-wide compound heterozygosity. We develop a sequence-based methodology for Human Leukocyte Antigen typing that contributes to disease risk prediction. Finally, we advance methods for analysis of disease and pharmacogenomic risk across the coding and non-coding genome that incorporate phased variant data. We show these methods are capable of identifying multigenic risk for inherited thrombophilia and informing the appropriate pharmacological therapy. These ethnicity-specific, family-based approaches to interpretation of genetic variation are emblematic of the next generation of genetic risk assessment using whole-genome sequencing.
View details for DOI 10.1371/journal.pgen.1002280
View details for Web of Science ID 000295419100031
View details for PubMedID 21935354
Thiopurine methyltransferase (TPMT) activity exhibits monogenic co-dominant inheritance, with ethnic differences in the frequency of occurrence of variant alleles. With conventional thiopurine doses, homozygous TPMT-deficient patients (~1 in 178 to 1 in 3,736 individuals with two nonfunctional TPMT alleles) experience severe myelosuppression, 30-60% of individuals who are heterozygotes (~3-14% of the population) show moderate toxicity, and homozygous wild-type individuals (~86-97% of the population) show lower active thioguanine nucleolides and less myelosuppression. We provide dosing recommendations (updates at http://www.pharmgkb.org) for azathioprine, mercaptopurine (MP), and thioguanine based on TPMT genotype.
View details for DOI 10.1038/clpt.2010.320
View details for Web of Science ID 000287439600018
View details for PubMedID 21270794
Biological Pathway Exchange (BioPAX) is a standard language to represent biological pathways at the molecular and cellular level and to facilitate the exchange of pathway data. The rapid growth of the volume of pathway data has spurred the development of databases and computational tools to aid interpretation; however, use of these data is hampered by the current fragmentation of pathway information across many databases with incompatible formats. BioPAX, which was created through a community process, solves this problem by making pathway data substantially easier to collect, index, interpret and share. BioPAX can represent metabolic and signaling pathways, molecular and genetic interactions and gene regulation networks. Using BioPAX, millions of interactions, organized into thousands of pathways, from many organisms are available from a growing number of databases. This large amount of pathway data in a computable form will support visualization, analysis and biological discovery.
View details for DOI 10.1038/nbt.1666
View details for Web of Science ID 000281719100019
View details for PubMedID 20829833
Recent advances in high-throughput genotyping and phenotyping have accelerated the creation of pharmacogenomic data. Consequently, the community requires standard formats to exchange large amounts of diverse information. To facilitate the transfer of pharmacogenomics data between databases and analysis packages, we have created a standard XML (eXtensible Markup Language) schema that describes both genotype and phenotype data as well as associated metadata. The schema accommodates information regarding genes, drugs, diseases, experimental methods, genomic/RNA/protein sequences, subjects, subject groups, and literature. The Pharmacogenetics and Pharmacogenomics Knowledge Base (PharmGKB; www.pharmgkb.org) has used this XML schema for more than 5 years to accept and process submissions containing more than 1,814,139 SNPs on 20,797 subjects using 8,975 assays. Although developed in the context of pharmacogenomics, the schema is of general utility for exchange of genotype and phenotype data. We have written syntactic and semantic validators to check documents using this format. The schema and code for validation is available to the community at http://www.pharmgkb.org/schema/index.html (last accessed: 8 October 2007).
View details for DOI 10.1002/humu.20662
View details for Web of Science ID 000253033000002
View details for PubMedID 17994540
PharmGKB is a knowledge base that captures the relationships between drugs, diseases/phenotypes and genes involved in pharmacokinetics (PK) and pharmacodynamics (PD). This information includes literature annotations, primary data sets, PK and PD pathways, and expert-generated summaries of PK/PD relationships between drugs, diseases/phenotypes and genes. PharmGKB's website is designed to effectively disseminate knowledge to meet the needs of our users. PharmGKB currently has literature annotations documenting the relationship of over 500 drugs, 450 diseases and 600 variant genes. In order to meet the needs of whole genome studies, PharmGKB has added new functionalities, including browsing the variant display by chromosome and cytogenetic locations, allowing the user to view variants not located within a gene. We have developed new infrastructure for handling whole genome data, including increased methods for quality control and tools for comparison across other data sources, such as dbSNP, JSNP and HapMap data. PharmGKB has also added functionality to accept, store, display and query high throughput SNP array data. These changes allow us to capture more structured information on phenotypes for better cataloging and comparison of data. PharmGKB is available at www.pharmgkb.org.
View details for DOI 10.1093/nar/gkm1009
View details for Web of Science ID 000252545400160
View details for PubMedID 18032438
View details for PubMedID 15882130
To determine how genetic variations contribute the variations in drug response, we need to know the genes that are related to drugs of interest. But there are no publicly available data-bases of known gene-drug relationships, and it is time-consuming to search the literature for this information. We have developed a resource to support the storage, summarization, and dissemination of key gene-drug interactions of relevance to pharmacogenetics. Extracting all gene-drug relationships from the literature is a daunting task, so we distributed a tool to acquire this knowledge from the scientific community. We also developed a categorization scheme to classify gene-drug relationships according to the type of pharmacogenetic evidence that supports them. Our resource (http://www.pharmgkb.org/home/project-community.jsp) can be queried by gene or drug, and it summarizes gene-drug relationships, categories of evidence, and supporting literature. This resource is growing, containing entries for 138 genes and 215 drugs of pharmacogenetics significance, and is a core component of PharmGKB, a pharmacogenetics knowledge base (http://www.pharmgkb.org).
View details for Web of Science ID 000226723300159
View details for PubMedID 15360921
The crystal structures of the ribosome reveal remarkable complexity and provide a starting set of snapshots with which to understand the dynamics of translation. To augment the static crystallographic models with dynamic information present in crosslink, footprint, and cleavage data, we examined 2691 proximity measurements and focused on the subset that was apparently incompatible with >40 published crystal structures. The measurements from this subset generally involve regions of the structure that are functionally conserved and structurally flexible. Local movements in the crystallographic states of the ribosome that would satisfy biochemical proximity measurements show coherent patterns suggesting alternative conformations of the ribosome. Three different types of data obtained for the two subunits display similar "mismatching" patterns, suggesting that the signals are robust and real. In particular, there is an indication of coherent motion in the decoding region within the 30S subunit and central protuberance and surrounding areas of the 50S subunit. Directions of rearrangements fluctuate around the proposed path of tRNA translocation and the plane parallel to the interface of the two subunits. Our results demonstrate that systematic combination and analysis of noisy, apparently incompatible data sources can provide biologically useful signals about structural dynamics.
View details for Web of Science ID 000186175900001
View details for PubMedID 14561879
The publication of the crystal structures of the ribosome offers an opportunity to retrospectively evaluate the information content of hundreds of qualitative biochemical and biophysical studies of these structures. We assessed the correspondence between more than 2,500 experimental proximity measurements and the distances observed in the ribosomal crystals. Although detailed experimental procedures and protocols are unique in almost each analyzed paper, the data can be grouped into subsets with similar patterns and analyzed in an integrative fashion. We found that, for crosslinking, footprinting, and cleavage data, the corresponding distances observed in crystal structures generally did not exceed the maximum values expected (from the estimated length of the agent and maximal anticipated deviations from the conformations found in crystals). However, the distribution of distances had heavier tails than those typically assumed when building three-dimensional models, and the fraction of incompatible distances was greater than expected. Some of these incompatibilities can be attributed to the experimental methods used. In addition, the accuracy of these procedures appears to be sensitive to the different reactivities, flexibilities, and interactions among the components. These findings demonstrate the necessity of a very careful analysis of data used for structural modeling and consideration of all possible parameters that could potentially influence the quality of measurements. We conclude that experimental proximity measurements can provide useful distance information for structural modeling, but with a broad distribution of inferred distance ranges. We also conclude that development of automated modeling approaches would benefit from better annotations of experimental data for detection and interpretation of their significance.
View details for DOI 10.1017/S135583820202407X
View details for Web of Science ID 000175155500002
View details for PubMedID 12003488
View details for Web of Science ID 000181756700019
The many interactions of tRNA with the ribosome are fundamental to protein synthesis. During the peptidyl transferase reaction, the acceptor ends of the aminoacyl and peptidyl tRNAs must be in close proximity to allow peptide bond formation, and their respective anticodons must base pair simultaneously with adjacent trinucleotide codons on the mRNA. The two tRNAs in this state can be arranged in two nonequivalent general configurations called the R and S orientations, many versions of which have been proposed for the geometry of tRNAs in the ribosome. Here, we report the combined use of computational analysis and tethered hydroxyl-radical probing to constrain their arrangement. We used Fe(II) tethered to the 5' end of anticodon stem-loop analogs (ASLs) of tRNA and to the 5' end of deacylated tRNA(Phe) to generate hydroxyl radicals that probe proximal positions in the backbone of adjacent tRNAs in the 70S ribosome. We inferred probe-target distances from the resulting RNA strand cleavage intensities and used these to calculate the mutual arrangement of A-site and P-site tRNAs in the ribosome, using three different structure estimation algorithms. The two tRNAs are constrained to the S configuration with an angle of about 45 degrees between the respective planes of the molecules. The terminal phosphates of 3'CCA are separated by 23 A when using the tRNA crystal conformations, and the anticodon arms of the two tRNAs are sufficiently close to interact with adjacent codons in mRNA.
View details for Web of Science ID 000085267900007
View details for PubMedID 10688361
Considerable evidence indicates that free radical injury may underlie the pathologic changes in muscular dystrophies from mammalian and avian species. We have investigated the role of oxidative injury in muscle necrosis in mice with a muscular dystrophy due to a defect in the dystrophin gene (the mdx strain). In order to avoid secondary consequences of muscle necrosis, all experiments were done on muscle prior to the onset of the degenerative process (i.e. during the 'pre-necrotic' phase) which lasted up to 20 days of age in the muscles examined. In pre-necrotic mdx muscle, there was an induction of expression of genes encoding antioxidant enzymes, indicative of a cellular response to oxidative stress. In addition, the levels of lipid peroxidation were greater in mdx muscle than in the control. Since the free radical nitric oxide (NO*) has been shown to mediate oxidative injury in various disease states, and because dystrophin has been shown to form a complex with the enzyme nitric oxide synthase, we examined pre-necrotic mdx muscle for evidence of NO*-mediated injury by measuring cellular nitrotyrosine formation. By both immunohistochemical and electrochemical analyses, no evidence of increased nitrotyrosine levels in mdx muscle was detected. Therefore, although no relationship with NO*-mediated toxicity was found, we found evidence of increased oxidative stress preceding the onset of muscle cell death in dystrophin-deficient mice. These results lend support to the hypothesis that free radical-mediated injury may contribute to the pathogenesis of muscular dystrophies.
View details for Web of Science ID 000077605200013
View details for PubMedID 9879685
The gamma-glutamyl carboxylase and vitamin K epoxidase activities of a series of mutants of bovine vitamin K-dependent carboxylase with progressively larger COOH-terminal deletions have been analyzed. The recombinant wild-type (residues 1-758) and mutant protein carboxylases, Cbx 711, Cbx 676, and Cbx 572, representing residues 1-711, 1-676, and 1-572, respectively, were expressed in baculovirus-infected Sf9 cells. Wild-type carboxylase had a Km for the substrate Phe-Leu-Glu-Glu-Leu (FLEEL) of 0.87 mM; the carboxylation of FLEEL was stimulated 2.5-fold by proPT18, the propeptide of prothrombin. Its Km for vitamin K hydroquinone was 23 microM and the specific epoxidase activity of the carboxylase was 938 pmol vitamin KO/30 min/pmol of carboxylase. Cbx 711, which was also stimulated by proPT18, had a Km for FLEEL, a Km for vitamin K hydroquinone, and a specific epoxidase activity that was comparable to the wild-type carboxylase. In contrast Cbx 572 lacked both carboxylase and epoxidase activities. Although Cbx 676 had a normal carboxylase active site in terms of the Km for FLEEL and its stimulation by proPT18, the Km for vitamin K hydroquinone was 540 microM, and the specific epoxidase activity was 97 pmol KO/30 min/pmol of Cbx 676. The catalytic efficiencies of Cbx 676 for glutamate carboxylation and vitamin K epoxidation were decreased 15- and 400-fold, respectively, from wild-type enzyme reflecting the requirement for formation of an activated vitamin K species for carboxylation to occur. These data indicate that the truncation of COOH-terminal segments of the carboxylase had no effect on FLEEL or propeptide recognition, but in the case of Cbx 676, selectively affected the interaction with vitamin K hydroquinone and the generation of epoxidase activity. These data suggest that a vitamin K epoxidase activity domain may reside near the COOH terminus while the carboxylase active site domain resides toward the NH2 terminus.
View details for Web of Science ID A1995QL58000055
View details for PubMedID 7890642