Community Academic Profiles

Gill Bejerano

Publication Details

  • Variations on probabilistic suffix trees: statistical modeling and prediction of protein families.

    Bejerano G, Yona G. Bioinformatics. 2001; 17 (1): 23-43

    We present a method for modeling protein families by means of probabilistic suffix trees (PSTs). The method is based on identifying significant patterns in a set of related protein sequences. The patterns can be of arbitrary length, and the input sequences do not need to be aligned, nor is delineation of domain boundaries required. The method is automatic, and can be applied, without assuming any preliminary biological information, with surprising success. Basic biological considerations such as amino acid background probabilities, and amino acids substitution probabilities can be incorporated to improve performance.

    PubMedID: 11222260

Stanford Medicine Resources:

Footer Links: