Hidden Markov Models That Use Predicted Local Structure

  • Slides: 16
Download presentation
Hidden Markov Models That Use Predicted Local Structure for Fold Recognition: Alphabets of Backbone

Hidden Markov Models That Use Predicted Local Structure for Fold Recognition: Alphabets of Backbone Geometry R Karchin, M Cline, Y Mandel. Gutfreund, K Karplus

Problem • Parent fold may have low sequence similarity • Most often, query is

Problem • Parent fold may have low sequence similarity • Most often, query is sequence-only

Proposed Solution • Try to predict local structure of target • Use local structure

Proposed Solution • Try to predict local structure of target • Use local structure prediction in fold recognition

Simple SAM Overview Query Sequence Multiple Alignment Profile HMM Search, and threshold Database Search

Simple SAM Overview Query Sequence Multiple Alignment Profile HMM Search, and threshold Database Search

Profile HMM Start Stop

Profile HMM Start Stop

Two-Track Profile HMM Start Stop AA Str AA AA AA Str Str

Two-Track Profile HMM Start Stop AA Str AA AA AA Str Str

SAM with Local Structure Query Sequence Predicted local structure Multiple Alignment Profile HMM Start

SAM with Local Structure Query Sequence Predicted local structure Multiple Alignment Profile HMM Start Stop AA Str AA Str Two-track HMM Database Search

Scoring • Profile HMM P(residue|state) = θ(AA, state) • Two-Track Profile HMM P(residue|state) =

Scoring • Profile HMM P(residue|state) = θ(AA, state) • Two-Track Profile HMM P(residue|state) = θ(AA, state)Φ(local, state) • SAM Two-Track Profile HMM P(residue|state) = θ(AA, state)Φ 0. 3(local, state)

Local Structure Alphabets DSSP-EHL STRIDE-EHL Protein Blocks ANG ALPHA TCO Hydrogen Bonding DSSP with

Local Structure Alphabets DSSP-EHL STRIDE-EHL Protein Blocks ANG ALPHA TCO Hydrogen Bonding DSSP with 6 different β-strands H-Bond and Φ, φ angles Designed from SOM of Φ, φ angles Partition of Ramachandran Dihedral angle between Calphas Dihedral angle between carbonyls

Predicting Local Structure of the Query • Alphabets very dissimilar • Use a neural

Predicting Local Structure of the Query • Alphabets very dissimilar • Use a neural network – Input: window of multiple alignment – Output: structure probabilities for a single residue

Good Structure Alphabets • Intuitively, we want – Predictability – Conservation – Better fold

Good Structure Alphabets • Intuitively, we want – Predictability – Conservation – Better fold recognition – Better alignment

Predictability • Evaluated on – Exact predictions (QN) – Overlap of structure segments (SOV)

Predictability • Evaluated on – Exact predictions (QN) – Overlap of structure segments (SOV) – Information gain • Winners: – *-EHL for precision – STR and PB for most information

Conservation • Used FSSP structural alignments • Calculated mutual information between proteins • Used

Conservation • Used FSSP structural alignments • Calculated mutual information between proteins • Used only alignments with low sequence similarity • Winners: – STR, PB

Fold Recognition Winners: All except PB (STRIDE-EHL leading, STR lagging)

Fold Recognition Winners: All except PB (STRIDE-EHL leading, STR lagging)

Alignment • Compared to DALI and CE alignments • Evaluated using a shift score

Alignment • Compared to DALI and CE alignments • Evaluated using a shift score – 1. 0 perfect match – -0. 2 is the worst (adjustable) • Winners – STR, and STRIDE gets an odd one. – All alphabets improve alignment

Conclusions • Local structure improves: – Fold recognition – Alignment when there’s little sequence

Conclusions • Local structure improves: – Fold recognition – Alignment when there’s little sequence similarity • Alignment and fold recognition are very different problems