Protein Structure Prediction II SCOP CATH gen THREADER

  • Slides: 25
Download presentation
Protein Structure Prediction II • • • SCOP CATH gen. THREADER Swiss-Model Mod. Base

Protein Structure Prediction II • • • SCOP CATH gen. THREADER Swiss-Model Mod. Base – – - Protein structure classification 3 D structure prediction A database of 3 D struc. Predict.

SCOP: Structural Classification of Proteins http: //scop. mrc-lmb. cam. ac. uk/scop/ • Based on

SCOP: Structural Classification of Proteins http: //scop. mrc-lmb. cam. ac. uk/scop/ • Based on known protein structures • Manually created by visual inspection • Hierarchical database structure: –Class, Fold, Superfamily, Family, Protein and Species

Node Parents of node Children of node

Node Parents of node Children of node

Node Parents of node Children of node

Node Parents of node Children of node

CATH: Protein Structure Classification by Class, Architecture, Topology and Homology http: //www. cathdb. info/latest/index.

CATH: Protein Structure Classification by Class, Architecture, Topology and Homology http: //www. cathdb. info/latest/index. html • Class: The secondary structure composition: mainly-alpha, mainly-beta and alpha-beta. • Architecture: The overall shape of the domain structure. Orientations of the secondary structures : e. g. barrel or 3 layer sandwich. • Topology: Structures are grouped into fold groups at this level depending on both the overall shape and connectivity of the secondary structures. • Homologous Superfamily: Evolutionary conserved structures

CATH: Protein Structure Classification by Class, Architecture, Topology and Homology

CATH: Protein Structure Classification by Class, Architecture, Topology and Homology

gen. THREADER Input sequence Type of Analysis (PSIPRED, MEMSAT, gen. THREAD) http: //bioinf. cs.

gen. THREADER Input sequence Type of Analysis (PSIPRED, MEMSAT, gen. THREAD) http: //bioinf. cs. ucl. ac. uk/psipred/psiform. html

Gen. THREADER Output

Gen. THREADER Output

Gen. THREADER Output The output sequences show some extent of sequence homology But high

Gen. THREADER Output The output sequences show some extent of sequence homology But high level of secondary structure conservation

SWISS-MODEL An automated protein modeling server. http: //swissmodel. expasy. org/

SWISS-MODEL An automated protein modeling server. http: //swissmodel. expasy. org/

SWISS-MODEL • The SWISS-MODEL algorithm can be divided into three steps: 1. Search for

SWISS-MODEL • The SWISS-MODEL algorithm can be divided into three steps: 1. Search for suitable templates: the server finds all similarities of a query sequence to sequences of known structure. It uses the BLASTP 2 program with the Ex. NRL-3 D database (a derivative of PDB database, specified for SWISS-MODEL). You get these partial results as a Swiss. Model Trace. Log file. 2. Check sequence identity with target: All templates with sequence identities above 25% are selected 3. Create the model using the Pro. Mod. II program. You get this as a Swiss. Model-Model file.

SWISS-MODEL Get PDB file by E-mail Load to J-Mol

SWISS-MODEL Get PDB file by E-mail Load to J-Mol

Homology Modeling Single Structure

Homology Modeling Single Structure

query Swiss-Model file Structures used for the homology model

query Swiss-Model file Structures used for the homology model

Comparative Modeling • Accuracy of the comparative model is related to the sequence identity

Comparative Modeling • Accuracy of the comparative model is related to the sequence identity on which it is based >50% sequence identity = high accuracy 30%-50% sequence identity= 90% modeled <30% sequence identity =low accuracy (many errors)

Mod. Base A Homology Model Database

Mod. Base A Homology Model Database

Ligand Binding Site

Ligand Binding Site

 • A Clan is defined as a group of Pfam families which share

• A Clan is defined as a group of Pfam families which share a common evolutionary origin. • They are generally different at the sequence and functional level but similar at the structure level. Histone superfamily