DOMAIN TERTIARY AND QUARTERNARY STRUCTURE OF PROTEINS Levels

Levels of protein structure organization

Between secondary and tertiary structure • Supersecondary structure: arrangement of elements of same or

Example of a b-hairpin: tryptophan zipper (1 LE 0)

Example of a b-hairpin in bovine pancreatic trypsin inhibitor– BPTI. Example of a protein

Example of a b-meander: aspectrin SH 3 domain (1 BK 2)

Helix E helix F Troponin C with four EF motifs that bind calcium ions.

The Helix-Turn-Helix motif • This motif is characteristic of proteins binding to the major

The a-helix-b-hairpin motif (zinc finger)

b-a-b Motif (very important and very frequent) Hydrophobic core between a-helix and b-sheet

Domains: classification criteria 1. Functionality (performing a biological function or role in formation and

Protein domains There is no consensus method to determine the domains and a structure

Domains: example • For Domains a recent ofform recently review evolved domainproteins insertion. frequently

Example of division of a protein into domains: human Hsp 70 chaperone

Domain identification algorithms Schultz’s method: neighborhood correlation criterion

The Go algorithm: interdomain distances are larger than intradomain distances

The Rose algorithm: based on the deviation of the long axes of the fragments

The Crippen algorithm: based on dissection of residues according to interresidue distances into clusters

Specific nodes in these dendrograms are identified as tertiary structural clusters of the protein;

The domains identified by this clustering method may not correspond to the functional domains

Classification of three-dimensional structures of protein Richardson’s classification a – a-helices are only or

SCOP classification Structural Classification Of Proteins This is a hierarchical classification scheme with the

[ http: //scop. mrc-lmb. cam. ac. uk/scop/ ]

Scop Classification Statistics 1. 75 release 38221 PDB Entries (23 Feb 2009). 110800 Domains.

CATH classification (Class (C), Architecture(A), Topology(T), Homologous superfamily (H)) Four hierarchy levels: 1. Class

• Class(C) derived from secondary structure content is assigned automatically • Architecture(A) describes

A „periodic table” of protein structures W. Taylor, Nature, 416, 6881, 657 -660 (2002)

a-helical structures ROP: two packed helices 1 rop - RASMOL

Antiparallel four-a-helix bundle 15 o twist of helix axes

The heme binding sites are located between the a-helices

G-protein coupled receptors: antiparallel 7 -helix bundles Bakteriorodopsyna – model teoretyczny Bacteriorhodopsin: theoretical model

Photosynthetic reaction center Bakteriorodopsyna – model teoretyczny 1 prc - RASMOL 1 bac, 1

a-helical structures with the Greek key topology • Tego typu domeny spotykane są przede

DNA-binding proteins 1 hdd - RASMOL 1 lmb - RASMOL

Large a-helical proteins 1 SLYa - RASMOL

Example of a protein with jellyroll topology: Carbohydrate-Binding Module Family 28 from Clostridium josui

Example of a b-barrel (red fluorescent protein; 3 NED)

Example of a b-propellor motif : Thermostable PQQ-dependent Soluble Aldose Sugar Dehydrogenase (3 DAS)

Quaternary structure is the result of subunit associations Fig. 5 -15

1 G 6 U: artificial homodimer (three-helix bundle)

Homo-multimers It is far more common to find copies of the same tertiary domain

Hetero-multimers In this case we see different tertiary domains aggregating together to form a

Sometimes, we find that several domains are found in a single enzyme complex, either

Two (further) steps in the biosynthetic pathway of tryptophan (in S. typhimirium) are catalysed

The biologically active unit is a heterotetramer comprised of 2 a and 2 b

Amyloid fibril formed from the Abeta peptide

Slides: 79

Download presentation

DOMAIN, TERTIARY, AND QUARTERNARY STRUCTURE OF PROTEINS

Levels of protein structure organization

Between secondary and tertiary structure • Supersecondary structure: arrangement of elements of same or different secondary structure into motifs; a motif is usually not stable by itself. • Domains: A domain is an independent unit, usually stable by itself; it can comprise the whole protein or a part of the protein.

Example of a b-hairpin: tryptophan zipper (1 LE 0)

Example of a b-hairpin in bovine pancreatic trypsin inhibitor– BPTI. Example of a protein with two b-hairpins: erabutoxin from whale.

Example of a b-meander: aspectrin SH 3 domain (1 BK 2)

Helix Hairpin

Alpha alpha corner (L 7. 24)

Helix E helix F Troponin C with four EF motifs that bind calcium ions. Because of high content of acidic amino-acid residues with side chains pointing inside the loop, the EF-hand motif constitutes a calciumbinding scaffold in troponin, calmodulin, etc.

The Helix-Turn-Helix motif • This motif is characteristic of proteins binding to the major DNA grove. • The proteins containing this motif recongize palindromic DNA sequences. • The second helix is responsible for nucleotide sequence recognition.

The Helix-Turn-Helix motif

The a-helix-b-hairpin motif (zinc finger)

b-a-b Motif (very important and very frequent) Hydrophobic core between a-helix and b-sheet

The Greek Key Motif

The Greek-key motif as seen in proteins

Domains: classification criteria 1. Functionality (performing a biological function or role in formation and stabilization of globular structure) 2. Solubility: • Globular proteins and protien domains (water solubke) • Membrane proteins and domains (lipid soluble) • Fibrillar protiens (insoluble) 3. Content of secondary structure • aa (parallel and antiparallel) • bb • a/b • a+b • high disulfide-bridge or metal content.

Protein domains There is no consensus method to determine the domains and a structure is often dissected into domains in an arbitrary and intuitive manner. However, the following features apply: - a domain is a potentially independent folding uint, - a domain is a potentially independent structural (and functional) unit but it is connected with the remaining part of a protein covalently, - domain sequence is often conserved and same or similar sequences can be encountered in other proteins with same domain(s), - domains often perform specific functions (e. g. , nucleotide or saccharide binding) - domain interface often functions as active center. A protein molecule can consist of a single or of several domains.

http: //pawsonlab. mshri. on. ca/

Domains: example • For Domains a recent ofform recently review evolved domainproteins insertion. frequently swapping encoded between by exons, reflecting protomers gene is not fusion of • Domains an on important level in are the. Domain hierarchical organisation oftwo the three-dimensional simpler uncommon modules. (for example For example, in the case in theofcase diphtheria of all hepatocyte toxin). growth factors andasplasminogens, structure of globular proteins, although not proteins can be described multidomain a number of kringle domains are present. structures.

Example of division of a protein into domains: human Hsp 70 chaperone

Human cystatin C: domain swapping

Domain identification algorithms Schultz’s method: neighborhood correlation criterion

The Go algorithm: interdomain distances are larger than intradomain distances

The Rose algorithm: based on the deviation of the long axes of the fragments from protein mean plane; works for continuous domains

The Crippen algorithm: based on dissection of residues according to interresidue distances into clusters Ca-Ca distances between secondary structures are represented in the form of average values termed 'proximity indices' and the secondary structural organisation is indicated in the form of dendrograms. An example is shown for the case of calmodulin.

Specific nodes in these dendrograms are identified as tertiary structural clusters of the protein; these include supersecondary structures and domains. A ratio of the average proximity indices (ignoring inter-clusteral distances) to the average of all proximity indices, weighted for the aggregation of small sub-clusters and termed the disjoint factor, is employed as a discriminatory parameter to identify automatically clusters representing individual domains. An example of domains identified in glutathione reducatase is shown below :

The domains identified by this clustering method may not correspond to the functional domains proposed. The "disjoint factor" gives a measure of the extent of interaction between domains and has been used to classify domains into one of the three types, disjoint, interacting and conjoint. Domains are classified as those with sparse inter-domain interfaces (disjoint), intermediate interactions (interacting) and elaborate interfaces (conjoint) based on the magnitude of the disjoint factor. An example of the three types is shown below :

Classification of three-dimensional structures of protein Richardson’s classification a – a-helices are only or dominant secondary-structure elements (e. g. , ferritin, myoglobin) b – b-sheets are only or dominant elements (e. g. , lipocain) a/b – contain strongly interacting helices and sheets a+b – contain weakly interacting or separated helices and sheets

SCOP classification Structural Classification Of Proteins This is a hierarchical classification scheme with the following 4 levels: 1. Families – one family is comprised by proteins related structurally, evolutionally, and functionally. 2. Superfamoilies – A superfamily is comprised by families of substantially related by structure and function. 3. Folds – Superfamilies with common topology of the main portion of the chain. 4. Classes - Groups of folds characterized by secondary structure: a (mainly a-helices), b (mainly b-sheets), a/b (a-helices and bsheets strongly interacting), a+b (a-helices and b-weakly interacting or not interacting), multidomain proteins (nonhomologous proteins with vert diverse folds).

[ http: //scop. mrc-lmb. cam. ac. uk/scop/ ]

Scop Classification Statistics 1. 75 release 38221 PDB Entries (23 Feb 2009). 110800 Domains. 1 Literature Reference (excluding nucleic acids and theoretical models) SCOP: Class Structural Classification of Proteins. Number of folds superfamilies Number of families All alpha proteins 284 507 871 All beta proteins 174 354 742 147 244 803 376 552 1055 66 66 89 58 110 123 90 1195 129 1962 219 3902 Alpha and beta proteins (a/b) Alpha and beta proteins (a+b) Multi-domain proteins Membrane and cell surface proteins Small proteins Total

CATH classification (Class (C), Architecture(A), Topology(T), Homologous superfamily (H)) Four hierarchy levels: 1. Class (Level C): according to the content of secondary structure type a, b, a&b (a/b and a+b), weakly or undefined secondary structure. 2. Architecture. (Level A) – Orientation and connection topology between secondary structure elements. 3. Topology. (Level T) – based on fold type. 4. Homoloous superfamilies. (Level H) – high homology indicating a common anscestor: - >30% sequence identity OR - > 20% sequence identiy and 60% structural homology OR - > 60% structural homology and similar domains have similar function.

• Class(C) derived from secondary structure content is assigned automatically • Architecture(A) describes the gross orientation of secondary structures, independent of connectivity. • Topology(T) clusters structures according to their topological connections and numbers of secondary structures • Homologous superfamily (H) [ http: //www. biochem. ucl. ac. uk/bsm/cath_new/ ]

A „periodic table” of protein structures W. Taylor, Nature, 416, 6881, 657 -660 (2002)

„Menagerie” of known protein folds

a-helical structures ROP: two packed helices 1 rop - RASMOL

Antiparallel four-a-helix bundle 15 o twist of helix axes

Example: ferritin 1 fha - RASMOL

2 mhr - RASMOL 1 fha - RASMOL

The heme binding sites are located between the a-helices

G-protein coupled receptors: antiparallel 7 -helix bundles Bakteriorodopsyna – model teoretyczny Bacteriorhodopsin: theoretical model 1 bac, 1 brd - RASMOL Each helix is about 20 residue long

Photosynthetic reaction center Bakteriorodopsyna – model teoretyczny 1 prc - RASMOL 1 bac, 1 brd - RASMOL

a-helical structures with the Greek key topology • Tego typu domeny spotykane są przede wszystkim w białkach globinowych. • Zbudowane są z dwóch warstw a-helis. • Kierunki helis obu wartsw są często prawie prostopadłe (upakowanie helisy metodą ortogonalną) • Domena przypomina nieco cylinder utworzony z helis skręconych w stosunku do jego osi o kąt od 0 do 45 o • Najczęściej spotykanym typem połączeń między helisami jest sekwencja +3, -1, -1, spotykana w motywie klucza greckiego 1 mba - RASMOL

DNA-binding proteins 1 hdd - RASMOL 1 lmb - RASMOL

Large a-helical proteins 1 SLYa - RASMOL

a/b horseshoe

The jellyroll topology

Example of a protein with jellyroll topology: Carbohydrate-Binding Module Family 28 from Clostridium josui Cel 5 A (3 ACI)

Example of a b-barrel (red fluorescent protein; 3 NED)

Example of a b-propellor motif : Thermostable PQQ-dependent Soluble Aldose Sugar Dehydrogenase (3 DAS)

The b-helix

Quaternary structure is the result of subunit associations Fig. 5 -15

1 RLZ (leucine zipper)

1 G 6 U: artificial homodimer (three-helix bundle)

Homo-multimers It is far more common to find copies of the same tertiary domain associating non-covalently. Such complexes are usually, though not always symmetrical. Because proteins are inherently asymmetrical objects, the multimers almost always exhibit rotational symmetry about one or more axes. The majority of the enzymes of the metabolic pathways seem to aggregate in this way, forming dimers, trimers, tetramers, pentamers, hexamers, octamers, decamers, dodecamers, (or even tetradecamers in the case of the chaperonin Gro. EL).

Hetero-multimers In this case we see different tertiary domains aggregating together to form a unit. The photoreaction centre is a good example.

Sometimes, we find that several domains are found in a single enzyme complex, either in a single polypeptide chain, or as an association of separate chains. Often the domains have related functions, for instance, where one domain will be responsible for binding, another for regulation, and a third for enzymatic activity. Cellobiohydrolase provides anexample of such a protein. It is not uncommon to find more than once the same chain in a protein complex. A good example is the F-1 ATPase.

Two (further) steps in the biosynthetic pathway of tryptophan (in S. typhimirium) are catalysed by tryptophan synthase which consists of two separate chains, designated a and b, each of which is effectively a distinct enzyme.

The biologically active unit is a heterotetramer comprised of 2 a and 2 b units. We sometimes find slightly different versions of the same protein associating. Thus, haemoglobin has both an A-chain and a B-chain, which come together to form a hetero-dimer. Two copies of this then associate to form the normal haemoglobin tetramer. Which is equivalent to an A-dimer associating with a B-dimer. Also, it can happen that two different chains associate to form a bigger secondary structure. It is the case of the pea lectin, where a very large b-sheet is nade out of strands coming from different protein chains:

Amyloid fibril formed from the Abeta peptide