Sequence VEQCCTSICSLYQL Determines 3 Dstructure Determines Function Glucose
- Slides: 99
. . . και μια φυσική προέκτασή του. . . Sequence VEQCCTSICSLYQL Determines 3 D-structure Determines Function • Glucose Uptake Pathway • Glycogen Synthesis Pathway • Formation of triglycerides
Fasta Format >gi|29848|emb|X 61622. 1|HSCDK 2 MR H. sapiens CDK 2 m. RNA ATGGAGAACTTCCAAAAGGTGGAAAAGATCGGAGAGGGCACGTACGGAGTTGTGTACAAAGCCAGAAACA AGTTGACGGGAGAGGTGGTGGCGCTTAAGAAAATCCGCCTGGACACTGAGGGTGTGCCCAGTAC TGCCATCCGAGAGATCTCTCTGCTTAAGGAGCTTAACCATCCTAATATTGTCAAGCTGCTGGATGTCATT CACACAGAAAATAAACTCTACCTGGTTTTTGAATTTCTGCACCAAGATCTCAAGAAATTCATGGATGCCT CTGCTCTCACTGGCATTCCTCTTCCCCTCATCAAGAGCTATCTGTTCCAGCTGCTCCAGGGCCTAGCTTT CTGCCATTCTCATCGGGTCCTCCACCGAGACCTTAAACCTCAGAATCTGCTTATTAACACAGAGGGGGCC ATCAAGCTAGCAGACTTTGGACTAGCCAGAGCTTTTGGAGTCCCTGTTCGTACTTACACCCATGAGGTGG TGACCCTGTGGTACCGAGCTCCTGAAATCCTCCTGGGCTCGAAATATTATTCCACAGCTGTGGACATCTG GAGCCTGGGCTGCATCTTTGCTGAGATGGTGACTCGCCGGGCCCTGTTCCCTGGAGATTCTGAGATTGAC CAGCTCTTCCGGATCTTTCGGACTCTGGGGACCCCAGATGAGGTGGTGTGGCCAGGAGTTACTTCTATGC CTGATTACAAGCCAAGTTTCCCCAAGTGGGCCCGGCAAGATTTTAGTAAAGTTGTACCTCCCCTGGATGA AGATGGACGGAGCTTGTTATCGCAAATGCTGCACTACGACCCTAACAAGCGGATTTCGGCCAAGGCAGCC CTGGCTCACCCTTTCTTCCAGGATGTGACCAAGCCAGTACCCCATCTTCGACTCTGATAGCCTTCTTGAA GCCCCCGACCCTAATCGGCTCACCCTCTCCTCCAGTGTGGGCTTGACCAGCTTGGCCTTGGGCTATTTGG ACTCAGGTGGGCCCTCTGAACTTGCCTTAAACACTCACCTTCTAGTCTTAACCAGCCAACTCTGGGAATA CAGGGGTGAAAGGGGGGAACCAGTGAAAATGAAAGGAAGTTTCAGTATTAGATGCACTTAAGTTAGCCTC CACCACCCTTTCCCCCTTCTCTTAGTTATTGCTGAAGAGGGTTGGTATAAAAATAATTTTAAAAAAGCCT TCCTACACGTTAGATTTGCCGTACCAATCTCTGAATGCCCCATAATTATTATTTCCAGTGTTTGGGATGA CCAGGATCCCAAGCCTCCTGCTGCCACAATGTTTATAAAGGCCAAATGATAGCGGGGGCTAAGTTGGTGC TTTTGAGAATTAAGTAAAACCACTGGGAGGAGTCTATTTTAAAGAATTCGGTTAAAAAATAGATC CAATCAGTTTATACCCTAGTGTTTTCCTCACCTAATAGGCTGGGAGACTGAAGACTCAGCCCGGGT GGGGGT
PIR/NBRF Format >P 1; CRAB_ANAPL ALPHA CRYSTALLIN B CHAIN CRYSTALLIN). MDITIHNPLI RRPLFSWLAP SRIFDQIFGE SPSLSPFLMR SPIFRMPSWL ETGLSEMRLE KHFSPEELKV KVLGDMVEIH GKHEERQDEH YRIPADVDPL TITSSLSLDG VLTVSAPRKQ TREEKPAIAG AQRK* (ALPHA(B)HLQESELLPA KDKFSVNLDV GFIAREFNRK SDVPERSIPI
Protein Families src-like protein tyrosine kinase - 5 in Drosophila proteome 38 tyrosine kinases 43 SH 2 domain containing 110 SH 3 domain containing
Local Similarity vav src 42 csw
Regular Expressions PROSITE Syntax: [RK]-G-{EDRKHPCG}-[AGSCI]-[FY]-[LIVA]-x-[FYM] Regular Expression: [RK] G[^EDRKHPCG] [AGSCI] [FY] [LIVA]. [FYM]
Motifs, Profiles και Patterns σε πολλαπλές στοιχίσεις PROSITE Syntax: P-A-[FW]-X-[YW]-[LV]-S-C-X(3)-[WYH]-Q-X(1 -7)-[EQ]-G-H-Y Regular Expression: PA[FW]. [YW][LV]SC. {3}[WYH]Q. {1, 7}[EQ]GHY
EGF domain –C-x-C-x(5)-G-x(2)-C
Sequence profiles are a condensed representation of multiple alignments master sequence HBA_human HBB_human MYG_phyca LGB 2_luplu GLB 1_glydi Each column of the profile pj(a) contains the amino acid frequencies in the multiple sequence alignment A C D E F G H I K L M N P Q R S T V W Y . . . . W W W G G G K E K K K D E V V V F I . . . 0. . . 0. . . 1. 0. . . 0 0 0. 2 0 0. 6 0 0 0. 2 0 0 0 0. 2 0 0 0. 6 0 0 0 0. 2 0 0 0 0. 6 0 0 G E N A A G 0. 25 0. 75 0 0 0. 25 0 0 0 0 0 0 A D H N D N N A V V I G G D A P A E E G K G 0 0 0. 2 0 0 0. 6 0 0 0 0 0. 2 0 0 0 0 0 0. 4 0 0 0. 2 0 0 0 . . . . .
Group photo of the participants at the Protein Bioinformatics and Community Resources Retreat. The name of each participant is followed by the short name of their protein resource or resources in parentheses. Back row: David Landsman (Histone database), Dan Haft (TIGRFAMS), Bernard Henrissat (CAZy), Rob Finn (Inter. Pro and Pfam), David Craik (Cono. Server and Cy. BASE), Arnaud Chatonnet (ESTHER), Neil Rawlings (MEROPS); Middle row: Amos Bairoch (ne. Xt. Prot), Gerard Manning (Kinase. com), Michael Spedding (IUPHAR), Gert Vriend (GPCRDB), Milton Saier (TCDB), Pantelis Bagos (OMPdb); Front row: Narayanaswamy Srinivasan (Kin. G), Ramanathan Sowdhamini (PASS 2), Alex Bateman (Pfam & Uni. Prot), Patsy Babbitt (SFLD), Kim Pruitt (Ref. Seq), Claire O’Donovan (Uni. Prot), Gemma Holliday (MACi. E), Nozomi Nagano (Ez. Cat. DB).
Best practices (Gert Vriend) 1. Longevity - The one rule to rule them all. Gert asks that unless you can maintain your database for at least 10 years, then do not start. 2. Users - All databases need users and citations. To gain and keep users, you need to provide query and browsing interfaces as well as someone who answers emails. 3. Befriend Nucleic Acids Research and DATABASE journals - The descriptions of your database are essential to inform new users. But it is also essential to target publications to the readership. 4. Collaborate - Your collaborators may offer an exit strategy in the future. 4 a. Be open - Nobody is going to steal your resource. 5. Give credit - There is more than 100% to go around. 6. Automate - Too much manual intervention makes for an unsustainable database leading to premature death. You need to automate roughly 90% of everything every year. 7. No new standards – Don’t invent a new standard. Use what exists. 8. Keep it simple - Google is a model interface. 9. Visibility - Be at the right conferences and be recognizable. Use the same logo and present a poster. 10. Exit strategy - At some point you will retire. Start planning early to ensure your database continues.
RECEPTOR CLASS G PROTEIN CLASS EFFECTOR FAMILY RECEPTOR FAMILY Margarita C Theodoropoulou, Pantelis G Bagos, Ioannis C Spyropoulos and Stavros J RECEPTOR SUBFAMILY Hamodrakas. "gp. DB: A database of GPCRs, G-proteins, Effectors and their interactions. " Bioinformatics. 2008 Jun 15; 24(12): 1471 -2. RECEPTOR TYPE • A publicly accessible, relational database of G PROTEIN FAMILY EFFECTOR SUBFAMILY G PROTEIN SUBFAMILY EFFECTOR TYPE G PROTEIN TYPE G-proteins and their interactions with GPCRs and effector molecules ORGANISM • gp. DB currently contains data concerning 391 G-proteins, 2738 GPCRs with known coupling preference and 1390 effectors, knowing to interact with specific G-proteins. • Classification according to a hierarchy of different classes, families, subfamilies and types, based on extensive literature search • The relational model of the database describes the known coupling specificity of the GPCRs to their respective alpha subunit of G-proteins and, also, the interaction between G-protein subfamilies and specific effector types, a unique feature not available in any other database • Full sequence information with cross-references to publicly available databases • Advanced text search, BLAST search against the database and a pattern search tool. • Approx. 50 unique visitors per month Availability: http: //bioinformatics. biol. uoa. gr/gp. DB/
Availability: http: //bioinformatics. biol. uoa. gr/Ex. Topo. DB/ • Experimental information collected from studies in the literature that report the use of biochemical methods. • Topological models of alpha-helical transmembrane proteins. • 2143 transmembrane proteins from 1833 studies. • Topological information is combined with transmembrane topology prediction (constrained predictions using HMM-TM) resulting in more reliable topological models. • Signal peptide annotation using Signal. P. • Interface that allows user-defined constrained topology prediction using HMM-TM • Blast Search against Ex. Topo. DB. Tsaousis G. N. , Tsirigos K. D. , Andrianou X. D. , Liakopoulos T. D. , Bagos P. G. , Hamodrakas S. J. Ex. Topo. DB: A database of experimentally derived topological models of transmembrane proteins. 2010, Bioinformatics, 26(19): 2490– 2492.
Availability: http: //www. ompdb. org • The biggest collection of beta barrel proteins currently available. • Started off with 85 families and 70, 000 protein sequences and currently contains 91 families and more than 400, 000 proteins. • Out of the 91 families, 15 families were built completely from scratch, 16 do not belong to the respective clan of Pfam, while 6 of them are annotated as DUF in Pfam • Each family entry contains extensive information (function of protein members, literature references, list of proteins with 3 D-structure, seed and full protein alignments) • Each database entry contains the following fields: OMPdb name, OMPdb id, Uniprot accession number, protein description and classification, sequence, species, organism name, taxonomy, links to other databases, accompanied with annotation for TM segments and signal peptides. • OMPdb follows the monthly updates of Uniprot through an semi-automated procedure. • Domain and Blast Search against OMPdb is available. • The database can be downloaded in several formats (text, FASTA, XML) through the Download page. • Approx. 350 unique visitors per month Tsirigos K. D, Bagos P. G. , Hamodrakas S. J. OMPdb: a database of β-barrel outer membrane proteins from Gramnegative bacteria. 2011, Nucleic Acids Research, 39 (Database Issue): 324– 331.
The two structural classes α-helical membrane proteins β-barrel membrane proteins
Gram-negative bacteria
Variety of structures…
Variety of structures…
and functions… • Specific and non-specific channels (porins) • Receptors for passive and active intake (Ton. B-dependent receptors, Fad. L, Tsx etc) • Adhesion molecules (Omp. X, Nsp. A, Opc. A) • Structural proteins-interactions with peptidoglycan (Omp. A) • Outer membrane enzymes (Omp. T, Omp. LA, Pag. P, Pag. L) • Protein secretion in nearly all secretory pathways (Secretins, Ushers, autotransporters, TPS etc) • Folding and assembly of membrane proteins (Omp 85/Sam 50) • Assembly of the outer membrane-LPS delivery (Imp/Ost. A)
Representative TM beta-barrels of known structure Protein name function Number of strands PDB code PFAM code Organism Omp. A Structural protein 8 1 QJP PF 01389 Escherichia coli Omp. X Adhesion 8 1 QJ 8 PF 06316 Escherichia coli Nsp. A Adhesion 8 1 P 4 T PF 02462 Neisseria Meningitidis Pag. P Enzyme 8 1 MM 4 PF 07017 Escherichia coli Pag. L Enzyme 8 2 ERV PB 038312 * Pseudomonas aeruginosa Omp. W General Porin 8 2 F 1 T PF 03922 Escherichia coli Omp. T Enzyme 10 1 I 78 PF 01278 Escherichia coli Opc. A Adhesion 10 1 K 24 PF 07239 Neisseria Meningitidis Omp. LA Enzyme 12 1 QD 5 PF 02253 Escherichia coli Nal. P Autotransporter 12 1 UYN PF 03797 Neisseria Meningitidis Tsx Transporter 12 1 TLY PF 03502 Escherichia coli Omp. G General Porin 14 2 F 1 C PB 051875 * Escherichia coli Fad. L Transporter 14 1 T 1 L PF 03349 Escherichia coli Opr. P General Porin 16 2 O 4 V PF 07396 Pseudomonas aeruginosa Omp. F General Porin 16 2 OMF PF 00267 Escherichia coli Fha. C Transporter (TPS) 16 2 QDZ PF 03865 Bordetella pertussis Porin General Porin 16 2 POR PB 028487 * Rhodobacter capsulatus Maltoporin Specific Porin 18 2 MPR PF 02264 Salmonella typhimurium Fep. A Ton. B-dependent Receptor 22 1 FEP PF 00593 Escherichia coli Bagos PG, Hamodrakas SJ. 2007, submitted
Domain Organization Concerning predictions, two issues are of importance: -Topology prediction -Discrimination
int. C/ invasin SP Lys. M Omp 85 SP P Pap. C SP Cop. B SP secretin SP Bcs. C SP Nfr. A SP Fom. A SP Som. A SP Oms 66/ omp 66 Asp 55/ 62 β-barrel P P P Ig-like C-type β-barrel TPR 1 TPR 2 TPR 1 β-barrel SLH β-barrel SP β-barrel
a) single-domain OMPs or b) multi-domain OMPs beta-barrel domain database search (BLAST) Until no new family members are found multiple alignment (Clustal. W) profile HMM (HMMER)
SRS (Sequence Retrieval System)
- Csf calculation
- Why are carbohydrates monomers drawn as a hexagon?
- God pod method normal range
- アルドヘキソース
- Ch2cl2 lewis structure
- Nucleotides of rna
- Sequence pseudocode example
- Differentiate finite sequence and infinite sequence
- Convolutional sequence to sequence learning.
- What determines baby's blood type
- What are energy roles
- Diction definition poem
- A free map determines which blocks are free, allocated.
- The three kinds of temperate marine climates all have
- The narrator determines
- Write a matrix equation that determines the loop currents
- A cpu scheduling algorithm determines an order
- This determines the heaviness of a mineral
- What did mendel conclude determines biological inheritance
- Clean price formula
- The device handler determines dash
- Red planet candy
- What is equal area criteria
- Language determines the way we think
- Different climates around the world
- Winn equation
- Are geometric sequences exponential
- A sequence is a function whose domain is
- A named sequence of statements is known as
- Sequence and series cengage
- A sequence is a function whose domain is
- Geometric sequence function
- Glucose breakdown
- Non reducing sugar examples
- Stimulus response
- Glucose indicator
- Selivanov test
- Glucose+oxygen
- Metabolism of glucose in erythrocytes
- Enzymatic methods for glucose determination
- Bial's test reagent
- Barfoed's test reagent
- Qualitative analysis of glucose
- Glucose found in
- Transport actif secondaire
- Glucose fructose galactose mannose
- Glyconeogenesis vs gluconeogenesis
- Galactose and glucose
- Fates of glucose 6 phosphate
- Oxidase test principle
- 2na + glucose/18
- Fates of glucose 6 phosphate
- Asparagine to oxaloacetate
- Glycogenolysis pathway
- Youtube
- Uridine diphosphate glucose
- Glucose introduction
- Fasting plasma glucose concentration
- Glycerol to glucose
- Foods that has glucose
- Synthesis of glucose from non-carbohydrate sources
- Glycerol to glucose
- Succinly
- Methyl red test
- Galactose and glucose
- Glucose sesselform
- 2na + glucose/18
- Pharynx
- Haworth in sesselkonformation
- Functions of carbohydrates
- Glucose synthesis from non-carbohydrate sources
- Csf interpretation
- Substrate of cellular respiration
- Chapter 6 cell energy photosynthesis and respiration
- Glucose cyclic structure
- Dextrorotatory and levorotatory
- L-glucose
- Glucose in urine causes
- Glucose alanine cycle
- Yeast respiration
- Glycosidic bond in glycogen
- Gtt graph
- Principle of molisch test
- Define molecular formula
- Glucose clinical significance
- Glycogen degradation
- Vant hoff factor of glucose
- Chu trình glucose alanin
- Water and glucose
- Bloodborne glucose
- Why is glucose polar
- Zuckerstammbaum
- Clearsafe blood control slide
- Etape glycolyse
- Transport membranaire du glucose
- Inside one osmosis bag
- Glucose cellular respiration
- When glucose spills over into the urine, it has
- Clearsafe blood control slide
- Glucose preparation