RNA Secondary Structure Prediction RNA role in organisms

  • Slides: 48
Download presentation
RNA – Secondary Structure Prediction

RNA – Secondary Structure Prediction

RNA role in organisms • The central dogma of biology: DNA RNA Protein Non

RNA role in organisms • The central dogma of biology: DNA RNA Protein Non Coding RNA • RNA as a catalyst: – Discovered in the early 1980’s.

RNA as a catalyst

RNA as a catalyst

RNA as a catalyst By en: User: Paul Atomic structure of the 50 S

RNA as a catalyst By en: User: Paul Atomic structure of the 50 S Subunit from Haloarcula marismortui. Proteins are shown in blue and the two RNA strands in orange and yellow By David S. Goodsell, CC BY 4. 0

RNA as a catalyst In eukaryotes the spliceosome performs the splicing reactions essential for

RNA as a catalyst In eukaryotes the spliceosome performs the splicing reactions essential for removing intron sequences, this process is required for the formation of mature m. RNA.

RNA structure • Primary Structure • Secondary Structure • Tertiary Structure Function

RNA structure • Primary Structure • Secondary Structure • Tertiary Structure Function

RNA secondary structure

RNA secondary structure

Terminology

Terminology

Nested base pairs • nested pseudoknots • Most base pairs in nature are nested.

Nested base pairs • nested pseudoknots • Most base pairs in nature are nested. • When non-nested base pairs occur, they are called pseudoknots.

Sequence constrained by structure • It is relatively common to find examples of RNAs

Sequence constrained by structure • It is relatively common to find examples of RNAs with a similar structure but without significant similarity. • It would be advantageous to be able to search for conserved structure in addition to sequence when searching similar RNAs.

Sequence constrained by structure • An example of a conserved structure:

Sequence constrained by structure • An example of a conserved structure:

Nussinov Jacobson Algorithm • Ruth Nussinov proposed the first dynamic programming approach for RNA

Nussinov Jacobson Algorithm • Ruth Nussinov proposed the first dynamic programming approach for RNA secondary structure prediction. • She is a Professor in the Department of Human Genetics, School of Medicine at Tel Aviv University.

Nussinov Jacobson Algorithm •

Nussinov Jacobson Algorithm •

Nussinov Jacobson Algorithm The problem: • Find the secondary structure with the maximum number

Nussinov Jacobson Algorithm The problem: • Find the secondary structure with the maximum number of paired bases Main idea: • Dynamic programming - the optimal solution to a certain sequence will be derived from the optimal solution of the sub-sequences

Nussinov Jacobson Algorithm • In order to solve our problem with DP, some assumptions

Nussinov Jacobson Algorithm • In order to solve our problem with DP, some assumptions must be made: • The assumptions: – No pseudoknots. – All base pairs are of equal energy. • Is this assumption necessary?

Nussinov Jacobson Algorithm •

Nussinov Jacobson Algorithm •

Nussinov Jacobson Algorithm • Or, in other words:

Nussinov Jacobson Algorithm • Or, in other words:

Nussinov Jacobson Algorithm • Or, in other words:

Nussinov Jacobson Algorithm • Or, in other words:

Nussinov Jacobson Algorithm • Or, in other words:

Nussinov Jacobson Algorithm • Or, in other words:

Nussinov Jacobson Algorithm • Or, in other words:

Nussinov Jacobson Algorithm • Or, in other words:

Nussinov Jacobson Algorithm • Or, in other words:

Nussinov Jacobson Algorithm • Or, in other words:

Nussinov Jacobson Algorithm • bifurcation

Nussinov Jacobson Algorithm • bifurcation

Nussinov Jacobson Algorithm Initialization: j i Image Source: Durbin et al. (2002) “Biological Sequence

Nussinov Jacobson Algorithm Initialization: j i Image Source: Durbin et al. (2002) “Biological Sequence Analysis”

Nussinov Jacobson Algorithm Initialization: j i Image Source: Durbin et al. (2002) “Biological Sequence

Nussinov Jacobson Algorithm Initialization: j i Image Source: Durbin et al. (2002) “Biological Sequence Analysis”

Nussinov Jacobson Algorithm Initialization: j i Image Source: Durbin et al. (2002) “Biological Sequence

Nussinov Jacobson Algorithm Initialization: j i Image Source: Durbin et al. (2002) “Biological Sequence Analysis”

Nussinov Jacobson Algorithm j i Image Source: Durbin et al. (2002) “Biological Sequence Analysis”

Nussinov Jacobson Algorithm j i Image Source: Durbin et al. (2002) “Biological Sequence Analysis”

Nussinov Jacobson Algorithm j i Image Source: Durbin et al. (2002) “Biological Sequence Analysis”

Nussinov Jacobson Algorithm j i Image Source: Durbin et al. (2002) “Biological Sequence Analysis”

Nussinov Jacobson Algorithm j i Image Source: Durbin et al. (2002) “Biological Sequence Analysis”

Nussinov Jacobson Algorithm j i Image Source: Durbin et al. (2002) “Biological Sequence Analysis”

Nussinov Jacobson Algorithm j i Image Source: Durbin et al. (2002) “Biological Sequence Analysis”

Nussinov Jacobson Algorithm j i Image Source: Durbin et al. (2002) “Biological Sequence Analysis”

Nussinov Jacobson Algorithm j i Image Source: Durbin et al. (2002) “Biological Sequence Analysis”

Nussinov Jacobson Algorithm j i Image Source: Durbin et al. (2002) “Biological Sequence Analysis”

Nussinov Jacobson Algorithm j i Image Source: Durbin et al. (2002) “Biological Sequence Analysis”

Nussinov Jacobson Algorithm j i Image Source: Durbin et al. (2002) “Biological Sequence Analysis”

Nussinov Jacobson Algorithm j i Image Source: Durbin et al. (2002) “Biological Sequence Analysis”

Nussinov Jacobson Algorithm j i Image Source: Durbin et al. (2002) “Biological Sequence Analysis”

Nussinov Jacobson Algorithm j i Image Source: Durbin et al. (2002) “Biological Sequence Analysis”

Nussinov Jacobson Algorithm j i Image Source: Durbin et al. (2002) “Biological Sequence Analysis”

Nussinov Jacobson Algorithm j i Image Source: Durbin et al. (2002) “Biological Sequence Analysis”

Nussinov Jacobson Algorithm j i Image Source: Durbin et al. (2002) “Biological Sequence Analysis”

Nussinov Jacobson Algorithm j i Image Source: Durbin et al. (2002) “Biological Sequence Analysis”

Nussinov Jacobson Algorithm j i Image Source: Durbin et al. (2002) “Biological Sequence Analysis”

Traceback •

Traceback •

Traceback •

Traceback •

Retrieving the Structure PAIRS STACK (1, 9) j i Slide taken from Andrew Hendriks,

Retrieving the Structure PAIRS STACK (1, 9) j i Slide taken from Andrew Hendriks, CMPT 889, Selected Topics in Bioinformatics CURRENT

Retrieving the Structure PAIRS j i Slide taken from Andrew Hendriks, CMPT 889, Selected

Retrieving the Structure PAIRS j i Slide taken from Andrew Hendriks, CMPT 889, Selected Topics in Bioinformatics STACK CURRENT (2, 9) (1, 9)

Retrieving the Structure G PAIRS STACK CURRENT (2, 9) (3, 8) (2, 9) C

Retrieving the Structure G PAIRS STACK CURRENT (2, 9) (3, 8) (2, 9) C G j i Slide taken from Andrew Hendriks, CMPT 889, Selected Topics in Bioinformatics

Retrieving the Structure G G C C PAIRS STACK CURRENT (2, 9) (4, 7)

Retrieving the Structure G G C C PAIRS STACK CURRENT (2, 9) (4, 7) (3, 8) G j i Slide taken from Andrew Hendriks, CMPT 889, Selected Topics in Bioinformatics

Retrieving the Structure A G G U C C PAIRS STACK CURRENT (2, 9)

Retrieving the Structure A G G U C C PAIRS STACK CURRENT (2, 9) (5, 6) (4, 7) (3, 8) G (4, 7) j i Slide taken from Andrew Hendriks, CMPT 889, Selected Topics in Bioinformatics

Retrieving the Structure A A G G U C C PAIRS STACK CURRENT (2,

Retrieving the Structure A A G G U C C PAIRS STACK CURRENT (2, 9) (6, 6) (5, 6) (3, 8) G (4, 7) j i Slide taken from Andrew Hendriks, CMPT 889, Selected Topics in Bioinformatics

Retrieving the Structure A A A G G U C C PAIRS STACK CURRENT

Retrieving the Structure A A A G G U C C PAIRS STACK CURRENT (2, 9) - (6, 6) (3, 8) G (4, 7) j i Slide taken from Andrew Hendriks, CMPT 889, Selected Topics in Bioinformatics

Retrieving the Structure A A A G G G j i Slide taken from

Retrieving the Structure A A A G G G j i Slide taken from Andrew Hendriks, CMPT 889, Selected Topics in Bioinformatics U C C

Nussinov Jacobson Algorithm •

Nussinov Jacobson Algorithm •

Drawbacks • Doesn’t allow wobble base pairs like G-U • Doesn’t consider stacking: A-U

Drawbacks • Doesn’t allow wobble base pairs like G-U • Doesn’t consider stacking: A-U A-U U-A A-U • Hair pin sizes not considered • More advanced algorithms consider energy minimization, to solve these issues.

Free Energy (ΔG) • ΔG approximated as the sum of contributions from loops, base

Free Energy (ΔG) • ΔG approximated as the sum of contributions from loops, base pairs and other secondary structures U U 4 nt loop +5. 9 A A G G 1 nt bulge +3. 3 -1. 1 terminal mismatch of hairpin -2. 9 stack C C -2. 9 stack (special case of 1 nt bulge) A G U A C A 5′ dangle -0. 3 unstructured single strand 0. 0 A A 5′ Image Source: Durbin et al. (2002) “Biological Sequence Analysis” C A U G U -1. 8 stack -0. 9 stack -1. 8 stack -2. 1 stack 3′