GENETICS genetic mapping classical approaches to study gene

  • Slides: 64
Download presentation
GENETICS genetic mapping, classical approaches to study gene function

GENETICS genetic mapping, classical approaches to study gene function

Basic aims: • uncovering gene function understand mechanisms of morphogenesis, development, metabolism, physiology etc.

Basic aims: • uncovering gene function understand mechanisms of morphogenesis, development, metabolism, physiology etc. in connection with coordinated gene expression • breeding production of plants (organisms) with improved characteristics or their combination

Terminology Gene • segment of DNA (genetic information) that specifies a trait (is a

Terminology Gene • segment of DNA (genetic information) that specifies a trait (is a predisposition for a trait) • basic unit of heridity in living organisms • Genotype + environment + ? = phenotype • Interactions between genes/proteins (epistases – mainly metabolic and signal pathways)

Allele – form of a gene • dominant vs. recesive (codominant, …) • genesis

Allele – form of a gene • dominant vs. recesive (codominant, …) • genesis of new alleles by mutations Locus – location of a gene on a chromosome • Genetic linkage – inheriting of certain genes (their alleles) jointly, - consequence of their presence on the same chromosome (gene distance c. M = % of recombinant gametes) HOW MANY alleles of certain gene is in the genome? of alleles = from 1 up to N = ploidy (* of loci with the gene)

Genetic (linkage) and physical maps differ Physical - sequence, genes …(bp) Genetic – based

Genetic (linkage) and physical maps differ Physical - sequence, genes …(bp) Genetic – based on frequency of recombination (c. M) - varying likelyhood of recombination What sequences are recombined with low probability?

Genetics classical (direct) x reverse Direct – from a trait (phenotype) to identification of

Genetics classical (direct) x reverse Direct – from a trait (phenotype) to identification of the corresponding gene Reverse – from a gene (prediction based on the sequence) to the gene’s function (phenotype) Both approaches are based on mutant analysis: Direct g. – looking for certain change in phenotype in the mutagenized Reverse g. – population (and analysing the genetic courses) targeted mutagenesis/modification of a selected gene (and analyzing the phenotypic consequences)

Reporter genes • encode proteins that can be directly visualized or enzymes, whose activity

Reporter genes • encode proteins that can be directly visualized or enzymes, whose activity can be visualized • quantitative or qualitative assessment • promoter activity analyses, subcellular localization of proteins, optimizing transformation procedure, …. Constrains: - background (autofluorescence, natural enzyme activity within the tissue) - protein stability (mask changes in promoter activity) - …. .

Reporter genes (the most frequently used) gene product substrate detection gus. A b-glucuronidase (E.

Reporter genes (the most frequently used) gene product substrate detection gus. A b-glucuronidase (E. coli) MUG X-gluc luciferin xxx fluorescence histochemical luminiscence luc gfp luciferase (fire fly) green fluorescent protein fluorescence (gelly fish)

Fluorescent proteins: GFP, Ds. Red, m. Cherry, Eos. FP - unique tools for in

Fluorescent proteins: GFP, Ds. Red, m. Cherry, Eos. FP - unique tools for in vivo labelling - encoded by small genes Aequoria victoria - origin: sea Coelenterata (corals, gelly fish) - modified forms: - fluorescent features, p. H sensitivity - codon usage, splicing, stability, … Colour variants of GFP and Ds. Red Eos. FP – photoconvertable fluorescent protein (green to red FP)

GUS - glucuronidase Qualitative detection (X-gluc = 5 -bromo-4 -chloro-3 -indolyl glucuronide) • oxidized

GUS - glucuronidase Qualitative detection (X-gluc = 5 -bromo-4 -chloro-3 -indolyl glucuronide) • oxidized blue precipitate of the reaction product • low background!!! • slow diffusion • mostly in the fixed material Quantitative detection • GUS enzyme isolation, fluorimetric statement • highly sensitive, low background (MUG = 4 -methylumbelliferyl-beta-D-glucuronide)

Mutagenesis • Classical: – chemical m. – EMS (ethane metyl sulfonate; point mutations) –

Mutagenesis • Classical: – chemical m. – EMS (ethane metyl sulfonate; point mutations) – physical m. – RTG, gama. . . (usually short deletions) – wide spektrum of affects (regulation, interaction) – even dominant mutations, resemble natural mutations, – BUT complicated identification of mutated genes

Identification of mutated gene (responsible for the mutation) Point mutations, short deletions 1) Based

Identification of mutated gene (responsible for the mutation) Point mutations, short deletions 1) Based on genetic map and segregation analysis + chromosom walking, sequencing (time, money and labor-consuming) 2) Using NGS (next generation seq. ) - faster, moderate expensive - even in unknown genomes!!! - mixed samples (back crosses) - comparisons of frequencies of similar oligomers Nordström et al. Nature Biotech. 2013 < > <

Mutagenesis • More advanced - insertional mutagenesis – random insertions in the genome! –

Mutagenesis • More advanced - insertional mutagenesis – random insertions in the genome! – by insertion of T-DNA of Agrobacterium tumefaciens (transposons) – allows simple determination of the site of insertion mutations are tagged with the inserted sequence – various strategies for gene isolation – advanced variants of insertional mutagenesis: • activation mutagenesis • promoter, enhancer trap, …

Insertional mutagenesis – knock-out gene (native form) inserted DNA Simple insertional mutagenesis – phenotype

Insertional mutagenesis – knock-out gene (native form) inserted DNA Simple insertional mutagenesis – phenotype analysis in T 2 - insertion into the coding region - early termination of translation (truncated = usually non-functional protein) - usually recessive allele

Identification of mutated gene Insertional mutagenesis: • sequencing of the flanking region (too low

Identification of mutated gene Insertional mutagenesis: • sequencing of the flanking region (too low template concentration for direct Sanger sequencing!) Technical solution (preamplification of the target locus): • • TAIL PCR (Thermal Asymmetric Inter. Laced PCR) adaptor PCR plasmid rescue i. PCR

TAIL PCR: SP 1 SP 2 SP 3 AP SP 1 AP SP 2

TAIL PCR: SP 1 SP 2 SP 3 AP SP 1 AP SP 2 AP AP AP SP 3 AP 1. three PCR (optimized Ta) with specific primer SP 1 -3 + certain AP 2. product sequencing SP 1 -3: complementary to inserted DNA AP: arbitrary (degenerated) primer - several universal types, high P of anealing near insertion Adaptor PCR: E E SP 1 SP 2 SP 3 SAP 1. cleavage (restriction endon. , E) 2. ligation of adaptors 3. 2 -3 PCR (spec. adapt. primer + spec. primers complementary to inserted DNA) 4. product sequencing

Plasmid rescue: ori bla/npt. III E E ori bla/npt. III E plasmid 1. 2.

Plasmid rescue: ori bla/npt. III E E ori bla/npt. III E plasmid 1. 2. 3. 4. 5. cleavage (E) circularization (ligation) transformation E. coli (ori, R) multiplication in bacteria sequencing Inverse PCR: E E E 1. 2. 3. 4. E cleavage (E) circularization (ligation) PCR (alt. nested) sequencing

Mutagenesis Most advanced genome editing = targeted mutagenesis within the genome: ZFN all causing

Mutagenesis Most advanced genome editing = targeted mutagenesis within the genome: ZFN all causing double stranded break TALEN in DNA CRISPR/Cas 9 Mutation: - usually inactivation (NHEJ – non homology end joining – random local mutations) - specific change of the sequence - driven by an oligonucleotide serving as a recombination template) - using modified CAS 9 (e. g. fused with C deaminase)

Genome editing tools: ZFN, TALEN • cleavage in selected sequence by fusion proteins: -

Genome editing tools: ZFN, TALEN • cleavage in selected sequence by fusion proteins: - DNA binding domain - nuclease domain Fok 1 • designable DNA binding domains: - Zn-finger (ZF) - TALE (transcription activatorlike effector from Xanthomonas) CRISPR/Cas 9 – cleavage by Cas 9 nuclease – targeted by guide RNA

Use of mutagenesis in genetics Direct: - classical mutagenesis - insertional mutagenesis (all types

Use of mutagenesis in genetics Direct: - classical mutagenesis - insertional mutagenesis (all types – see later) Reverse: - directed mutagenesis - also random insertional mutagenesis 1) generation + characterization of random insertional mutants 2) making a collection 3) ordering of mutant in selected gene from the collection

Approaches of reverse genetics - analyses of mutants with insertion in an analyzed gene

Approaches of reverse genetics - analyses of mutants with insertion in an analyzed gene - modulation of an analyzed gene expression - promoter activity analyses - analyses of subcellular protein localization (GFP fusion) - gene modification (within/outside the genome) - TILLING, …. . - analysis of the gene polymorphism in various ecotypes (1000+1 genome project)

RG 1: Analyses of insertion mutants (usually knock-out) - publicly available (Arabidopsis, rice, …)

RG 1: Analyses of insertion mutants (usually knock-out) - publicly available (Arabidopsis, rice, …) - insertions in different sites in the genome 80 % genes (A. th. ) for inactivation – optimally 5’ exons, experiment. confirmation is needed!) – mutant selection in silico, ordering seeds from the collection Gene 1 1 Gene 2 23 4 Gene 3 56 7 = sites of T-DNA insertions in individual lines (1 -8) 8 …line number

WWW interphase Genes Insertions http: //signal. salk. edu/cgi-bin/tdnaexpress

WWW interphase Genes Insertions http: //signal. salk. edu/cgi-bin/tdnaexpress

RG 2: Modulation of gene expression - increased m. RNA/protein level (overexpression) - introduction

RG 2: Modulation of gene expression - increased m. RNA/protein level (overexpression) - introduction of a gene with a strong constitutive promoter (e. g. viral) - syn. gain-of-function mutations, ectopic expr. - decreased m. RNA/protein level by RNAi - silencing, knock-down - X knock–out, loss-of-function = complete inactivation by disruption of the gene

Decreasing expression level (TGS) Formation of si. RNA against promoter: Introduction of promoter inverted

Decreasing expression level (TGS) Formation of si. RNA against promoter: Introduction of promoter inverted repeat (IR) - „haipin transcript“ IR (DNA) RNA hairpin intramolecular base-pairing A gene to be silenced: x Promoter methylation (stop of transcription) m. RNA ds. RNA formation of si. RNA

Decreasing expression level (PTGS) A gene to be m. RNA silenced: formation of ds.

Decreasing expression level (PTGS) A gene to be m. RNA silenced: formation of ds. RNA formation of si. RNAs m. RNA slicing 1) expression of the gene in antisense orientation (in reverse) 2) expression of a hairpin construct (i. e. antisense-intron-sense) 3) expression of a gene without terminator (unterminated - ds. RNA by RDR 6) 4) ami. RNA – complex tertiary haipin RNA producing one artificial mi. RNA antisense RNA intermolecular pairing IR hairpin intramolecular pairing Rd. RP without terminator synthesis of complementary strand to the transcript by RDR

RG 3: Promoter analysis Fusion of an analyzed promoter with a reporter gene (transcription

RG 3: Promoter analysis Fusion of an analyzed promoter with a reporter gene (transcription fusion), with or without the original gene: GFP Pro gene reporter gene/fusion, e. g. Gene GFP Term - usually introduction of a new copy into the genome Results indicate: - tissue, organ, developmental specificity - responses to external factors Confirmation with other approaches advisible/needed (risk of artifacts)

Promoter fusion with GFP and GUS Arabidopsis thaliana

Promoter fusion with GFP and GUS Arabidopsis thaliana

RG 4: Formation of protein fusions - removal of stop codon, fusion in the

RG 4: Formation of protein fusions - removal of stop codon, fusion in the reading frame (= translational fusion) - functional domains, natural interaction often preserved (!)

GFP fusion proteins - Protein localization analyses, protein interactions - complementation of a knock-out

GFP fusion proteins - Protein localization analyses, protein interactions - complementation of a knock-out mutant recommended - in vivo labelling of cellular structures Golgy complex chromosomes microtubules

RG 5: Modification of the gene (outside the genome) - introduction of a modified

RG 5: Modification of the gene (outside the genome) - introduction of a modified gene copy to the knock-out mutant • gene isolation (including the native promoter) • directed mutagenesis ex vivo by PCR (primers) • selection of homozygous knock-out mutant in the gene (heterozygous mutant in the case of lethal mutations) • introduction of the modified gene (analysis of numerous lines to avoid „position-effect“ artifacts) RG 6: Gene editing in the genome (see earlier) (ZFN, TALEN, CRISPR/Cas 9 + oligonucleotides)

RG 7: TILLING isolation of mutants with point mutations in selected gene Targeting induced

RG 7: TILLING isolation of mutants with point mutations in selected gene Targeting induced local lesions in genomes • chemical mutagenesis (EMS) – seed and DNA stocks • PCR- and heteroduplex analysis-based screen for mutants in certain gene • Point mutations! (changed regulation, interactions, …)

TILLING – overview seed stocks pooled DNA stocks

TILLING – overview seed stocks pooled DNA stocks

TILLING 1. 2. 3. PCR of selected sequence from DNA stocks isolated from mutant

TILLING 1. 2. 3. PCR of selected sequence from DNA stocks isolated from mutant population Reassociation with PCR fragment from wt plant Cleavage of ss sites of heteroduplex + electrophoretic separation of endlabelled fragments

Direct genetics - selection of mutants by altered phenotype shootmeristemless agamous

Direct genetics - selection of mutants by altered phenotype shootmeristemless agamous

Mutant screens – phenotype, conditions, treatments, … - looking for outstanding individuals

Mutant screens – phenotype, conditions, treatments, … - looking for outstanding individuals

Insertional mutagenesis for gene isolation gene (native form) inserted DNA Simple insertional mutagenesis –

Insertional mutagenesis for gene isolation gene (native form) inserted DNA Simple insertional mutagenesis – phenotype analysis in T 2 - inactivation / early termination of translation (truncated protein) - insertion into promoter – often deregulated expression (up/down) - usually recessive, but can be dominant Activation mutagenesis – phenotype analysis in T 1 - inserted sequence contain promoter or enhancer that can activate expression of adjacent (otherwise inactive) gene - dominant

Insertional mutagenesis for gene isolation gene (native form) inserted DNA Promotor and enhancer-trap insertions

Insertional mutagenesis for gene isolation gene (native form) inserted DNA Promotor and enhancer-trap insertions (selection based on expression of the reporter) - phenotype analysis in T 1 generation - T-DNA with reporter gene without promoter (with minimal promoter) = reporter expression visible when the construct is inserted behind an active promoter (expression pattern depends on the promoter) - dominant - simultaneously the insertion usually causes inactivation = recessive (T 2)

Direct genetics – identification of mutated gene - the same phenotypic change can result

Direct genetics – identification of mutated gene - the same phenotypic change can result from different mutations „there are numerous ways how to build up a house incorrectly“ - allelic mutations – mutation in the same gene (x different g. ) How to find out if two homozygous mutants with the same phenotype are mutated in the same gene or not (in case of recesive mutations)? Crossing of homozygous mutants F 1 - wt phenotype = different genes (complementation, heterozygous in both genes) - mutant phenotype = allelic (same genes were mutated) = no complementation

Identification/mapping of unknown (mutated) genes („with phenotype“) by cosegregation analysis Based on genetic map

Identification/mapping of unknown (mutated) genes („with phenotype“) by cosegregation analysis Based on genetic map – procedure: 1. genetic mapping – looking for genetic markers in strong genetic linkage (necessity of dense polymorphic markers!) 2. identification of the gene - chromosom walking - sequencing (sequence comparisons)

Genetic marker morphological or molecular trait 1) with polymorphism between parental genotypes (e. g.

Genetic marker morphological or molecular trait 1) with polymorphism between parental genotypes (e. g. different ecotypes) 2) known or identifiable position in the genetic map DNA markers sequenced only if needed markers known in advance

Genetic linkage x crossing-over during meiosis 1. Cytologic event 2. Genetic result Parental chromosomes

Genetic linkage x crossing-over during meiosis 1. Cytologic event 2. Genetic result Parental chromosomes Parental Genotype (heterozygous Aa and Bb ) Locus A Locus B Meiosis Without Crossing-over Meiosis Crossing-over Gametes 1 3 2 4 Not recombinant Recombinant Not recombinant ( same as parental genotype ) Recombinant ( new )

Identification/mapping of genes Strategies for cosegragation analysis: Looking for genetic markers in the genetic

Identification/mapping of genes Strategies for cosegragation analysis: Looking for genetic markers in the genetic linkage (as strong as possible) with our gene of interest: „Gradual approaching“ Random deep screen (repeated if needed) desired gene M 1 M 2

Types of genetic markers = trait with known or identifiable position in genetic map

Types of genetic markers = trait with known or identifiable position in genetic map with polymorphism between parental genotypes (e. g. different ecotypes) • Morphological (limited number) • Molecular

Morphological markers for Arabidopsis Gene symbol Name Phenotype Location (chr. c. M) an-1 angustifolia

Morphological markers for Arabidopsis Gene symbol Name Phenotype Location (chr. c. M) an-1 angustifolia narrow leaves, crinkled siliques 1 -55. 2 ap 1 -1 apetala no petals 1 -99. 3 py pyrimidine requiring white leaves, restored by pyrimidine 2 -49. 1 er-1 erecta compact inflorescence, blunt siliques 2 -43. 5 hy 2 -1 long hypocotyl elongated hypocotyl, slender 3 -11. 5 gl 1 -1 glabra no trichomes 3 -46. 2 bp-1 brevipedicellus short pedicels, siliques bent downwards, short plant 4 -15. 0 cer 2 -2 eceriferum bright green stems, siliques bent downwards, short plant 4 -51. 9 ms 1 -1 male sterile no siliques 5 -2. 5 tt 3 -1 transparent testa yellow seeds, no anthocyanin 5 -57. 4

Types of genetic markers • Morphological (limited number) • Molecular – Isozymes (the same

Types of genetic markers • Morphological (limited number) • Molecular – Isozymes (the same activity, but different mobility in gel electrophoresis) – DNA markers – differences in DNA sequence known OR detectable position in the genome!

DNA molecular markers (= usually an electrophoretic band) • RFLP (Restriction fragment length polymorfism)

DNA molecular markers (= usually an electrophoretic band) • RFLP (Restriction fragment length polymorfism) + Southern • RAPD (Random amplified polymorphism detection) • AFLP (Amplified fragment length polymorphism)

Examples of DNA molecular markers Known position in the genome • RFLP (Restriction fragment

Examples of DNA molecular markers Known position in the genome • RFLP (Restriction fragment length polymorfism) + Southern hybridization • SSR (Simple sequence repeats) Unknown sequence and position - randomly amplified DNA segments - sequence and position determined subsequently (only for those of interest - in genetic linkage with a trait) • RAPD (Random amplified polymorphism detection) • AFLP (Amplified fragment length polymorphism)

RFLP

RFLP

RAPD

RAPD

AFLP

AFLP

Identification/mapping of genes Strategies for cosegragation analysis: Looking for genetic markers in the genetic

Identification/mapping of genes Strategies for cosegragation analysis: Looking for genetic markers in the genetic linkage (as strong as possible) with our gene of interest: „Gradual approaching“ Random deep screen desired gene M 1 M 2

„Gradual approaching“ • Crossing of different genotypes with high polymorphism (multiple differences in markers)!!!

„Gradual approaching“ • Crossing of different genotypes with high polymorphism (multiple differences in markers)!!! Basic set of genetic markers in Arabidopsis thaliana 2 -3 in every chromosomal arm

Molecular markers in Arabidopsis

Molecular markers in Arabidopsis

A, B – full linkage! A, C – free recombination AA bb cc F

A, B – full linkage! A, C – free recombination AA bb cc F 1 (heterozygote) A bc P 1 (homozygote) Aa b. B Cc gametes a BC aa BB CC P 2 (homozygote) gametes A b. C F 2 – full linkage: AB: Ab: a. B: ab 2: 1: 1: 0 AA a a a A b b BB B b A bc a BC a Bc Cosegragation analysis in F 2 generation F 2 – without linkage: AC: Ac: a. C: ac = 9: 3: 3: 1 AA AA Cc AA cc Aa CC Aa Cc Aa cc aa CC aa Cc aa CC cc

Segregation in F 2 generation (P=XXyy x xx. YY, F 1 = Xx. Yy

Segregation in F 2 generation (P=XXyy x xx. YY, F 1 = Xx. Yy – frequency of gametes depends on the linkage) gamety XY (0. 5) Xy x. Y xy (0. 5) XY (0. 5) XXYY XY (0. 25) XXYy XY (0. 5) Xx. YY XY (0. 5) Xx. Yy XY (0. 25) Xy XXYy XY (0. 5) XXyy Xy Xx. Yy XY Xxyy Xy (0. 5) x. Y Xx. YY XY (0. 5) Xx. Yy XY xx. YY x. Y xx. Yy x. Y (0. 5) xy (0. 5) Xx. Yy XY (0. 25) Xxyy Xy (0. 5) xx. Yy x. Y (0. 5) xxyy xy (0. 25) 9: 3: 3: 1 (XY: Xy: x. Y: xy) x 4, 75: 2: 2: 0, 25 no linkage = different chromosoms (arms) week genetic linkage Looking for strong linkage!

Random screen: Cosegregation with molecular markers • Crossing of different genotypes with high polymorphism

Random screen: Cosegregation with molecular markers • Crossing of different genotypes with high polymorphism (multiple differences in markers)!!! • Analysis of high number of markers at ones • Which marker A, B, C, D is linked with locus R? Fenotyp: r R fenotyp r fenotyp R

Bulked segregant analysis • Strong linkage = possibility to analyze in bulk r phenotype

Bulked segregant analysis • Strong linkage = possibility to analyze in bulk r phenotype r R phenotype R

Having two markers surrounding desired gene allows to continue „Chromosome Mutovaný gen X walking“

Having two markers surrounding desired gene allows to continue „Chromosome Mutovaný gen X walking“ Libraries of big genomic fragments from the mutant YACs, BACs = yeast (bacterial) arteficial chromosome, ~ 300 (100) kbp cosmids ( fág, 50 kbp) Looking for presence of marker sequence and for overlaps using hybridization

Direct and reverse genetics in Arabidopsis reverse direct Identification of mutation site + Tilling

Direct and reverse genetics in Arabidopsis reverse direct Identification of mutation site + Tilling – „searching“ in non-characterized collection of lines by PCR and reassociation

Marker assisted selection (MAS) Molecular marker in strong genetic linkage with certain trait can

Marker assisted selection (MAS) Molecular marker in strong genetic linkage with certain trait can be used for screening of hybrids instead of the phenotypic characterization Advantages: • Not influenced by environmental conditions • Screens of seedlings • Often simple and cheaper • Possibility to distinguish between homo- and heterozygots (using certain markers)

Other approaches to identify genes - yeast complementation x two-hybrid screen Y 2 H

Other approaches to identify genes - yeast complementation x two-hybrid screen Y 2 H – reporter expression triggered by reconstitution of transcription factor splitted into: AC DB DNA-binding (DB) activation domain (AD = polymerase-binding) DB fused to a known protein, which interactors are searched for = „bait“ AD fused to potential intractor or randomly to seqs from c. DNA library Physical interaction = reconstitution of transcription factor transcription in one cell

Yeast two-hybrid screen for protein interactors - testing predicted interactions - picking up new

Yeast two-hybrid screen for protein interactors - testing predicted interactions - picking up new interactors