Immunoglobulins Generation of Diversity Introduction Immunologist estimate that
Immunoglobulins Generation of Diversity
Introduction • Immunologist estimate that each person has the ability to produce a range of individual antibodies capable of binding to a total of well over 1010 epitopes • According to the germline theory, a unique gene encodes each antibody • Unfortunately, for this theory to be true the number of antibody genes would need to be 100 -1000 -fold greater than the entire human genome
Theories • An alternative theory, the somatic mutation theory, holds that a single germline immunoglobulin gene undergoes multiple mutations that generate immunoglobulin diversity. This scheme, however, requires an unimaginable mutation rate • The immune system has developed a much more elegant solution- the chromosomal rearrangement of separate gene segments, which employs some elements of the germline and somatic mutation theories
Gene Rearrangement • Each light and heavy chain is encoded by a series of genes occurring in clusters along the chromosome • In humans, the series of genes encoding the k light chain, λ light chain, and the heavy chain are located on chromosomes 2, 22, and 14 respectively • When a cell becomes committed to the B lymphocyte lineage, it rearranges the DNA, encoding its light and heavy chains by cutting and splicing together some of the DNA sequences, thus modifying the sequence of the variable region gene
Tonegawa’s demonstration • 1976—used restriction enzymes and DNA probes to show that germ cell DNA contained several smaller DNA segments compared to DNA taken from developed lymphocytes (myeloma cells)
k l H VH 1 VH 2 VH 65 DH 1 -------27 JH 1 -----6 Cg 1 gene 1 transcript 1 protein Antibody specificities more than 1, 000, 000 Human genome about 30, 000 genes Human Antibody genes H: chromosome 14 k: chromosome 2 l: chromosome 22
Ig gene sequencing complicated the model Structures of germline VL genes were similar for Vk, and Vl, However there was an anomaly between germline and rearranged DNA: VL CL ~ 95 aa ~ 100 aa L L V L CL ~ 208 aa Where do the extra 13 amino acids come from? L V L JL ~ 95 aa CL ~ 100 aa Extra amino acids provided by one of a small set of J or JOINING regions
Further diversity in the Ig heavy chain L VH JH DH CH Heavy chain: between up to 8 additional amino acids between JH and CH The D or DIVERSITY region Each heavy chain requires three recombination events: JH to DH, JHDH to VH and JHDH VHto CH L VL JL CL Each light chain requires two recombination events: VL to JL and VLJL to CL
Problems? 1. How is an infinite diversity of specificity generated from finite amounts of DNA? 2. How can the same specificity of antibody be on the cell surface and secreted? 3. How do V region find J regions and why don’t they join to C regions? 4. How does the DNA break and rejoin?
Diversity: Multiple germline genes Vk & Jk Loci: • 132 Vk genes on the short arm of chromosome 2 • 29 functional Vk genes with products identified • 87 pseudo Vk genes • 16 functional Vk genes - with no products identified • 25 orphans Vk genes on the long arm of chromosome 2 • 5 Jk regions Vl & Jl Loci: • 105 Vl genes on the short arm of chromosome 22 • 30 functional genes with products identified • 56 pseudogenes • 6 functional genes - with no products identified • 13 relics (<200 bp of Vl sequence) • 25 orphans on the long arm of chromosome 22 • 4 Jl regions
Diversity: Multiple Germline Genes VH Locus: JH Locus: DH Locus: • 123 VH genes on chromosome 14 • 40 functional VH genes with products identified • 79 pseudo VH genes • 4 functional VH genes - with no products identified • 24 non-functional, orphan VH sequences on chromosomes 15 & 16 • 9 JH genes • 6 functional JH genes with products identified • 3 pseudo JH genes • 27 DH genes • 23 functional DH genes with products identified • 4 pseudo DH genes • Additional non-functional DH sequences on the chromosome 15 orphan locus • reading DH regions in 3 frames functionally increases number of DH regions
Reading D segment in 3 frames Analysis of D regions from different antibodies One D region can be used in any of three frames Different protein sequences lead to antibody diversity GGGACAGGGGGC Gly. Thr. Gly Frame 1 GGGACAGGGGGC Gly. Gln. Gly Frame 2 GGGACAGGGGGC Asp. Arg. Gly Frame 3
Estimates of combinatorial diversity Using functional V, D and J genes: 40 VH x 27 DH x 6 JH = 5, 520 combinations D can be read in 3 frames: 5, 520 x 3 = 16, 560 combinations 29 Vk x 5 Jk = 145 combinations 30 Vl x 4 Jl = 120 combinations = 265 different light chains If H and L chains pair randomly as H 2 L 2 i. e. 16, 560 x 265 = 4, 388, 400 possibilities Due only to COMBINATORIAL diversity In practice, some H + L combinations are unstable. Certain V and J genes are also used more frequently than others. Other mechanisms add diversity at the junctions between genes JUNCTIONAL diversity
Problems? 1. How is an infinite diversity of specificity generated from finite amounts of DNA? Mathematically, Combinatorial Diversity can account for some diversity – how do the elements rearrange? 2. How can the same specificity of antibody be on the cell surface and secreted? 3. How do V region find J regions and why don’t they join to C regions? 4. How does the DNA break and rejoin?
Genomic organisation of Ig genes (Numbers include pseudogenes etc. ) LH 1 -123 VH 1 -123 DH 1 -27 Lk 1 -132 Vk 1 -132 Ll 1 -105 Vl 1 -105 JH 1 -9 Jk 1 -5 Cl 1 Jl 1 Cl 2 Jl 2 Cm Ck Cl 3 Jl 3 Cl 4 Jl 4
Ig light chain gene rearrangement by somatic recombination Vk Germline Rearranged 1° transcript Spliced m. RNA Jk Ck
Ig light chain rearrangement: Rescue pathway There is only a 1: 3 chance of the join between the V and J region being in frame Vk Jk Non-productive rearrangement Light chain has a second chance to make a productive join using new V and J elements Spliced m. RNA transcript Ck
Ig heavy chain gene rearrangement VH 1 -123 DH 1 -27 JH 1 -9 Cm Somatic recombination occurs at the level of DNA which can now be transcribed
RNA processing Primary transcript RNA p. As V D J 8 Cm 1 J 9 Cm 2 Cm 3 D J 8 Cm 1 h Cm 2 Cm 3 Cm 4 V m. RNA h Cm 4 AAAAA The Heavy chain m. RNA is completed by splicing the VDJ region to the C region The H and L chain m. RNA are now ready for translation VL JL VH DH JH AAAAA CL h CH AAAAA
Problems? 1. How is an infinite diversity of specificity generated from finite amounts of DNA? Combinatorial Diversity and genomic organisation can account for some diversity 2. How can the same specificity of antibody be on the cell surface and secreted? 3. How do V region find J regions and why don’t they join to C regions? 4. How does the DNA break and rejoin?
Remember These Facts? • Cell surface antigen receptor on B cells Allows B cells to sense their antigenic environment Connects extracellular space with intracellular signalling machinery • Secreted antibody functions Neutralisation Arming/recruiting effector cells Complement fixation How does the model of recombination allow for two different forms of the same protein?
The constant region has additional, optional exons Cm Primary transcript RNA Each H chain domain (& the hinge) encoded by separate exons Cm 1 h Cm 2 AAAAA Secretion coding sequence Cm 3 Polyadenylation site (secreted) p. As Polyadenylation site (membrane) p. Am Cm 4 Membrane coding sequence
Membrane Ig. M constant region DNA Cm 1 h Cm 2 Cm 3 Cm 4 Transcription 1° transcript p. Am Cm 1 Cleavage & polyadenylation at p. Am and RNA splicing m. RNA h Cm 2 Cm 3 Cm 1 h Cm 2 Cm 3 Cm 4 AAAAA Membrane coding sequence encodes transmembrane region that retains Ig. M in the cell membrane Protein Fc
Secreted Ig. M constant region DNA Cm 1 h Cm 2 Cm 3 Cm 4 Transcription 1° transcript p. As Cm 1 h Cm 2 Cm 3 Cm 4 AAAAA Cleavage polyadenylation at p. As and RNA splicing m. RNA Cm 1 h Cm 2 Cm 3 Cm 4 Protein AAAAA Secretion coding sequence encodes the C terminus of soluble, secreted Ig. M Fc
Alternative RNA processing generates transmembrane or secreted Ig
(a) Secreted & membrane forms of the heavy chain by alternative ( differential ) RNA processing of primary transcript.
Synthesis, assembly, and secretion of the immunoglobulin molecule.
Problems? 1. How is an infinite diversity of specificity generated from finite amounts of DNA? Combinatorial Diversity and genomic organisation accounts for some diversity 2. How can the same specificity of antibody be on the cell surface and secreted? Use of alternate polyadenylation sites 3. How do V region find J regions and why don’t they join to C regions? 4. How does the DNA break and rejoin?
V, D, J flanking sequences Sequencing up and down stream of V, D and J elements Conserved sequences of 7, 23, 9 and 12 nucleotides in an arrangement that depended upon the locus Vl 7 Vk 7 23 12 7 23 9 12 9 7 12 9 9 9 VH 9 D 23 7 12 9 7 Jl 7 Jk 9 23 7 JH
Gene rearrangements are made at recombination signal sequences (RSS). RSSs are heptamer-nonamer sequences Each RSS contains a conserved heptamer, a conserved nonamer and a spacer of either 12 or 23 base pairs.
There is a RSS downstream of every V gene segment, upstream of every J gene segment and flanking every D gene segment Generic light chain locus Generic heavy chain locus V D J
Recombination signal sequences (RSS) HEPTAMER - Always contiguous with coding sequence 9 VH 7 23 √ VH 9 7 12 23 7 D 7 9 9 12 7 D NONAMER - Separated from the heptamer by a 12 or 23 nucleotide spacer 7 JH 23 7 12 9 √ 9 23 7 JH 12 -23 RULE – A gene segment flanked by a 23 mer RSS can only be linked to a segment flanked by a 12 mer RSS
1. Rearrangements only occur between segments on the same chromosome. 2. A heptamer must pair with a complementary heptamer; a nonamer must pair with a complementary nonamer. 3. One of the RSSs must have a spacer with 12 base pairs and the other must be 23 base pairs (the 12/23 rule).
- RSS having a one-turn spacer can join only with RSS having a two-turn spacer : one-turn / two-turn joining rule - This ensures that V, D, J segments join in proper order & that segments of the same type do not join each other. - The enzymes recognizing RSS : recombination-activating genes. ( RAG-1, -2), lymphoid-specific gene products
Molecular explanation of the 12 -23 rule 12 -mer = one turn 23 -mer = two turns 23 V 7 Intervening DNA of any length 9 12 9 7 D J
Molecular explanation of the 12 -23 rule V 4 V 1 V 8 V 9 V 3 V 2 V 7 V 6 V 3 V 4 V 2 V 5 9 9 • Heptamers and nonamers align back-to-back V 9 V 1 7 12 -mer 7 • The shape generated by the RSS’s acts as a target for recombinases V 6 Loop of intervening DNA is excised DJ 23 -mer V 5 DJ • An appropriate shape can not be formed if two 23 -mer flanked elements attempted to join (i. e. the 12 -23 rule) V 7 V 8
Junctional diversity Mini-circle of DNA is permanently lost from the genome 9 7 V 7 12 23 9 9 23 Coding joint DJ 7 7 12 9 Signal joint VDJ Imprecise and random events that occur when the DNA breaks and rejoins allows new nucleotides to be inserted or lost from the sequence at and around the coding joint.
Non-deletional recombination V 1 V 1 V 3 V 2 7 V 2 V 3 9 23 V 4 7 V 9 DJ 9 V 4 Looping out works if all V genes are in the same transcriptional orientation 9 DJ 12 7 D J How does recombination occur when a V gene is in opposite orientation to the DJ region? 9 12 7 DJ
Non-deletional recombination V 4 and DJ in opposite transcriptional orientations J 9 12 7 D J 2. 9 12 7 D 1. 9 7 V 4 23 9 9 23 9 12 3. 23 7 V 4 4. 9 23 7 V 4 J 7 D 9 12 7 D J 9 7 V 4 12 23 7 D J 9
2. 1. 9 9 23 7 12 9 23 9 V 4 12 7 7 V 4 D J Heptamer ligation - signal joint formation 7 D J 3. V 4 9 23 9 4. 9 12 23 7 7 D J 7 7 12 9 V to DJ ligation coding joint formation V 4 D J Fully recombined VDJ regions in same transcriptional orientation No DNA is deleted
Problems? 1. How is an infinite diversity of specificity generated from finite amounts of DNA? Combinatorial Diversity and genomic organisation accounts for some diversity 2. How can the same specificity of antibody be on the cell surface and secreted? Use of alternative polyadenylation sites 3. How do V region find J regions and why don’t they join to C regions? The 12 -23 rule 4. How does the DNA break and rejoin?
Steps of Ig gene recombination 7 23 V 7 23 D J 9 12 7 V 7 23 9 9 12 7 9 9 7 23 9 12 7 V 9 D J Recombination activating gene products, (RAG 1 & RAG 2) and ‘high mobility group proteins’ bind to the RSS The two RAG 1/RAG 2 complexes bind to each other and bring the V region adjacent to the DJ region • The recombinase complex makes single stranded nicks in the DNA. The free OH on the 3’ end hydrolyses the phosphodiester bond on the other strand. • This seals the nicks to form a hairpin structure at the end of the V and D regions and a flush double strand break at the ends of the heptamers. • The recombinase complex remains associated with the break D J 9 12 7
Steps of Ig gene recombination V 7 23 9 D J 9 12 7 V D J 9 12 7 7 23 9 V A number of other proteins, (Ku 70: Ku 80, XRCC 4 and DNA dependent protein kinases) bind to the hairpins and the heptamer ends. The hairpins at the end of the V and D regions are opened, and exonucleases and transferases remove or add random nucleotides to the gap between the V and D region DNA ligase IV joins the ends of the V and D region to form the coding joint and the two heptamers to form the signal joint.
Junctional diversity: P nucleotide additions 7 23 V AT GTGACAC J DTA CACTGTG 9 9 12 7 7 7 9 23 12 9 The recombinase complex makes single stranded nicks at random sites close to the ends of the V and D region DNA. U U V TC CACAGTG AG GTGTCAC TC AG TC CACAGTG AG GTGTCAC 7 GTGACAC CACTGTG 7 V V AT AT J JDTA 9 23 12 9 The 2 nd strand is cleaved and hairpins form between the complimentary bases at ends of the V and D region. D J
V 3 V 2 V 4 CACAGTG GTGTCAC 7 GTGACAC CACTGTG 7 9 23 12 V 5 9 V AT J DTA V U TC AG V 7 TC AG AT TA U U U Heptamers are ligated by DNA ligase IV V 8 V and D regions juxtaposed V 6 D J
U TC AG AT TA Regions to be joined are juxtaposed AT TA Endonuclease cleaves single strand at random sites in V and D segment U V TC AG U V U Generation of the palindromic sequence D J The nicked strand ‘flips’ out V TC~GA AG AT TA~TA D J The nucleotides that flip out, become part of the complementary DNA strand In terms of G to C and T to A pairing, the ‘new’ nucleotides are palindromic. The nucleotides GA and TA were not in the genomic sequence and introduce diversity of sequence at the V to D join. (Palindrome - A Santa at NASA)
Junctional Diversity – N nucleotide additions V TC~GA CACTCCTTA AT AG TTCTTGCAA TA~TA V TC~GACACACCTTA V TC CACACCTTA TC~GA GTT AT AG TTCTTGCAA TA~TA TA AG C D J TTCTTGCAA TA~TA D J Terminal deoxynucleotidyl transferase (Td. T) adds nucleotides randomly to the P nucleotide ends of the singlestranded V and D segment DNA Complementary bases anneal Exonucleases nibble back free ends DNA polymerases fill in the gaps with complementary nucleotides and DNA ligase IV joins the strands
Generation of Antibody Diversity P-nucleotide and N-nucleotide addition during joining.
P and N region nucleotide alteration adds to diversity of V region • During recombination some nucleotide bases are cut from or add to the coding regions (p nucleotides) • Up to 15 or so randomly inserted nucleotide bases are added at the cut sites of the V, D and J regions (n nucleotides_ • Td. T (terminal deoxynucleotidyl transferase) a unique enzyme found only in lymphocytes • Since these bases are random, the amino acid sequence generated by these bases will also be random
Junctional Diversity V TCGACGTTATAT AGCTGCAATATA D J TTTTT Germline-encoded nucleotides TTTTT Palindromic (P) nucleotides - not in the germline TTTTT Non-template (N) encoded nucleotides - not in the germline Creates an essentially random sequence between the V region, D region and J region in heavy chains and the V region and J region in light chains.
Problems? 1. How is an infinite diversity of specificity generated from finite amounts of DNA? Combinatorial Diversity, genomic organisation and Junctional Diversity 2. How can the same specificity of antibody be on the cell surface and secreted? Use of alternative polyadenylation sites 3. How do V region find J regions and why don’t they join to C regions? The 12 -23 rule 4. How does the DNA break and rejoin? Imprecisely to allow Junctional Diversity
Why do V regions not join to J or C regions? VH DH JH C IF the elements of Ig did not assemble in the correct order, diversity of specificity would be severely compromised 2 x DIVERSITY Full potential of the H chain for diversity needs V-D-J-C joining - in the correct order 1 x DIVERSITY Were V-J joins allowed in the heavy chain, diversity would be reduced due to loss of the imprecise join between the V and D regions
Additional Degrees of Variation • Somatic hypermutation: Stimulated memory B cells accumulate small mutations on the VL or VH leading to affinity maturation to antigens that are frequently or chronically present • Isotype switching
Somatic hypermutation FR 1 Variability CDR 1 FR 2 CDR 2 FR 3 CDR 3 FR 4 100 80 60 40 20 20 40 60 80 100 120 Amino acid No. Wu - Kabat analysis compares point mutations in Ig of different specificity. What about mutation throughout an immune response to a single epitope? How does this affect the specificity and affinity of the antibody?
Somatic hypermutation leads to affinity maturation Day 6 Day 8 Day 12 Day 18 CDR 3 CDR 1 CDR 2 Clone 1 Clone 2 Clone 3 Clone 4 Clone 5 Clone 6 Clone 7 Clone 8 Clone 9 Clone 10 Cells with accumulated mutations in the CDR are selected for high antigen binding capacity – thus the affinity matures throughout the course of the response Lower affinity - Not clonally selected Deleterious mutation Beneficial mutation Higher affinity - Clonally selected Identical affinity - No influence on clonal selection Neutral mutation Hypermutation is T cell dependent Mutations focussed on ‘hot spots’ (i. e. the CDRs) due to double stranded breaks repaired by an error prone DNA repair enzyme.
Allelic Exclusion • A single B cell can express only one VL and one VH allele to the exclusion of all others • Both must be on the same member of the chromosome pair-either maternal or paternal • The restriction of VL and VH expression to a single member of the chromosome pair is termed allelic exclusion • The presence of both maternal and paternal allotypes in the serum reflects the expression of different alleles by different population of B cells
Allelic exclusion: only one chromosome is active in any one lymphocyte
Model to account for allelic exclusion: If one allele arranges nonproductively, a B cell still can rearrange the other allele productively; once a productive rearrangement( 33%) have occurred, the recombination machinery is turned off. ( the protein product acts as a signal to prevent further gene rearrangement)
Antibody isotype switching Throughout an immune response the specificity of an antibody will remain the same (notwithstanding affinity maturation) The effector function of antibodies throughout a response needs to change drastically as the response progresses. Antibodies are able to retain variable regions whilst exchanging constant regions that contain the structures that interact with cells. Organisation of the functional human heavy chain C region genes J regions Cm Cd Cg 3 Cg 1 Ca 1 Cg 2 Cg 4 Ce Ca 2
Switch regions Cm Sm Cd Cg 3 Sg 3 Cg 1 Sg 1 Ca 1 Sa 1 Cg 2 Sg 2 Cg 4 Sg 4 Ce Se Ca 2 Sa 2 • Upstream of C regions are repetitive regions of DNA called switch regions. (The exception is the Cd region that has no switch region). • The Sm consists of 150 repeats of [(GAGCT)n(GGGGGT)] where n is between 3 and 7. • Switching is mechanistically similar in may ways to V(D)J recombination. • Isotype switching does not take place in the bone marrow, however, and it will only occur after B cell activation by antigen and interactions with T cells.
7 means of generating antibody diversity
Generation of Antibody Diversity • Germ line diversity. • Combinatorial diversity. • Junctional diversity. • Somatic hypermutation ( affinity maturation)
- Slides: 63