Chip Viewer Software Chip Viewer is coded to

  • Slides: 29
Download presentation
Chip. Viewer Software • Chip. Viewer is coded to visualize and analyze the tiling

Chip. Viewer Software • Chip. Viewer is coded to visualize and analyze the tiling chip data. 1. Data visualization 2. Data normalization 3. Data mining 4. Novel gene detection and fetching 5. Expression analyses

1. Chip. Viewer: ABSTRACT The SSP tiling chips, which cover the whole Arabidopsis thaliana

1. Chip. Viewer: ABSTRACT The SSP tiling chips, which cover the whole Arabidopsis thaliana genomic sequences, are customarily designed for multi-purposes: To map transcriptional units of the Arabidopsis ORFeome, to mine and identify novel transcripts and small RNA species, to do high-throughput polymorphism detection among Arabidopsis accessions and to identify evolutionarily conserved non-coding regions in Brassica. The Chip. Viewer, a Unix/Linux based application, is coded in-house in the Salk Institute Genomic Analysis Laboratory (SIGn. AL) to qualitatively and quantitatively analyze the huge amount of data generated bye tiling chip experiments. Its X/Motif graphic user interface (GUI) allows uses to view chip expression profiles, de-convoluted chip expression profiles and histograms. An improved Match-only Integral Distribution algorithm (MOID) is introduced to call gene or genomic region absolute expression. In addition, a Relative Weight (RW), instead of normalization, is adopted to make it possible for universal comparison and analysis among different chip experiments. Furthermore, its comparative analysis tool allows for discovery of Single Feature Polymorphisms (SFPs) among Arabidopsis accessions, which will be useful markers for mapping QTLs using natural variations in the weed. A snapshot can be reached at http: //signal. salk. edu/chipviewer/

2. Chip. Viewer: Strategy of Chip Array Design (Test version)

2. Chip. Viewer: Strategy of Chip Array Design (Test version)

3. Chip. Viewer: De-Convolution of The Virtual Tiling Chip In the tiling chip, probes

3. Chip. Viewer: De-Convolution of The Virtual Tiling Chip In the tiling chip, probes are randomly allocated. Raw De-convoluted

4. Chip. Viewer: Zoom in the virtual chip

4. Chip. Viewer: Zoom in the virtual chip

5. Chip. Viewer: Histograms of Hybridization Signals in Genomic Orders Find: to find an

5. Chip. Viewer: Histograms of Hybridization Signals in Genomic Orders Find: to find an annotated gene Mining: to detect expressing regions Specially novel regions Fetch: to fetch a regional sequence

6. Chip. Viewer: Histogram: hybridization and gene annotation

6. Chip. Viewer: Histogram: hybridization and gene annotation

7. Chip. Viewer: Normalization – Linear regression normalization y=ax+b After normalization: a = 1

7. Chip. Viewer: Normalization – Linear regression normalization y=ax+b After normalization: a = 1 and b = 0 for two sets of chip data y=ax+b a b y – log (intensity of the chip to be normalized) x – log (intensity of the standard chip) a – slope b – intercept

8. Chip. Viewer: Normalization – Linear regression normalization 2 Two sets of data different

8. Chip. Viewer: Normalization – Linear regression normalization 2 Two sets of data different to each other

9. Chip. Viewer: Normalization – Distribution normalization Use a standard distribution or other chip’s

9. Chip. Viewer: Normalization – Distribution normalization Use a standard distribution or other chip’s distribution model to normalize a set of data Normalization based on the distribution of the 2 nd chip

10. Chip. Viewer: Normalization – Comparing the 2 models Linear regression model Distribution based

10. Chip. Viewer: Normalization – Comparing the 2 models Linear regression model Distribution based model

11. Chip. Viewer: Statistic Methods and Algorithms Weight – Statistical analyses a. Cramer von

11. Chip. Viewer: Statistic Methods and Algorithms Weight – Statistical analyses a. Cramer von Mises test ( two tails) b. Kolmogorov-Smirnov test ( two tails) c. Weight (one tail) n Wt = Σ (S (ƒ(i)) – F(ƒ(i))) d(ƒ(i)) 1 N Rw = Wt / Wo

12. Chip. Viewer: Weight – Statistical analyses 2 1 NULL distribution Chip distribution Di

12. Chip. Viewer: Weight – Statistical analyses 2 1 NULL distribution Chip distribution Di DKS Distribution of Selected probes Wt 0 x

13. Chip. Viewer: Relative Weight – Statistical analyses 3 1 NULL distribution Chip distribution

13. Chip. Viewer: Relative Weight – Statistical analyses 3 1 NULL distribution Chip distribution Di Dks (i) Wt Wo DKS (NULL) Distribution of Selected probes 0 Rw = Wt / Wo x

14. Chip. Viewer: Weight – Statistical analyses 4 chip DKS Samples NULL distribution chip

14. Chip. Viewer: Weight – Statistical analyses 4 chip DKS Samples NULL distribution chip distribution v p(v)

15. Chip. Viewer: Weight – Statistical analyses 4 -2 Samples chip NULL distribution chip

15. Chip. Viewer: Weight – Statistical analyses 4 -2 Samples chip NULL distribution chip distribution

16. Chip. Viewer: Detection of novel genes Use Mining button to detect the expression

16. Chip. Viewer: Detection of novel genes Use Mining button to detect the expression regions. This gene is now annotated as in the latest AGI genome. At 1 g 09645 Use Mining button to detect the expression regions.

17. Chip. Viewer: Detection of novel genes 2 From the latest AGI genome to

17. Chip. Viewer: Detection of novel genes 2 From the latest AGI genome to the future annotation ATG A T G TAA RAFL Nothing in the latest AGI genome, but a new gene At 1 g 11765 will be there in the future version.

18. Chip. Viewer: Mapping of transcriptional units of ORFeome From 2000 v At 1

18. Chip. Viewer: Mapping of transcriptional units of ORFeome From 2000 v At 1 g 09750 (MIPS) to the latest AGI At 1 g 09750 2000 v Annotation (MIPS) The latest AGI Annotation

19. Chip. Viewer: Mapping of transcriptional units of ORFeome

19. Chip. Viewer: Mapping of transcriptional units of ORFeome

20. Chip. Viewer: Mapping of transcriptional units of ORFeome Schematic of the Identification of

20. Chip. Viewer: Mapping of transcriptional units of ORFeome Schematic of the Identification of New Genes g. DNA m. RNA S (Salk) clones C (Chip) clones

21. Chip. Viewer: Identification of sm RNAs 5’GACCCAATAAGCCGCTCCGATTGGA 3’ CCB 5

21. Chip. Viewer: Identification of sm RNAs 5’GACCCAATAAGCCGCTCCGATTGGA 3’ CCB 5

22. Chip. Viewer: Evidence for Genes with Expression Tiling Arrays Regions covered Total predicted

22. Chip. Viewer: Evidence for Genes with Expression Tiling Arrays Regions covered Total predicted ORFs w/o c. DNAs or ESTs ORFs w/ evidence for expression Putative exons in intergenic regions chip 1 a Chr. 1 1. 8 -3. 0 Mb 320 110 72 2 chip 1 b Chr. 1 3. 0 -4. 2 Mb 300 104 79 6 chip 2 c Chr. 2 17. 0 -18. 2 Mb 328 117 96 1 chip 2 d Chr. 2 18. 2 -19. 5 Mb 321 100 83 3 1269 431 330 12

23. Chip. Viewer: Sense and anti-sense Discovery of Sense and anti-sense signals in Plasmodium

23. Chip. Viewer: Sense and anti-sense Discovery of Sense and anti-sense signals in Plasmodium and Arabidopsis. Sense Anti-Sense Signals of the gene PFC 0110 w in Plasmodium.

24. Chip. Viewer: Data fetching Detect Fetch Six frames Sequence

24. Chip. Viewer: Data fetching Detect Fetch Six frames Sequence

25. Chip. Viewer: Comparing multiple sets of data There are 3 sets of the

25. Chip. Viewer: Comparing multiple sets of data There are 3 sets of the Plasmodium tiling chip data respectively in red, blue and green bars. ---- The Plasmodium tiling chip is a perfect match only chip

26. Chip. Viewer: Comparing multiple sets of data (cluster tracks) There are 9 sets

26. Chip. Viewer: Comparing multiple sets of data (cluster tracks) There are 9 sets of the chip data.

27. Chip. Viewer: Outputs a. Output of de-convoluted data ( raw/normalized) b. Output of

27. Chip. Viewer: Outputs a. Output of de-convoluted data ( raw/normalized) b. Output of expression proportion – output of the signal proportion of exon, intron and intragenic regions. c. Output of expression data mining – output of the expressing regions and their weights. d. Output of all or selected probe sets and genes’ expressing data (statistical weight).

Acknowlegements Thanks to Dr. Ecker, Dr. Lim and others at the Salk Institute Genomic

Acknowlegements Thanks to Dr. Ecker, Dr. Lim and others at the Salk Institute Genomic Analysis Laboratory for their suggestions, comments and supports, as well as thanks to Dr. Winzeler and Dr. Grunenfielder from Scripps Institute. Huaming Chen