Download presentation
Presentation is loading. Please wait.
Published byDamon Anthony Modified over 9 years ago
1
Comparative Genomics Gene Regulatory Networks (GRNs) Anil Jegga Biomedical Informatics Contact Information: Anil Jegga Biomedical Informatics Room # 232, S Building 10th Floor CCHMC Homepage: http://anil.cchmc.org Tel: 513-636-0261 E-mail: anil.jegga@cchmc.org Session 2: February 24, 2012 2/24/20121Jegga Biomedical Informatics Additional exercise available at: http://anil.cchmc.org/grn.html
2
Session 1: Overview of GRNs (Feb 23) a.Computational Approaches b.Cis-Element Identification c.Comparative Genomics d.Regulatory region variations e.p53 case study Session 2: Database Session (Feb 24) a.Genome Browsers b.Promoter Analysis, TFBS Search c.Co-regulated gene analysis 2/24/20122Jegga Biomedical Informatics
3
Session 2 (Databases/Servers) Feb 24, 2012 a.Genome Browsers b.Promoter Analysis, TFBS Search c.Co-regulated gene analysis
4
Genome Browser (http://genome.ucsc.edu) 2/24/20124Jegga Biomedical Informatics
5
1 45 6 3 2 Genome Browser (http://genome.ucsc.edu) Genome Browser Gateway choices: 1.Select Clade 2.Select genome/species: You can search only one species at a time 3.Assembly: the official backbone DNA sequence 4.Position: location in the genome to examine or search term (gene symbol, accession number, etc.) 5.Image width: how many pixels in display window; 5000 max 6.Configure: make fonts bigger + other options 2/24/20125Jegga Biomedical Informatics
6
Genome Browser (http://genome.ucsc.edu) 2/24/20126Jegga Biomedical Informatics
7
Genome Browser (http://genome.ucsc.edu) Explore the tracks 2/24/20127Jegga Biomedical Informatics
8
2/24/20128Jegga Biomedical Informatics
9
2/24/2012Jegga Biomedical Informatics9 What if I want to download promoter sequences for several genes at a time?
10
Genome Browser (http://genome.ucsc.edu) 2/24/201210Jegga Biomedical Informatics
11
Genome Browser (http://genome.ucsc.edu) 1 1 2 2 3 3 4 4 5 5 6 6 2/24/201211Jegga Biomedical Informatics
12
2/24/201212Jegga Biomedical Informatics
13
2/24/2012Jegga Biomedical Informatics13 Other Genome Browsers: ENSEMBL http://www.ensembl.org
14
2/24/2012Jegga Biomedical Informatics14 I have a promoter sequence and how do I scan it for known TFBSs?
15
2/24/2012Jegga Biomedical Informatics15 JASPAR: http://jaspar.genereg.net
16
2/24/2012Jegga Biomedical Informatics16 JASPAR: http://jaspar.genereg.net
17
2/24/2012Jegga Biomedical Informatics17 JASPAR: http://jaspar.genereg.net
18
2/24/2012Jegga Biomedical Informatics18 Gene-Regulation: http://www.gene-regulation.com Need to have an account (free for academic use)
19
2/24/2012Jegga Biomedical Informatics19 How can I identify putative regulatory regions for a gene or microRNA?
20
I have found a miRNA enriched in my gene list or I am interested in a specific gene and I want to identify putative regulatory regions for miRNA/gene GenomeTrafac: http://genometrafac.cchmc.org 2/24/201220Jegga Biomedical Informatics
21
GenomeTrafac: http://genometrafac.cchmc.org 2/24/201221Jegga Biomedical Informatics
22
GenomeTrafac: http://genometrafac.cchmc.org 2/24/201222Jegga Biomedical Informatics
23
GenomeTrafac: http://genometrafac.cchmc.org 2/24/201223Jegga Biomedical Informatics
24
GenomeTrafac: http://genometrafac.cchmc.org 2/24/201224Jegga Biomedical Informatics
25
GenomeTrafac: http://genometrafac.cchmc.org 2/24/201225Jegga Biomedical Informatics
26
2/24/2012Jegga Biomedical Informatics26 DCODE: http://www.dcode.org/
27
2/24/2012Jegga Biomedical Informatics27 ECR Browser: http://ecrbrowser.dcode.org/ Multispecies (not limited to pairwise comparisons)
28
2/24/2012Jegga Biomedical Informatics28 ECR Browser: http://ecrbrowser.dcode.org/
29
2/24/2012Jegga Biomedical Informatics29 ECR Browser: http://ecrbrowser.dcode.org/
30
2/24/2012Jegga Biomedical Informatics30 ECR Browser: http://ecrbrowser.dcode.org/
31
2/24/2012Jegga Biomedical Informatics31 ECR Browser: http://ecrbrowser.dcode.org/
32
RESOURCES - URLs: Summary Application/ResourceURL Genome Browserhttp://genome.ucsc.edu JASPARhttp://jaspar.genereg.net/ Gene Regulationhttp://www.gene-regulation.com GenomeTrafachttp://genometrafac.cchmc.org DCODEhttp://www.dcode.org/ 2/24/201232Jegga Biomedical Informatics
33
I have a list of co-expressed mRNAs (Transcriptome)…. I want to find the shared cis-elements – Known and Novel Known transcription factor binding sites (TFBS) Conserved oPOSSUM DiRE Non-conserved Pscan MatInspector (*Licensed) Unknown TFBS or Novel motifs Conserved oPOSSUM Weeder-H Non-conserved MEME Weeder 1.Each of these applications support different forms of input. Very few support probeset IDs. 2.Red Font: Input sequence required; Do not support gene symbols, gene IDs, or accession numbers. The advantage is you can use them for scanning sequences from any species. 3.*Licensed software: We have access to the licensed version. 1.Each of these applications support different forms of input. Very few support probeset IDs. 2.Red Font: Input sequence required; Do not support gene symbols, gene IDs, or accession numbers. The advantage is you can use them for scanning sequences from any species. 3.*Licensed software: We have access to the licensed version. 2/24/201233Jegga Biomedical Informatics
34
I have a list of co-expressed mRNAs (Transcriptome)…. I want to find the shared cis-elements – Known and Novel Known transcription factor binding sites (TFBS) Conserved oPOSSUM DiRE Non-conserved Pscan MatInspector (*Licensed) Unknown TFBS or Novel motifs Conserved oPOSSUM Weeder-H Non-conserved MEME Weeder 2/24/201234Jegga Biomedical Informatics
35
oPOSSUM (http://burgundy.cmmt.ubc.ca/oPOSSUM/) Supports human and mouse 2/24/201235Jegga Biomedical Informatics
36
oPOSSUM (http://www.cisreg.ca/oPOSSUM) Disadvantage: Supports either human or mouse only 2/24/201236Jegga Biomedical Informatics
37
oPOSSUM (http://www.cisreg.ca/oPOSSUM) The JASPAR PHYLOFACTS database consists of 174 profiles that were extracted from phylogenetically conserved gene upstream elements. They are a mix of known and as of yet undefined motifs. When should it be used? They are useful when one expects that other factors might determine promoter characteristics and/or tissue specificity. The JASPAR PHYLOFACTS database consists of 174 profiles that were extracted from phylogenetically conserved gene upstream elements. They are a mix of known and as of yet undefined motifs. When should it be used? They are useful when one expects that other factors might determine promoter characteristics and/or tissue specificity. 2/24/201237Jegga Biomedical Informatics
38
oPOSSUM (http://www.cisreg.ca/oPOSSUM) The Z-score statistic reflects the occurrence of the TFBS in the promoters of the co-expressed set compared to background. The Fisher statistic reflects the proportion of genes that contain the TFBS compared to background. 2/24/201238Jegga Biomedical Informatics
39
oPOSSUM (http://www.cisreg.ca/oPOSSUM) 2/24/201239Jegga Biomedical Informatics
40
2/24/201240Jegga Biomedical Informatics
41
oPOSSUM (http://www.cisreg.ca/oPOSSUM) 2/24/201241Jegga Biomedical Informatics
42
DiRE (http://dire.dcode.org/) 2/24/201242Jegga Biomedical Informatics
43
DiRE (http://dire.dcode.org/) ECR-Browser (http://ecrbrowser.dcode.org/) 2/24/201243Jegga Biomedical Informatics
44
Pscan (http://159.149.109.9/pscan) 2/24/201244Jegga Biomedical Informatics
45
Pscan (http://159.149.109.9/pscan) 2/24/201245Jegga Biomedical Informatics
46
Pscan (http://159.149.109.9/pscan) 2/24/201246Jegga Biomedical Informatics
47
Pscan (http://159.149.109.9/pscan) 2/24/201247Jegga Biomedical Informatics
48
Pscan (http://159.149.109.9/pscan) 2/24/201248Jegga Biomedical Informatics
49
Pscan (http://159.149.109.9/pscan) Comparing different input gene sets : 1.In the detailed output for a given matrix, you can compare the results obtained with the matrix on the gene set just submitted with the results the matrix had produced on another gene set. The latter could be a "negative" gene set (or vice versa ). 2.To perform the comparison, you have to fill in the "Compare with..." box fields with mean, standard deviation and sample size values of the other analysis - for the current one you can find them in the "Sample Data Statistics" box or in the overall text output that can be downloaded from the main output page. 3.Warning: Make sure that the values you input are correct, and especially that they were obtained by using the same matrix. Once you have clicked the "Go!" button, an output window will pop up and report if either of the two means is significantly higher than the other, together with a confidence p- value computed with a Welch t-test. 2/24/201249Jegga Biomedical Informatics
50
I have a list of co-expressed mRNAs (Transcriptome)…. I want to find the shared cis-elements – Known and Novel Known transcription factor binding sites (TFBS) Conserved oPOSSUM DiRE Non-conserved Pscan MatInspector (*Licensed) Unknown TFBS or Novel motifs Conserved oPOSSUM Weeder-H Non-conserved MEME Weeder 2/24/201250Jegga Biomedical Informatics
51
oPOSSUM (http://www.cisreg.ca/oPOSSUM) 2/24/201251Jegga Biomedical Informatics
52
oPOSSUM (http://www.cisreg.ca/oPOSSUM) The JASPAR PHYLOFACTS database consists of 174 profiles that were extracted from phylogenetically conserved gene upstream elements. They are a mix of known and as of yet undefined motifs. When should it be used? They are useful when one expects that other factors might determine promoter characteristics and/or tissue specificity. The JASPAR PHYLOFACTS database consists of 174 profiles that were extracted from phylogenetically conserved gene upstream elements. They are a mix of known and as of yet undefined motifs. When should it be used? They are useful when one expects that other factors might determine promoter characteristics and/or tissue specificity. 2/24/201252Jegga Biomedical Informatics
53
oPOSSUM (http://www.cisreg.ca/oPOSSUM) 2/24/201253Jegga Biomedical Informatics
54
oPOSSUM (http://www.cisreg.ca/oPOSSUM) 2/24/201254Jegga Biomedical Informatics
55
I have a list of co-expressed mRNAs (Transcriptome)…. I want to find the shared cis-elements – Known and Novel Known transcription factor binding sites (TFBS) Conserved oPOSSUM DiRE Non-conserved Pscan MatInspector (*Licensed) Unknown TFBS or Novel motifs Conserved oPOSSUM Weeder-H Non-conserved MEME Weeder 1.Each of these applications support different forms of input. Very few support probeset IDs. 2.Red Font: Input sequence required; Do not support gene symbols, gene IDs, or accession numbers. The advantage is you can use them for scanning sequences from any species. 3.*Licensed software: We have access to the licensed version. 1.Each of these applications support different forms of input. Very few support probeset IDs. 2.Red Font: Input sequence required; Do not support gene symbols, gene IDs, or accession numbers. The advantage is you can use them for scanning sequences from any species. 3.*Licensed software: We have access to the licensed version. How to fetch promoter/upstream sequence – single/multiple? 2/24/201255Jegga Biomedical Informatics
56
I have a list of co-expressed mRNAs (Transcriptome)…. I want to find the shared cis-elements – Known and Novel Known transcription factor binding sites (TFBS) Conserved oPOSSUM DiRE Non-conserved Pscan MatInspector (*Licensed) Unknown TFBS or Novel motifs Conserved oPOSSUM Weeder-H Non-conserved MEME Weeder 1.Each of these applications support different forms of input. Very few support probeset IDs. 2.Red Font: Input sequence required; Do not support gene symbols, gene IDs, or accession numbers. The advantage is you can use them for scanning sequences from any species. 3.*Licensed software: We have access to the licensed version. 1.Each of these applications support different forms of input. Very few support probeset IDs. 2.Red Font: Input sequence required; Do not support gene symbols, gene IDs, or accession numbers. The advantage is you can use them for scanning sequences from any species. 3.*Licensed software: We have access to the licensed version. Use the fetched promoter/upstream sequences for the following analyses 2/24/201256Jegga Biomedical Informatics
57
WeederH (http://159.149.109.9/pscan) 1.Supports large number of species. 2.Does not support multiple sequences (multifasta) input. You have to enter each sequence separately. 3.Good for small number of sequences where you expect a potential novel (or not included in the TFBS libraries) conserved motif. 2/24/201257Jegga Biomedical Informatics
58
Weeder (http://159.149.109.9/modtools/) Do not use Groupwise mail when submitting large number of sequences because the results are sent “in the mail” and not as an attachment. And Groupwise mail truncates messages if they are very long. Use Gmail instead. A link to the results page used to be sent earlier. 2/24/201258Jegga Biomedical Informatics
59
Weeder (http://159.149.109.9/modtools/) 2/24/201259Jegga Biomedical Informatics
60
MEME (http://meme.sdsc.edu) MEME takes as input a group of DNA or protein sequences and outputs as many motifs as requested. MEME uses statistical modeling techniques to automatically choose the best width, number of occurrences, and description for each motif. Your MEME results consist of: your MEME results in HTML format your MEME results in XML format your MEME results in TEXT format and the MAST results of searching your input sequences for the motifs found by MEME using MAST. 2/24/201260Jegga Biomedical Informatics
61
MEME (http://meme.sdsc.edu) 2/24/201261Jegga Biomedical Informatics
62
TOMTOM can be used to find out if an overrepresented motif in your sequences matches or is similar to a known TFBS MEME (http://meme.sdsc.edu) 2/24/201262Jegga Biomedical Informatics
63
Summary Cis-Element Finding Matrix CONSERVEDNON-CONSERVED KNOWN TFBS oPOSSUM DiRE Pscan MatInspector* NOVEL/UNKNOWN TFBS OR MOTIFS oPOSSUM WEEDER-H MEME WEEDER 2/24/201263Jegga Biomedical Informatics
64
RESOURCES - URLs: Summary Application/ResourceURL oPOSSUMhttp://burgundy.cmmt.ubc.ca/oPOSSUM/ DiREhttp://dire.dcode.org/ Weeder-Hhttp://159.149.109.9/modtools/ Weederhttp://159.149.109.9/modtools/ Pscanhttp://159.149.109.9/pscan MEMEhttp://meme.sdsc.edu/ MatInspectorhttp://www.genomatix.de/ Genome Browserhttp://genome.ucsc.edu ECR Browserhttp://ecrbrowser.dcode.org 2/24/201264Jegga Biomedical Informatics Additional exercise available at: http://anil.cchmc.org/grn.html
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.