Presentation is loading. Please wait.

Presentation is loading. Please wait.

Deidrey Langat Shinen Lo Mahmoud Rezaei Carissa Tudor.

Similar presentations


Presentation on theme: "Deidrey Langat Shinen Lo Mahmoud Rezaei Carissa Tudor."— Presentation transcript:

1 Deidrey Langat Shinen Lo Mahmoud Rezaei Carissa Tudor

2 Hypothesis Where probes present an overexpression after demethylation treatment there will be a significantly higher occurance of CpG Islands around those probes than around probes that do not have an overexpression following demethylation. i.e. Probe overexpression is correlated with the number of CpG Islands in existence 10,000 base pairs upstream of the Probe.

3 Sample 0.010mRNAchr5:42836013-428260130 0.021mRNAchr11:74594777-745847770 0.021mRNAchr21:45130327-451203270 0.022mRNAchr22:20444866-204348660 0.023mRNAchr2:68477967-684679670 0.023ncRNAchr6:160020430-1600104300 0.032ncRNAchr16:24135824-241258240 0.033ncRNAchr2:160837124-1608271241 0.036ncRNAchr1:42738071-427280710 0.036ncRNAchr1:144346199-1443361990 111.544mRNAchr18:59323230-593132300 115.002mRNAchr4:6746670-67366700 118.500mRNAchr7:72884859-728748591 124.146mRNAchr2:172672477-1726624772 308.952mRNAchr7:93354113-933441130 63.900ncRNAchr7:41691439-416814390 77.180ncRNAchr13:105827406-1058174060 125.522ncRNAchr18:53866484-538564841 345.396ncRNAchr6:25808985-257989850 498.672ncRNAchr16:2455344-24453441

4 Perl: Pseudocode 1. Open xls file and sort using Perl 2. Read in data of top and lowest Expression levels based on demethylation treatment 3. Navigate to Genome Browser database with parameters set as follows: * Groups - "All Tracks" * Track - "CpG Islands" 4. Submit each of the chromosomal location from xls file as strings to the genome browser database 5. Click on 'Get output' to locate CpG islands 6. Count # of results 7. Record # or results for each sample data

5 Perl: Step 1-Reading Excel file Code: #!/usr/bin/perl –w use strict; use Win32::OLE qw(in with); useWin32::OLE::Const 'Microsoft Excel'; $Win32::OLE::Warn = 3; # die on errors...# get already active Excel application or open newmy $Excel = Win32::OLE->GetActiveObject('Excel.Application') || Win32::OLE->new('Excel.Application', 'Quit'); # open Excel filemy $Book = $Excel->Workbooks->Open("C:/Documents and Settings/Mahmoud/Desktop/Claremont/forfirst1000highmRNA.xls"); # You can dynamically obtain the number of worksheets, rows, and columns # through the Excel OLE interface. Excel's Visual Basic Editor has more # information on the Excel OLE interface. Here we just use the first# worksheet, rows 1 through 4 and columns 1 through 3. # select worksheet number 1 (you can also select a worksheet by name)my $Sheet = $Book->Worksheets(1); my $newvalue = 10000; foreach my $row (1..1000) { foreach my $col (55) { # skip empty cells next unless defined $Sheet->Cells($row,$col)->{'Value'}; # print out the contents of a cell printf "At ($row, $col) the value is %s and we looked at upstream from %s to %s\n", $Sheet->Cells($row,$col)->{'Value'}, $Sheet->Cells($row,$col)->{'Formula'}-$newvalue, $Sheet->Cells($row,$col)->{'Value'}; }} # clean up $Book->Close;

6 Perl: Step 1-Reading Excel file Output: At (1, 55) the value is 65370390 and we looked at upstream from 65360390 to 65370390 At (2, 55) the value is 30769374 and we looked at upstream from 30759374 to 30769374 At (3, 55) the value is 99484506 and we looked at upstream from 99474506 to 99484506 At (4, 55) the value is 42484609 and we looked at upstream from 42474609 to 42484609

7 Step 2-4

8 Step 5

9 FASTA Format Code: #!/bin/perl -w use Bio::Seq;use Bio::SeqIO; $seq_obj = Bio::Seq->new(-seq => "aaaatgggggggggggccccgtt", -display_id => "#12345", - desc => "example 1", - alphabet => "dna" ); #!/bin/perl -w use Bio::Seq;use Bio::SeqIO; $seq_obj2 = Bio::Seq->new(-seq => "aaaatgggggggggggcccccccccc", -display_id => "#12346", - desc => "example 2", - alphabet => "dna" ); $seqio_obj = Bio::SeqIO->new(-file => '>sequence.fasta', -format => 'fasta' ); $seqio_obj->write_seq($seq_obj); $seqio_obj = Bio::SeqIO->new(-file => '>sequence.fasta', -format => 'fasta' ); $seqio_obj->write_seq($seq_obj); $seqio_obj->write_seq($seq_obj2);

10 FASTA Format Output: >#12345 example 1aaaatgggggggggggccccgtt>#12346 example2aaaatgggggggggggcccccccccc

11 Bioperl Code: #!/local/bin/perl –w use Bio::DB::GenBank;my $gb =new Bio::DB::GenBank(-retrievaltype=>'tempfile',- format=>'Fasta'); my ($seq) = $seq =$gb- >get_Seq_by_id("AB000460"); print $seq->id, "\n"; print $seq->desc(), "Sequence:\n"; print $seq->seq(), "\n";exit;

12 CACAATGACATGCAGACCTGCATATTGGAGCTGGACGGAGAAACTGGGCTAATGTGACAGACAGCAACAAGAGTAAGGCAGTTGC TTCGCTATTGAGAGAAAGAACCATATGAAGAAATTTCGGCAGAGGCGGACCGGGAACCTCAGCAGCTGCAGAACTACTGGTCAGA AGTGCGCTACACGGTGCGCTGCATCTACCGCCAGGCAGGAACCCCGCTGGCAGATGACCAGGACCAGTCTCTGGTGCCTGACAA GGAGGGAGTGAAGGAGCTCGTGGATAGGCTCTGCGAGAGGGACCCCTACCAGCTGTACCAGCGTCTGGAACAGCAAGCTCGAGA GTATGTGCTGGAGATGAAGGTCCGCCTGCTCCGGCAGCTGTCGGCTGCGGCCAAGGTGAAGGCACCATCTGGCCTGCAGGGCCC GCCGCAAGCGCACCAGTTCATCTCCCTCCTGCTTGAGGAGTACGGCGCCCTCTGCCAGGCCGCACGCTCCATCAGCACCTTCCTT GGCACTCTGGAAAATGAACACTTGAAAAAGTTCCAAGTGACGTGGGAACTGCATAATAAACACCTGTTTGAAAATCTGGTCTTTTCG GAGCCACTTCTTCAGAGCAACTTGCCCGCACTGGTGTCACAGATCAGGCTAGGAACCACCACACACGACACCTGCAGTGAGGACA CATACAGTACCTTGCTGCAGAGGTACCAGCGTTCCGAGGAGGAGCTGCGCAGAGTCGCCGAGGAGTGGCTGGAGTGCCAGAAGA GGATCGACGCCTATGTCGACGAGCAGATGACAATGAAAACCAAGCAGCGCATGTTAACAGAAGACTGGGAGCTTTTTAAACAAAGA AGATTCATTGAAGAACAGTTAACCAATAAGAAAGCAGTTACTGGCGAGAACAACTTCACAGACACCATGAGGCACGTGTTATCGTC CCGGCTGAGCATGCCCGACTGCCCCAACTGCAACTACAGGAGAAGATGTGCTTGCGATGACTGCAGTCTCTCACACATCCTCACG TGTGGTATCATGGACCCCCCCGTCACTGATGACATCCACATTCACCAGCTCCCACTTCAAGTGGATCCTGCTCCTGACTATCTTGC TGAGAGGAGCCCGCCCAGTGTGTCATCTGCAAGCTCGGGGTCCGGCTCCAGCTCTCCCATCACAATTCAGCAGCACCCCAGGCT CATCCTCACAGACAGTGGCTCGGCACCAACTTTTTGTAGTGATGATGAAGATGTTGCACCATTGTCAGCCAAATTTGCTGATATTTA TCCATTGAGTAATTATGATGATACCGAGGTGGTGGCCAACATGAATGGAATCCACAGCGAATTGAATGGTGGCGGGGAAAACATGG CCCTGAAGGATGAGTCTCCTCAGATAAGCAGTACCAGCAGTAGTTCCTCAGAAGCTGATGATGAAGAAGCGGACGGCGAGAGTAG TGGGGAGCCCCCAGGGGCCCCGAAGGAAGATGGAGTGCTGGGAAGCAGGAGCCCCAGGACAGAGGAGAGCAAAGCAGACAGTC CACCCCCATCCTACCCAACACAGCAGGCTGAACAAGCTCCAAACACTTGTGAATGTCATGTTTGTAAGCAAGAAGCTTCTGGACTG ACACCATCTGCAATGACAGCCGGAGCCCTTCCTCCTGGCCATCAGTTCTTGAGCCCAGAGAAGCCCACACACCCTGCACTGCACC TTTACCCTCACATCCATGGACATGTGCCTTTGCACACTGTTCCACACCTGCCACGCCCTCTCATCCACCCCACCTTGTATGCAACG CCCCCCTTCACACACAGTAAGGCTTTACCGCCAGCACCTGTTCAGAATCACACAAATAAGCATCAGGTATTCAATGCATCTCTTCAA GACCATATTTATCCGAGCTGTTTTGGGAATACTCCAGAGTGGAATAGTTCTAAATTTATAAGTCTTTGGGGATCAGAAGTGATGAAT GATAAGAACTGGAATCCTGGCACTTTCTTGCCAGATACAATTTCTGGGAGTGAAATATTAGGGCCAACACTCTCAGAAACAAGACC GGAAGCCCTTCCACCTCCATCTAGCAATGAAACACCTGCAGTCTCGGATAGTAAAGAGAAAAAGAATGCTGCAAAAAAGAAATGTTT ATACAATTTCCAAGATGCTTTCATGGAAGCAAATAAAGTTGTCATGGCCACGTCATCAGCCACGTCCTCTGTGTCCTGCACAGCTAC CACAGTGCAGTCCAGCAACAGCCAGTTCAGAGTGTCATCCAAGAGACCTCCTTCAGTAGGTGACGTGTTTCATGGCATCAGCAAG GAGGACCACAGACACTCGGCCCCAGCCGCCCCGAGGAATAGCCCCACGGGCTTGGCCCCCCTCCCAGCGCTCTCGCCTGCTGC GCTGTCACCTGCTGCGCTCTCACCTGCCTCCACACCTCACCTTGCAAATCTTGCAGCCCCATCATTCCCCAAAACAGCAACCACAA CTCCTGGGTTTGTGGACACACGCAAGAGTTTCTGTCCTGCACCCCTACCCCCGGCCACAGATGGCTCCATTAGCGCCCCTCCAAG TGTCTGCAGTGACCCTGACTGCGAAGGGCACCGCTGCGAGAATGGTGTCTACGACCCACAGCAGGATGATGGGGACGAGAGTGC AGATGAGGACAGCTGCTCTGAGCACAGCTCCAGCACCTCGACCTCCACCAACCAGAAGGAGGGCAAGTACTGCGACTGCTGCTAC TGCGAATTCTTTGGGCACGGCGGGCCTCCAGCTGCACCAACAAGTAGAAATTATGCAGAAATGAGGGAAAAGCTTCGCTTACGGC TGACCAAGAGGAAAGAGGAGCAACCTAAAAAAATGGACCAGATCTCAGAAAGGGAAAGCGTCGTTGACCATCGGAGGGTGGAGGA TTTGTTGCAGTTTATAAATAGCTCCGAAACCAAACCAGTGAGCAGCACGCGTGCAGCGAAGCGAGCAAGGCATAAGCAAAGGAAG CTGGAGGAGAAAGCTCGCCTAGAAGCAGAGGCCAGGGCCCGGGAGCACCTGCACCTCCAGGAGGAGCAGAGGCGGCGGGAGG AGGAGGAGGATGAGGAAGAAGAGGAGGATCGTTTCAAGGAGGAATTTCAGCGGCTTCAGGAGCTTCAGAAGCTAAGAGCTGTAAA AAAGAAGAAGAAGGAGAGGCCAAGTAAAGACTGCCCCAAGTTGGACATGCTCACTAGAAATTTCCAGGCAGCAACAGAGTCTGTTC CTAACTCTGGAAACATCCACAATGGCTCACTAGAGCAAACTGAAGAACCAGAAACCTCTTCTCACTCCCCATCCAGGCATATGAAC CACTCAGAGCCCAGGCCAGGGCTAGGGGCTGATGGGGATGCTGCAGACCCCGTCGACACCAGAGACTCCAAATTTCTCCTCCCC AAGGAGGTGAATGGGAAGCAGCATGAGCCACTCTCTTTTTTCTTCGACATCATGCAGCACCATAAAGAAGGAAATGGCAAGCAGAA GCTGAGGCAGACCAGCAAGGCCAGCAGCGAGCCAGCGAGGAGGCCCACAGAGCCCCCCAAGGCCACAGAGGGGCAGTCCAAG CCCCGGGCCCAGACTGAGTCAAAGGCTAAGGTGGTCGACCTCATGTCCATCACAGAGCAGAAAAGAGAGGAGAGAAAAGTCAACA GTAATAACAATAACAAAAAGCAGCTGAACCACATCAAGGACGAAAAGTCAAACCCAACCCCTATGGAGCCCACCTCTCCCGGTGAG CATCAGCAGAACAGCAAGCTGGTGCTGGCAGAGTCCCCTCAGCCAAAGGGCAAGAACAAGAAAAATAAGAAGAAGAAAGGAGACA GAGTCAACAATTCAATTGATGGAGTTTCGCTCTTGTTGCCCAGTCTGGGGTACAATGGTGCAATCTTGGCTCACTGCAACCTTCGC CTCCCAGGTTCAAGCGATTGTGCTGCCTCAGCCTCCCAAGTAGTTGGAATTACAGATGATGTCTTTCTACCTAAAGATATTGACCTA GACAGTGTGGATATGGATGAGACAGAGAGGGAAGTGGAATATTTCAAAAGGTTCTGCTTGGATTCTGCTAGACAGACCCGACAAAG ACTGTCTATCAACTGGTCCAATTTTAGCTTGAAAAAAGCCACCTTTGCTGCCCACTGAATGAGGACTCCCTGGAGAGGGACACGCG AGAGGCAGGCCAGGCTGCACCACCCCAAGAGCCACGCCCCTCGCTGGCGCCCCAGAGCCGTGGTGCTTGCCAAGGGCTGTGCG GAGCTGGTGCTGCCTGAAACCCCAGACCGAGAAGTTGATGCTCGGCCCACGCCGTTAGCTCGTGTGCGTGTAGTCTGTGCGTGA GACTCCTTCGATTGTAGCTCTGTGCTGTCGGATTGGAACAGTAGTTCCCGCCAAGTCCTCCCACCACCGCGGCCTCGGAGGCCTG GGCCGTGGCCAGATAGGAGTTTGCATCATCCACGTGGCTCCGTTGCCTCTGCATTGCGCCCTGTCCTGTCATGTGTCCTCACCGG GGTATCGGCCGTCACTCAGCTCTCCTGTGCCCCTGCGTCTCACCCTAGGCGGGCTGGGCGGGGCAGGCCTCCTTTGTTCTCCAC AATCTACTGTCTCCGAGTGTACACGTTGCGCTGTTTGTGTTTGATCCCCCTGACTTGTAGCCAGCTTGTGTAAGATCCCTTGCAGAA CGAGAAAGTTAAAAACAAGCCCACCCAGTACTCACACCATCAAGTCTGTTATAGAGTGTACGACTGTATTAACACGGAGGCCTGCC TGGCTACTTTTTTAACATATTGTTAAGTAATATTAAAATCATGTCTTTCTTTTTGAAAGATG

13 Thank you!


Download ppt "Deidrey Langat Shinen Lo Mahmoud Rezaei Carissa Tudor."

Similar presentations


Ads by Google