Using Isoform-Sensitive Microarrays to Study Different Modes of Alternative Splicing Christina Zheng Ares Lab RNA Club September 14, 2006
Outline Isoform-sensitive microarrays (splicing arrays) –introduction –challenges Probe cross-hybridization –mapping of probes onto the genome –excluding potential cross-hybridizing probes Analysis of different modes of alternative splicing –annotation of different modes –using splicing arrays to study different modes Isoform Ratio (IR) Isoform Expression (IE) Future directions
Outline Isoform-sensitive microarrays (splicing arrays) –introduction –challenges Probe cross-hybridization –mapping of probes onto the genome –excluding potential cross-hybridizing probes Analysis of different modes of alternative splicing –annotation of different modes –using splicing arrays to study different modes Isoform Ratio (IR) Isoform Expression (IE) Future directions
Splicing Arrays Used to assay and identify splicing changes associated with different biological conditions –muscle specific alternative splicing –alternative splicing associated with nonsense mediated decay The first splicing array was made in yeast –Clark et al. Science 2002 Mammalian splicing arrays – –Johnson et. al. Science 2003 – –Pan et. al. Mol. Cell 2004 –Li et. al. Cancer Research 2006 –Le et. al. Nucleic Acids Research 2004 –Sugnet et. al. PLoS 2006
Affymetrix Mouse Splicing Array 5 X mer probes probes are grouped intro probesets (6-10 probes) gene probesets - 8 – 10 probes placed in common regions exon probesets exon-exon junction probesets – 6 probesets across 30 nucleotides 15,000+ genes Sugnet et al. PLoS Comput. Bio inflexible probe selection greater chance of cross-hyb
Splicing Arrays – AS events All exon-exon junctions of human mRNA RefSeq – –Johnson et. al. Science 2003 Focused on simple cassette exon events – –Pan et. al. Mol. Cell 2004 Focused on simple AS events with two isoforms –Le et. al. Nucleic Acids Research 2004 –Ule et al. Nature Genetics 2005 –Sugnet et. al. PLoS 2006 –Li et. al. Cancer Research 2006 Skip to include ratio –one measurement for each event –not applicable to more complicated modes of AS
Difficulties with Splicing Arrays Greater potential of probe cross-hybridization –inflexibility in probe selection due to location of events exon probes – restricted to the alternative exon exon-exon junction probes – restricted to exon-exon junction Alternative splicing (AS) events –identifying/annotating them –analyzing different modes of AS more complex with a greater number of isoforms
Outline Isoform-sensitive microarrays (splicing arrays) –introduction –challenges Probe cross-hybridization –mapping of probes onto the genome –excluding potential cross-hybridizing probes Analysis of different modes of alternative splicing –annotation of different modes –using splicing arrays to study different modes Isoform Ratio (IR) Isoform Expression (IE) Future directions
Probe Remapping Tools used to remap onto the May 2004 mouse assembly –GMAP Wu et al. Bioinformatics 2005 –BLAT –home-made junction database used GMAP to align all mRNA and EST from unigene made a database of sequences and genomic coordinates of all exon-exon junctions Remapped probes –uniquely mapped 25mer: –multiple hits: (cross-hyb to other genes) –not mapped 25mer: missed exon-exon junction SNPs changed from old mouse assembly to new
Remapping Probes
Potential Cross-hybridization Potential cross-hybridization –BLAST ~400,000 uniquely mapped probes Cutoff for the level of similarity to other genes –how do different levels of similarity affect probe intensity? –took probes which only hit 2 genes hit 25nt to one gene hit at different level to another (24nt, 23nt, 22nt ….) –choose a cutoff based on the how the probe behavior in each class
Probe Analysis
Outline Isoform-sensitive microarrays (splicing arrays) –introduction –challenges Probe cross-hybridization –mapping of probes onto the genome –excluding potential cross-hybridizing probes Analysis of different modes of alternative splicing –annotation of different modes –using splicing arrays to study different modes Isoform Ratio (IR) Isoform Expression (IE) Future directions
Analysis of Affy Splicing Array Previous work –Ule et al. Nature Genetics 2005 –Sugnet et al. PLoS 2006 Focused on simple cassette exon events and or simple two isoform events Using a variation of skip to include ratio Array was designed with more complicated events
Splicing Event Probe Groupings Annotated AS events –exonwalk identifies and annotates events, no matter how complicated the event Mapped the probes onto annotated events 3418 AS events: –1 isoform: 2002 –2 isoforms: 892 –3 isoforms: 182 –4 isoforms: 95 –5 isoforms: 44 –6 or more isoforms: 203
Isoform Ratio Isoform 1Isoform 2 Isoform 3 Isoform Ratio (IR) = isoform i isoform isoform = isoform1+isoform2+isoform3 isoform1 isoform isoform2 isoform isoform3 isoform
Isoform Ratio Significance Analysis of Microarrays (SAM) –identify statistically significant IRs –based on a modified t test - ‘relative difference’, s = standard deviation; s 0 = small positive constant, s = standard deviation; s 0 = small positive constant q value - min false discovery rate (FDR) Storey J. Roy. Stat. Soc. Ser. B 2002 – (FDR) –the minimum FDR incurred for calling a specific isoform significant –analogous to p-value for false positive rate –can use a q-value as a specific cutoff much like a p-value x t - x c s+s 0 # of false positives # of significant isoforms
Isoform Ratio Identifying muscle specific AS events – C2C12 myoblast differentiation system Run samples on Affymetrix mouse splicing array C2C12 stem cells differentiate stem cells myo-tubule formation isolate control RNAisolate test RNA
Analysis Pipeline Background correction, normalization, and probe summarization –RMA (Irizarry et al. Biostatistics 2003) Grouping probesets into splicing events –mapping probesets onto annotated AS events –calculating IR Grouping probesets into genes –average of all probesets within a gene Use SAM (Tusher et al. PNAS 2001) to test significance differences between test and control –q-value (min false discovery rate) Storey J. Roy. Stat. Soc. Ser. B 2002 Display results on dataviewer
Splicing Array Dataviewer
GeneViewer
Muscle Specific AS events DnaJ (Hsp40) homolog Coro6, actin binding protein upregulated
AAAAA Multiple rounds of normal translation STOP Ribosome includeskip AAAAA STOP EJC Stop codon is in last exon EJC Premature stop codon (PTC) >50nt AAAAA STOP EJC AAAAA NMD STOP Ribosome EJC Example: PTB Isoform Expression Connection between AS and nonsense-mediated decay (NMD) Block NMD and assay for changes in individual isoform changes
Isoform Expression Isoform 1Isoform 2 Isoform 3 Isoform Expression (IE) = log (isoform i) – log (gene) gene = probes in gene log (isoform1) – log(gene)log (isoform2) – log(gene)log (isoform3) – log(gene)
Analysis Pipeline Background correction, normalization, and probe summarization –RMA (Irizarry et al. Biostatistics 2003) Grouping probesets into splicing events –mapping probesets onto predefined AS events –calculating IE Grouping probesets into genes –average of all probesets within a gene Use SAM (Tusher et al. PNAS 2001) to test the significance between test and control –q-value (min false discovery rate) Display results on dataviewer
AS associated with NMD SAT1 - spermidine/spermine N1-acetyl transferase 1 –down regulates polyamine levels in the cell –the inclusion of an alternative exon throws it out of frame NMD –block NMD under conditions which SAT1 is needed polyamine and polyamine analog (BENSPM) expect inclusion of the exon the be repressed –missed by previous analysis methods because this event is an example of having probes for only one of the isoforms
Outline Isoform-sensitive microarrays (splicing arrays) –introduction –challenges Probe cross-hybridization –mapping of probes onto the genome –excluding potential cross-hybridizing probes Analysis of different modes of alternative splicing –annotation of different modes –using splicing arrays to study different modes Isoform Ratio (IR) Isoform Expression (IE) Future directions
Probe cross-hybridization –18bp cross-hyb level – behavior of exon probes vs exon-exon junction probes Different modes of AS –better classification of the more complicated modes Future Directions
Acknowledgements Ares Lab Manny Ares John-Paul Donohue Leslie Grate Roland Nagel Julie Ni Lily Shiue Charles Sugnet
Splicing Arrays Clark et. al. Science nt probes each intron-containing gene in yeast Splice Junction (SJ) Index = log - log (SJ mut ) (SJ wt ) (EX mut ) (EX wt ) Normalize out gene expression