Background & Motivation Problem & Feature Construction Experiments Design & Results Conclusions and Future Work Exploring Alternative Splicing Features.

Slides:



Advertisements
Similar presentations
Particle swarm optimization for parameter determination and feature selection of support vector machines Shih-Wei Lin, Kuo-Ching Ying, Shih-Chieh Chen,
Advertisements

Optimization of SVM Parameters for Promoter Recognition in DNA Sequences Robertas Damaševičius Software Engineering Department, Kaunas University of Technology.
Two short pieces MicroRNA Alternative splicing.
Ab initio gene prediction Genome 559, Winter 2011.
15-20 september WABI031 A Method to Detect Gene Structure and Alternative Splice Sites by Agreeing ESTs to a Genomic Sequence Paola Bonizzoni Graziano.
1 Alternative Splicing. 2 Eukaryotic genes Splicing Mature mRNA.
1 Gene Finding Charles Yan. 2 Gene Finding Genomes of many organisms have been sequenced. We need to translate the raw sequences into knowledge. Where.
Alternative splicing and evolution Daniel Jeffares.
Optimized Numerical Mapping Scheme for Filter-Based Exon Location in DNA Using a Quasi-Newton Algorithm P. Ramachandran, W.-S. Lu, and A. Antoniou Department.
Bioinformatics Alternative splicing Multiple isoforms Exonic Splicing Enhancers (ESE) and Silencers (ESS) SpliceNest Lecture 13.
Lecture 12 Splicing and gene prediction in eukaryotes
A Multivariate Biomarker for Parkinson’s Disease M. Coakley, G. Crocetti, P. Dressner, W. Kellum, T. Lamin The Michael L. Gargano 12 th Annual Research.
MicroRNA Targets Prediction and Analysis. Small RNAs play important roles The Nobel Prize in Physiology or Medicine for 2006 Andrew Z. Fire and Craig.
Progress report Yiming Zhang 02/10/2012. All AS events in ASIP Intron retention Exon skipping Alternative Acceptor site NAGNAG AltA Alternative Donor.
Signposts for translation initiation: An illustration of formulating a research project Xuhua Xia
Prediction model building and feature selection with SVM in breast cancer diagnosis Cheng-Lung Huang, Hung-Chang Liao, Mu- Chen Chen Expert Systems with.
Xuhua Xia Signposts for translation initiation: An illustration of formulating a research project Xuhua Xia.
Queensland University of Technology CRICOS No J Using a Beagle to sniff for Bacterial Promoters Stefan R. Maetschke, Michael Towsey and James M.
Assignment 2: Papers read for this assignment Paper 1: PALMA: mRNA to Genome Alignments using Large Margin Algorithms Paper 2: Optimal spliced alignments.
GA-Based Feature Selection and Parameter Optimization for Support Vector Machine Cheng-Lung Huang, Chieh-Jen Wang Expert Systems with Applications, Volume.
From Structure to Function. Given a protein structure can we predict the function of a protein when we do not have a known homolog in the database ?
Iowa State University Department of Computer Science Artificial Intelligence Research Laboratory Research supported in part by a grant from the National.
1 Transcript modeling Brent lab. 2 Overview Of Entertainment  Gene prediction Jeltje van Baren  Improving gene prediction with tiling arrays Aaron Tenney.
Fea- ture Num- ber Feature NameFeature description 1 Average number of exons Average number of exons in the transcripts of a gene where indel is located.
Exploring Alternative Splicing Features using Support Vector Machines Feature for Alternative Splicing Alternative splicing is a mechanism for generating.
Computational Genomics and Proteomics Lecture 8 Motif Discovery C E N T R F O R I N T E G R A T I V E B I O I N F O R M A T I C S V U E.
Sackler Medical School
Parsing A Bacterial Genome Mark Craven Department of Biostatistics & Medical Informatics University of Wisconsin U.S.A.
Gene Prediction: Similarity-Based Methods (Lecture for CS498-CXZ Algorithms in Bioinformatics) Sept. 15, 2005 ChengXiang Zhai Department of Computer Science.
Motif discovery and Protein Databases Tutorial 5.
From Genomes to Genes Rui Alves.
Journal report: High Resolution Model of Transcription Factor- DNA Affinities Improve In Vitro and In Vivo Binding Predictions Paper by: Phadera Gius,
Eukaryotic Gene Prediction Rui Alves. How are eukaryotic genes different? DNA RNA Pol mRNA Ryb Protein.
Alternative Splicing (a review by Liliana Florea, 2005) CS 498 SS Saurabh Sinha 11/30/06.
A Non-EST-Based Method for Exon-Skipping Prediction Rotem Sorek, Ronen Shemesh, Yuval Cohen, Ortal Basechess, Gil Ast and Ron Shamir Genome Research August.
Genes and Genomes. Genome On Line Database (GOLD) 243 Published complete genomes 536 Prokaryotic ongoing genomes 434 Eukaryotic ongoing genomes December.
RBP1 Splicing Regulation in Drosophila Melanogaster Fall 2005 Jacob Joseph, Ahmet Bakan, Amina Abdulla This presentation available at
Pre-mRNA secondary structures influence exon recognition Michael Hiller Bioinformatics Group University of Freiburg, Germany.
How can we find genes? Search for them Look them up.
Comparative Genomics Methods for Alternative Splicing of Eukaryotic Genes Liliana Florea Department of Computer Science Department of Biochemistry GWU.
Research about Alternative Splicing recently 楊佳熒.
Feature Extraction Artificial Intelligence Research Laboratory Bioinformatics and Computational Biology Program Computational Intelligence, Learning, and.
Splice Site Recognition in DNA Sequences Using K-mer Frequency Based Mapping for Support Vector Machine with Power Series Kernel Dr. Robertas Damaševičius.
JIGSAW: a better way to combine predictions J.E. Allen, W.H. Majoros, M. Pertea, and S.L. Salzberg. JIGSAW, GeneZilla, and GlimmerHMM: puzzling out the.
Bioinformatics Workshops 1 & 2 1. use of public database/search sites - range of data and access methods - interpretation of search results - understanding.
TOX680 Unveiling the Transcriptome using RNA-seq Jinze Liu.
Typically, classifiers are trained based on local features of each site in the training set of protein sequences. Thus no global sequence information is.
Enhanced Regulatory Sequence Prediction Using Gapped k-mer Features 王荣 14S
Motif Search and RNA Structure Prediction Lesson 9.
PRABINA KUMAR MEHER SCIENTIST DIVISION OF STATISTICAL GENETICS INDIAN AGRICULTURAL STATISTICS RESERARCH INSTITUTE INDIAN COUNCIL OF AGRICULTURAL RESEARCH.
Classification of real and pseudo microRNA precursors using local structure-sequence features and support vector machine 朱林娇 14S
MicroRNA Prediction with SCFG and MFE Structure Annotation Tim Shaw, Ying Zheng, and Bram Sebastian.
UCSC Genome Browser Zeevik Melamed & Dror Hollander Gil Ast Lab Sackler Medical School.
Final Report (30% final score) Bin Liu, PhD, Associate Professor.
Finding genes in the genome
Nawanol Theera-Ampornpunt, Seong Gon Kim, Asish Ghoshal, Saurabh Bagchi, Ananth Grama, and Somali Chaterji Fast Training on Large Genomics Data using Distributed.
Machine Learning and Data Mining: A Math Programming- Based Approach Glenn Fung CS412 April 10, 2003 Madison, Wisconsin.
1 Gene Finding. 2 “The Central Dogma” TranscriptionTranslation RNA Protein.
Regulation of Gene Expression
bacteria and eukaryotes
The Transcriptional Landscape of the Mammalian Genome
How to forecast solar flares?
Disease risk prediction
Babak Alipanahi1, Andrew Delong, Matthew T Weirauch & Brendan J Frey
Ab initio gene prediction
Shih-Wei Lin, Kuo-Ching Ying, Shih-Chieh Chen, Zne-Jung Lee
Volume 3, Issue 4, Pages (April 2013)
Introduction to Alternative Splicing and my research report
Gene Structure.
Gene Structure.
Presentation transcript:

Background & Motivation Problem & Feature Construction Experiments Design & Results Conclusions and Future Work Exploring Alternative Splicing Features using Support Vector Machines Jing Xia 1, Doina Caragea 1, Susan J. Brown 2 1 Computing and Information Sciences Kansas State University, USA 2 Bioinformatics Center Kansas State University, USA Jan

Background & Motivation Problem & Feature Construction Experiments Design & Results Conclusions and Future Work Outline 1 Background & Motivation 2 Problem & Feature Construction Problem Definition Data Set Feature Construction 3 Experiments Design & Results Experimental Design Experimental Results 4 Conclusions and Future Work Conclusion

Background & Motivation Problem & Feature Construction Experiments Design & Results Conclusions and Future Work Alternative Splicing exonintronexonintronexon DNA 5’UTRGTAGGTAG3’UTR Splicing: important step Trasncription TSS ATG during gene expression exonintron exon intron exon pre−mNRA cap Variable splicing process 5’UTRGU AGGT AG 3’UTR (Alternative splicing) one Splicing AUG gene -> many proteins mRNA Translation protein Genes expression: genes to pro- teins

Background & Motivation Problem & Feature Construction Experiments Design & Results Conclusions and Future Work Alternative Splicing pre−mRNA Gene Splicing: important step during gene expression Alternative Splicing Variable splicing process (Alternative splicing) one transcript isoforms gene -> many proteins Proteins One genes to many proteins

Background & Motivation Problem & Feature Construction Experiments Design & Results Conclusions and Future Work Patterns of Alternative Splicing Constitutively Spliced Exon (CSE) Alternatively Spliced Exon (ASE) Exon skipping (most CSE ASE frequent) exon1exon2exon3exon4 Alternative 5’ splice sites Alternative 3’ splice sites Intron retention Mutually exclusive Here, focus on predicting alternatively spliced exons (ASE) and constitutively spliced exons (CSE) based on SVM

Alternative splicing Wet lab experiments finding AS is time Traditionally, align EST to genome alignments (limited to amount of EST available to the genome) Problem & Feature Construction Experiments Design & Results Conclusions and Future Work Background & Motivation

Problem & Feature Construction Experiments Design & Results Conclusions and Future Work Identifying Alternative Splicing in genome Transcripts Alternative splicing Wet lab experiments finding AS is time genomic DNA consuming Traditionally, align EST to genome alignments (limited to amount of EST available to the genome) Alternative 3’ Exon Exon Skipping

Background & Motivation Problem & Feature Construction Experiments Design & Results Conclusions and Future Work Identifying Alternative Splicing in genome Alternative splicing Wet lab experiments finding AS is time consuming Traditionally, align EST to genome alignments (limited to amount of EST available to the genome) Use machine learning algorithms that to predict AS at the genome level

Background & Motivation Problem Definition Problem & Feature Construction Data Set Experiments Design & Results Feature Construction Conclusions and Future Work Problem Definition Problem Definition: given an exon, can we predict it as alternatively spliced exons (ASE) or constitutively spliced exons (CSE)? Constitutively Spliced Exon (CSE) Alternatively Spliced Exon (ASE) CSE ASE CSE exon1exon2exon3exon4

Background & Motivation Problem Definition Problem & Feature Construction Data Set Experiments Design & Results Feature Construction Conclusions and Future Work Problem Addressed and Our Approach Problem Definition predict alternatively spliced exons (ASE) vs constitutively spliced exons (CSE) Use Support Vector Machine (SVM) Task:Two-class (ASE and CSE) classification problem Need:Training data set containing labeled examples (ASE & CSE) Learning: Train classifier with training data Application: Predict unknown ASE Need features to represent ASEs & CSEs

Background & Motivation Problem Definition Problem & Feature Construction Data Set Experiments Design & Results Feature Construction Conclusions and Future Work Problem Addressed and Our Approach Problem Definition predict alternatively spliced exons (ASE) vs constitutively spliced exons (CSE) Use Support Vector Machine (SVM) Task:Two-class (ASE and CSE) classification problem Need:Training data set containing labeled examples (ASE & CSE) Learning: Train classifier with training data Application: Predict unknown ASE Need features to represent ASEs & CSEs

Background & Motivation Problem Definition Problem & Feature Construction Data Set Experiments Design & Results Feature Construction Conclusions and Future Work Problem Addressed and Our Approach Problem Definition predict alternatively spliced exons (ASE) vs constitutively spliced exons (CSE) Use Support Vector Machine (SVM) Task:Two-class (ASE and CSE) classification problem Need:Training data set containing labeled examples (ASE & CSE) Learning: Train classifier with training data Application: Predict unknown ASE Need features to represent ASEs & CSEs

Background & Motivation Problem Definition Problem & Feature Construction Data Set Experiments Design & Results Feature Construction Conclusions and Future Work Problem Addressed and Our Approach Problem Definition predict alternatively spliced exons (ASE) vs constitutively spliced exons (CSE) Use Support Vector Machine (SVM) Task:Two-class (ASE and CSE) classification problem Need:Training data set containing labeled examples (ASE & CSE) Learning: Train classifier with training data Application: Predict unknown ASE Need features to represent ASEs & CSEs

Background & Motivation Problem Definition Problem & Feature Construction Data Set Experiments Design & Results Feature Construction Conclusions and Future Work Problem Addressed and Our Approach Problem Definition predict alternatively spliced exons (ASE) vs constitutively spliced exons (CSE) Use Support Vector Machine (SVM) Task:Two-class (ASE and CSE) classification problem Need:Training data set containing labeled examples (ASE & CSE) Learning: Train classifier with training data Application: Predict unknown ASE Need features to represent ASEs & CSEs

Background & Motivation Problem Definition Problem & Feature Construction Data Set Experiments Design & Results Feature Construction Conclusions and Future Work Data Set Published data set from the model organism, C. elegans (worm) Includes alternatively spliced exons (ASE) and constitutively spliced exons (CSE) Contains 487 ASEs and 2531 CSEs 100-base local sequences around splice sites Example of data set ASE GTACTATAGCGTGCTG....ACCGTTCGTACTCGCT ASE ATACTATAGCGTCTTG....ACCGATCGTACACGCT CSE GTACTATAGCGTCTTG....ACCGATCGTACTCGCT AG exon GT AG − −

Background & Motivation Problem Definition Problem & Feature Construction Data Set Experiments Design & Results Feature Construction Conclusions and Future Work Data Set Published data set from the model organism, C. elegans (worm) Includes alternatively spliced exons (ASE) and constitutively spliced exons (CSE) Contains 487 ASEs and 2531 CSEs 100-base local sequences around splice sites Example of data set ASE GTACTATAGCGTGCTG....ACCGTTCGTACTCGCT ASE ATACTATAGCGTCTTG....ACCGATCGTACACGCT CSE GTACTATAGCGTCTTG....ACCGATCGTACTCGCT AG exon GT AG − −

Background & Motivation Problem Definition Problem & Feature Construction Data Set Experiments Design & Results Feature Construction Conclusions and Future Work Data Set Published data set from the model organism, C. elegans (worm) Includes alternatively spliced exons (ASE) and constitutively spliced exons (CSE) Contains 487 ASEs and 2531 CSEs 100-base local sequences around splice sites Example of data set ASE GTACTATAGCGTGCTG....ACCGTTCGTACTCGCT ASE ATACTATAGCGTCTTG....ACCGATCGTACACGCT CSE GTACTATAGCGTCTTG....ACCGATCGTACTCGCT AG exon GT AG − −

Background & Motivation Problem Definition Problem & Feature Construction Data Set Experiments Design & Results Feature Construction Conclusions and Future Work Data Set Published data set from the model organism, C. elegans (worm) Includes alternatively spliced exons (ASE) and constitutively spliced exons (CSE) Contains 487 ASEs and 2531 CSEs 100- base local sequences around splice sites Previous work: Motifs captured and identified by kernel G. Ratch et al., Length of exons and flanking introns Sorek et al. Our work: Exploit more biologically significant features Use several additional approaches to derive features

Background & Motivation Problem Definition Problem & Feature Construction Data Set Experiments Design & Results Feature Construction Conclusions and Future Work Data Set Published data set from the model organism, C. elegans (worm) Includes alternatively spliced exons (ASE) and constitutively spliced exons (CSE) Contains 487 ASEs and 2531 CSEs 100- base local sequences around splice sites Previous work: Motifs captured and identified by kernel G. Ratch et al., Length of exons and flanking introns Sorek et al. Our work: Exploit more biologically significant features Use several additional approaches to derive features

Background & Motivation Problem Definition Problem & Feature Construction Data Set Experiments Design & Results Feature Construction Conclusions and Future Work Feature List Several features known to be biologically important Strength of splice sites (SSS) Motif features Intronic splicing regulator (ISR) Motifs derived from local sequences (MAST) Exonic splicing enhancer (ESE) Reduced set of motif features based on locations of motifs on secondary structure (MAST-R) Optimal folding energy (OPE) Basic sequence features (BSF)

Background & Motivation Problem Definition Problem & Feature Construction Data Set Experiments Design & Results Feature Construction Conclusions and Future Work SSS: Strength of Splice Site CGAG exon AGGTAAGT We consider all splice sites CGAG exon AGGTAAGT ∑ exon GGAG AGGTAGGT score = logF(X i ) F(X), CGAG exon AGGTTAGT i CCAG exon AGGTAAGT where X ∈ {A,U,G,C}. i ∈ {−3,+7} −3 +7 −26 +2 for 3’ splice sites (3’ss) and 3’ ss 5’ ss i ∈ {−26,+2} for 5’ splice sites (5’ss).

Background & Motivation Problem Definition Problem & Feature Construction Data Set Experiments Design & Results Feature Construction Conclusions and Future Work Motif: sequence pattern that occurs repeatedly in group of sequences Intronic Splicing Regulator: identified in Kabat et al. MAST: derived by MEME using [-100,+100] sequence Exon Splicing Enhancers: based on two assumption ISR exon Illustration of ISR dispersed among sequences

Background & Motivation Problem Definition Problem & Feature Construction Data Set Experiments Design & Results Feature Construction Conclusions and Future Work Motif: sequence pattern that occurs repeatedly in group of sequences Intronic Splicing Regulator: identified in Kabat et al. MAST: derived by MEME using [-100,+100] sequence Exon Splicing Enhancers: based on two assumption Example: a 20-base motif derived from sequences around splice sites

Background & Motivation Problem Definition Problem & Feature Construction Data Set Experiments Design & Results Feature Construction Conclusions and Future Work Motif: sequence pattern that occurs repeatedly in group of sequences Intronic Splicing Regulator: identified in Kabat et al. MAST: derived by MEME using [-100,+100] sequence Exon Splicing Enhancers: based on two assumption more frequent in exons than in introns more frequent in exons with weak splice sites than in exons with strong splice sites ISR MAST ESE Motifs - dispersed among exons and introns

Background & Motivation Problem Definition Problem & Feature Construction Data Set Experiments Design & Results Feature Construction Conclusions and Future Work Pre-mRNA secondary structures motif influence exon recognition AUCCAUGGGCCGGAUGUGACGGUAGUAGGGUAUACGUCACAUAGGCUUCCUCUCAUGA Located at different structure Secondary structure: derived from Mfold filter motifs using secondary structure Loop Stem Optimal Folding Energy: stability of RNA secondary structure

Background & Motivation Problem Definition Problem & Feature Construction Data Set Experiments Design & Results Feature Construction Conclusions and Future Work Pre-mRNA secondary structures motif influence exon recognition AUCCAUGGGCCGGAUGUGACGGUAGUAGGGUAUACGUCACAUAGGCUUCCUCUCAUGA Located at different structure Secondary structure: derived from Mfold filter motifs using secondary structure Loop Stem Optimal Folding Energy: stability of RNA secondary structure

Background & Motivation Problem Definition Problem & Feature Construction Data Set Experiments Design & Results Feature Construction Conclusions and Future Work GC content (G & C ratio),= A+U+G+C, characteristics of sequence Sequence length Length of exons and length of exons’ flanking introns frames of stop codons Summary of features Motif features Secondary structure Strength of splice sites Sequence features

Background & Motivation Problem & Feature ConstructionExperimental Design Experiments Design & ResultsExperimental Results Conclusions and Future Work Experimental Design List of previous defined features as SVM input Combination of different features to represent ASEs & CSEs split1split2split3split4split5 Tune SVM parameters to train (kernel linear, RBF.., Cost C) 20%80% 5−fold cross validation Choose parameters with best cross-validation (CV) accuracy Test trained SVM on testing ASEs & CSEs

Background & Motivation Problem & Feature ConstructionExperimental Design Experiments Design & ResultsExperimental Results Conclusions and Future Work Experimental Design List of previous defined features as SVM input Combination of different features to represent ASEs & CSEs split1split2split3split4split5 Tune SVM parameters to train (kernel linear, RBF.., Cost C) 20%80% 5−fold cross validation Choose parameters with best cross-validation (CV) accuracy Test trained SVM on testing ASEs & CSEs

Background & Motivation Problem & Feature ConstructionExperimental Design Experiments Design & ResultsExperimental Results Conclusions and Future Work Experimental Design List of previous defined features as SVM input Combination of different features to represent ASEs & CSEs split1split2split3split4split5 Tune SVM parameters to train (kernel linear, RBF.., Cost C) 20%80% 5−fold cross validation Choose parameters with best cross-validation (CV) accuracy Test trained SVM on testing ASEs & CSEs

Background & Motivation Problem & Feature ConstructionExperimental Design Experiments Design & ResultsExperimental Results Conclusions and Future Work Experimental Design List of previous defined features as SVM input Combination of different features to represent ASEs & CSEs split1split2split3split4split5 Tune SVM parameters to train (kernel linear, RBF.., Cost C) 20%80% 5−fold cross validation Choose parameters with best cross-validation (CV) accuracy Test trained SVM on testing ASEs & CSEs

Background & Motivation Problem & Feature ConstructionExperimental Design Experiments Design & ResultsExperimental Results Conclusions and Future Work Experimental Design List of previous defined features as SVM input Combination of different features to represent ASEs & CSEs split1split2split3split4split5 Tune SVM parameters to train (kernel linear, RBF.., Cost C) 20%80% 5−fold cross validation Choose parameters with best cross-validation (CV) accuracy Test trained SVM on testing ASEs & CSEs

Background & Motivation Problem & Feature ConstructionExperimental Design Experiments Design & ResultsExperimental Results Conclusions and Future Work Experimental results Results of alternatively spliced exon classification. All features, including ISR motifs, are used. CCross Validation ScoreTest score fp 1%AUC %fp 1%AUC% Split Split Split Split Split

Background & Motivation Problem & Feature ConstructionExperimental Design Experiments Design & ResultsExperimental Results Conclusions and Future Work Experimental results Mixed-Feas (85.55%) Base-Feas(78.78%) False Positive Rate Comparison of ROC curves obtained using basic features only and basic features plus other mixed features (except conserved ISR motifs). Models trained using 5-fold CV with C = 1.

Background & Motivation Problem & Feature ConstructionExperimental Design Experiments Design & ResultsExperimental Results Conclusions and Future Work Experimental results AUC score comparison between data sets with secondary struc- tural features and data sets without secondary structural fea- tures

Background & Motivation Problem & Feature ConstructionExperimental Design Experiments Design & ResultsExperimental Results Conclusions and Future Work Motif Evaluation Intersection between motifs derived from sequences & intronic splicing regulators

Background & Motivation Problem & Feature ConstructionExperimental Design Experiments Design & ResultsExperimental Results Conclusions and Future Work Motif Evaluation Conserved ESE in metazoans (animals), Human and Mouse

Background & Motivation Problem & Feature ConstructionExperimental Design Experiments Design & ResultsExperimental Results Conclusions and Future Work Motif Evaluation Comparison with A. thaliana

Background & Motivation Problem & Feature Construction Conclusion Experiments Design & Results Conclusions and Future Work Conclusions Alternative splicing (AS) events can be found using transcripts Machine learning effectively used for prediction of AS events Identified features informative in predicting AS Explored comparatively comprehensive feature sets from biological point of view

Background & Motivation Problem & Feature Construction Conclusion Experiments Design & Results Conclusions and Future Work Conclusions Alternative splicing (AS) events can be found using transcripts Machine learning effectively used for prediction of AS events Identified features informative in predicting AS Explored comparatively comprehensive feature sets from biological point of view

Background & Motivation Problem & Feature Construction Conclusion Experiments Design & Results Conclusions and Future Work Conclusions Alternative splicing (AS) events can be found using transcripts Machine learning effectively used for prediction of AS events Identified features informative in predicting AS Explored comparatively comprehensive feature sets from biological point of view

Background & Motivation Problem & Feature Construction Conclusion Experiments Design & Results Conclusions and Future Work Conclusions Alternative splicing (AS) events can be found using transcripts Machine learning effectively used for prediction of AS events Identified features informative in predicting AS Explored comparatively comprehensive feature sets from biological point of view

Background & Motivation Problem & Feature Construction Conclusion Experiments Design & Results Conclusions and Future Work Future Work Apply this approach to specific organism Identify motifs more accurately Refine relationships between features (2nd Structure:w and motifs) Learn other types of AS events (not only skipped exons) adapted from "Detection of Alternative Splicing Events Using Machine Learning"

Background & Motivation Problem & Feature Construction Conclusion Experiments Design & Results Conclusions and Future Work Thank you for your attention! Questions? Related work RASE Acknowledgement data set from Dr. Ratsch’s FML group projects/RASE/altsplicedexonsplits.tar.gz Dr. Caragea’s MLB group Dr. Brown’s Bininformatics Center at KSU