Download presentation
Presentation is loading. Please wait.
Published byRosamond Tate Modified over 9 years ago
2
Show & Tell Limsoon Wong KRDL Datamining: Turning Biological Data into Gold
3
Show & Tell Jonathan’s rules: Blue or Circle Jessica’s rules: All the rest What is Datamining? Whose block is this? Jonathan’s blocks Jessica’s blocks
4
Show & Tell What is Datamining? Question: Can you explain how?
5
Show & Tell What are the Benefits? To the patient: Better drug, better treatment To the pharma: Save time, save cost, make more $ To the scientist: Better science
6
Show & Tell The Datamining Process
7
Show & Tell Epitope Prediction TRAP-559AA MNHLGNVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYSE EVCNDQVDLYLLMDCSGSIRRHNWVNHAVPLAMKLIQQLN LNDNAIHLYVNVFSNNAKEIIRLHSDASKNKEKALIIIRS LLSTNLPYGRTNLTDALLQVRKHLNDRINRENANQLVVIL TDGIPDSIQDSLKESRKLSDRGVKIAVFGIGQGINVAFNR FLVGCHPSDGKCNLYADSAWENVKNVIGPFMKAVCVEVEK TASCGVWDEWSPCSVTCGKGTRSRKREILHEGCTSEIQEQ CEEERCPPKWEPLDVPDEPEDDQPRPRGDNSSVQKPEENI IDNNPQEPSPNPEEGKDENPNGFDLDENPENPPNPDIPEQ KPNIPEDSEKEVPSDVPKNPEDDREENFDIPKKPENKHDN QNNLPNDKSDRNIPYSPLPPKVLDNERKQSDPQSQDNNGN RHVPNSEDRETRPHGRNNENRSYNRKYNDTPKHPEREEHE KPDNNKKKGESDNKYKIAGGIAGGLALLACAGLAYKFVVP GAATPYAGEPAPFDETLGEEDKDLDEPEQFRLPEENEWN
8
Show & Tell Epitope Prediction Results Prediction by our ANN model for HLA-A11 29 predictions 22 epitopes 76% specificity 1 66 100 Rank by BIMAS Number of experimental binders 19 (52.8%) 5 (13.9%) 12 (33.3%) Prediction by BIMAS matrix for HLA-A*1101
9
Show & Tell Gene Expression Analysis Clustering gene expression profiles Classifying gene expression profiles find stable differentially expressed genes
10
Show & Tell Gene Expression Analysis Results The Discovery System Correlation test Voter selection Class prediction
11
Show & Tell Protein Interaction Extraction “What are the protein-protein interaction pathways from the latest reported discoveries?”
12
Show & Tell Protein Interaction Extraction Results Rule-based system for processing free texts in scientific abstracts Specialized in extracting protein names extracting protein-protein interactions
13
Show & Tell Transcription Start Prediction
14
Show & Tell Transcription Start Prediction Results
15
Show & Tell Medical Record Analysis Looking for patterns that are valid novel useful understandable
16
Show & Tell Medical Record Analysis Results DeEPs, a novel “emerging pattern’’ method Beats C4.5, CBA, LB, NB, TAN in 21 out of 32 UCI benchmarks Works for gene expressions
17
Show & Tell Under the Hood Artificial neural network Neighbourhood analysis Non-linear analysis Template matching Emerging pattern Hidden markov models Bayesian inference Decision tree induction ...
18
Show & Tell Behind the Scene Epitope Prediction Vladimir Brusic Judice Koh Seah Seng Hong Zhang Guanglan Yu Kun Transcription Start Prediction Vladimir Bajic Seah Seng Hong Gene Expression Analysis Zhang Louxin Zhang Zhuo Zhu Song Medical Records Li Jinyan Protein Interaction Extraction Ng See Kiong Zhang Zhuo
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.