Knowledge Discovery in Biomedicine Limsoon Wong Institute for Infocomm Research.

Slides:



Advertisements
Similar presentations
Progress Against Lymphoma. 1970–1979 Progress Against Lymphoma 1970– : FDA approves doxorubicin, a vital part of combination chemotherapy.
Advertisements

Childhood Cancers: A Review Haruna Baba Jibril MB,BS; FCMPaed; MSc (Haem) Department of Pediatrics Princess Marina Hospital.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Show & Tell Limsoon Wong KRDL Datamining: Turning Biological Data into Gold.
Strategic Center for Clinical Cancer Research Clinical Cancer Research using Emerging Advanced Technologies for Health Title Slide.
E2A and acute lymphoblastic leukemias (ALL). A closer look at the E2A gene... Other names: TCF3, ITF1, and Factors E12/E47 Located on chromosome 19 Encodes.
Copyright © 2004 by Limsoon Wong Research & Discovery: Technologies Today for Solving Problems Tomorrow Limsoon Wong Institute for Infocomm Research.
MOLECULAR GENETICS OF B CELL LYMPHOMAS: AN UPDATE Michel Trudel, MD, FRCPC Shaikh Khalifa Medical Center.
Applications to Bioinformatics: Microarray Data Mining
Classification of Microarray Data. Sample Preparation Hybridization Array design Probe design Question Experimental Design Buy Chip/Array Statistical.
Classification of Microarray Data. Sample Preparation Hybridization Array design Probe design Question Experimental Design Buy Chip/Array Statistical.
Introduction to the Knowledge Discovery Department Institute for Infocomm Research Limsoon Wong Deputy Executive Director (Research) I 2 R: Imagination.
3 rd Summer School in Computational Biology September 10, 2014 Frank Emmert-Streib & Salissou Moutari Computational Biology and Machine Learning Laboratory.
1 Robust diagnosis of DLBCL from gene expression data from different laboratories DIMACS - RUTCOR Workshop on Boolean and Pseudo-Boolean Functions in Memory.
Genetic Testing in Genomic Medicine Gail H. Vance M.D. Professor, Department of Medical & Molecular Genetics Indiana University School of Medicine.
Gene Expression Based Tumor Classification Using Biologically Informed Models ISI 2003 Berlin Claudio Lottaz und Rainer Spang Computational Diagnostics.
Copyright  2003 limsoon wong Diagnosis of Childhood Acute Lymphoblastic Leukemia and Optimization of Risk-Benefit Ratio of Therapy Limsoon Wong Institute.
Re-Examination of the Design of Early Clinical Trials for Molecularly Targeted Drugs Richard Simon, D.Sc. National Cancer Institute linus.nci.nih.gov/brb.
MammaPrint, the story of the 70-gene profile
Exciting Bioinformatics Adventures Limsoon Wong Institute for Infocomm Research.
AAAI05 Tutorial on Bioinformatics & Machine Learning Jinyan Li & Limsoon Wong Institute for Infocomm Research 21 Heng Mui Keng Terrace Singapore Copyright.
Technology at St. Jude Children’s Research Hospital By Kristin Rathke May 22, 2006 Computer & Society.
Challenges and Considerations in Linking Adult and Pediatric Leukemias David G. Poplack M.D. Texas Children’s Cancer Center Baylor College of Medicine.
Products of haematopoiesis. Leukaemia, the current hypothesis Defect in maturation of white blood cells-may involve a block in differentiation and/or.
Structured Analysis of Microarrays & Differential Coexpression Claudio Lottaz, Dennis Kostka & Rainer Spang Courses in Practical DNA Microarray Analysis.
Copyright  2003 limsoon wong Data Mining of Gene Expression Profiles for the Diagnosis and Understanding of Diseases Limsoon Wong Institute for Infocomm.
Chapter 7 Essential Concepts in Molecular Pathology Companion site for Molecular Pathology Author: William B. Coleman and Gregory J. Tsongalis.
Evaluation of Supervised Learning Algorithms on Gene Expression Data CSCI 6505 – Machine Learning Adan Cosgaya Winter 2006 Dalhousie University.
Multiple Examples of tumor tissue (public data from Whitehead/MIT) SVM Classification of Multiple Tumor Types DNA Microarray Data Oracle Data Mining 78.25%
Exagen Diagnostics, Inc., all rights reserved Biomarker Discovery in Genomic Data with Partial Clinical Annotation Cole Harris, Noushin Ghaffari.
MANAGEMENT OF MANTLE CELL LYMPHOMA IN TUNISIA R BEN LAKHAL, L KAMMOUN, K ZAHRA, S KEFI Sousse 25 MAY 2012.
Copyright  2004 limsoon wong CS2220: Computation Foundation in Bioinformatics Limsoon Wong Institute for Infocomm Research Lecture slides for 3 February.
It is only the beginning: Putting microarrays into context Matthias E. Futschik Institute for Theoretical Biology Humboldt-University, Berlin, Germany.
Background Diffuse large B-cell lymphoma (DLBCL) is the most commonly occurring lymphoma in the Western world. It’s account for about one-third of all.
Building and Running caGrid Workflows in Taverna 1 Computation Institute, University of Chicago and Argonne National Laboratory, Chicago, IL, USA 2 Mathematics.
Selection of Patient Samples and Genes for Disease Prognosis Limsoon Wong Institute for Infocomm Research Joint work with Jinyan Li & Huiqing Liu.
Michael Birrer Ian McNeish New Developments in Biology and Targets of Epithelial Ovarian Cancer.
Computational biology of cancer cell pathways Modelling of cancer cell function and response to therapy.
Knowledge Discovery from Biological and Clinical Data: BASIC BACKGROUND.
Rule-Based Data Mining Methods for Classification Problems in Biomedical Domains Jinyan Li Limsoon Wong Copyright © 2004 by Jinyan Li and Limsoon Wong.
Construction of cancer pathways for personalized medicine | Presented By Date Construction of cancer pathways for personalized medicine Predictive, Preventive.
OBVIOUS DIFFERENCES Other medical conditions in adults - effects of [subclinical] organ dysfunction on drug disposition Better tolerance in children.
Enabling Reproducible Gene Expression Analysis Using Biological Pathways Limsoon Wong 7 April 2011 (Joint work with Donny Soh, Difeng Dong, Yike Guo)
Bertinoro, Nov 2005 Some Data Mining Challenges Learned From Bioinformatics & Actions Taken Limsoon Wong National University of Singapore.
Copyright  2003 limsoon wong From Informatics to Bioinformatics: The Knowledge Discovery Perspective Limsoon Wong Institute for Infocomm Research Singapore.
Developing medicines for the future and why it is challenging Angela Milne.
Limsoon Wong Laboratories for Information Technology Singapore From Informatics to Bioinformatics.
Bioinformatics MEDC601 Lecture by Brad Windle Ph# Office: Massey Cancer Center, Goodwin Labs Room 319 Web site for lecture:
Copyright  2004 limsoon wong A Practical Introduction to Bioinformatics Limsoon Wong Institute for Infocomm Research Lecture 3, May 2004 For written notes.
Gray Zone Lymphoma (GZL) with Features Intermediate between Classical Hodgkin Lymphoma (cHL) and Diffuse Large B-Cell Lymphoma (DLBCL): A Large Retrospective.
Prof. Yechiam Yemini (YY) Computer Science Department Columbia University (c)Copyrights; Yechiam Yemini; Lecture 2: Introduction to Paradigms 2.3.
Copyright © 2004, 2005 by Jinyan Li and Limsoon Wong For written notes on this lecture, please read chapter 14 of The Practical Bioinformatician, CS2220:
Limsoon Wong Laboratories for Information Technology Singapore From Datamining to Bioinformatics.
Copyright © 2004, 2005 by Jinyan Li and Limsoon Wong For written notes on this lecture, please read chapter 3 of The Practical Bioinformatician, CS2220:
Copyright © 2004 by Jinyan Li and Limsoon Wong Rule-Based Data Mining Methods for Classification Problems in Biomedical Domains Jinyan Li Limsoon Wong.
Copyright  2004 limsoon wong CS2220: Computation Foundation in Bioinformatics Limsoon Wong Institute for Infocomm Research Lecture slides for 13 January.
R-CHOP with Iodine-131 Tositumomab Consolidation for Advanced Stage Diffuse Large B-Cell Lymphoma (DLBCL): Southwest Oncology Group Protocol S0433 Friedberg.
Frontier Pharma Innovative Licensing Opportunities in Non-Hodgkin Lymphoma 2015 Published on : August No. Pages : 115.
(1) Genotype-Tissue Expression (GTEx) Largest systematic study of genetic regulation in multiple tissues to date 53 tissues, 500+ donors, 9K samples, 180M.
Haematological malignancy research network © April 2014.
What we are learning about Alzheimer’s disease genetics Bryan J. Traynor.
Show & Tell Limsoon Wong Kent Ridge Digital Labs Singapore Role of Bioinformatics in the Genomic Era.
Evolution-informed Modeling discover biomarkers for precision oncology Li Liu, M.D. August 22, 2016.
Advances in the Management of Pediatric Acute Leukemia
Logical Analysis and Invariants
Challenging old dogmas: improved diagnostics and tailored therapy by unraveling the biology of acute leukemia in children © Eline, 8 years.
Building better therapy for children with acute lymphoblastic leukemia
Focus on lymphomas Cancer Cell
Volume 1, Issue 2, Pages (March 2002)
Molecular prognostication of liver cancer: End of the beginning
Presentation transcript:

Knowledge Discovery in Biomedicine Limsoon Wong Institute for Infocomm Research

Copyright © 2004 by Limsoon Wong Plan Knowledge discovery in brief Eg 1: Optimizing treatment of childhood ALL Eg 2: Predicting survivals of patients with DLBC lymphoma Concluding remarks

Copyright © 2004 by Limsoon Wong Knowledge Discovery in Brief

Jonathan’s rules: Blue or Circle Jessica’s rules: All the rest Whose block is this? Jonathan’s blocks Jessica’s blocks What is Knowledge Discovery? Copyright © 2004 by Limsoon Wong

Question: Can you explain how? What is Knowledge Discovery? Copyright © 2004 by Limsoon Wong

Some classifiers/learning methods Steps of Knowledge Discovery Training data gathering Feature generation –k-grams, colour, texture, domain know-how,... Feature selection –Entropy,  2, CFS, t-test, domain know-how... Feature integration –SVM, ANN, PCL, CART, C4.5, kNN,...

Copyright © 2004 by Limsoon Wong Knowledge Discovery for Optimizing Treatment of Childhood ALL Image credit: Yeoh et al, 2002

Childhood ALL Major subtypes: T-ALL, E2A-PBX, TEL-AML, BCR-ABL, MLL genome rearrangements, Hyperdiploid>50, Diff subtypes respond differently to same Tx Over-intensive Tx –Development of secondary cancers –Reduction of IQ Under-intensiveTx –Relapse The subtypes look similar Conventional diagnosis –Immunophenotyping –Cytogenetics –Molecular diagnostics Unavailable in most ASEAN countries Copyright © 2004 by Limsoon Wong

Copyright © 2004 by Jinyan Li and Limsoon Wong Single-Test Platform of Microarray & Knowledge Discovery training data collection feature selection Image credit: Affymetrix feature generation feature integration

Conventional Tx: intermediate intensity to all  10% suffers relapse  50% suffers side effects  costs US$150m/yr Our optimized Tx: high intensity to 10% intermediate intensity to 40% low intensity to 50% costs US$100m/yr Copyright © 2004 by Jinyan Li and Limsoon Wong High cure rate of 80% Less relapse Less side effects Save US$51.6m/yr Impact

Copyright © 2004 by Limsoon Wong Knowledge Discovery for Predicting Survival of Patients with DLBC Lymphoma Image credit: Rosenwald et al, 2002

Copyright © 2004 by Limsoon Wong Diffuse Large B-Cell Lymphoma DLBC lymphoma is the most common type of lymphoma in adults Can be cured by anthracycline-based chemotherapy in 35 to 40 percent of patients  DLBC lymphoma comprises several diseases that differ in responsiveness to chemotherapy Intl Prognostic Index (IPI) –age, “Eastern Cooperative Oncology Group” Performance status, tumor stage, lactate dehydrogenase level, sites of extranodal disease,... Not good for stratifying DLBC lymphoma patients for therapeutic trials  Use gene-expression profiles to predict outcome of chemotherapy?

Knowledge Discovery from Gene Expression of “Extreme” Samples “extreme” sample selection knowledge discovery from gene expression 240 samples 80 samples 26 long- term survivors 47 short- term survivors 7399 genes 84 genes T is long-term if S(T) < 0.3 T is short-term if S(T) > 0.7

p-value of log-rank test: < Risk score thresholds: 0.7, 0.5, 0.3 Kaplan-Meier Plot for 80 Test Cases

(A) IPI low, p-value = (B) IPI intermediate, p-value = Improvement Over IPI

(A) W/o sample selection (p =0.38) (B) With sample selection (p=0.009) No clear difference on the overall survival of the 80 samples in the validation group of DLBCL study, if no training sample selection conducted Merit of “Extreme” Samples

Copyright © 2004 by Limsoon Wong Knowledge Discovery for A Few Other Biomedical Applications

Develop systems to recognize protein peptides that bind MHC molecules Develop systems to recognize hot spots in viral antigens Predict Epitopes, Find Vaccine Targets Vaccines are often the only solution for viral diseases Finding & developing effective vaccine targets (epitopes) is slow and expensive process

Dragon’s 10x reduction of TSS recognition false positives Recognize Functional Sites, Help Scientists Effective recognition of initiation, control, & termination of biological processes is crucial to speeding up & focusing scientific expts Data mining of bio seqs to find rules to recognize & understand functional sites

Knowledge extraction system to process free text extract protein names extract interactions Understand Proteins, Fight Diseases Understanding function & role of protein needs organised info on interaction pathways Such info are often reported in scientific paper but are seldom found in structured db

Copyright © 2004 by Limsoon Wong Benefits of Bioinformatics To the patient: –Better drug, better treatment To the pharma: –Save time, save cost, make more $ To the scientist: –Better science

Copyright © 2004 by Limsoon Wong References A. Yeoh et al, “Classification, subtype discovery, and prediction of outcome in pediatric acute lymphoblastic leukemia by gene expression profiling”, Cancer Cell, 1: , 2002 A. Rosenwald et al, “The use of molecular profiling to predict survival after chemotherapy for diffuse large B-cell lymphoma”, NEJM, 346: , 2002 H. Liu et al, “Selection of patient samples and genes for outcome prediction”, Proc. CSB2004, pages

Copyright © 2004 by Limsoon Wong Any Question?

Copyright © 2004 by Limsoon Wong To be presented 10/10/04, am Raffles Convention Centre NHG-IBM Symposium