Complex Sentence Processor

Slides:



Advertisements
Similar presentations
Bio-Medical Interaction Extractor Syed Toufeeq Ahmed ASU.
Advertisements

Signal Transduction Mechanisms Underlying Underlying Growth Control and Oncogenesis Ronit Sagi-Eisenberg Dept. of Cell and Developmental Biology Sackler.
p53 Revealed character as a tumor suppressor gene in 1989.
Introduction to Oncology Dr. Saleh Unit 9 R.E.B, 4MedStudents.com 2003.
Cancer Cancer originates in dividing cells –Intestinal lining (colon) –Lung tissue –Breast tissue (glands/ducts) –Prostate (gland) –White blood cells.
A Comprehensive Map of Molecular Interactions in RB Pathway Laurence Calzone (1), Amélie Gelay (1), Andrei Zinovyev (1), François Radvanyi (2), Emmanuel.
IntEx: A Syntactic Role Driven Protein-Protein Interaction Extractor for Bio-Medical Text Syed Toufeeq Ahmed Deepthi Chidambaram Hasan Davulcu Chitta Baral.
34 Cancer.
BioSci 145A lecture 18 page 1 © copyright Bruce Blumberg All rights reserved BioSci 145A Lecture 18 - Oncogenes and Cancer Topics we will cover today.
Link Grammar ( by Davy Temperley, Daniel Sleator & John Lafferty ) Syed Toufeeq Ahmed ASU.
P73 Shatil Amin March 27 th Content I.Structure and Function II.Regulation III.Is it involved in human cancers?
Cancer Tumor Cells and the Onset of Cancer
Computational biology of cancer cell pathways Modelling of cancer cell function and response to therapy.
Genetics of Cancer Genetic Mutations that Lead to Uncontrolled Cell Growth.
Genetics of Cancer Genetic Mutations that Lead to Uncontrolled Cell Growth.
Information Extraction from BioMedical Abstracts Dr. Hasan Davulcu Syed Toufeeq Ahmed Deepthi Chidambaram.
Types of Genes Associated with Cancer
Cancer. Cancer is a disease of the cell cycle Caused by one or more of the following: Increase in growth signals Loss of inhibitory signals In addition,
Regulation of the Cell Cycle The cell cycle can be regulated at any of the phases, but typically, variability in the length of the cell cycle is based.
BCB 570 Spring Signal Transduction Julie Dickerson Electrical and Computer Engineering.
Cancer Chapter 16. VII. Cancer & gene regulation  A. Somatic cell mutations can =cancer  1. caused by chemical carcinogens  2. high energy radiation.
THE GENETIC BASIS OF CANCER
How do you think cells communicate?
Molecular Genetics: Part 2B Regulation of metabolic pathways:
Lecture #8 Date _________
Daily Grammar Practice Week One Grade 8
Gene Expression.
Genes and Development CVHS Chapter 16.
Targeting signal transduction
The Genetic Basis of Cancer
Controls the Cell Cycle
Regulation of Gene Expression
Regulation of the Cell Cycle & Cancer
Concept 18.5: Cancer results from genetic changes that affect cell cycle control The gene regulation systems that go wrong during cancer are the very same.
Chap. 16 Problem 1 Cytokine receptors and RTKs both form functional dimers on binding of ligand. Ligand binding activates cytosolic kinase domains which.
You have identified a novel cytoplasmic protein
Lecture #8 Date _________
Genetics of Cancer.
M.B.Ch.B, MSC, DCH (UK), MRCPCH
PTEN (a.k.a. MMAC1 and TEP1) and Cowden’s Disease
Chapter 12: The Cell Cycle
Regulation of Gene Expression
Extracellular Regulation of Apoptosis
Figure 1 A schematic representation of the HER2 signalling pathway
BIOLOGY 12 Cancer.
Regulation of Gene Expression
Chapter 12: The Cell Cycle
Daily Grammar Practice Week One Grade 8
Chapter 12: The Cell Cycle
PTEN Tumor Suppressor and Cancer
Development of PI3K/AKT/mTOR Pathway Inhibitors and Their Application in Personalized Therapy for Non–Small-Cell Lung Cancer  Vassiliki Papadimitrakopoulou,
Cancer and the Cell Cycle
Transcription Initiation:
Cell division is highly regulated
AP Biology The Cell Cycle.
M.B.Ch.B, MSC, PhD, DCH (UK), MRCPCH
Chapter 11 Cell Communication.
Oncogenes and Angiogenesis: Signaling Three-Dimensional Tumor Growth
AKT/PKB Signaling: Navigating Downstream
Multifunctional Tumor Suppressor
Simon Ekman, MD, PhD, Murry W. Wynes, PhD, Fred R. Hirsch, MD, PhD 
Canonical gliomagenesis mediators EGFR, P53, and retinoblastoma protein (RB1) are important for cancer signaling. Canonical gliomagenesis mediators EGFR,
Chapter 12: The Cell Cycle
Chapter 12: The Cell Cycle
Volume 104, Issue 4, Pages (February 2001)
Vladimir A. Botchkarev  Journal of Investigative Dermatology 
Schematic representation of signaling pathways modulated by PKD1 in cancer. Schematic representation of signaling pathways modulated by PKD1 in cancer.
Dysregulation of the mTOR Pathway Secondary to Mutations or a Hostile Microenvironment Contributes to Cancer and Poor Wound Healing  Richard A.F. Clark,
Cell Communication.
Tenets of PTEN Tumor Suppression
Presentation transcript:

Complex Sentence Processor Using Link Grammar to simplify complex sentences 12/24/2018 Deepthi Chidambaram

Problem Statement John played the pipes. Extraction of gene-gene interactions from unstructured biomedical text. Corpus – Biomedical abstracts, curated text Rich in interactions Freely available Approach – verb based extraction. John played the pipes. Interactions in a noun phrase are also extracted – detailed by Toufeeq. Crux of the sentence 12/24/2018 Deepthi Chidambaram

Sentences in abstracts Interactions specified in ‘creative’ ways HMBA inhibits MEC-1 cell proliferation. GBMs commonly overexpress the oncogenes EGFR and PDGFR, and contain mutations and deletions of tumor suppressor genes PTEN and TP53. Protein kinase B (PKB) has emerged as the focal point for many signal transduction pathways, regulating multiple cellular processes such as glucose metabolism, transcription, apoptosis, cell proliferation, angiogenesis, and cell motility. 12/24/2018 Deepthi Chidambaram

Problems that come up Anaphora resolution [Anaphora] Pronominals – It activates HMBA. Sortal anaphora – Both enzymes are phosphorylated. Event anaphora – This reaction acts in a mediated environment. Multiple interactions - Complex sentences Most of the tumor-suppressive properties of Pten are dependent on its lipid phosphatase activity, which inhibits the phosphatidylinositol-3'-kinase (PI3K)/Akt signaling pathway through dephosphorylation of phosphatidylinositol-(3,4,5)-triphosphate 12/24/2018 Deepthi Chidambaram

Our solution: Pronoun resolution Pronouns in abstracts – third person It, itself, them, themselves. Replace pronouns with first noun group that matches the number. References in the absence of pronouns – handled by Link Grammar. 12/24/2018 Deepthi Chidambaram

Pronoun Resolution: walkthrough Ku loads onto dsDNA ends and it can diffuse along the DNA in an energy-independent manner. Ku loads onto dsDNA ends and Ku can diffuse along the DNA in an energy-independent manner. When breast cancers were examined for NGAL mRNA and protein levels, they were found to exhibit heterogeneous expression. When breast cancers were examined for NGAL mRNA and protein levels , breast cancers were found to exhibit heterogeneous expression . 12/24/2018 Deepthi Chidambaram

Complex Sentence Structures Independent clauses with connectives Many dependent clauses with one independent clause with / without connectives Multiple agents and goals in a single clause Gene14 binds to Gene15 in response to 1-b-Gene16 or methylmethanesulfonate ; this interaction does not require Gene17-Gene18-Gene19. Gene57-Gene58-Gene59-Gene60 is blocked by Gene61, which binds to Gene62-Gene63-Gene64-Gene65. Gene96 or Gene97 competes with Gene98 for binding to Gene99 and Gene100 or Gene101 stimulates Gene102-Gene103-Gene104 in vitro in the absence of Gene105. 12/24/2018 Deepthi Chidambaram

Our Solution: Complex Sentences Identify clauses in complex sentences. Build simple sentences from the clauses. Tool used – Link Grammar Parser [Link] Clause Format. Subject | Verb | Object | Modifying phrase (Adverbial Phrase/ Prepositional Phrase) 12/24/2018 Deepthi Chidambaram

CSP – Goal Upon growth factor stimulation of quiescent cells, Gene100 declines late in Gene101 and Gene102 is replaced by Gene103, which is absent in quiescent cells. Upon growth factor stimulation of quiescent cells, Gene100 declines late in Gene101. Gene102 is replaced by Gene103. Gene103 is absent in quiescent cells. 12/24/2018 Deepthi Chidambaram

Complex Sentence Processor E|18|Upon growth factor stimulation of quiescent cells, Gene100 declines late in Gene101 and Gene102 is replaced by Gene103, which is absent in quiescent cells. C|2|In Gene11-Gene12, Gene13 stimulates Gene14-Gene15-Gene16-Gene17. | CSP E|18|upon growth factor stimulation of quiescent cells , Gene100|declines||late#in Gene101#| E|18|Gene102|is replaced||by Gene103 , which#| E|18|Gene103 |is absent||in quiescent cells#| C|2|in Gene11-Gene12 , Gene13|stimulates|Gene14-Gene15-Gene16-Gene17#|| Subject Verb Objects Modifying Phrases Upon… declines late # in Gene101# … 12/24/2018 Deepthi Chidambaram

Complex Sentence Processor CSP – Data Flow Pronoun Resolution module Prolog Abstracts Gene Tagger Pre-Processor Link Grammar, Java Complex Sentence Processor Sentence database 12/24/2018 Deepthi Chidambaram

Illustration

Partial List of References [Link] Daniel Sleator and Davy Temperley. 1991. Parsing English with a Link Grammar. Carnegie Mellon University Computer Science technical report CMU-CS-91-196, October 1991. [Kohn] Kohn, K. W. (1999). "Molecular Interaction Map of the Mammalian Cell Cycle Control and DNA Repair Systems." Molecular Biology of the cell 10: 2703-2734. [Locuslink] Pruitt, K. D. and D. R. Maglott (2001). "RefSeq and LocusLink: NCBI gene-centered resources." Nucleic Acids Res 29(1): 137-140. (http://www.ncbi.nlm.nih.gov/LocusLink/ ) [Anaphora] Casta˜no, J., Zhang, J., Pustejovsky, J., Anaphora Resolution in Biomedical Literature 12/24/2018 Deepthi Chidambaram