Semantic natural language understanding with Spark, UIMA & machine-learned ontologies David Talby @davidtalby Claudiu Branzan @melcutz.

Slides:



Advertisements
Similar presentations
Personalized Navigation in the Semantic Web: An Enhanced Faceted Browser Michal Tvarožek FIIT STU BA.
Advertisements

Introduction to Information Retrieval
NLP Highlights GS Savova And team. Medication CEM template associatedCode Change_status Conditional Dosage Duration End_date Form Frequency Generic Negation_indicator.
© Copyright 2012 STI INNSBRUCK Apache Lucene Ioan Toma based on slides from Aaron Bannert
The Survey of Well-being of Young Children (SWYC)
Who am I Gianluca Correndo PhD student (end of PhD) Work in the group of medical informatics (Paolo Terenziani) PhD thesis on contextualization techniques.
Properties of Text CS336 Lecture 3:. 2 Information Retrieval Searching unstructured documents Typically text –Newspaper articles –Web pages Other documents.
Pet Fish and High Cholesterol in the WHI OS: An Analysis Example Joe Larson 5 / 6 / 09.
CASE Tools And Their Effect On Software Quality Peter Geddis – pxg07u.
Logic Programming for Natural Language Processing Menyoung Lee TJHSST Computer Systems Lab Mentor: Matt Parker Analytic Services, Inc.
TERMINOLOGICAL SERVER Proposal for short term complementary homeworks SEMINOP 28/10/2010.
STAT!Ref ® IT! Powered by Teton Data Systems. Presentation Goals: What is STAT!Ref®? Where is STAT!Ref? How to STAT!Ref® it!
Tips: The more you read in English, the better you will do in this part of the exam. ( ) Try reading: newspaper articles, sports magazines,
Intralab Workshop - Reactome CMAP Chang-Feng Quo June 29 th, 2006.
Open Health Natural Language Processing Consortium (OHNLP)
Value Set Resolution: Build generalizable data normalization pipeline using LexEVS infrastructure resources Explore UIMA framework for implementing semantic.
CTAKES The clinical Text Analysis and Knowledge Extraction System.
Intel Framework. What is intelligence? Intel framework defines intelligence as an atomic bit of data with associated metadata Things you want to know.
MedSearch Vaishnav Janardhan COMS E6125 Web-Enhanced Information Management.
2007. Software Engineering Laboratory, School of Computer Science S E Web-Harvest Web-Harvest: Open Source Web Data Extraction tool 이재정 Software Engineering.
Institute for Personal Robots in Education (IPRE)‏ CSC 170 Computing: Science and Creativity.
IBM Research © Copyright IBM Corporation 2005 | A Development Environment for Configurable Meta-Annotators in a Pipelined NLP Architecture Youssef Drissi,
IDS 1 Extended Keyword Index & Improved Search for Semantic e-Catalog 이동주.
Mayo cTAKES: UIMA Type System
Using Domain Ontologies to Improve Information Retrieval in Scientific Publications Engineering Informatics Lab at Stanford.
MedKAT Medical Knowledge Analysis Tool December 2009.
Kaiser Permanente Convergent Medical Terminology (CMT) Using Oxford RDFox and SNOMED for Quality Measures.
Open Health Natural Language Processing Consortium
Semantic Wiki: Automating the Read, Write, and Reporting functions Chuck Rehberg, Semantic Insights.
© NCSR, Frascati, July 18-19, 2002 CROSSMARC big picture Domain-specific Web sites Domain-specific Spidering Domain Ontology XHTML pages WEB Focused Crawling.
Semantic Web Technologies Readings discussion Research presentations Projects & Papers discussions.
University of Sheffield, NLP Introduction to Text Mining Module 4: Development Lifecycle (Part 1)
The Complete Health History QUESTIONS ????????????????
Myths about flu and the flu vaccine
A quick reference to literature searches
WELCOME TO SHIPLEY MEDICAL PRACTICE
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
Phenotyping youth depression
How to Use the Nursing Diagnosis Handbook: A Guide to Planning Care
WELCOME TO SHIPLEY MEDICAL PRACTICE
Learning to Personalize Query Auto-Completion
Kaiser Permanente Convergent Medical Terminology (CMT)
M. Lee Chambliss MD MSPH Associate Professor
Hypertension.
DATABASE SEARCH & REVIEW GETTING STARTED GUIDE FOR EMIS WEB USERS
Semantic natural language understanding at scale using Spark, machine-learned annotators and deep-learned ontologies David Claudiu Branzan.
Build smarter bots and devices by connecting to the Microsoft Graph
Monogenic Disorders Case Studies
Treatment and Prevention of Cold and Flu
RAI and MDS Chapter 16 Red book.
Getting Started With Your Medical Practicum Experience
Electronic Health Records
Query Processing B.Ramamurthy Chapter 12 11/27/2018 B.Ramamurthy.
EHR System Function and Information Model (EHR-S FIM is based on EHR-S FM R2.0) CP.1.2 Manage Allergy, Intolerance and Adverse Reaction List aka DC
Influenza Take Action to Prevent the Flu! Allison Ormond, RN
10B be going to (plans) + – I’m going to have a holiday next month.
10B be going to (plans) + – I’m going to have a holiday next month.
Good Drugs Like Medicine
Temple Ready within an Hour of Collection Capture
Using text mining methods to detect a clinical infection
Search Engine Architecture
CERNER MILLENNIUM Abstracting Chart Data Into Unity: Immunizations Excerpt This presentation reviews strategies for moving immunization data from legacy.
Chaitali Gupta, Madhusudhan Govindaraju
Psychological Disorders
A framework for ontology Learning FROM Big Data
Scratching through the ages
FTForm Plus QDE Form Designer


Flu vaccine is free for anyone, six months of age and older, who live, work or attend school in Ontario. The Flu and You October 2019.
Presentation transcript:

Semantic natural language understanding with Spark, UIMA & machine-learned ontologies David Talby @davidtalby Claudiu Branzan @melcutz

The problem Who is at risk for sepsis? Who needs to be vaccinated? Who fits this clinical trial? Who on this protocol did not have this side effect? Who is getting meds they’re allergic to?

At the beginning, there was search Scalable & robust Indexing pipeline Tokenizers & analyzers Synonyms, spellers & Auto-suggest File formats & header boosting Rankers, link & reputation boosting

Then there was semantic search “cheap red prom dresses” “laptops under $500” “italian restaurants near me that deliver” “captain america civil war tonight” “nba scores” Dictionary Based Attribute Extraction Dell - XPS 15.6 4K Ultra HD Touch-Screen Laptop - Intel Core i5 - 8GB Memory - 256GB Solid State Drive - Silver Machine Learned Attribute Extraction If you go for the ambience, you'll be disappointed. If you go for good, inexpensive and authentic Mexican food, then you're in the right place.

And Then, you need to understand language Prescribing sick days due to diagnosis of influenza. Positive Jane complains about flu-like symptoms. Speculative Jane may be experiencing some sort of flu episode. Possible Jane’s RIDT came back negative for influenza. Negative Jane is at high risk for flu if she’s not vaccinated. Conditional Jane’s older brother had the flu last month. Family history Jane had a severe case of flu last year. Patient history

Language gets complex & domain specific Joe expressed concerns about the risks of bird flu. Nothing Joe shows no signs of stroke, except for numbness. Double Negative Nausea, vomiting and ankle swelling negative. Compound Patient denies alcohol abuse. Speculative Allergies: Penicillin, Dust, Sneezing. Compound (it gets worse – in reality a lot of text isn’t valid English)

Let’s build this! The input The processing The output (patient records) The processing framework The output The query engines ASSERTIONS ASSERTIONS

First Demo: Annotators & Assertions

Machine learned annotators Sometimes, it’s easier to just code an annotation’s business logic Grammatical Patterns If … then … Direct Inferences Age < 18 ==> Child Lookups RIDT (lab test) But sometimes it’s easier to learn it from examples: Under-diagnosed conditions Flu Depression Implied by Context relevant labs normal

Second Demo: Machine Learned Annotator

What about expanding & updating ontologies? Word2Vec

Let’s build this too! Ontology

Third Demo: Ontology Enrichment

Summary & applications Who is at risk for sepsis? Who needs to be vaccinated? Who fits this clinical trial? Who on this protocol did not have this side effect? Who is getting meds they’re allergic to?

Free notebooks: https://github.com/Atigeo/nlp_demo @melcutz @davidtalby

In case the live demo gets cold feet on stage appendix In case the live demo gets cold feet on stage