Extracting BI-RADS Features from Portuguese Clinical Texts H. Nassif, F. Cunha, I.C. Moreira, R. Cruz- Correia, E. Sousa, D. Page, E. Burnside, and I.

Slides:



Advertisements
Similar presentations
Golan.O, Sperber.F, Shalmon.A, Weinstein.I, Gat.A
Advertisements

LABELING TURKISH NEWS STORIES WITH CRF Prof. Dr. Eşref Adalı ISTANBUL TECHNICAL UNIVERSITY COMPUTER ENGINEERING 1.
View Learning: An extension to SRL An application in Mammography Jesse Davis, Beth Burnside, Inês Dutra Vítor Santos Costa, David Page, Jude Shavlik &
MINING FEATURE-OPINION PAIRS AND THEIR RELIABILITY SCORES FROM WEB OPINION SOURCES Presented by Sole A. Kamal, M. Abulaish, and T. Anwar International.
Mining External Resources for Biomedical IE Why, How, What Malvina Nissim
Automatic Report Generation from Ontologies: the MIAKT Approach Kalina Bontcheva, Yorick Wilks Department of Computer Science University of Sheffield.
Omphalos Session. Omphalos Session Programme Design & Results 25 mins Award Ceremony 5 mins Presentation by Alexander Clark25 mins Presentation by Georgios.
COMPUTATIONAL INTELLIGENCE FOR THE DETECTION AND CLASSIFICATION OF MALIGNANT LESIONS IN SCREENING MAMMOGRAPHY DATA E. Panourgias,
Computer Aided Diagnosis: CAD overview
Retrieval of Similar Electronic Health Records using UMLS Concept Graphs Laura Plaza and Alberto Díaz Universidad Complutense de Madrid.
· Information gathering · Data analysis · Decision making · “ Human life is too important to be left to a computer “ Patients receive the best treatment.
Supporting e-learning with automatic glossary extraction Experiments with Portuguese Rosa Del Gaudio, António Branco RANLP, Borovets 2007.
1 Health Text Lexical Processing Mojtaba Sabbagh.
Automating Breast Ultrasound Workflow DICOM Compatible Automated Segmentation Digital Work Sheet Documentation Point and Click Lesion Characteristics Previous.
A Computer Aided Detection System For Digital Mammograms Based on Radial Basis Functions and Feature Extraction Techniques By Mohammed Jirari Shanghai,
Predicting the Semantic Orientation of Adjective Vasileios Hatzivassiloglou and Kathleen R. McKeown Presented By Yash Satsangi.
Active Learning with Support Vector Machines
ISP 121 Statistics That Deceive. Simpson’s Paradox It’s a well accepted rule of thumb that the larger the data set, the better Simpson’s Paradox demonstrates.
Associative Learning in Hierarchical Self Organizing Learning Arrays Janusz A. Starzyk, Zhen Zhu, and Yue Li School of Electrical Engineering and Computer.
1 Automated Feature Abstraction of the fMRI Signal using Neural Network Clustering Techniques Stefan Niculescu and Tom Mitchell Siemens Medical Solutions,
A Computer Aided Detection System For Mammograms Based on Asymmetry and Feature Extraction Techniques By Mohammed Jirari Benidorm, Spain Sept 9th, 2005.
February 13, 1997CWU B.Kovalerchuk1 DESIGN OF CONSISTENT SYSTEM FOR RADIOLOGISTS TO SUPPORT BREAST CANCER DIAGNOSIS.
Automatic Detection And Classification Of Microcalcifications In Digital Mammograms Institute for Brain and Neural Systems Brown University Providence.
Breast Imaging Made Brief and Simple
Knowledge Science & Engineering Institute, Beijing Normal University, Analyzing Transcripts of Online Asynchronous.
AJCC Staging Moments AJCC TNM Staging 7th Edition Breast Case #2 Contributors: Stephen B. Edge, MD Roswell Park Cancer Institute, Buffalo, New York David.
Uncovering Age-Specific Invasive and DCIS Breast Cancer Rules Using Inductive Logic Programming Houssam Nassif, David Page, Mehmet Ayvaci, Jude Shavlik,
BI-RADS By Nina Zahedi MD.
Bayesian Network for Predicting Invasive and In-situ Breast Cancer using Mammographic Findings Jagpreet Chhatwal1 O. Alagoz1, E.S. Burnside1, H. Nassif1,
Marion C.W. Henry, MD Yale University
ENDA MOLLOY, ELECTRONIC ENG. FINAL PRESENTATION, 31/03/09. Automated Image Analysis Techniques for Screening of Mammography Images.
Andreea Bodnari, 1 Peter Szolovits, 1 Ozlem Uzuner 2 1 MIT, CSAIL, Cambridge, MA, USA 2 Department of Information Studies, University at Albany SUNY, Albany,
Empirical Methods in Information Extraction Claire Cardie Appeared in AI Magazine, 18:4, Summarized by Seong-Bae Park.
Age Stratified Risk Prediction of Invasive versus In-situ Breast Cancer: A Logistic Regression Model Mehmet Ayvaci 1,2 Oguzhan Alagoz 1,Jagpreet Chhatwal.
DR (MRS) AUGUSTINA BADU-PEPRAH MB Ch B, FWACS RADIOLOGIST KATH.
Introduction to Breast Imaging BREAST RAD LAB Directions: Please answer all the questions prior to interactive conference. 1.
Integrating Machine Learning and Physician Knowledge to Improve the Accuracy of Breast Biopsy Inês Dutra University of Porto, CRACS & INESC-Porto LA Houssam.
How do we Collect Data for the Ontology? AmphibiaTree 2006 Workshop Saturday 11:30–11:45 J. Leopold.
Open Information Extraction using Wikipedia
How will you approach the 35-year old, with a 2x2x2cm, firm, mobile, well-circumscribed non-tender mass on her R breast?
Managed by UT-Battelle for the Department of Energy Learning Cue Phrase Patterns from Radiology Reports Using a Genetic Algorithm Robert M. Patton, Ph.D.
Relevance Detection Approach to Gene Annotation Aid to automatic annotation of databases Annotation flow –Extraction of molecular function of a gene from.
Presenter: Shanshan Lu 03/04/2010
Unsupervised Learning of Visual Sense Models for Polysemous Words Kate Saenko Trevor Darrell Deepak.
A Scalable Machine Learning Approach for Semi-Structured Named Entity Recognition Utku Irmak(Yahoo! Labs) Reiner Kraft(Yahoo! Inc.) WWW 2010(Information.
Chapter 23: Probabilistic Language Models April 13, 2004.
Number Sense Disambiguation Stuart Moore Supervised by: Anna Korhonen (Computer Lab)‏ Sabine Buchholz (Toshiba CRL)‏
Improved Video Categorization from Text Metadata and User Comments ACM SIGIR 2011:Research and development in Information Retrieval - Katja Filippova -
Information Extraction for Clinical Data Mining: A Mammography Case Study H. Nassif, R. Woods, E. Burnside, M. Ayvaci, J. Shavlik and D. Page University.
The CALMA project A CAD tool in breast radiography A.Ceccopieri, Padova
TRAINING PACKAGE The User Action Framework Reliability Study July 1999.
Automatic extraction of BI-RADS breast tissue composition classes from mammography reports Bethany Percha (Stanford) Houssam Nassif (U. Wisconsin) Jafi.
Maximum Entropy techniques for exploiting syntactic, semantic and collocational dependencies in Language Modeling Sanjeev Khudanpur, Jun Wu Center for.
THIRD CLASSIFICATION OF MICROCALCIFICATION STAGES IN MAMMOGRAPHIC IMAGES THIRD REVIEW Supervisor: Mrs.P.Valarmathi HOD/CSE Project Members: M.HamsaPriya( )
1 CS 430: Information Discovery Lecture 21 Interactive Retrieval.
國立雲林科技大學 National Yunlin University of Science and Technology Intelligent Database Systems Lab 1 Self-organizing map for cluster analysis of a breast cancer.
Dr. Julia Flukinger Breast Radiologist, Director Breast MRI, Advanced Radiology May 21, 2106.
Case (II) Chemoresistant group. Case 6. F/41 Rt. breast cancer, 3 cycle NAC for 2 months (a) Indistinct margined, irregular shaped, hyperdense mass in.
SCORE AS YOU LIFT (SAYL) A Statistical Relational Learning Approach to Uplift Modeling Houssam Nassif 1, Finn Kuusisto 1, Elizabeth S. Burnside 1, David.
BI-RADS classification in mammography
Example 4: (A,B) Standard CC and MLO views of the right breast in this screening mammogram for a 60-year-old woman who never had any prior mammograms.
Variable Appearances of Ductal Carcinoma In Situ Calcifications on Digital Mammography, Synthesized Mammography, and Tomosynthesis: A Pictorial Essay 
Current Status of Breast Ultrasound
The efficacy of using CAD for detection of
Introduction Task: extracting relational facts from text
Automatic Extraction of BI-RADS Features from Cross-Institution and Cross-Language Free-Text Mammography Reports Houssam Nassif, Terrie Kitchner, Filipe.
Avoiding Pitfalls in Mammographic Interpretation
Hierarchical, Perceptron-like Learning for OBIE
By Hossein Hematialam and Wlodek Zadrozny Presented by
Marion C.W. Henry, MD Yale University
Presentation transcript:

Extracting BI-RADS Features from Portuguese Clinical Texts H. Nassif, F. Cunha, I.C. Moreira, R. Cruz- Correia, E. Sousa, D. Page, E. Burnside, and I. Dutra University of Wisconsin – Madison, and University of Porto, Portugal

The American Cancer Society, Cancer Facts & Figures 2009.

Impression (free text) Mammogram Radiologist Structured Database Predictive Model Benign Malignant

BI-RADS Lexicon Concepts

Lobular ShapeOval ShapeObscured Margin… Report 1010… Report 2101… …………… Example In the right breast, an approximately 1.0 cm mass is identified in the right upper slightly inner breast. This mass is noncalcified and partially obscured and lobulated in appearance. Concepts

Nassif 09

Syntax Analyzer Tokenize sentences Discard punctuation Keep stop words Stem words

Nassif 09

Information from Lexicon Translate lexicon into Portuguese Lexicon specifies synonyms: Eg: Equal density, Isodense Lexicon allows for ambiguous wording: TextConcept indistinct margin indistinct calcificationamorphous calcification indistinct imagenot a concept

Nassif 09

Experts Provide domain specific information – Synonyms: Oval, Ovoid – Acronyms, abbreviations – Domain idiosyncrasies Interact with and modify semantic rules

Nassif 09

Concept Finder Regular expression rules Extract concepts from text Rule formation: – Initial rules based on lexicon – Rules refined by experts

Rule Generation Example 1 Aim: Regional Distribution Concept Lexicon specifies the word “regional” Initial rule: presence of the word “regional” Run on training set, experts see results Many false positives: – “regional medical center”, “regional hospital” Rule refined by experts: – “regional.* !(medical|hospital)”

Rule Generation Example 2 Aim: Skin Thickening Concept Lexicon specifies “skin thickening” Try “skin” and “thickening” in same sentence – “skin retraction and thickening” – “thickening of the overlying skin” – “A BB placed on the skin overlying a palpable focal area of thickening in the upper outer right breast” Experts suggest “skin” and “thickening” in close proximity

Scope Scope: distance between two words Start with a large scope: – assess number of true and false positives Move to smaller scopes: – assess number of false negatives Check precision and recall estimates Experts decide on the best distance

Nassif 09

Negation Detector Negation triggers (Mutalik 01, Gindl 08): – “não” (not) when not preceded by “onde” (where) – “sem” (without) – “nem” (nor). Precedes or appears within the subsentence Establish negation scope “without evidence of suspicious cluster of microcalcifications”

Dataset Training set: 1,129 reports, unlabeled Testing set: 153 pairs, labeled by radiologist – Basic screening report – Detailed diagnostic report Perform three refinement passes – Double blind, based on lexicon – Refine rules – Refine manual labeling and rules

Results

Conclusion Out of 48 disputed cases, parser correctly classified 25 (52.1%) First Portuguese BI-RADS extractor – Discovers features missed or misclassified – Similar performance to manual annotation Method portable to other languages