Use of Active Learning for Selective Annotation of Training Data in a Supervised Classification System for Digitized Histology Scott Doyle 1, Michael Feldman.

Slides:



Advertisements
Similar presentations
Dept of Biomedical Engineering, Medical Informatics Linköpings universitet, Linköping, Sweden A Data Pre-processing Method to Increase.
Advertisements

CHOICE Pathology Informatics 2010 Boston, Massachusetts DataReady ® : A Deployable Data Management and Integration System for Large-scale Cancer Repositories.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
A Novel Approach for Recognizing Auditory Events & Scenes Ashish Kapoor.
FUZZ-IEEE 2009 Methods of Interpretation of a Non-stationary Fuzzy System for the Treatment of Breast Cancer Xiao-Ying Wang 1, Jonathan M. Garibaldi 1,
Localization of Diagnostically Relevant Regions of Interest in Whole Slide Images Ezgi Mercan 1, Selim Aksoy 2, Linda G. Shapiro 1, Donald L. Weaver 3,
16 November 2004Biomedical Imaging BMEN Biomedical Imaging of the Future Alvin T. Yeh Department of Biomedical Engineering Texas A&M University.
Content Based Image Clustering and Image Retrieval Using Multiple Instance Learning Using Multiple Instance Learning Xin Chen Advisor: Chengcui Zhang Department.
Principal Component Analysis
By Russell Armstrong Supervisor Mrs Wei Ji Diagnosis Analysis of Lung Cancer by Genome Expression Profiles.
Generic Object Detection using Feature Maps Oscar Danielsson Stefan Carlsson
Robust Real-time Object Detection by Paul Viola and Michael Jones ICCV 2001 Workshop on Statistical and Computation Theories of Vision Presentation by.
Empirically Assessing End User Software Engineering Techniques Gregg Rothermel Department of Computer Science and Engineering University of Nebraska --
Spatial Semi- supervised Image Classification Stuart Ness G07 - Csci 8701 Final Project 1.
1 Integrating User Feedback Log into Relevance Feedback by Coupled SVM for Content-Based Image Retrieval 9-April, 2005 Steven C. H. Hoi *, Michael R. Lyu.
Feature Selection and Its Application in Genomic Data Analysis March 9, 2004 Lei Yu Arizona State University.
Diagnosis of Ovarian Cancer Based on Mass Spectrum of Blood Samples Committee: Eugene Fink Lihua Li Dmitry B. Goldgof Hong Tang.
A fuzzy video content representation for video summarization and content-based retrieval Anastasios D. Doulamis, Nikolaos D. Doulamis, Stefanos D. Kollias.
MACHINE LEARNING. What is learning? A computer program learns if it improves its performance at some task through experience (T. Mitchell, 1997) A computer.
February 13, 1997CWU B.Kovalerchuk1 DESIGN OF CONSISTENT SYSTEM FOR RADIOLOGISTS TO SUPPORT BREAST CANCER DIAGNOSIS.
Automatic Detection And Classification Of Microcalcifications In Digital Mammograms Institute for Brain and Neural Systems Brown University Providence.
Introduction to machine learning
Introduction to Data Mining Engineering Group in ACL.
Remote Sensing Laboratory Dept. of Information Engineering and Computer Science University of Trento Via Sommarive, 14, I Povo, Trento, Italy Remote.
Selective Sampling on Probabilistic Labels Peng Peng, Raymond Chi-Wing Wong CSE, HKUST 1.
3 ème Journée Doctorale G&E, Bordeaux, Mars 2015 Wei FENG Geo-Resources and Environment Lab, Bordeaux INP (Bordeaux Institute of Technology), France Supervisor:
Ensembles of Classifiers Evgueni Smirnov
Detection of Prostate Cancer from Whole-Mount Histology Images Using Markov Random Fields James P. Monaco 1, John E. Tomaszewski 2, Michael D. Feldman.
: Chapter 1: Introduction 1 Montri Karnjanadecha ac.th/~montri Principles of Pattern Recognition.
Thien Anh Dinh1, Tomi Silander1, Bolan Su1, Tianxia Gong
1 Mining with Noise Knowledge: Error Awareness Data Mining Xindong Wu Department of Computer Science University of Vermont, USA; Hong Kong Polytechnic.
Trust-Aware Optimal Crowdsourcing With Budget Constraint Xiangyang Liu 1, He He 2, and John S. Baras 1 1 Institute for Systems Research and Department.
1 Action Classification: An Integration of Randomization and Discrimination in A Dense Feature Representation Computer Science Department, Stanford University.
Page 1 Ming Ji Department of Computer Science University of Illinois at Urbana-Champaign.
Developing a Validated Tool For Evaluating Whole Slide Images.
Efficient Processing of Pathological Images Using the Grid: Computer-Aided Prognosis of Neuroblastoma B. Barla Cambazoglu Ohio State University Department.
Automatic Detection and Segmentation of Robot-Assisted Surgical Motions presented by Henry C. Lin Henry C. Lin, Dr. Izhak Shafran, Todd E. Murphy, Dr.
Lecture 10: 8/6/1435 Machine Learning Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
Image Analysis for Neuroblastoma Classification: Hysteresis Thresholding for Nuclei Segmentation Metin Gurcan 1, PhD Tony Pan 1, MS Hiro Shimada 2, MD,
Hierarchical Annotation of Medical Images Ivica Dimitrovski 1, Dragi Kocev 2, Suzana Loškovska 1, Sašo Džeroski 2 1 Department of Computer Science, Faculty.
Copyright © 2010 Siemens Medical Solutions USA, Inc. All rights reserved. Hierarchical Segmentation and Identification of Thoracic Vertebra Using Learning-based.
ImArray - An Automated High-Performance Microarray Scanner Software for Microarray Image Analysis, Data Management and Knowledge Mining Wei-Bang Chen and.
CSE 8331 Spring CSE 8331 Spring 2010 Image Mining Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist University.
Prostate Cancer CAD Michael Feldman, MD, PhD Assistant Professor Pathology University Pennsylvania.
Color correction of pathological images *Tokiya Abe,Pinky A. Bautista, Yukako Yagi, John Gilbertson **Yuri Murakami, Masahiro Yamaguchi, Nagaaki Ohyama.
SVM-based techniques for biomarker discovery in proteomic pattern data Elena Marchiori Department of Computer Science Vrije Universiteit Amsterdam.
AdvisorStudent Dr. Jia Li Shaojun Liu Dept. of Computer Science and Engineering, Oakland University Automatic 3D Image Segmentation of Internal Lung Structures.
Fuzzy Machine Learning Methods for Biomedical Data Analysis
Ivica Dimitrovski 1, Dragi Kocev 2, Suzana Loskovska 1, Sašo Džeroski 2 1 Faculty of Electrical Engineering and Information Technologies, Department of.
COP5992 – DATA MINING TERM PROJECT RANDOM SUBSPACE METHOD + CO-TRAINING by SELIM KALAYCI.
Multiple Instance Learning for Sparse Positive Bags Razvan C. Bunescu Machine Learning Group Department of Computer Sciences University of Texas at Austin.
A Framework for a Fully Automatic Karyotyping System E. Poletti, E. Grisan, A. Ruggeri Department of Information Engineering, University of Padova, Italy.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Developing outcome prediction models for acute intracerebral hemorrhage patients: evaluation of a Support Vector Machine based method A. Jakab 1, L. Lánczi.
THIRD CLASSIFICATION OF MICROCALCIFICATION STAGES IN MAMMOGRAPHIC IMAGES THIRD REVIEW Supervisor: Mrs.P.Valarmathi HOD/CSE Project Members: M.HamsaPriya( )
SUPERVISED AND UNSUPERVISED LEARNING Presentation by Ege Saygıner CENG 784.
Active, Semi-Supervised Learning for Textual Information Access Anastasia Krithara¹, Cyril Goutte², Massih-Reza Amini³, Jean-Michel Renders¹ Massih-Reza.
Automatic Classification for Pathological Prostate Images Based on Fractal Analysis Source: IEEE TRANSACTIONS ON MEDICAL IMAGING, VOL. 28, NO. 7, JULY.
Automated GrowCut for Segmentation of Endoscopic Images Farah Deeba, Francis M. Bui, Khan A. Wahid Department Of Electrical And Computer Engineering University.
Chang-Tsun Li BEng in Electrical Engineering: Chung Cheng Inst. Tech. (CCIT), National Defence University, Taiwan MSc in Computer Science : Naval Postgraduate.
Debesh Jha and Kwon Goo-Rak
In Search of the Optimal Set of Indicators when Classifying Histopathological Images Catalin Stoean University of Craiova, Romania
Mammogram Analysis – Tumor classification
Source: Procedia Computer Science(2015)70:
Deepak Kumar1, Chetan Kumar1, Ming Shao2
Region and Shape Extraction
Volume 5, Issue 6, Pages e3 (December 2017)
Three steps are separately conducted
Multiple Organ detection in CT Volumes - Week 3
Presented By: Firas Gerges (fg92)
Presentation transcript:

Use of Active Learning for Selective Annotation of Training Data in a Supervised Classification System for Digitized Histology Scott Doyle 1, Michael Feldman 2, John Tomaszewski 2, Anant Madabhushi 1 1 Department of Biomedical Engineering, Rutgers, The State University of New Jersey 2 Department of Surgical Pathology, University of Pennsylvania

Outline  Background  Digital Prostate Histopathology  Supervised Classification  Active Learning  Methodology  Active Learning  Data Description  Experimental Setup  Experimental Results  Concluding Remarks

Prostate Cancer Detection  ~1 million biopsies per year in USA  tissue samples per biopsy  80% benign diagnosis  Large amount of data to analyze

Computer-Aided Diagnosis  Identifies regions of interest / suspicion  Quantitative  Automated  Reduces variability  Supervised classification system Doyle, S., Feldman, M., Tomaszewski, J., Madabhushi, A. “A Hierarchical Computer-aided Classification Scheme for Automated Detection of Prostatic Adenocarcinoma from Digitized Histology,” APIII 2006

Supervised Classification  Expert segmentation for training  Histopathology:  Expensive, time-consuming to annotate  Cost per training sample is high

Supervised Classification  Random training inefficient  Possible redundancy with existing training  No guarantee of improved accuracy

Solution: Active Learning  Choose training samples intelligently, not randomly  Increased accuracy per training sample  Forced choice of training, maximized accuracy  Useful where:  Large amount of unlabeled data  Annotations are expensive  Ideally suited for histopathology data

Active Learning Classifier Performance Accuracy # of Training Samples Random Learning Active Learning

Previous Work  Liu [2004], Vogiatzis and Tsapatsoulis [2006]  Gene microarray data  Yao, et al [2008]  Content-based image retrieval  Little work done in histopathology with Active Learning

Outline  Background  Digital Prostate Histopathology  Supervised Classification  Active Learning  Methodology  Active Learning  Data Description  Experimental Setup  Experimental Results  Concluding Remarks

Build Classifier Active Learning Methodology Cancer Non-cancer Uncertain Classification Obtained from pathologist Training Data Labeled Unlabeled Build Classifier Classify Unlabeled Training

Active Learning Methodology Uncertain Classification Informative Samples Certain Classification Uninformative Obtain Expert LabelsCombine With Original Set Eliminate, labeling these adds no information + Identify Informative Regions

Active Learning Methodology Generate New ClassifierNew Training Set

Feature Extraction Cancer Region Original Image Feature Images

Classification Feature ImagesC4.5 Decision Tree Doyle, S., Madabhushi, A., Feldman, M., Tomaszeweski, J.: A Boosting Cascade for Automated Detection of Prostate Cancer from Digitized Histology, MICCAI, Lecture Notes in Computer Science, Vol. 4191, pp , “Random Forest” [Brieman, 2001] Majority voting determines classification

Image Data Description  27 H&E stained digital biopsy samples  Data breakdown:  Initial Training Set  Unlabeled Training Set  Testing Set  Active Learning drawn from Unlabeled Training  Groups rotated so all images are tested

Classification  Three training groups evaluated:  Initial set:  Active Learning set:  Random Learning set: Initial Training Active Learning Initial Training Random Learning Initial Training + +

Outline  Background  Digital Prostate Histopathology  Supervised Classification  Active Learning  Methodology  Active Learning  Data Description  Experimental Setup  Experimental Results  Concluding Remarks

Results: Qualitative Original ImageRandom LearningActive Learning

Results: Qualitative Random Learning Active Learning

Results: Qualitative Original ImageRandom LearningActive Learning

Results: Qualitative Active LearningRandom Learning

Quantitative Evaluation

Outline  Background  Digital Prostate Histopathology  Supervised Classification  Active Learning  Methodology  Active Learning  Data Description  Experimental Setup  Experimental Results  Concluding Remarks

Concluding Remarks  Maximize classification accuracy by choosing training intelligently  Efficiently obtain annotations  Make the most use of “training budget”  Build Active Learning into clinical applications  Online training correction / modification  User feedback

Acknowledgements  The Coulter foundation (WHCF )  New Jersey Commission on Cancer Research  The National Cancer Institute (R21CA , R03CA )  The US Department of Defense (427327)  The Society for Medical Imaging and Informatics