Applying Deep Neural Network to Enhance EMPI Searching

Slides:



Advertisements
Similar presentations
Brian Merrick CS498 Seminar.  Introduction to Neural Networks  Types of Neural Networks  Neural Networks with Pattern Recognition  Applications.
Advertisements

1 Lecture 5: Automatic cluster detection Lecture 6: Artificial neural networks Lecture 7: Evaluation of discovered knowledge Brief introduction to lectures.
Connectionist models. Connectionist Models Motivated by Brain rather than Mind –A large number of very simple processing elements –A large number of weighted.
Artificial Neural Networks (ANNs)
Text Classification: An Implementation Project Prerak Sanghvi Computer Science and Engineering Department State University of New York at Buffalo.
Machine Learning Motivation for machine learning How to set up a problem How to design a learner Introduce one class of learners (ANN) –Perceptrons –Feed-forward.
Introduction to Machine Learning Approach Lecture 5.
Radial-Basis Function Networks
Ranga Rodrigo April 5, 2014 Most of the sides are from the Matlab tutorial. 1.
Attention Deficit Hyperactivity Disorder (ADHD) Student Classification Using Genetic Algorithm and Artificial Neural Network S. Yenaeng 1, S. Saelee 2.
CHAPTER 12 ADVANCED INTELLIGENT SYSTEMS © 2005 Prentice Hall, Decision Support Systems and Intelligent Systems, 7th Edition, Turban, Aronson, and Liang.
Artificial Neural Networks (ANN). Output Y is 1 if at least two of the three inputs are equal to 1.
Using Neural Networks in Database Mining Tino Jimenez CS157B MW 9-10:15 February 19, 2009.
DeepFont: Large-Scale Real-World Font Recognition from Images
NEURAL NETWORKS FOR DATA MINING
Artificial Intelligence Techniques Multilayer Perceptrons.
1 Chapter 11 Neural Networks. 2 Chapter 11 Contents (1) l Biological Neurons l Artificial Neurons l Perceptrons l Multilayer Neural Networks l Backpropagation.
EE459 Neural Networks Examples of using Neural Networks Kasin Prakobwaitayakit Department of Electrical Engineering Chiangmai University.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
C - IT Acumens. COMIT Acumens. COM. To demonstrate the use of Neural Networks in the field of Character and Pattern Recognition by simulating a neural.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Artificial Intelligence CIS 342 The College of Saint Rose David Goldschmidt, Ph.D.
Chapter 6 Neural Network.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Kim HS Introduction considering that the amount of MRI data to analyze in present-day clinical trials is often on the order of hundreds or.
語音訊號處理之初步實驗 NTU Speech Lab 指導教授: 李琳山 助教: 熊信寬
Machine Learning Artificial Neural Networks MPλ ∀ Stergiou Theodoros 1.
A Presentation on Adaptive Neuro-Fuzzy Inference System using Particle Swarm Optimization and it’s Application By Sumanta Kundu (En.R.No.
Sparse Coding: A Deep Learning using Unlabeled Data for High - Level Representation Dr.G.M.Nasira R. Vidya R. P. Jaia Priyankka.
Business Intelligence and Decision Support Systems (9 th Ed., Prentice Hall) Chapter 6: Artificial Neural Networks for Data Mining.
Introduction to Machine Learning, its potential usage in network area,
Detecting Web Attacks Using Multi-Stage Log Analysis
Brief Intro to Machine Learning CS539
Big data classification using neural network
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Definition CASE tools are software systems that are intended to provide automated support for routine activities in the software process such as editing.
Deep Learning Amin Sobhani.
DeepFont: Identify Your Font from An Image
Extreme Learning Machine
School of Computer Science & Engineering
Alan D. Mead, Talent Algorithms Inc. Jialin Huang, Amazon
Can Computer Algorithms Guess Your Age and Gender?
Week 12 Option 3: Database Design
Lots of news about Artificial Intelligence in the press
Self organizing networks
What is Pattern Recognition?
Basic Intro Tutorial on Machine Learning and Data Mining
CSE 4705 Artificial Intelligence
Prof. Carolina Ruiz Department of Computer Science
Artificial Intelligence Chapter 3 Neural Networks
network of simple neuron-like computing elements
INTRODUCTION.
Cluster Validity For supervised classification we have a variety of measures to evaluate how good our model is Accuracy, precision, recall For cluster.
Zip Codes and Neural Networks: Machine Learning for
Artificial Intelligence Chapter 3 Neural Networks
MACHINE LEARNING TECHNIQUES IN IMAGE PROCESSING
MACHINE LEARNING TECHNIQUES IN IMAGE PROCESSING
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Artificial Intelligence Chapter 3 Neural Networks
The ultimate in data organization
Artificial Intelligence Chapter 3 Neural Networks
Deep Learning Authors: Yann LeCun, Yoshua Bengio, Geoffrey Hinton
Information Retrieval and Web Design
Machine Learning for Space Systems: Are We Ready?
Deep Learning with Botanical Specimen Images
THE ASSISTIVE SYSTEM SHIFALI KUMAR BISHWO GURUNG JAMES CHOU
Artificial Intelligence Chapter 3 Neural Networks
Machine Learning.
Prof. Carolina Ruiz Department of Computer Science
Presentation transcript:

Applying Deep Neural Network to Enhance EMPI Searching Stanley Liang Supervisor: Prof. Jimmy Huang School of information technology

What is EMPI EMPI is the acronym of enterprise master patient index EMPI is a database used in a healthcare organization to maintain the data consistency of a variety of EHR information on the patients seen and managed within its various departments. An identical patient has is likely to have different information regarding the same domain in different information systems. EMPI plays a role as a universal identifier or label of an identical patient is the complex EHR system.

Measure of Entry Errors Incorrect search results are most likely to be caused by entry typos Edit distance, or Levenshtein distance, is a way to measure the dissimilarity of two words by counting the minimum number of changes to make them the same Edit distances is commonly applied to natural language processing for automatic spelling correction. To retrieve the due EHR information, we apply the similar idea to EMPI searching to classify the typos into three categories: Match – no error Minor error – 1 ~ 3 errors Major error – 4 or more errors A deep neural network model is applied in this task. The algorithm can be implemented either by MATLAB or by C#.

Deep Learning Deep learning generally means a neural network having more than two hidden layers for performing either supervised or unsupervised learning Deep learning is commonly applied to pattern recognition in unstructured data, e.g. text, photos, videos, and sound Deep learning is capable of discovering intricate structure in large data sets by the back-propagation algorithm Deep learning provides extremely accurate predictions

The Feedforward-Backpropagation learning Cycle

Experiment Environments MATLAB MATLAB 2016b Neural Network Toolbox 20,000 EMPI Records with random errors C# & Visual Studio Accord.NET Framework

Implementing Deep Learning in EMPI Information Retrieval The manual input queries by the users usually have edit errors. Error in text editing: Stanley  Stanly Logical error: Female Male Date error: 2016-11-02 2012-10-02 Problem: if we use SQL query only, the machine cannot learn because SQL only use equal search and range search. If the patient is in the system but the query is wrong, it is difficult to retrieve the information or at high cost of computing time Task Provide adaptive search augmented by deep learning implicit search with good tolerance of occasional manual entry errors Improve system efficiency

Algorithm Implementation Levenshtein Distance Deep Learning

Data set for testing Category Number of records Label description No error 6,000 [1 0 0] Totally correct Minor error 1,000 [0 1 0] ED=1 in a single column randomly, Total=1 ED=2 in a single column randomly, Total=2 ED=3 in a single column randomly, Total=3 ED=1 in two single columns randomly, Total=2 ED=1 in three single columns randomly, Total=2 ED=1 in a random column and ED=2 in another column randomly, Total=3 Major error 2,000 [0 0 1] ED>=4 in a random column, Total>=4 ED>=4 in total in two random columns, Total>=4 ED>=4 in total in three random columns, Total>=4

Ten-fold cross-validation

Confusion Matrix Green Cells represent correct classification The accuracy is 97.4% in this case Class 1: match Class 2: minor error Class 3: major error

Conclusion The deep neural network classifier has stable performance on this work with the average accuracy at 96.3% We conclude that deep learning provides a good artificial intelligence solution for the EMPI query classification to enhance the overall system performance We expect that artificial intelligence (AI) powered by deep learning will play an important role in a variety of healthcare decision making process

The End of Presentation Thank you