Research in Health Informatics: Bangladesh Perspective Dr. Abu Sayed Md. Latiful Hoque Professor, Dept. of CSE, BUET Workshop on Health Data Analytics.

Slides:



Advertisements
Similar presentations
Data warehousing and Data mining – an overview Dr. Suman Bhusan Bhattacharyya MBBS, ADHA, MBA.
Advertisements

Faculty of Computer Science © 2006 CMPUT 605February 11, 2008 A Data Warehouse Architecture for Clinical Data Warehousing Tony R. Sahama and Peter R. Croll.
Civil and Environmental Engineering Carnegie Mellon University Sensors & Knowledge Discovery (a.k.a. Data Mining) H. Scott Matthews April 14, 2003.
© Prentice Hall1 DATA MINING TECHNIQUES Introductory and Advanced Topics Eamonn Keogh (some slides adapted from) Margaret Dunham Dr. M.H.Dunham, Data Mining,
Dr. Tahar Kechadi Dr. Joe Carthy
Data Mining By Archana Ketkar.
Data Mining – Intro.
1 Data and Knowledge Management. 2 Data Management: A Critical Success Factor The difficulties and the process Data sources and collection Data quality.
Advanced Database Applications Database Indexing and Data Mining CS591-G1 -- Fall 2001 George Kollios Boston University.
DASHBOARDS Dashboard provides the managers with exactly the information they need in the correct format at the correct time. BI systems are the foundation.
Data Mining: A Closer Look
Data Mining.
CIT 858: Data Mining and Data Warehousing Course Instructor: Bajuna Salehe Web:
GUHA method in Data Mining Esko Turunen Tampere University of Technology Tampere, Finland.
Enterprise systems infrastructure and architecture DT211 4
Data Mining By Andrie Suherman. Agenda Introduction Major Elements Steps/ Processes Tools used for data mining Advantages and Disadvantages.
Data Mining: Concepts & Techniques. Motivation: Necessity is the Mother of Invention Data explosion problem –Automated data collection tools and mature.
OLAM and Data Mining: Concepts and Techniques. Introduction Data explosion problem: –Automated data collection tools and mature database technology lead.
Data Warehouse Fundamentals Rabie A. Ramadan, PhD 2.
Mining Large Data at SDSC Natasha Balac, Ph.D.. A Deluge of Data Astronomy Life Sciences Modeling and Simulation Data Management and Mining Geosciences.
Shilpa Seth.  What is Data Mining What is Data Mining  Applications of Data Mining Applications of Data Mining  KDD Process KDD Process  Architecture.
Data Mining. 2 Models Created by Data Mining Linear Equations Rules Clusters Graphs Tree Structures Recurrent Patterns.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
Data Mining Techniques As Tools for Analysis of Customer Behavior
1 Data Mining Books: 1.Data Mining, 1996 Pieter Adriaans and Dolf Zantinge Addison-Wesley 2.Discovering Data Mining, 1997 From Concept to Implementation.
Data Mining Techniques As Tools for Analysis of Customer Behavior Lecture 2:
2 Outline of the presentation Objectives, Prerequisite and Content Brief Introduction to Lectures Discussion and Conclusion Objectives, Prerequisite and.
1 SHIM 413 Database Applications for Healthcare Fall 2006 Slides by H. T. Bao.
Chapter 1 Introduction to Data Mining
2015年10月18日星期日 2015年10月18日星期日 2015年10月18日星期日 Introduction to Data Mining 1 Chapter 1 Introduction to Data Mining Chen. Chun-Hsien Department of Information.
Copyright © 2004 Pearson Education, Inc.. Chapter 27 Data Mining Concepts.
Data Preprocessing Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2010.
Guest Lecture Introduction to Data Mining Dr. Bhavani Thuraisingham September 17, 2010.
Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
Chapter 5 DATA WAREHOUSING Study Sections 5.2, 5.3, 5.5, Pages: & Snowflake schema.
1 Introduction to Data Mining C hapter 1. 2 Chapter 1 Outline Chapter 1 Outline – Background –Information is Power –Knowledge is Power –Data Mining.
1 Knowledge Discovery from DataBases (KDD) A.K.A. Data Mining & by other names as well Carlo Zaniolo UCLA CS Dept.
An Introduction Student Name: Riaz Ahmad Program: MSIT( ) Subject: Data warehouse & Data Mining.
January 17, 2016Data Mining: Concepts and Techniques 1 What Is Data Mining? Data mining (knowledge discovery from data) Extraction of interesting ( non-trivial,
Advanced Database Concepts
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 28 Data Mining Concepts.
2016年6月12日星期日 2016年6月12日星期日 2016年6月12日星期日 Introduction to Data Mining 1 Chapter 1 Introduction to Data Mining Chen. Chun-Hsien Department of Information.
CS570: Data Mining Spring 2010, TT 1 – 2:15pm Li Xiong.
July 7, 2016 Data Mining: Concepts and Techniques 1 1.
There is an inherent meaning in everything. “Signs for people who can see.”
The Application of Data Mining in Telecommunication by Wang Lina February 2003.
1-1 TITLE PRESENTATION:HEALTHCARE GROUP MEMBER: CHUAH XUE LI(212176) ONG SEAT NEE(212133) STIN2063 MACHINE LEARNING.
Intro to MIS – MGS351 Databases and Data Warehouses
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 1 —
Data Mining Motivation: “Necessity is the Mother of Invention”
Introduction to Data Mining
Chapter 13 The Data Warehouse
MIS 451 Building Business Intelligence Systems
Introduction C.Eng 714 Spring 2010.
Data and Applications Security Introduction to Data Mining
Data Warehouse.
Chapter 13 – Data Warehousing
Data Warehouse and OLAP
CSE591: Data Mining by H. Liu
Data Warehousing and Data Mining
Smart Portal To Protect Child Online
C.U.SHAH COLLEGE OF ENG. & TECH.
Data Mining: Concepts and Techniques
Data Mining: Concepts and Techniques
Data Mining Techniques As Tools for Analysis of Customer Behavior
Data Warehousing Data Mining Privacy
Data Mining: Concepts and Techniques
CSE591: Data Mining by H. Liu
Data Warehouse and OLAP
Presentation transcript:

Research in Health Informatics: Bangladesh Perspective Dr. Abu Sayed Md. Latiful Hoque Professor, Dept. of CSE, BUET Workshop on Health Data Analytics

Outline 2  Motivation  Introduction  Health Informatics, Data, and Problems  Research on the Development of Data Marts and Warehouses and Applications  Research on Data Mining for better Health Services  Conclusions

3 Motivation for Developing National Health Data Warehouse (NHDW) of Bangladesh 3  Bangladesh needs to develop NHDW for  Better Healthcare Delivery  Better Health related Research  Better Health Monitoring and Administration  Developed Countries like US, UK or Australia already developed DWs

Define National Reference level In Bangladesh, rule of thumbs is for Woman > 15 years, non pregnant Hb> 11 Good; Hb >=10.0 ok, Less than 10 medication needed. 4 WHO's Hemoglobin thresholds used to define anemia [29] 4 Motivation (2)

5 Motivation (3) National health trend analysis 5

Motivation (4) Location Dependency analysis Finding Impact of different region of Bangladesh on the test result Arsenic effect on Chandpur; Forecasting of disease, virus or epidemic Juice of Palm tree of which location is more suspicious for Nipah virus 6

Motivation (5) Redundant/ Fraud Testing Awareness If For a Costly Test T 3 : Age(X, Negative (X, T 3 ) [Support =70%, Confidence=95%] National awareness can be developed not to perform the test at initial level for Young Males. 7

Introduction  What Is Data Mining?  Data mining (knowledge discovery from data)  Extraction of interesting ( non-trivial, implicit, previously unknown and potentially useful) patterns or knowledge from huge amount of data  Alternative names  Knowledge discovery (mining) in databases (KDD), knowledge extraction, data/pattern analysis, data archeology, data dredging, information harvesting, business intelligence, etc.

Knowledge Discovery (KDD) Process  This is a view from typical database systems and data warehousing communities  Data mining plays an essential role in the knowledge discovery process Data Cleaning Data Integration Databases Data Warehouse Task-relevant Data Selection Data Mining Pattern Evaluation

Research on Health Data 10  Using effective data mining tools & algorithms possible to produce useful information from Health dataset Applications of Health Data Mining

Patient Visit Cycle in Bangladesh 11

Patient Visit Cycle (Cont’) 12

Patient Visit Cycle (Cont’) 13

Patient Visit Cycle (Cont’) 14

Patient Visit Cycle (Cont’) 15

Existing NHDW Overview Now DGHS has: 33 aggregated data sets Most health programs are included Data entered up-to 4, 501 Union level facility 3 individual data sets Data entered from 13,000 community clinic 16

Research on Health Data: Challenges 17  Architecture of National Health DW of Bangladesh  Preprocessing  Missing Value Problem  Noisy Data Problem  Data Transformation  Record Linkage  No Standard Patient Identification in data  Missing Value Problem  Noisy Data Problem  Privacy Preservation

18 Proposed Architecture of NHDW, Bangladesh

19 Sample Fact and Dimension Tables of NHDW for Pathological Data

Designing NHDW Data Cube 20

21 Snapshot of Pathological Data

Our Approach: Patient Identification Technique based on Secured Record Linkage (PITSRL) 22

Patient De-Identification with Linkage Preservation (PDLP) Technique 23

Dataset Quality 24

Research on Specialized Group: Diabetic Food Intake Insulin Type & Dose Exercise Stress Blood Glucose Level time  (t+PH) t+PH Historical Dataset Knowledge Acquirement Current Time t Conceptual View of BGL Prediction Model

Scopes of Future Research Improvement of data quality:  A dataset with sufficient amount of variations is indispensible.  The quality of dataset depends on accuracy of estimation and accumulation of life style data (diet, insulin, exercise, stress etc.)  An intelligent entry module can be developed for estimating effects of emotional events on blood glucose level.

Scopes of Future Research Development of Prediction Technique: A model can be developed with neural network based classification techniques for predicting glucose concentration for improved performance. Development of Therapy Optimizer: Development of an inexpensive, hand-held ANN diabetes therapy system for patients, which will incorporate a BGL monitoring device.

Conclusions 28  Health Data Warehouse is very essential for a Nation for better healthcare delivery  Main challenges are proper framework development, data preprocessing (e.g., cleaning, missing value imputation), record linkage establishment and privacy preservation  We have proposed a national framework for integration of enormous, diverse health data to facilitate knowledge discovery.  We have performed some preprocessing on health data such as cleaning, normalization etc.

Conclusions (2) 29  Proper Record Linkage and Privacy Preservation are two big issues for Integrated health systems  We have developed Patient Identification Technique based on Secured Record Linkage (PITSRL) for Privacy Preserved Record Linkage  For a noisy health dataset of patients, we achieved 87% sufficient record linkage key.  For a training dataset of 100 patient records, PITSRL achieved 100% accuracy of identifying unique and duplicate patients.

Conclusions (3) 30  Data mining technique can be used for the prediction of severity of diseases e.g., diabetic, blood pressure etc.  Specialized patient management system need to be developed for special care for critical patients

Thank You 31