Big Data Meets Medical Physics Dosimetry Fumbeya Marungo, Hilary Paisley, John Rhee Dr. Todd McNutt Dr. Scott Robertson.

Slides:



Advertisements
Similar presentations
بسم الله الرحمن الرحيم ﴿و قل رب زدنى علماً﴾ صدق الله العظيم.
Advertisements

Data warehousing and Data mining – an overview Dr. Suman Bhusan Bhattacharyya MBBS, ADHA, MBA.
The Statisticians Role in Pharmaceutical Development
Mapping Studies – Why and How Andy Burn. Resources The idea of employing evidence-based practices in software engineering was proposed in (Kitchenham.
IEC Substation Configuration Language and Its Impact on the Engineering of Distribution Substation Systems Notes Dr. Alexander Apostolov.
EGEE NA3 Planning Prof. Malcolm Atkinson Director John Murison Training Manager CERN 13 th November 2003.
2015/6/1Course Introduction1 Welcome! MSCIT 521: Knowledge Discovery and Data Mining Qiang Yang Hong Kong University of Science and Technology
Business Information Systems Research Project in Information Systems (IS4401)
CS 501: Software Engineering Fall 2000 Lecture 4 Management I: Project Management.
Requirements Engineering Processes
SwE 313 Introduction to Rational Unified Process (RUP)
Business Intelligence Dr. Mahdi Esmaeili 1. Technical Infrastructure Evaluation Hardware Network Middleware Database Management Systems Tools and Standards.
Sylnovie Merchant, Ph.D MIS 210 Fall 2004 Lecture 1: The Systems Analyst Project Management MIS 210 Information Systems I.
Software Verification and Validation (V&V) By Roger U. Fujii Presented by Donovan Faustino.
COMPUTERS AND RADIATION THERAPY/ONCOLOGY Lydia Eichler CIS 109.
Page 1 ISMT E-120 Desktop Applications for Managers Introduction to Microsoft Access.
INFO 637Lecture #31 Software Engineering Process II Launching & Strategy INFO 637 Glenn Booker.
Data Mining. 2 Models Created by Data Mining Linear Equations Rules Clusters Graphs Tree Structures Recurrent Patterns.
Chapter 11 Databases.
Prediction of Regional Tumor Spread Using Markov Models Megan S. Blackburn Monday, April 14, 2008.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
Ihr Logo Data Explorer - A data profiling tool. Your Logo Agenda  Introduction  Existing System  Limitations of Existing System  Proposed Solution.
How Can Engineering Take Data Sciences from Ideas to Action
PowerPoint 2003 – Level 1 Computer Concepts Cathy Horwitz April 25, 2011.
Abdominal Aortic Aneurysm Dr. Andrea Bucchi School of Engineering – University of Portsmouth 9 th June 2014 University of Portsmouth – School of Engineering.
The McGraw-Hill Companies, Inc Information Technology & Management Thompson Cats-Baril Chapter 3 Content Management.
RUP Implementation and Testing
Multi-agent Research Tool (MART) A proposal for MSE project Madhukar Kumar.
U NIVERSITY OF D ELAWARE C OMPUTER & I NFORMATION S CIENCES D EPARTMENT Optimizing Compilers CISC 673 Spring 2011 Course Overview John Cavazos University.
Chapter 6: Foundations of Business Intelligence - Databases and Information Management Dr. Andrew P. Ciganek, Ph.D.
Computers in Healthcare Jinbo Bi Department of Computer Science and Engineering Connecticut Institute for Clinical and Translational Research University.
WXGE 6103 Digital Image Processing Semester 2, Session 2013/2014.
© 2012 Cengage Learning. All Rights Reserved. This edition is intended for use outside of the U.S. only, with content that may be different from the U.S.
SQL Queries Relational database and SQL MySQL LAMP SQL queries A MySQL Tutorial and applications Database Building Assignment.
1/36 1 Project Management What works, what doesn’t Michael Smith, TeraTech, Inc
Introduction University of Sunderland CIT304 Harry R Erwin, PhD.
Lecture 3 Managing the Development Project SFDV Principles of Information Systems.
Project Plan for nSite Central Michael Dunn Ryan Sessions Kyle Kerrigan.
Slide 1 Project Management Chapter 4. Slide 2 Objectives ■ Become familiar with estimation. ■ Be able to create a project workplan. ■ Become familiar.
INFO 637Lecture #101 Software Engineering Process II Review INFO 637 Glenn Booker.
Risk Factor Analysis - A New Qualitative Risk Management Tool
U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Advanced Compilers CMPSCI 710 Spring 2004 Lecture 1 Emery Berger University of Massachusetts,
©Ian Sommerville 2000 Slide 1 Project management l Organising, planning and scheduling software projects l Objectives To introduce software project management.
CS 501: Software Engineering Fall 1999 Lecture 6 Management I: Project Management.
System Changes and Interventions: Registry as a Clinical Practice Tool Mike Hindmarsh Improving Chronic Illness Care, a national program of the Robert.
1 These courseware materials are to be used in conjunction with Software Engineering: A Practitioner’s Approach, 5/e and are provided with permission by.
Ad Hoc Graphical Reports Ad Hoc Graphical Reports Copyright © Team #4 CSCI 6838 Spring CSCI Research Project and Seminar Team# 4 (
Mike Hindmarsh Improving Chronic Illness Care California Chronic Care Learning Communities Initiative Collaborative February 2, 2004 Oakland, CA Clinical.
Project Management. Introduction  Project management process goes alongside the system development process Process management process made up of three.
Problem Query image by content in an image database.
Team X Review. The members of Team X propose to develop a therapeutic activity system using a Microsoft Surface unit and a Windows 7 desktop. By working.
Now what? 1.  I have short-listed projects I am interested in  I know the types of projects I would like to pursue  I have an idea of the resources.
Books Visualizing Data by Ben Fry Data Structures and Problem Solving Using C++, 2 nd edition by Mark Allen Weiss MATLAB for Engineers, 3 rd edition by.
1 Seattle University Master’s of Science in Business Analytics Key skills, learning outcomes, and a sample of jobs to apply for, or aim to qualify for,
Collage Score Card & Software defect prediction
Modern Systems Analysis and Design Third Edition
Project Management and Information Security
Advanced Compilers CMPSCI 710 Spring 2003 Lecture 1
CS 540 Database Management Systems
Courtesy from Dr. McNutt
Data Mining: Concepts and Techniques Course Outline
Data Warehousing and Data Mining
CS & CS Capstone Project & Software Development Project
Finance & Planning Committee of the San Francisco Health Commission
Welcome! Knowledge Discovery and Data Mining
Optimizing Compilers CISC 673 Spring 2009 Course Overview
Modern Systems Analysis and Design Third Edition
Planning for Design Project
Getting Started with Microsoft Azure Machine Learning
Presentation transcript:

Big Data Meets Medical Physics Dosimetry Fumbeya Marungo, Hilary Paisley, John Rhee Dr. Todd McNutt Dr. Scott Robertson

Topic  Radiation dosimetry is planning the placement and intensity of radiation doses for oncology treatment.  Often, planning uses many simplifying assumptions and does not include additional data within the patient’s record. Uniformity assumption leads to a simplified view of side effect risk Images courtesy of Dr. Todd McNutt, Dr. Scott Robertson

Goal  Use “Big Data” analytics techniques to create a toxicity (side effect) risk model(s) by exploring the diverse information within the Oncospace database.  Ultimately, patients benefit from the lessons of previous outcomes. Big Data + Medical Physics = Safer, more effective treatments Images courtesy of Dr. Todd McNutt, Dr. Scott Robertson

Importance and Relevance  Example – Irradiation of the parotid gland can lead to xerostomia, i.e. sever e dry mouth.  The uniformity assumption does not account for the gland’s complex structure. Big Data + Medical Physics = Safer, more effective treatments Images courtesy of Dr. Todd McNutt, Dr. Scott Robertson

Importance and Relevance  Example – Irradiation of the parotid gland can lead to xerostomia, i.e. sever e dry mouth.  Oncospace has 3-D dosage data. Big Data + Medical Physics = Safer, more effective treatments Images courtesy of Dr. Todd McNutt, Dr. Scott Robertson

Technical Approach  We use a standard interactive Data Mining and Knowledge Discovery Process (Cios et al. 2002).  Early and late portions of the process requires a great deal of input from mentors. Image from (Cios et al. 2002)

Technical Approach  Understanding the data, and the problem domain are key:  The data model chart is just the beginning.  Beware of terms of art. The chart tells how, not what, to query Images courtesy of Dr. Todd McNutt, Dr. Scott Robertson Not quite a histogram

Technical Approach  Primary Technologies:  Microsoft SQL Server – Database server for Oncospace and Data Sandbox.  Weka – Open-source data mining software.  Git – Source control software.  Matlab – Scientific software  Java 7 – Weka, and Matlab are native Java applications.  Javadoc – Software documentation  Secondary/Optional:  Python  Groovy and/or Jython --- Java-platform scripting languages

Deliverables: Block Diagram OncospaceB Data Cleaning Data Sandbox Data Mining Algorithm Result Generalization Deliverables: Min Expected Max Data Preparation

Deliverables  Minimum:  A data mining algorithm(s) that is callable from Matlab that accepts a treatment plan and additional clinical data and outputs a risk measure for a specific toxicity.  Software that cleans and transforms data from Oncospace into a format acceptable to the algorithm.  Performance assessment of the algorithm.  Expected:  Algorithm meets acceptable performance levels.  Maximum:  Generalize process to create risk measure on one or more additional toxicities.

Key Dates (Feb-Mar)

Key Dates (Apr-May)

Tasks and Critical Dependencies Task No.TaskDurationStartEndCritical Dependencies 1Select Project3 days28-Jan-1430-Jan-14None 2Project Planning Presentation1 day11-Feb-14 None 3Project Planning Report1 day17-Feb-14 None 4Project Planning11 days3-Feb-1417-Feb-14None 5Setup Development Environment6 days6-Feb-1413-Feb-14None 6Literature Review14 days11-Feb-1428-Feb-14Input from mentors 7Database Access1 day13-Feb-14 Input from mentors, Support JHH IT 8Target Database Access1 day13-Feb-14 Support JHH IT 9Develop Target Database14 days17-Feb-146-Mar-14Input from mentors 10Meeting with mentors1 day20-Feb-14 11Begin Preparing Paper Seminar10 days20-Feb-145-Mar-14Task 6, Input from mentors 12Data Clensing and Preprocessing9 days24-Feb-146-Mar-14Task 9, Input from mentors 13Meeting with mentors1 day27-Feb-14 14Paper Presentation1 day6-Mar-14 Task 11 15Data Reduction and Transformation14 days6-Mar-1425-Mar-14Task 12 16Meeting with mentors1 day10-Mar-14 17Meeting with mentors1 day14-Mar-14 18Data Mining11 days13-Mar-1427-Mar-14Task 15, Input from mentors 19Check Point Presentation1 day18-Mar-14 20Assess Models16 days20-Mar-1410-Apr-14Task 18, Input from mentors 21Writing Report37 days20-Mar-149-May-14Task 20 22Integrate Software17 days10-Apr-142-May-14Task 20 23Work on Poster21 days11-Apr-149-May-14Task 20 24Poster Day1 day9-May-14 Task 23

Management Plan (Feb-Mar)

Management Plan (Apr-May)

Dependencies  Critical Dependencies:  Must be done. Team’s project manager responsible for insuring these are met.  Non-critical Dependencies:  Not necessary, but will speed progress.  Allotment of ~$750 per team member for book, software licenses, etc.  On site workstation(s)

Team Member Responsibilities  General Responsibilities:  Semi-weekly meeting.  Ad hoc meetings as necessary.  Roles:  Fumbeya Marungo, Team Lead.  Hilary Paisley, Project Manager.  John Rhee, Software Engineer.  Task Responsibilities:  A “Surgical Team” approach (Brooks, 1995). Tasks are assigned and agreed upon during the regular meetings.

Initial Reading List  Quantitative Analyses of Normal Tissue Effects in the Clinic (QUANTEC). Bentzen et al  Use of Normal Tissue Complication Probability Models in the Clinic. Marks et al  Uniqueness of medical data mining. Cios et al  Novel approaches to improve the therapeutic index of HN RT. Buettner et al  Volume effects and region-dependent radio-sensitivity of the parotid gland. Konings et al  Predictive data mining in clinical medicine - Current issues and guidelines. Bellazzi et al  Vision Automation and advanced computing in clinical radiation oncology. Moore et al  Mythical Man-Month, The: Essays on Software Engineering, Anniversary Edition. Brooks 1995.

Thank You  Dr. Todd McNutt, Mentor  Dr. Scott Robertson, Mentor  Dr. Russell Taylor, Instructor  CIS II Classmates…