Disease Diagnostics Data Analysis

Slides:



Advertisements
Similar presentations
Request Dispatching for Cheap Energy Prices in Cloud Data Centers
Advertisements

SpringerLink Training Kit
Luminosity measurements at Hadron Colliders
From Word Embeddings To Document Distances
Choosing a Dental Plan Student Name
Virtual Environments and Computer Graphics
Chương 1: CÁC PHƯƠNG THỨC GIAO DỊCH TRÊN THỊ TRƯỜNG THẾ GIỚI
THỰC TIỄN KINH DOANH TRONG CỘNG ĐỒNG KINH TẾ ASEAN –
D. Phát triển thương hiệu
NHỮNG VẤN ĐỀ NỔI BẬT CỦA NỀN KINH TẾ VIỆT NAM GIAI ĐOẠN
Điều trị chống huyết khối trong tai biến mạch máu não
BÖnh Parkinson PGS.TS.BS NGUYỄN TRỌNG HƯNG BỆNH VIỆN LÃO KHOA TRUNG ƯƠNG TRƯỜNG ĐẠI HỌC Y HÀ NỘI Bác Ninh 2013.
Nasal Cannula X particulate mask
Evolving Architecture for Beyond the Standard Model
HF NOISE FILTERS PERFORMANCE
Electronics for Pedestrians – Passive Components –
Parameterization of Tabulated BRDFs Ian Mallett (me), Cem Yuksel
L-Systems and Affine Transformations
CMSC423: Bioinformatic Algorithms, Databases and Tools
Some aspect concerning the LMDZ dynamical core and its use
Bayesian Confidence Limits and Intervals
实习总结 (Internship Summary)
Current State of Japanese Economy under Negative Interest Rate and Proposed Remedies Naoyuki Yoshino Dean Asian Development Bank Institute Professor Emeritus,
Front End Electronics for SOI Monolithic Pixel Sensor
Face Recognition Monday, February 1, 2016.
Solving Rubik's Cube By: Etai Nativ.
CS284 Paper Presentation Arpad Kovacs
انتقال حرارت 2 خانم خسرویار.
Summer Student Program First results
Theoretical Results on Neutrinos
HERMESでのHard Exclusive生成過程による 核子内クォーク全角運動量についての研究
Wavelet Coherence & Cross-Wavelet Transform
yaSpMV: Yet Another SpMV Framework on GPUs
Creating Synthetic Microdata for Higher Educational Use in Japan: Reproduction of Distribution Type based on the Descriptive Statistics Kiyomi Shirakawa.
MOCLA02 Design of a Compact L-­band Transverse Deflecting Cavity with Arbitrary Polarizations for the SACLA Injector Sep. 14th, 2015 H. Maesaka, T. Asaka,
Hui Wang†*, Canturk Isci‡, Lavanya Subramanian*,
Fuel cell development program for electric vehicle
Overview of TST-2 Experiment
Optomechanics with atoms
داده کاوی سئوالات نمونه
Inter-system biases estimation in multi-GNSS relative positioning with GPS and Galileo Cecile Deprez and Rene Warnant University of Liege, Belgium  
ლექცია 4 - ფული და ინფლაცია
10. predavanje Novac i financijski sustav
Wissenschaftliche Aussprache zur Dissertation
FLUORECENCE MICROSCOPY SUPERRESOLUTION BLINK MICROSCOPY ON THE BASIS OF ENGINEERED DARK STATES* *Christian Steinhauer, Carsten Forthmann, Jan Vogelsang,
Particle acceleration during the gamma-ray flares of the Crab Nebular
Interpretations of the Derivative Gottfried Wilhelm Leibniz
Advisor: Chiuyuan Chen Student: Shao-Chun Lin
Widow Rockfish Assessment
SiW-ECAL Beam Test 2015 Kick-Off meeting
On Robust Neighbor Discovery in Mobile Wireless Networks
Chapter 6 并发:死锁和饥饿 Operating Systems: Internals and Design Principles
You NEED your book!!! Frequency Distribution
Y V =0 a V =V0 x b b V =0 z
Fairness-oriented Scheduling Support for Multicore Systems
Climate-Energy-Policy Interaction
Hui Wang†*, Canturk Isci‡, Lavanya Subramanian*,
Ch48 Statistics by Chtan FYHSKulai
The ABCD matrix for parabolic reflectors and its application to astigmatism free four-mirror cavities.
Measure Twice and Cut Once: Robust Dynamic Voltage Scaling for FPGAs
Online Learning: An Introduction
Factor Based Index of Systemic Stress (FISS)
What is Chemistry? Chemistry is: the study of matter & the changes it undergoes Composition Structure Properties Energy changes.
THE BERRY PHASE OF A BOGOLIUBOV QUASIPARTICLE IN AN ABRIKOSOV VORTEX*
Quantum-classical transition in optical twin beams and experimental applications to quantum metrology Ivano Ruo-Berchera Frascati.
The Toroidal Sporadic Source: Understanding Temporal Variations
FW 3.4: More Circle Practice
ارائه یک روش حل مبتنی بر استراتژی های تکاملی گروه بندی برای حل مسئله بسته بندی اقلام در ظروف
Decision Procedures Christoph M. Wintersteiger 9/11/2017 3:14 PM
Limits on Anomalous WWγ and WWZ Couplings from DØ
Presentation transcript:

Disease Diagnostics Data Analysis Dr James A. Covington Bio-Medical Sensors Laboratory School of Engineering

Sensors@Warwick

Motivation… Point of care Rapid Patient acceptable Low-cost Simple Hospitals/Home Developing countries

The Biological Solution

Artificial Olfaction Invented at Warwick in the early 1980s replicate the human nose Non-invasive, real-time Immediate sample introduction Portable/small form factor Can be used away from the lab Easy to use/understand No specialised services (gas lines etc.). Dr George Dodd

Electronic Nose Operation Array of sensors with different broad sensitivity e.g. Alcohols Operate by measuring change in resistance/capacitance/frequency e- e- e-

Electronic Nose for Medicine sensor array DRmax / Rb time DRmax  Example Sensor Response Sample in Sample out Response Time Many sensors within the electronic nose respond to different odours within the sample. These responses are then processed The air from around an area of interest is sampled

Understanding your problem… Disease Diagnostics Data Analysis

What is the medical question? Is there any difference between these two? Which one is the same as the standard?

Nature of the test…

What are the issues? Who took the sample? How did they take the sample? When did they take the sample? How old is the sample? Did they take it in a different room? When did the person last eat? Do they have perfume on? When was the room last cleaned? Understand how your sample is collected

University Hospital Coventry & Warwickshire Diseases investigated… Bile acid malabsorption Bladder/prostate Cancer Clostridium difficile Coeliac's disease Colorectal Cancer Crohns disease /Ulcerative colitis Diabetes Hepatic encephalopathy Irritable bowel syndrome Liver disease Obesity Pelvic radiation Pre-term labour Tuberculosis Brain Cancer/Schizophrenia Liver disease Wound infections Lung diseases Metabolic diseases Eye infections Ear/Nose/Throat Bacterial infections i.e. MRSA & C-Diff Application of ‘Smell Technology’ Gastrointestinal diseases

Understanding your Machine… Electronic noses in Medicine

Important Test Conditions

Traditional Electronic Nose Array of discrete Sensors Most employ metal-oxides Non-linear response with gas concentration Change in resistance can be defined as: 𝑅 𝑆 =𝐴 C −𝛼 Where Rs is the sensor resistance, A is a constant and alpha is the slope of the Rs curve

Typical Sensor Responses Generates around 1000 data points per sensor Feature reduction is required

Potential Features…

And also… And anything else you can think of…

Ion Mobility Spectrometry - FAIMS Used in chemical warfare detection Applications for military or home security

Sample Collection… FAIMS creates datasets of 52,254 data points for one scan Usually three full scans are taken Feature reduction is critical… Urine Breath Stool

Wavelet transformation Discrete Wavelet transform Raw Data Data in 1D Andrea S. Martinez-Vernon 2016

Wavelet transform At each level in the above diagram the signal is decomposed into low and high frequencies. Due to the decomposition process the input signal must be a multiple of 2n where is the number of levels.

Feature heat-map Coeliac Disease 1 in 100 in the UK are affected, with many undiagnosed Urine samples used, 20 CD patients and 27 controls Heat map of different features Clear differences in the dataset

Understanding your Data processing… Disease Diagnostics Data Analysis

Multivariate data processing techniques

Unsupervised - PCA example In PCA, we are interested to find the directions (components) that maximize the variance in our dataset Linear separation

Supervised - LDA example LDA determines a suitable subspace to distinguish between patterns that belong to different classes

Classifiers - Traditional K-NN Two common are k-Nearest Neighbour and neural networks (with various learning methods) Multi-output solution

Medical binary classifiers Single-output solution…many options… Support vector machine: Samples are assigned into a new feature space to maximise the difference between the two groups Random Forrest: Creates a series of decision trees which vote together to create a classification Sparse logistic regression: Fits a model to the data and then uses this model to predict unknown samples (non-linear) But also… Neural network Gaussian processes

Medical binary classifiers II Support vector machines Random forest

FAIMS Medical Data Remove “zero” values (padding) Wilcoxon rank-sum test for (with cross-validation) in turn It is a non-parametric statistical hypothesis test used when comparing two related samples, matched samples, or repeated measurements on a single sample to assess whether their population mean ranks differ (i.e. it is a paired difference test). Then keep only the features with the lowest p-value Normally n=2 is sufficient

Coeliac's from urine Box whisker plot of probabilities Boxes show interquartile range Data created by Sparse Logistic regression Sensitivity/Specificity of 85%

What does you medic want you to give them? Electronic noses in Medicine

Sensitivity and Specificity Sensitivity: measures the proportion of positives that are correctly identified Specificity measures the proportion of negatives that are correctly identified True positive: Sick people correctly identified as sick False positive: Healthy people incorrectly identified as sick True negative: Healthy people correctly identified as healthy False negative: Sick people incorrectly identified as healthy

IBD in Breath Graphical plot that illustrates the performance of a binary classifier system as its discrimination threshold is varied. Plots the positive rate (sensitivity) against the false positive rate (specificity) at various threshold settings. 76 IBD patients / 22 Controls Random Forrest Classifier Sensitivity: 74% Specificity: 75%

C.Diff from Stool 213 stool samples All suspected of C.diff 71 confirmed cases 10 fold cross-validation AUC = 0.93 (95% CI: 0.85,1) Sensitivity: 92% Specificity: 86%

Conclusions… Electronic noses have been around for more than 20 years Developed at Warwick, there have been a range of classification approaches applied to them Critical understanding of the medical problem is needed before processing data Multiple methods applied to classification – with mixed results Future maybe a machine in every GP and/or home