EEG Event Classification Using Deep Learning

Slides:



Advertisements
Similar presentations
M. Tabrizi: Seizure Detection May, A COMPARATIVE ANALYSIS OF NONLINEAR FEATURES FOR AN HMM-BASED SEIZURE DETECTION SYSTEM Masih Tabrizi and Joseph.
Advertisements

An Approach to ECG Delineation using Wavelet Analysis and Hidden Markov Models Maarten Vaessen (FdAW/Master Operations Research) Iwan de Jong (IDEE/MI)
Northeast Regional Epilepsy Group Christos Lambrakis M.D. 1.
Introduction The aim the project is to analyse non real time EEG (Electroencephalogram) signal using different mathematical models in Matlab to predict.
Artifact (artefact) reduction in EEG – and a bit of ERP basics CNC, 19 November 2014 Jakob Heinzle Translational Neuromodeling Unit.
Corpus Development EEG signal files and reports had to be manually paired, de-identified and annotated: Corpus Development EEG signal files and reports.
EEG findings in patients with Neurological Disorders Instructor: Dr. Gharibzadeh By: Fahime Sheikhzadeh.
Standard electrode arrays for recording EEG are placed on the surface of the brain. Detection of High Frequency Oscillations Using Support Vector Machines:
Seizure prediction by non- linear time series analysis of brain electrical activity Ilana Podlipsky.
Abstract EEGs, which record electrical activity on the scalp using an array of electrodes, are routinely used in clinical settings to.
Automatic Labeling of EEGs Using Deep Learning M. Golmohammadi, A. Harati, S. Lopez I. Obeid and J. Picone Neural Engineering Data Consortium College of.
Data Processing Machine Learning Algorithm The data is processed by machine algorithms based on hidden Markov models and deep learning. They are then utilized.
Analysis of Temporal Lobe Paroxysmal Events Using Independent Component Analysis Jonathan J. Halford MD Department of Neuroscience, Medical University.
Abstract The emergence of big data and deep learning is enabling the ability to automatically learn how to interpret EEGs from a big data archive. The.
THE TUH EEG CORPUS: The Largest Open Source Clinical EEG Corpus Iyad Obeid and Joseph Picone Neural Engineering Data Consortium Temple University Philadelphia,
THE TUH EEG CORPUS: A Big Data Resource for Automated EEG Interpretation A. Harati, S. López, I. Obeid and J. Picone Neural Engineering Data Consortium.
Exploration of Instantaneous Amplitude and Frequency Features for Epileptic Seizure Prediction Ning Wang and Michael R. Lyu Dept. of Computer Science and.
The Goal Use computers to aid physicians in diagnosis of neurological diseases, particularly epilepsy Detect pathological events in real time Currently,
TUH EEG Corpus Data Analysis 38,437 files from the Corpus were analyzed. 3,738 of these EEGs do not contain the proper channel assignments specified in.
Wong, Gardner, Krieger, Litt (2006) Zack Dvey-Aharon, March 2008.
Big Mechanism for Processing EEG Clinical Information on Big Data Aim 1: Automatically Recognize and Time-Align Events in EEG Signals Aim 2: Automatically.
Automatic Discovery and Processing of EEG Cohorts from Clinical Records Mission: Enable comparative research by automatically uncovering clinical knowledge.
Automated Interpretation of EEGs: Integrating Temporal and Spectral Modeling Christian Ward, Dr. Iyad Obeid and Dr. Joseph Picone Neural Engineering Data.
Abstract Automatic detection of sleep state is important to enhance the quick diagnostic of sleep conditions. The analysis of EEGs is a difficult time-consuming.
Demonstration A Python-based user interface: Waveform and spectrogram views are supported. User-configurable montages and filtering. Scrolling by time.
Market: Customer Survey: 57 clinicians from academic medical centers and community hospitals, and 44 industry professionals. Primary Customer Need: 70%
Spoken Language Group Chinese Information Processing Lab. Institute of Information Science Academia Sinica, Taipei, Taiwan
Abstract Automatic detection of sleep state is an important queue in accurate detection of sleep conditions. The analysis of EEGs is a difficult time-consuming.
Improved EEG Event Classification Using Differential Energy A.Harati, M. Golmohammadi, S. Lopez, I. Obeid and J. Picone Neural Engineering Data Consortium.
Date of download: 6/28/2016 Copyright © 2016 American Medical Association. All rights reserved. From: Similarity of Lateralized Rhythmic Delta Activity.
The Neural Engineering Data Consortium Mission: To focus the research community on a progression of research questions and to generate massive data sets.
Constructing Multiple Views The architecture of the window was altered to accommodate switching between waveform, spectrogram, energy, and all combinations.
Constructing Multiple Views The architecture of the window was altered to accommodate switching between waveform, spectrogram, energy, and all combinations.
Descriptive Statistics The means for all but the C 3 features exhibit a significant difference between both classes. On the other hand, the variances for.
Big data classification using neural network
Scalable EEG interpretation using Deep Learning and Schema Descriptors
[Ran Manor and Amir B.Geva] Yehu Sapir Outlines Review
F Onorati1, G Regalia1, C Caborni1, R Picard1,2. 1
CLASSIFICATION OF SLEEP EVENTS IN EEG’S USING HIDDEN MARKOV MODELS
QRS Detection Linda Henriksson 1.
G. Suarez, J. Soares, S. Lopez, I. Obeid and J. Picone
Enhanced Visualizations for Improved Real-Time EEG Monitoring
THE TUH EEG SEIZURE CORPUS
Epileptic Seizure Prediction
Human Language Technology Research Institute
RECURRENT NEURAL NETWORKS FOR VOICE ACTIVITY DETECTION
Data Mining Classification: Alternative Techniques
Enhanced Visualizations for Improved Real-Time EEG Monitoring
N. Capp, E. Krome, I. Obeid and J. Picone
Optimizing Channel Selection for Seizure Detection
EEG Recognition Using The Kaldi Speech Recognition Toolkit
Seizure freedom with VNS monotherapy: A case report
GATED RECURRENT NETWORKS FOR SEIZURE DETECTION
Big Data Resources for EEGs: Enabling Deep Learning Research
AN ANALYSIS OF TWO COMMON REFERENCE POINTS FOR EEGS
E. von Weltin, T. Ahsan, V. Shah, D. Jamshed, M. Golmohammadi, I
Improved EEG Event Classification Using Differential Energy
Vinit Shah, Joseph Picone and Iyad Obeid
Automatic Interpretation of EEGs for Clinical Decision Support
feature extraction methods for EEG EVENT DETECTION
LECTURE 42: AUTOMATIC INTERPRETATION OF EEGS
Northeast Regional Epilepsy Group Christos Lambrakis M.D.
LECTURE 41: AUTOMATIC INTERPRETATION OF EEGS
Machine Learning for Visual Scene Classification with EEG Data
Human Language Technology Research Institute
EEG Event Classification Using Deep Learning
A Dissertation Proposal by: Vinit Shah
Deep Residual Learning for Automatic Seizure Detection
Multi-UAV to UAV Tracking
Shengcong Chen, Changxing Ding, Minfeng Liu 2018
Presentation transcript:

EEG Event Classification Using Deep Learning Vinit Shah and Joseph Picone Neural Engineering Data Consortium Temple University

What is an EEG ? Electroencephalography (EEG) is a popular tool used to diagnose brain related illnesses. Scalp Electroencephalogram (EEG) monitoring is a non-invasive and convenient method to assess electrical activity from brain. Interpretation of EEGs is challenging, and its accurate annotation requires extensive training. Diagnosis is performed considering the factors such as patient’s video recordings, medical history, age, environmental & physiological changes, etc.

Seizure Morphologies Electrographic seizures can be detected either by observing epileptiform activities or by observing artifacts related to specific types of seizures. There are multiple types of seizures (i.e. tonic-clonic/grand mal, absence/petit mal, complex-partial) Interpretation of focal seizures require temporal as well as sufficient level of spatial information. Typically interpreters look for epileptiform activity such as spike and wave discharges and its evolution over time. Easy seizures show clear evolution in signal’s frequency and amplitude.

Epileptiform Activity Inconclusive Segments Some EEG records are very challenging and show wide spread epileptiform features along with artifacts (i.e. Shivers). Obscured patterns as such could also make interpretation inconclusive. There are specific rules implemented by American Clinical Neurophysiology Society (ACNS) for diagnosis of patients with epilepsy. Accurate onset and offset detection of an ictal is, in many cases, subjective which encourages us to use Any-Overlap method for scoring. Shivers Epileptiform Activity

Spectral Properties of an Ictal Seizure Seizures usually occur within the range of 2.5 to 25 Hz. Seizure duration can rand from 3 seconds to days. Generalized seizures are easy to spot due to their high energy in specific frequency bands. Waxing-waning patterns (e.g., bursts) can be mistakenly identified as ictal.

Artifacts and Medications Pose Challenges Artifacts such as chewing resemble features of tonic-clonic and complex-partial seizures. This is usually disambiguated from the context/history of the record. Medication makes it more difficult. Subtle seizures such as extremely focal, low energy seizures are widespread in ICU patients due to medication.

Feature Extraction TUH EEG Seizure Corpus (v1.2.1) was used: Version 1.2.1 Training set Evaluation set w/seiz Total Patients 119 265 38 50 Sessions 182 583 98 239 Epochs (sec.) 76,517 (6.4%) 1,196,381 (100.0%) 55,765 (9.0%) 618,096 (100.0%) TUH EEG Seizure Corpus (v1.2.1) was used: The database consists of clinical data with many types of seizures and lots of artifacts. World’s largest open source repository of EEG data. Standard frequency domain features are used: 10 frames per second 0.2 second analysis window 9 base features including energy, differential energy, and 7 cepstral coefficients. 1st and 2nd derivatives are used for most features. Total dimension: 26.

Baseline System: CNN-LSTM Three baseline systems: CNN-LSTM system Channel based LSTM networks A Kaldi multipass system with P-norm and MLP networks. CNN-LSTM Architecture: Each sample is a 21 sec. long window for all 22 channels. Trained with constant learning rate with a kernel size of (3, 3). Heuristic postprocessing approaches are applied which include a threshold for output probabilities and seizure events of a certain duration. An Adam optimizer is used. CNN-LSTM Model

Baseline System: LSTM Channel based LSTM Architecture: Each sample is a 7 sec. long window with right/left context (splice width) of 11 frames (1.1 sec.). Each channel is processed individually so that the model only learns spike/sharp and wave discharges. Trained with annealing learning rate after CV loss is stagnated for 3 consecutive epochs. SGD optimizer with nestrov momentum is used. A small CNN-LSTM-MLP model is used for postprocessing followed by heuristic postprocessing. LSTM Model

Baseline System: Kaldi Kaldi baseline systems: Kaldi multipass systems with P-norm (Dan’s DNN (nnet2) implementation) Kaldi multipass systems with MLP networks (TF implementation). Kaldi P-norm fast: Fixed Affined Component / LDA is applied to decorrelate splice window of 11 (Left/Right context of 5). Training is performed for 20 epochs with annealing learning rate with last 5 epochs with constant minimum lr. P-norm Input dim = 2000 & Output dim = 400. Preconditioned SGD is used which is a matrix valued learning rate. Kaldi DNN (TF): Tensorflow’s MLP network with 3 hidden layers is implemented. Priors, decision tree and alignments from Kaldi’s LDA-MLLT systems are used for acoustic modeling.

Evaluation Metrics Any Overlap method (OVLP): Any overlap method is a permissive method which looks for the detection of an event within a proximity of the reference. This metric tend to produce higher sensitivities since only isolated events are considered as false alarms. Multiple overlapping events detected in bursts are also counted as detection. Performance measures are calculated in terms of Sensitivity and Specificity (or false alarms per 24 hours): Sensitivity = ( True Positives / (True Positives + False Negatives) ) Specificity = (True Negatives / (True Negatives + False Positives) ) False Alarm rate = Rate of ( ( # Target False Positives / Total duration ) × (60 × 60 × 24) )

Performance Kaldi models use word boundary information during event classification. Kaldi’s best performance is only ~50% sensitivity @ 2.58 FAs/24 hours (with word boundaries). Channel-based LSTM network outperforms other models with no word boundary information. DNN Model Sensitivity (%) FA/24 Hours CNN-LSTM 30.8% 6.7 LSTM 40.3% 5.8 Kaldi P-Norm 60.1% 25.7 Kaldi MLP 49.8% 2.6 ROC curve for the target/seizure class. The region of interest is when the FA is low (< 10 FAs/24 hours).

Decoding and Error Analysis Kaldi lattices were used during decoding. Lattice-1best, lattice-push and lattice-to-post were used to obtain decoding results. Each of which uses word boundaries. Kaldi has a crude energy based automatic segmentation approach which is not adequate for segmentation of EEGs. Hypothesis Transcriptions Reference Transcriptions

Posterior Distribution LSTM network (AutoEEG) Decoding and Error Analysis Performance of the DNN-HMM models on seizures with different durations is quite similar. Decoded transcription probabilities are very high compared to any non-Kaldi models we have developed. Due to the binary classification problem, LM seems to flip the correctly detected class at the beginning and end of the record. Posterior Distribution P-Norm DNN (Kaldi) LSTM network (AutoEEG)