Improved EEG Event Classification Using Differential Energy

Slides:



Advertisements
Similar presentations
M. Tabrizi: Seizure Detection May, A COMPARATIVE ANALYSIS OF NONLINEAR FEATURES FOR AN HMM-BASED SEIZURE DETECTION SYSTEM Masih Tabrizi and Joseph.
Advertisements

Introduction The aim the project is to analyse non real time EEG (Electroencephalogram) signal using different mathematical models in Matlab to predict.
Corpus Development EEG signal files and reports had to be manually paired, de-identified and annotated: Corpus Development EEG signal files and reports.
1 Audio Compression Techniques MUMT 611, January 2005 Assignment 2 Paul Kolesnik.
LYU0103 Speech Recognition Techniques for Digital Video Library Supervisor : Prof Michael R. Lyu Students: Gao Zheng Hong Lei Mo.
THE TUH EEG CORPUS: A Big Data Resource for Automated EEG Interpretation A. Harati, S. López, I. Obeid and J. Picone Neural Engineering Data Consortium.
Ashish Uthama EOS 513 Term Paper Presentation Ashish Uthama Biomedical Signal and Image Computing Lab Department of Electrical.
Representing Acoustic Information
Abstract EEGs, which record electrical activity on the scalp using an array of electrodes, are routinely used in clinical settings to.
Automatic Labeling of EEGs Using Deep Learning M. Golmohammadi, A. Harati, S. Lopez I. Obeid and J. Picone Neural Engineering Data Consortium College of.
Analysis of Temporal Lobe Paroxysmal Events Using Independent Component Analysis Jonathan J. Halford MD Department of Neuroscience, Medical University.
Abstract The emergence of big data and deep learning is enabling the ability to automatically learn how to interpret EEGs from a big data archive. The.
THE TUH EEG CORPUS: The Largest Open Source Clinical EEG Corpus Iyad Obeid and Joseph Picone Neural Engineering Data Consortium Temple University Philadelphia,
THE TUH EEG CORPUS: A Big Data Resource for Automated EEG Interpretation A. Harati, S. López, I. Obeid and J. Picone Neural Engineering Data Consortium.
Jacob Zurasky ECE5526 – Spring 2011
Data Acquisition An EEG measurement represents a difference between the voltages at two electrodes. The signal is usually displayed using a montage which.
Authors: Sriram Ganapathy, Samuel Thomas, and Hynek Hermansky Temporal envelope compensation for robust phoneme recognition using modulation spectrum.
ECE 8443 – Pattern Recognition LECTURE 10: HETEROSCEDASTIC LINEAR DISCRIMINANT ANALYSIS AND INDEPENDENT COMPONENT ANALYSIS Objectives: Generalization of.
The Goal Use computers to aid physicians in diagnosis of neurological diseases, particularly epilepsy Detect pathological events in real time Currently,
TUH EEG Corpus Data Analysis 38,437 files from the Corpus were analyzed. 3,738 of these EEGs do not contain the proper channel assignments specified in.
Speech Communication Lab, State University of New York at Binghamton Dimensionality Reduction Methods for HMM Phonetic Recognition Hongbing Hu, Stephen.
Big Mechanism for Processing EEG Clinical Information on Big Data Aim 1: Automatically Recognize and Time-Align Events in EEG Signals Aim 2: Automatically.
EEL 6586: AUTOMATIC SPEECH PROCESSING Speech Features Lecture Mark D. Skowronski Computational Neuro-Engineering Lab University of Florida February 27,
Automatic Speech Recognition A summary of contributions from multiple disciplines Mark D. Skowronski Computational Neuro-Engineering Lab Electrical and.
Automatic Discovery and Processing of EEG Cohorts from Clinical Records Mission: Enable comparative research by automatically uncovering clinical knowledge.
Automated Interpretation of EEGs: Integrating Temporal and Spectral Modeling Christian Ward, Dr. Iyad Obeid and Dr. Joseph Picone Neural Engineering Data.
Abstract Automatic detection of sleep state is important to enhance the quick diagnostic of sleep conditions. The analysis of EEGs is a difficult time-consuming.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition LECTURE 12: Advanced Discriminant Analysis Objectives:
Demonstration A Python-based user interface: Waveform and spectrogram views are supported. User-configurable montages and filtering. Scrolling by time.
Market: Customer Survey: 57 clinicians from academic medical centers and community hospitals, and 44 industry professionals. Primary Customer Need: 70%
By Sarita Jondhale 1 Signal preprocessor: “conditions” the speech signal s(n) to new form which is more suitable for the analysis Postprocessor: operate.
Abstract Automatic detection of sleep state is an important queue in accurate detection of sleep conditions. The analysis of EEGs is a difficult time-consuming.
Improved EEG Event Classification Using Differential Energy A.Harati, M. Golmohammadi, S. Lopez, I. Obeid and J. Picone Neural Engineering Data Consortium.
The Neural Engineering Data Consortium Mission: To focus the research community on a progression of research questions and to generate massive data sets.
Constructing Multiple Views The architecture of the window was altered to accommodate switching between waveform, spectrogram, energy, and all combinations.
Constructing Multiple Views The architecture of the window was altered to accommodate switching between waveform, spectrogram, energy, and all combinations.
Results from Mean and Variance Calculations The overall mean of the data for all features was for the REF class and for the LE class. The.
Descriptive Statistics The means for all but the C 3 features exhibit a significant difference between both classes. On the other hand, the variances for.
Scalable EEG interpretation using Deep Learning and Schema Descriptors
BioSignal Analytics Inc.
Automated Identification of Abnormal Adult EEG
Spectral and Temporal Modulation Features for Phonetic Recognition Stephen A. Zahorian, Hongbing Hu, Zhengqing Chen, Jiang Wu Department of Electrical.
LECTURE 11: Advanced Discriminant Analysis
ARTIFICIAL NEURAL NETWORKS
Spoken Digit Recognition
Presentation on Artificial Neural Network Based Pathological Voice Classification Using MFCC Features Presenter: Subash Chandra Pakhrin 072MSI616 MSC in.
CLASSIFICATION OF SLEEP EVENTS IN EEG’S USING HIDDEN MARKOV MODELS
G. Suarez, J. Soares, S. Lopez, I. Obeid and J. Picone
Enhanced Visualizations for Improved Real-Time EEG Monitoring
THE TUH EEG SEIZURE CORPUS
Enhanced Visualizations for Improved Real-Time EEG Monitoring
N. Capp, E. Krome, I. Obeid and J. Picone
Optimizing Channel Selection for Seizure Detection
EEG Recognition Using The Kaldi Speech Recognition Toolkit
Big Data Resources for EEGs: Enabling Deep Learning Research
AN ANALYSIS OF TWO COMMON REFERENCE POINTS FOR EEGS
E. von Weltin, T. Ahsan, V. Shah, D. Jamshed, M. Golmohammadi, I
Ala’a Spaih Abeer Abu-Hantash Directed by Dr.Allam Mousa
Digital Systems: Hardware Organization and Design
AUDIO SURVEILLANCE SYSTEMS: SUSPICIOUS SOUND RECOGNITION
Automatic Interpretation of EEGs for Clinical Decision Support
feature extraction methods for EEG EVENT DETECTION
LECTURE 42: AUTOMATIC INTERPRETATION OF EEGS
LECTURE 41: AUTOMATIC INTERPRETATION OF EEGS
Govt. Polytechnic Dhangar(Fatehabad)
EEG Event Classification Using Deep Learning
Deep Residual Learning for Automatic Seizure Detection
EEG Event Classification Using Deep Learning
Presenter: Shih-Hsiang(士翔)
Combination of Feature and Channel Compensation (1/2)
Presentation transcript:

Improved EEG Event Classification Using Differential Energy - My name is MG and on behalf of my colleagues in NEDC I’d like to say one time more welcome to Temple University. Today I am going to talk about improved EEG event classification using differential energy. - Feature extraction for automatic classification of EEG signals typically relies on time frequency representations of the signal including techniques like wavelets and cepstral-based filter banks. - In or study we applied feature extraction methods of speech processing on EEG signal. Today I present a comparison of a variety of approaches for feature extraction. - Additionally, I present a new simple feature named as differential energy and demonstrate that a combination of these approaches with differential energy can decrease the overall error rate more than 24% and improves our ability to discriminate between signal events and background noise. Harati, M. Golmohammadi, S. Lopez, I. Obeid and J. Picone Neural Engineering Data Consortium Temple University

The TUH EEG Corpus The TUH EEG Corpus contains over 28,000 sessions collected from 15,000+ patients over a period of 14 years at Temple University Hospital. - We have used the TUH EEG Corpus for evaluation. TUH EEG is the world’s largest publicly available database of clinical EEG data, comprising more than 28,000 EEG records and over 15,000 patients. - It represents the clinical data from Temple University Hospital’s Department of Neurology since 2002 and is an ongoing data collection project. - The raw signals consist of multichannel recordings that vary between 20 and 128 channels sampled at a minimum of 250 Hz using a 16-bit A/D converter. The data is stored as EDF files. In our study, we have resampled all the data to a common sample frequency of 250 Hz.

AutoEEG: Automatic Interpretation of EEGs AutoEEG is a hybrid system that uses three levels of processing to achieve high performance event detection on clinical data: P1: hidden Markov models P2: Deep Learning P3: Language Model In our lab we have been developing a predictive analytic tool named as AutoEEG to analyze EEG signal automatically. Physicians are spending 4 to 6 hours a day reading and scanning EEGs. AutoEEG can save time and money for them and increase their accuracy. The first stage is using a conventional HMM. In the second stage the temporal and spatial context of EEGs are analyzed using a deep learning method and here we are using SdA. In the third stage, we are using a language mode that is provided in consultation with our board certified neurologists at Temple University Hospital.

AutoEEG: Clinically-Relevant Signal Events Six events of interest based on multiple iterations with board certified neurologists: Clinical data contains many artifacts that can easily be interpreted as spikes, such as patient movement. Manual review of the data is a time-consuming process, especially for long-term EEG recordings. In ICU applications, real-time response is critical. Performance requirements for this application is a detection rate above 95% with a false alarm rate below 5%. For ICU applications, the false alarm rate must be extremely low (e.g., 1/8 FA/hr.). The system detects three events of clinical interest - SPSW events are epileptiform transients that are typically observed in patients with epilepsy. - PLED events are indicative of EEG abnormalities and often manifest themselves with repetitive spike or sharp wave discharges that can be focal or lateralized over one hemisphere. These signals display quasi-periodic behavior. - GPED events are similar to PLEDs, and manifest themselves as periodic short-interval diffuse discharges, periodic long-interval diffuse discharges and suppression-burst patterns according to the interval between the discharges. Triphasic waves, which manifest themselves as diffuse and bilaterally synchronous spikes with bifrontal predominance, typically at a rate of 1-2 Hz, are also included in this class. artifacts (ARTF) are recorded electrical activity that is not of cerebral origin, such as those due to the equipment, patient behavior or the environment; eye movement (EYEM) are common events that can often be confused with a spike; background (BCKG) is used for all other signals.

AutoEEG: Feature Extraction Goal: Calibrate traditional approaches to feature extraction based on cepstral coefficients, energy and derivatives. Mel Frequency Cepstral Coefficients (MFCCs): A linear frequency scale is used for EEGs. We have shown wavelets have very little advantage over MFCCs on the TUH EEG (which contradicts the literature). Introduced a new differential energy term that accentuates the differences between quasi-periodic signals and spikes. - Here I will review some well-know feature extraction methods in SR that we applied them on EEGs. Our goal is to calibrate traditional approaches and introduce differential energy. The most important feature extraction method that we used in our system is Mel Frequency Cepstral Coefficient (MFCC). You can see a block diagram of MFCC along with energy feature and their derivatives. Our window size is 200 ms and frame size 100 ms. It is common in the MFCC approach to compute cepstral coefficients by computing a high resolution fast Fourier Transform, then we apply the mel filterbank to the power spectra and sum the energy in each filter. Take the DCT of the log filterbank energies. Keep DCT coefficients 2-8, discard the rest. - Our experiments with Wavelets have shown very little advantage over MFCCs on the TUH EEG Corpus. - Unlike speech recognition which uses a mel scale for reasons related to speech perception, we use a linear frequency scale for EEGs, since there is no physiological evidence that a log scale is meaningful.

AutoEEG: Feature Extraction Time Domain energy an overlapping analysis window (a 50% overlap was used here) to ensure a smooth trajectory Frequency Domain energy provides a smoother, more stable estimate of the energy that leverages the cepstral representation of the signal. Derivatives of features: There are two types of energy terms that are often used: time domain and frequency domain. Time domain energy is a straightforward computation using the log of the sum of the squares of the windowed signal. We use an overlapping analysis window (a 50% overlap was used here) to ensure a smooth trajectory of this features. - The energy of the signal can also be computed in the frequency domain by computing the sum of squares of the oversampled filter bank outputs after they are downsampled. This form of energy is commonly used in speech recognition systems because it provides a smoother, more stable estimate of the energy that leverages the cepstral representation of the signal. However, the virtue of this approach has not been extensively studied for EEG processing. A typical value for N is 9 (corresponding to 0.9 secs) for the first derivative in EEG processing, and 3 for the second derivative.

AutoEEG: Feature Extraction Differential energy: To improve differentiation between transient pulse-like events (e.g., SPSW events) and stationary background noise This simple feature has proven to be surprisingly effective. We typically use a 0.9 sec window for this calculation. - In order to improve differentiation between transient pulse-like events (e.g., SPSW events) and stationary background noise, we have introduced a differential energy term that attempts to model the long-term change in energy. - This term examines energy over a range of M frames centered about the current frame, and computes the difference between the maximum and minimum over this interval. - This simple feature has proven to be surprisingly effective.

Experiments: The Curse of Dimensionality The use of differential features raises the dimension of a typical feature vector from 9 (e.g., 7 cepstral coefficients, frequency domain energy and differential energy) to 27. The training set contains segments from 359 sessions while the evaluation set was drawn from 159 sessions. - Dimensionality is something we must always pay attention to in classification systems and here, the training set was designed to provide a sufficient number of examples to train statistical models such as HMMs. The use of differential features raises the dimension of a typical feature vector from 9 (e.g., 7 cepstral coefficients, frequency domain energy and differential energy) to 27. There must be sufficient training data to support this increase in dimensionality. - The training set contains segments from 359 sessions while the evaluation set was drawn from 159 sessions. A distribution of the frequency of occurrence of the 6 types of events in the training and evaluation set is shown in Table 1. Note that some classes, such as SPSW, occur much less frequently in the actual corpus than common events such as BCKG. In fact, 99% of the data is assigned to the class BCKG, so special care must be taken to build robust classifiers for the non-background classes.

Experiments: Absolute Features 6-way classification: SPSW, GPED, PLED, EYEM, ARTF, BCKG 4-way classification: collapse the 3 background classes into one category, resulting in 4 classes – SPSW, GPED, PLED and BACKG. 2-way classification: collapse SPSW, GPED and PLED into a target class (TARG) and (ARTF, EYEM and BCKG) into a background class (BCKG). The first series of experiments was run on a simple combination of features. A summary of these experiments is shown in Table. Cepstral-only features were compared with several energy estimation algorithms. It is clear that the combination of frequency domain energy and differential energy provides a substantial reduction in performance. However, note that differential energy by itself (system no. 4) produces a noticeable degradation in performance.

Experiments: Differential Features Derivatives reduced the error rate on the 6-way task by 4% absolute (systems no. 1, 6 and 11). However, the improvement for a system using differential energy is much less pronounced (systems no. 5, 10 and 15). Eliminate the second derivative for differential energy (no. 16): - A second set of experiments were run to evaluate the benefit of using differential features. These experiments are summarized in Table. - The addition of the first derivative adds about 4% absolute in performance. However, when differential energy is introduced, the improvement in performance drops to only 4% absolute. - The story is somewhat mixed for the use of second derivatives. On the base cepstral feature vector, second derivatives reduce the error rate on the 6-way task by 4% absolute (systems no. 1, 6 and 11). However, the improvement for a system using differential energy is much less pronounced (systems no. 5, 10 and 15). - In fact, it appears that differential energy and derivatives do something very similar. Therefore, we evaluated a system that eliminates the second derivative for differential energy. This system is labeled no. 16 in Table 3. We obtained a small but significant improvement in performance over system no. 10.

Analysis: DET Curves DET curves for systems nos. 1, 5, 10, and 15 are shown below. The addition of first derivatives provides a measurable improvement in performance while second derivatives are less beneficial. Differential energy provides a substantial improvement over the base cepstral features. A DET curve analysis of feature extraction systems that compares absolute and differential features. The addition of first derivatives provides a measurable improvment in performance while second derivatives are less beneficial.

Analysis: Deep Learning and Language Modeling System no. 15: 7 cepstral coefficients 1 f_energy 1 d_energy 9 deltas for all 9 delta-deltas for all Total: 27 features System no. 16: 7 cepstral coefficients 1 f_energy 1 d_energy 9 deltas for all 8 delta-deltas (cepstral + f_energy only) Total: 26 features

Summary Established valuable baselines for standard feature extraction approaches for EEG signals. Traditional feature extraction methods used in speech recognition are relevant to EEGs. The use of a novel differential energy feature improved performance for absolute features, but that benefit diminishes as first and second order derivatives are included. There is benefit to using derivatives and there is a small advantage to using frequency domain energy. There are no significant benefits to using wavelets and other time- frequency representations. Future work will focus on new feature extraction methods based on principles of deep learning, discriminative training and nonparametric Bayesian models. The database will be significantly expanded over the next year and evolved into new tasks (e.g., real-time seizure detection).

Brief Bibliography [1] T. Yamada and E. Meng, Practical Guide for Clinical Neurophysiologic Testing: EEG. Philadelphia, Pennsylvania, USA: Lippincott Williams & Wilkins, 2009. [2] A. Harati, S. Lopez, I. Obeid, M. Jacobson, S. Tobochnik, and J. Picone, “THE TUH EEG CORPUS: A Big Data Resource for Automated EEG Interpretation,” in Proceedings of the IEEE SPMB, 2014, pp. 1–5. [3] J. Picone, “Continuous Speech Recognition Using Hidden Markov Models,” IEEE ASSP Mag., vol. 7, no. 3, pp. 26–41, Jul. 1990. [4] S. Sanei and J. A. Chambers, EEG signal processing. Hoboken, New Jersey, USA: Wiley- Interscience, 2008. [5] J. Lyons, “Mel Frequency Cepstral Coefficient (MFCC) tutorial,” Practical Cryptography, 2015 (available: http://practicalcryptography. com/miscellaneous/machine-learning/guide-mel- frequency-cepstral-coefficients-mfccs/. [6] S. Davis and P. Mermelstein, “Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Sentences,” IEEE Trans. ASSP, vol. 28, no. 4, pp. 357–366, 1980. [7] P. Garrit, et al., “Wavelet Analysis for Feature Extraction on EEG Signals,” presented at Temple University CoE Res. Exp. for UG Conf., 2015 (available at http://www.isip.piconepress.com/publications/ unpublished/conferences/2015/summer_of_code/wavelets/).