Dimensionality Reduction for fMRI Brain Imaging Data Leman Akoglu Carnegie Mellon University, Computer Science Department Abstract Functional Magnetic.

Slides:

Advertisements

Similar presentations

1 Challenge the future Multi-scale mining of fMRI data with hierarchical structured sparsity – R. Jenatton et al, SIAM Journal of Imaging Sciences, 2012.

Advertisements

SVM—Support Vector Machines

Bilge Mutlu, Andreas Krause, Jodi Forlizzi, Carlos Guestrin, and Jessica Hodgins Human-Computer Interaction Institute, Carnegie Mellon University Robust,

fMRI data analysis at CCBI

Predictive Modeling of Spatial Properties of fMRI Response Predictive Modeling of Spatial Properties of fMRI Response Melissa K. Carroll Princeton University.

Hidden Process Models with applications to fMRI data Rebecca Hutchinson Oregon State University Joint work with Tom M. Mitchell Carnegie Mellon University.

Hidden Process Models: Decoding Overlapping Cognitive States with Unknown Timing Rebecca A. Hutchinson Tom M. Mitchell Carnegie Mellon University NIPS.

Region of Interests (ROI) Extraction and Analysis in Indexing and Retrieval of Dynamic Brain Images Researcher: Xiaosong Yuan, Advisors: Paul B. Kantor.

1 Learning fMRI-Based Classifiers for Cognitive States Stefan Niculescu Carnegie Mellon University April, 2003 Our Group: Tom Mitchell, Luis Barrios, Rebecca.

1 Classifying Instantaneous Cognitive States from fMRI Data Tom Mitchell, Rebecca Hutchinson, Marcel Just, Stefan Niculescu, Francisco Pereira, Xuerui.

Hidden Process Models Rebecca Hutchinson Tom M. Mitchell Indrayana Rustandi October 4, 2006 Women in Machine Learning Workshop Carnegie Mellon University.

7/17/2002 Greg Grudic: Nonparametric Modeling 1 High Dimensional Nonparametric Modeling Using Two-Dimensional Polynomial Cascades Greg Grudic University.

1 Automated Feature Abstraction of the fMRI Signal using Neural Network Clustering Techniques Stefan Niculescu and Tom Mitchell Siemens Medical Solutions,

Multi-voxel Pattern Analysis (MVPA) and “Mind Reading” By: James Melrose.

Modeling fMRI data generated by overlapping cognitive processes with unknown onsets using Hidden Process Models Rebecca A. Hutchinson (1) Tom M. Mitchell.

Measuring Blood Oxygenation in the Brain. Functional Imaging Functional Imaging must provide a spatial depiction of some process that is at least indirectly.

Learning to Identify Overlapping and Hidden Cognitive Processes from fMRI Data Rebecca Hutchinson, Tom Mitchell, Indra Rustandi Carnegie Mellon University.

Jeff Howbert Introduction to Machine Learning Winter Machine Learning Feature Creation and Selection.

HELSINKI UNIVERSITY OF TECHNOLOGY LABORATORY OF COMPUTER AND INFORMATION SCIENCE NEURAL NETWORKS RESEACH CENTRE Variability of Independent Components.

Comparison of Boosting and Partial Least Squares Techniques for Real-time Pattern Recognition of Brain Activation in Functional Magnetic Resonance Imaging.

U NIVERSITY OF M ASSACHUSETTS A MHERST Department of Computer Science 2011 Predicting Solar Generation from Weather Forecasts Using Machine Learning Navin.

Boris Babenko Department of Computer Science and Engineering University of California, San Diego Semi-supervised and Unsupervised Feature Scaling.

8/25/05 Cognitive Computations Software Tutorial Page 1 SNoW: Sparse Network of Winnows Presented by Nick Rizzolo.

Fuzzy Entropy based feature selection for classification of hyperspectral data Mahesh Pal Department of Civil Engineering National Institute of Technology.

Classification and Ranking Approaches to Discriminative Language Modeling for ASR Erinç Dikici, Murat Semerci, Murat Saraçlar, Ethem Alpaydın 報告者：郝柏翰 2013/01/28.

Keystroke Recognition using WiFi Signals

Neural systems supporting the preparatory control of emotional responses Tor D. Wager, Brent L. Hughes, Matthew L. Davidson, Melissa Brandon, and Kevin.

1 Preliminary Experiments: Learning Virtual Sensors Machine learning approach: train classifiers –fMRI(t, t+  )  CognitiveState Fixed set of possible.

Reduction of Training Noises for Text Classifiers Rey-Long Liu Dept. of Medical Informatics Tzu Chi University Taiwan.

Contrasts & Inference - EEG & MEG Himn Sabir 1. Topics 1 st level analysis 2 nd level analysis Space-Time SPMs Time-frequency analysis Conclusion 2.

Mining Social Network for Personalized Prioritization Language Techonology Institute School of Computer Science Carnegie Mellon University Shinjae.

Why to reduce the number of the features? Having D features, we want to reduce their number to n, where n

D. M. J. Tax and R. P. W. Duin. Presented by Mihajlo Grbovic Support Vector Data Description.

ECE 8443 – Pattern Recognition LECTURE 08: DIMENSIONALITY, PRINCIPAL COMPONENTS ANALYSIS Objectives: Data Considerations Computational Complexity Overfitting.

MLE’s, Bayesian Classifiers and Naïve Bayes Machine Learning Tom M. Mitchell Machine Learning Department Carnegie Mellon University January 30,

Paired Sampling in Density-Sensitive Active Learning Pinar Donmez joint work with Jaime G. Carbonell Language Technologies Institute School of Computer.

Gang WangDerek HoiemDavid Forsyth. INTRODUCTION APROACH (implement detail) EXPERIMENTS CONCLUSION.

Chapter 3: Maximum-Likelihood Parameter Estimation l Introduction l Maximum-Likelihood Estimation l Multivariate Case: unknown , known  l Univariate.

Hand Motion Identification Using Independent Component Analysis of Data Glove and Multichannel Surface EMG Pei-Jarn Chen, Ming-Wen Chang, and and Yi-Chun.

1/18 New Feature Presentation of Transition Probability Matrix for Image Tampering Detection Luyi Chen 1 Shilin Wang 2 Shenghong Li 1 Jianhua Li 1 1 Department.

C O R P O R A T E T E C H N O L O G Y Information & Communications Neural Computation Machine Learning Methods on functional MRI Data Siemens AG Corporate.

© Devi Parikh 2008 Devi Parikh and Tsuhan Chen Carnegie Mellon University April 3, ICASSP 2008 Bringing Diverse Classifiers to Common Grounds: dtransform.

1 Modeling the fMRI signal via Hierarchical Clustered Hidden Process Models Stefan Niculescu, Tom Mitchell, R. Bharat Rao Siemens Medical Solutions Carnegie.

Data Mining and Decision Support

Effects of Verbal Working Memory Load on Corticocorical Connectivity Modeled by Path Analysis of Functional Magnetic Resonance Imaging Data Honey et al.

FMRI and Behavioral Studies of Human Face Perception Ronnie Bryan Vision Lab

By Semaj and Hannah The thing that is difficult about his job is dealing with doctors.

Control of a humanoid robot using EEG. Problem EEG is low bandwidth Hard to exercise fine grained control.

1 Bernard Ng 1, Arash Vahdat 2, Ghassan Hamarneh 3, Rafeef Abugharbieh 1 Contact 1 Biomedical Signal and Image Computing Lab,

Next, this study employed SVM to classify the emotion label for each EEG segment. The basic idea is to project input data onto a higher dimensional feature.

Using Asymmetric Distributions to Improve Text Classifier Probability Estimates Paul N. Bennett Computer Science Dept. Carnegie Mellon University SIGIR.

The general linear model and Statistical Parametric Mapping II: GLM for fMRI Alexa Morcom and Stefan Kiebel, Rik Henson, Andrew Holmes & J-B Poline.

A Novel Assessment Tool for Alzheimer's and Frontotemporal Dementias Jeanyung Chey 1,2, Hyun Song 2, Jungsuh Suk 1, & Minue J. Kim 3 The Proportional Reasoning.

Learning to Decode Cognitive States from Brain Images Tom Mitchell et al. Carnegie Mellon University Presented by Bob Stark.

Evaluating Hypotheses

Chapter 3: Maximum-Likelihood Parameter Estimation

Boosted Augmented Naive Bayes. Efficient discriminative learning of

Perceptrons Lirong Xia.

Hidden Process Models with applications to fMRI data

Machine Learning Feature Creation and Selection

A Similarity Retrieval System for Multimodal Functional Brain Images

Principal Component Analysis

Keystroke Recognition using Wi-Fi Signals

School of Computer Science, Carnegie Mellon University

Adaptive multi-voxel representation of stimuli, rules and responses

Amy E. Skerry, Rebecca Saxe Current Biology

What is The Optimal Number of Features

CAMCOS Report Day December 9th, 2015 San Jose State University

A Data Partitioning Scheme for Spatial Regression

Perceptrons Lirong Xia.

Presentation transcript:

Dimensionality Reduction for fMRI Brain Imaging Data Leman Akoglu Carnegie Mellon University, Computer Science Department Abstract Functional Magnetic Resonance Imaging (fMRI) is a very powerful instrument to collect data about activity in the human brain. Like in many empirical sciences, this new method has led to a flood of new data. Motivation: If appropriate analysis tools can be developed given the big amount of data produced, fMRI technology offers revolutionary approaches to the study of human brain functioning. For example, if cognitive states of the brain could be decoded, medical diagnosis of Alzheimer’s, Dementia, Brain Tumors or Schizophrenia would be possible given the fMRI brain activity of a human subject. Limitations: (1) sparse data (tens of training examples per human subject), (2) noisy data (3) extremely high dimensional (up to 10 5 ) feature space. Objectives: (1) Figure out powerful dimensionality reduction methods in order to make “learning” easier and faster. (2) Find best informative features in order to increase classification accuracy. FEATURE SELECTION METHODS Discrim Train a separate classifier for each voxel. Each voxel has 16 features ( 8-sec intervals) The accuracy of each single-voxel classifier over the training data is regarded as the measure of discriminating power. Pick top n most discriminating voxels. EXPERIMENT RESULTS Picture versus Sentence case study Active Score each voxel based on how active it is relative to the fixation (rest) condition. Pick top n most active voxels. ActiveThenDiscrim Select most active m voxels. Train a separate classifier for each of m active voxels. Pick top n most discriminating active voxels. DiscrimAndActive Train a separate classifier for each voxel. Select top n most ‘discriminating’ voxels. Select top n voxels with highest activity score. Pick the subset of voxels in the intersection (most active AND discriminating voxels) *Time-SeriesAvg Group those voxels time-series of which are highly correlated. Correlation measure is covariance. Average time series of voxels in the same group to form new supervoxels. *Time-SeriesMost Determine the most effective voxel. Find those voxels time-series of which is not correlated to that of the most effective voxel (informative voxels). Drop voxels with time-series highly correlated to that of the most effective voxel (reduce redundancy). Feature selectionAvgErrABCDEF All (~5000) Active(120) Discrim(120) ActiveThenDiscrim (nToKeep=120, nActive=2000) DiscrimAndActive (nDiscrim=120, nActive=2000) ActiveTSavg(240) DiscrimTSavg(120) ActiveThenDiscrimTSavg (nToKeep=120, nActive=2000) ActiveTSmost(120) DiscrimTSmost(120) ActiveThenDiscrimTSmost (nToKeep=120, nActive=2000) Feature selection Average error 1NN3NN9NNSVM Active (nToKeep) (120)0.2854(240)0.3000(480) (240) Discrim (nToKeep) (120)0.2417(120)0.2042(120) (120) ActiveThenDiscrim (nToKeep, nActive) (240,1000) (120,1000) (120,2000) (120,1000) DiscrimAndActive (nDiscrim, nActive) (120,2000) (120,3000) (120,3000) (120,3000) All (~5000) CONCLUSIONS  Brain cognitive state classification is possible (better than random classification accuracies).  Error decreases considerably when feature selection is used for all types of classifiers.  Discrimination-based method outperforms activity-based method. But, Discrim is computationally more expensive than Active. It is also prone to overfitting as its performance is evaluated on training data.  ActiveThenDiscrim outperforms Active and its accuracy is very close to that of Discrim, but is computationally less demanding, which makes it a good alternative.  DiscrimAndActive outperforms Active and well approximates the error rates of Discrim, just like ActiveThenDiscrim. But, it is computationally as demanding as Discrim. Still, it could be a good alternative for feature selection as it reduces the number of voxels significantly.  For the time-series methods, the number of features are further reduced, almost halved. Still, accuracy results are very close to those without applying time-series methods. These methods come with extra computational cost, but can be employed when high dimensionality is a problem as it makes learning difficult, increasing the number of parameters to be estimated consecutive trials for 6 human subjects - fMRI images every 500 msec - rest (fixation) periods for zero-signal-data - find a mapping function f : fMRI-sequence(t 0,t 0 +8)  { Picture, Sentence } 1 st stimulus (picture): 4 secs Rest(fixation) period: 4 secs 2 nd stimulus (sentence) : 4secs