Dr. J Bruce Morton & Daamoon Ghahari

Slides:



Advertisements
Similar presentations
FMRI Methods Lecture 10 – Using natural stimuli. Reductionism Reducing complex things into simpler components Explaining the whole as a sum of its parts.
Advertisements

Covariance Matrix Applications
Lecture 3: A brief background to multivariate statistics
EigenFaces.
Pattern Recognition for the Natural Sciences Explorative Data Analysis Principal Component Analysis (PCA) Lutgarde Buydens, IMM, Analytical Chemistry.
1er. Escuela Red ProTIC - Tandil, de Abril, 2006 Principal component analysis (PCA) is a technique that is useful for the compression and classification.
Principal Component Analysis
Principal Component Analysis
Principal Component Analysis
The Terms that You Have to Know! Basis, Linear independent, Orthogonal Column space, Row space, Rank Linear combination Linear transformation Inner product.
Exploring Microarray data Javier Cabrera. Outline 1.Exploratory Analysis Steps. 2.Microarray Data as Multivariate Data. 3.Dimension Reduction 4.Correlation.
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL FADTTS: Functional Analysis of Diffusion Tensor Tract Statistics Hongtu Zhu, Ph.D. Department of Biostatistics.
Correlation. The sample covariance matrix: where.
A vector can be interpreted as a file of data A matrix is a collection of vectors and can be interpreted as a data base The red matrix contain three column.
SVD(Singular Value Decomposition) and Its Applications
Summarized by Soo-Jin Kim
Chapter 2 Dimensionality Reduction. Linear Methods
Presented By Wanchen Lu 2/25/2013
Next. A Big Thanks Again Prof. Jason Bohland Quantitative Neuroscience Laboratory Boston University.
A biologist and a matrix The matrix will follow. Did you know that the 20 th century scientist who lay the foundation to the estimation of signals in.
The Multiple Correlation Coefficient. has (p +1)-variate Normal distribution with mean vector and Covariance matrix We are interested if the variable.
Some matrix stuff.
ADHD By: Kourtni, Chelsea, and Aaron. What is ADHD? ADHD stands for Attention deficit hyperactivity disorder ADHD is a problem with inattentiveness, over-activity,
Working with Students with Attention Deficit Hyperactivity Disorder ADHD/ADD.
By: Holly J. Braden   ADHD can lead to problems with:  Organization  Becoming Easily Frustrated  Social Interactions  This can affect a child’s.
SINGULAR VALUE DECOMPOSITION (SVD)
Distances Between Genes and Samples Naomi Altman Oct. 06.
1 Sample Geometry and Random Sampling Shyh-Kang Jeng Department of Electrical Engineering/ Graduate Institute of Communication/ Graduate Institute of Networking.
Principal Components: A Mathematical Introduction Simon Mason International Research Institute for Climate Prediction The Earth Institute of Columbia University.
Reduces time complexity: Less computation Reduces space complexity: Less parameters Simpler models are more robust on small datasets More interpretable;
EIGENSYSTEMS, SVD, PCA Big Data Seminar, Dedi Gadot, December 14 th, 2014.
Advanced Computer Graphics Spring 2014 K. H. Ko School of Mechatronics Gwangju Institute of Science and Technology.
SGPP: Spatial Gaussian Predictive Process Models for Neuroimaging Data Yimei Li Department of Biostatistics St. Jude Children’s Research Hospital Joint.
Principal Component Analysis (PCA).
Character legibility: which details are important, and how might it be measured? John Hayes, Jim Sheedy, Yu-Chi Tai, Vinsunt Donato, David Glabe.
Attention Deficit Disorder (ADD) By: Joe, Kyle, and Nate.
Attention-Deficit/Hyperactivity Disorder: What you need to know
Differences in SPECT Perfusion in Children and Adolescents with ADHD
Principal component analysis (PCA)
Factor and Principle Component Analysis
Principal Component Analysis (PCA)
Department of Psychiatry, Washington University School of Medicine
CSSE463: Image Recognition Day 27
Linear Algebra Review.
Linear Algebra review (optional)
Exploring Microarray data
Attention-Deficit/ Hyperactivity Disorder
Lecture 8:Eigenfaces and Shared Features
Lecture: Face Recognition and Feature Reduction
Sleep and Adhd The Link between Parent and Child Sleep Disturbances in Children with Attention Deficit Hyperactivity Disorder Dr. Martin Efron The Child.
Where do I find information about…..?
Principal Component Analysis (PCA)
What is development? Domains of development
Principal Component Analysis
Dynamic graphics, Principal Component Analysis
A Similarity Retrieval System for Multimodal Functional Brain Images
Principal Component Analysis (PCA)
Detecting Gray Matter Maturation via Tensor-based Surface Morphometry
Recitation: SVD and dimensionality reduction
Matrix Algebra and Random Vectors
CSSE463: Image Recognition Day 25
Outline Singular Value Decomposition Example of PCA: Eigenfaces.
Maths for Signals and Systems Linear Algebra in Engineering Lectures 13 – 14, Tuesday 8th November 2016 DR TANIA STATHAKI READER (ASSOCIATE PROFFESOR)
CSSE463: Image Recognition Day 25
Lecture 13: Singular Value Decomposition (SVD)
Principal Component Analysis
Linear Algebra review (optional)
Restructuring Sparse High Dimensional Data for Effective Retrieval
Lecture 16. Classification (II): Practical Considerations
White Matter by Diffusion MRI following Methylphenidate Treatment: A Randomized Control Trial in Males with Attention-Deficit/Hyperactivity Disorder In.
Presentation transcript:

Hypothesis-free search for biomarkers in a large imaging data set using SVD Dr. J Bruce Morton & Daamoon Ghahari Cognitive Development & Neuroimaging Lab Methods Lunch June 24th 2019

Outline Current methods of diagnosing ADHD & other neuropsychiatric disorders The search for relevant brain biomarkers ABCD & large neuroimaging consortiums What is SVD and how can it be used in the search for biomarkers Combining SVD with General Linear Modeling Demonstration of SVD with simulated data

Current Method of Diagnosing ADHD Attention-deficit/hyperactivity disorder (ADHD) is one of the most commonly diagnosed childhood behavioral disorders, affecting approximately 5% of school-age children Currently ADHD is characterized based on atypical behavioural profiles such as consistent inattention, impulsivity and hyperactivity reported by parents, caregivers, and school teachers. So in other words the diagnosis of ADHD remains reliant on classification systems derived from clusters of symptoms that are subjectively reported rather than etiology or neurobiology. Thus, the quest for functional or structural brain biomarkers which support the behavioural characterization and account for behavioural variability within ADHD is really important.

The Search for Relevant Brain Biomarkers ? So while it is true that in recent decades we have witnessed marked advances in identifying biological correlates for neuropsychiatric conditions such as ADHD, our understanding of the underlying neurobiology still remains insufficient to aid in the diagnosis or characterization of the condition. In particular, one hurdle in the face of identifying relevant brain features is the availability of large-scale imaging data, which would allow us deploy data driven approaches and test empirically grounded hypothesis.

ABCD & Large Neuroimaging Consortiums > 11,000 children Luckily, scientists all around the world have realized the value of developing and contributing to open-science consortiums. One such example is the ABCD consortium which stands for Adolescent Brain Cognitive Development Currently, the ABCD Study is the largest long- term study of brain development and child health in the United States, with over 21 institutions involved. The most recent release of data contains over 11,000 children in the “baseline” collection who range from 9-12 years old, with future releases containing follow up metrics

ABCD & Large Neuroimaging Consortiums The really cool thing about ABCD is the sheer amount data they have available The ABCD protocol contains a comprehensive set of physical, cognitive, social, emotional, environmental, behavioral, and academic assessments, as well as multimodal neuroimaging (DTI, RSI, resting state MRI, task-based MRI, and structural MRI) and biospecimen collection, so hair and saliva samples, for hormonal, genetic, environmental exposure, and substance use analysis. The study releases completely unprocessed raw dicoms, as well as processed data in a matrix form

ABCD & Large Neuroimaging Consortiums White Matter Metrics FA MD Tract Volumes Features Participants This is data which has gone through standard preprocessing, including motion and displacement correction, registration, ROI segmentation using , and so forth -- all the details of their pipeline are released in their documentation as well as quality control metrics for every subject you could imagine how data released in this matrix form would be very appealing for a project if you are a graduate student who doesn’t have access to a supercomputing facility or doesnt have 12 years to run dicoms from 11,000 subjects through your pipeline So what I have is a matrix, M, with participants as rows and features as columns-- lets say that you are interested in white matter metrics, such as Fractional Anisotropy, Mean Diffusivity, White matter volumes, etc. you could populate the matrix with functional/structural features that you are interested in The issue now becomes -- how can I work with possibly 1000s of features across 11,000 participants in an effort to characterize brain biomarkers of ADHD? And this is where SVD can be really useful.

Singular Value Decomposition is essentially a method which can be used to reduce the dimensionality in the original data matrix M The result of SVD is three component matrices, U, SIGMA and the transpose of V. U is a matrix containing the left-singular vectors or “left eigenvectors” of M The diagonal of Sigma contains non-zero singular values or eigenvalues of M in a descending order The transpose of V contains the right singular vectors or “right eigenvectors” of M.

We can then take the diagonal eigenvalues of SIGMA and display them as a scree-plot There are various methods, but basically you can identify the “elbow” of the scree plot, which is the point where adding additional components doesn’t really add crucial information to our reconstruction & select the top x eigenvectors which account for the most variance in the data matrix Now we can truncate our eigenvector matrices based on the number of eigenvalues we want to keep.

So then if we take our top eigenvectors, and multiply it against the original data matrix we can get a matrix containing component factor loadings-- which are essentially the relative weightings of each of those components in the reconstruction of that subject Or in other words, a measure of how well every subject can be mapped onto a specific eigenvector based on their white matter features

We can then take the matrix of component factor loadings and model it as the dependent variable in a general linear model, where we have a regressor matrix, which can contain some independent variables such as CBCL ADHD symptomatology, age sex, and so on. This will essentially allows us to determine how aberrant patterns in brain phenotypes as computed using SVD relate to ADHD symptom severity and other metrics which were not included in our original data matrix.

Simulation Code To illustrate this and to give everyone a more intuitive understanding we have some simulation code.