Presentation is loading. Please wait.

Presentation is loading. Please wait.

Returning Back …. A Big Thanks Again Prof. Jason Bohland Quantitative Neuroscience Laboratory Boston University Prof. Matt Hibbs Jackson Labs.

Similar presentations


Presentation on theme: "Returning Back …. A Big Thanks Again Prof. Jason Bohland Quantitative Neuroscience Laboratory Boston University Prof. Matt Hibbs Jackson Labs."— Presentation transcript:

1 Returning Back …

2 A Big Thanks Again Prof. Jason Bohland Quantitative Neuroscience Laboratory Boston University Prof. Matt Hibbs Jackson Labs

3  Quality control → set of 3041 genes  Combine gene volumes into a large matrix  Decompose the voxel x gene matrix using singular value decomposition (SVD) voxels modes xx genes s.v.’s M ≈M ≈ “weight” spatial pattern gene pattern Large-scale Correlation

4 N=271 before we get to 90% of the variance N=67 before we get to 80% of the variance Principal modes (SVD) Cerebral cortex Olfactory areas Hippocampus Retrohippocampal Striatum Pallidum Thalamus Hypothalamus Midbrain Pons Medulla Cerebellum All LH brain voxels plotted as projections on first 3 modes

5 Interpreting gene modes Spatial modes are easily visualized. Attempt to annotate eigenmodes using Gene Ontology (GO) annotations: Each GO term partitions gene list into two subsets: IN genes: Genes annotated by that GO term OUT genes: Genes not annotated by that GO term Each singular vector associates each subset above with a set of amplitudes Compare these amplitudes, asking whether ‘IN’ genes have larger magnitudes than ‘OUT’ genes use K-S test to test whether the amplitude distributions are different

6 In this low dimensional space Cerebellum and striatum separated - GABAergic interneurons and glutamatergic projection neurons in adult mouse forebrain Other regions are clustered in greatly reduced space, but with considerable overlap Anatomical regions do not in general correspond directly to individual SVD modes Clustering of gene expression profiles in very low dimensional subspace groups voxels drawn from same brain regions

7 Component Annotations Distinctly high amplitude in the dentate gyrus of the hippocampus. Enhanced specificity for the cerebellum, Particularly prominent in the cerebellum and the striatum. Decomposition extracts correlated structure in expression profiles that corresponds to anatomical subdivision

8 Once again

9 Gene clustering? Genes are somewhat less separable - and less categorical Build gene-gene similarity graph partition, color code each point…

10 K-Means Segmentation What does gene expression tell about regional brain organization ? Use simple cluster analysis. K-means clustering:  Dimensionality reduced (to 271) by truncating SVD  Assign one of K labels to each voxel  All voxels assigned the same label have more similar expression profiles than voxels with different labels  Similarity defined by Euclidean distance Data-driven parcellation of mouse brain anatomy (level of granularity determined by K)

11 K-means clustering results

12 Spatially Contiguous Clusters K=2 – clusters separates cerebral cortex hippocampus (gray) from other areas (white) K = 8 – cerebellum/striatum clearly segmented, cortex is subdivided into distinct layers K = 16 - thalamus has its own cluster; cortical layers further differentiated, midbrain separated from hindbrain Large K – More anatomical details observed; separation of caudoputamen from the nucleus accumbens; display laminar and areal patterns in cortex

13 Clustering in Cerebral Cortex K = 40 (masked) ARA Area masks Divides aud/vis areas from somatosensory areas Laminar clusters broken into distinct groups along anterior– posterior direction (bottom) at border between auditory & somatosensory areas Validation

14 Relevant Questions Determine, for a given structure, at what value of K it emerges as its own cluster ? Relative prioritization of anatomical areas based on expression pattern similarity Dominant clustering of gene expression along cortical layers consistent with those of Ng et al.

15 Compare with Reference Atlas Reference atlases here are “flat” parcellations with 12 or 94 regions Similarity index (S) ranges from 0-1 Overlap saturating at K > 30 Clusters for large K are subdivisions of those for low K

16 Compare with Reference Atlas K=12 Clusters 1, 2, 3, and 4 together the cerebral cortex Cluster 11 largely corresponds thalamus Cluster 9 is wholly contained in the cerebellum Cluster 10 in the striatum.

17 Classification of Region Membership Supervised learning using linear discriminant (25% test set, 10-fold cross-validation) 94.5% correct overall

18 What Next ? Size of voxels large relative to individual cell bodies Voxels will contain a mixture of several cell types. Unique expression signature for discrete brain locations with different combinations of cell types. Spatial co-expression indicator of functionally-related or interacting genes

19 Localization of expression Normalized Expression Energy Voxels Non-localized expression pattern Well-localized expression pattern Kullback-Leibler (KL) divergence from (spatial) uniformity

20 summed thresholded Gene Localization Select most localized genes (KL > ~1.56) to further analyze Threshold voxels based on intensity histogram of summed expressions Remaining LH mask (6102 voxels) essentially excludes cerebral cortex

21 Voxel Uniformity in Gene Space Measure KL divergence from uniform density across gene space at each voxel Brighter color indicates lower KL divergence (more uniform expression across genes) Note cortex is generally more uniform than subcortical areas And middle cortical layers are notably more uniform than superficial and deepest layers

22 “Expression diversity” Expression diversity across gross structures Expression diversity across cortical layers and areas Average KL divergence across all voxels in a particular anatomical region

23 Construct a bipartite graph with N (200) genes in vertex set V 1 and M (~6000) mask voxels in V 2  Edges are expression levels of each gene at each voxel Apply graph partitioning methods to cut graph into connected components  Components contain both voxels and genes  Here we used the isoperimetric algorithm (Grady and Schwartz, 2006). GENES VOXELS Biclustering Genes & Voxels V1V1 V2V2 Can we group genes that are each highly localized to common brain regions (sets of voxels)?

24 What is Biclustering ? Finding submatrices in an n x m matrix that follow a desired pattern* Row/column order need not be consistent between different biclusters.

25

26

27

28

29 Bicluster properties For any submatrix C IJ where I and J are a subsets of genes and conditions, the mean squared residude score is A bicluster is a submatrix C IJ that has a low mean squared residue score. Biclustering of Expression data: Cheng and Church, RECOMB 2001

30 Cheng and Church Greedy Approach Finds a submatrix that minimizes MSR Biclusters (a) and (b) fits the definition of MSR

31 Biclustering Localized Genes 40 genes 29 genes Resulting voxel clusters correspond well to individual anatomical regions, w/ functionally relevant gene lists 97% of energy in the cerebellum Highly localized to ventricle system

32 Biclustering Localized Genes 30 genes 11 genes Results shown are for 13 biclusters 69% of energy in dentate gyrus, 20% Ammon’s horn 99% of energy in thalamus

33 Cell-type expression model Hypothesis: do genes emerging from these biclusters represent preferential “markers” of cell types localized to the corresponding regions? Cell-type specific microarray data are available (Okaty et al., 2009; 2011) to help answer this question Compare microarray profiles of these cell types with voxel- based transcriptomic data from ABA  2131 overlapping genes (with high quality ABA data)

34 Cell-type based expression Spatial patterns reflect organization within brain regions A B C D (A)Granule cells (B) Purkinje cells (C) Stellate cells (D) mature oligodendrocytes

35 Biclusters Cell Types Highly localized genes emerging from bi- clusters (usually) show selective expression in local cell types CP bi-cluster Cb bi-cluster

36 Heritable “Disease Networks” Online Mendelian Inheritance in Man (OMIM) –Contains records of genetic basis for ~4000 disorders –Manually curated 94 unique entities that are of neurological / neuropsychiatric interest and intersect our gene set 1.For each disorder, calculate the mean expression pattern across orthologs of implicated genes (MGI orthology) 2.Calculate a distance matrix between disorders by computing the pairwise cosine distance between expression profiles 3.Cluster disorders using hierarchical cluster analysis

37 OMIM Disease Clusters Complete linkage clustering

38 Lhx1 Autism Candidate For a given gene list, embed expression similarity in 2D space Ex: ASD candidate genes from Wigler lab (CSHL) (16 genes in high quality coronal data set) Calculate cosine distance matrix, and apply metric MDS Provide sub-groupings based on expression locus Fgd3 Cb MapT Doc2a Ctx Ptpdc1

39 Next ? timeTR... -0.5 0 0.5 1 Component Spatial components 024681012 1 2 3 4 fMRI


Download ppt "Returning Back …. A Big Thanks Again Prof. Jason Bohland Quantitative Neuroscience Laboratory Boston University Prof. Matt Hibbs Jackson Labs."

Similar presentations


Ads by Google