PDbase : A database of Parkinson’s Disease-related genes and genetic variation using substantia nigra ESTs Jin Ok Yang Korean BioInformation Center (KOBIC)

Slides:



Advertisements
Similar presentations
MitoInteractome : Mitochondrial Protein Interactome Database Rohit Reja Korean Bioinformation Center, Daejeon, Korea.
Advertisements

Creating NCBI The late Senator Claude Pepper recognized the importance of computerized information processing methods for the conduct of biomedical research.
1 Computational Molecular Biology MPI for Molecular Genetics DNA sequence analysis Gene prediction Gene prediction methods Gene indices Mapping cDNA on.
Bioinformatics Needs for the post-genomic era Dr. Erik Bongcam-Rudloff The Linnaeus Centre for Bioinformatics.
Gene ontology & hypergeometric test Simon Rasmussen CBS - DTU.
Gene Expression And Regulation Bioinformatics January 11, 2006 D. A. McClellan
Bioinformatics: a Multidisciplinary Challenge Ron Y. Pinter Dept. of Computer Science Technion March 12, 2003.
Kate Milova MolGen retreat March 24, Microarray experiments: Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
ONCOMINE: A Bioinformatics Infrastructure for Cancer Genomics
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Gene Discovery & Genome Browsing
PHL 437/Pharmacogenomics Fourth Lecture (Parkinson’s disease) By Abdelkader Ashour, Ph.D. Phone:
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
Sequence Analysis. Today How to retrieve a DNA sequence? How to search for other related DNA sequences? How to search for its protein sequence? How to.
Parkinson’s Disease Pathology 430/826 The Molecular Basis of Disease Neurological Genetics 9 th March 2015 Dr. John Rossiter
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Doug Brutlag 2011 Genome Databases Doug Brutlag Professor Emeritus of Biochemistry & Medicine Stanford University School of Medicine Genomics, Bioinformatics.
Paola CASTAGNOLI Maria FOTI Microarrays. Applicazioni nella genomica funzionale e nel genotyping DIPARTIMENTO DI BIOTECNOLOGIE E BIOSCIENZE.
Computational Molecular Biology Biochem 218 – BioMedical Informatics Gene Regulatory.
Doug Brutlag Professor Emeritus Biochemistry & Medicine (by courtesy) Genome Databases Computational Molecular Biology Biochem 218 – BioMedical Informatics.
Doug Brutlag 2011 Next Generation Sequencing and Human Genome Databases Doug Brutlag Professor Emeritus of Biochemistry & Medicine Stanford University.
>>> Korean BioInformation Center >>> KRIBB Korea Research institute of Bioscience and Biotechnology GS2PATH: Linking Gene Ontology and Pathways Jin Ok.
Epigenome 1. 2 Background: GWAS Genome-Wide Association Studies 3.
EnrichNet: network-based gene set enrichment analysis Presenter: Lu Liu.
PutidaNET :Interactome database service and network analysis of Pseudomonas putida KT2440 (P. putida KT2440) Korean BioInformation Center (KOBIC) Seong-Jin,
Ranking-Aware Integration and Explorative Search of Distributed Bio-Data Dipartimento di Elettronica e Informazione NETTAB 2012 Integrated Bio-Search November.
Amandine Bemmo 1,2, David Benovoy 2, Jacek Majewski 2 1 Universite de Montreal, 2 McGill university and Genome Quebec innovation centre Analyses of Affymetrix.
Parkinson’s Disease superKAT :).
DONNA MAGLOTT, PH.D. PRO AND MEDICAL GENETICS RESOURCES AT NCBI.
©Edited by Mingrui Zhang, CS Department, Winona State University, 2008 Identifying Lung Cancer Risks.
Copyright OpenHelix. No use or reproduction without express written consent1.
Korea BioInformation Center Byoung-Chul Kim
 Parkinson Disease (PD) is a disorder of the brain that causes a variety of movement problems.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Overview  Introduction  Biological network data  Text mining  Gene Ontology  Expression data basics  Expression, text mining, and GO  Modules and.
Respective contributions of MIAME, GeneOntology and UMLS for transcriptome analysis Fouzia Moussouni, Anita Burgun, Franck Le Duff, Emilie Guérin, Olivier.
 Parkinson’s Disease (PD) -progressive neurodegenerative disease affecting motor ability -third most common neurologic disorder of older adults.
EB3233 Bioinformatics Introduction to Bioinformatics.
Lecture 11. Topics in Omic Studies (Cancer Genomics, Transcriptomics and Epignomics) The Chinese University of Hong Kong CSCI5050 Bioinformatics and Computational.
Alternative Splicing (a review by Liliana Florea, 2005) CS 498 SS Saurabh Sinha 11/30/06.
By: Alejandro Navarro and Andrea Ors. Content Introduction What is Parkinson's disease? What causes the disease? History Main symptoms Treatment Statistics.
Exploring and Exploiting the Biological Maze Zoé Lacroix Arizona State University.
ANALYSIS OF GENE EXPRESSION DATA. Gene expression data is a high-throughput data type (like DNA and protein sequences) that requires bioinformatic pattern.
Supplementary Figure 1: Comparison of the results obtained from three widely used databases (namely AmiGO, ArrayExpress and GeneCards) with that from HypoxiaDB.
second most common neurodegenerative disorder progressive loss of muscle control trembling of the limbs and head while at rest stiffness, slowness, and.
Bioinformatics Workshops 1 & 2 1. use of public database/search sites - range of data and access methods - interpretation of search results - understanding.
By Katelyn Chaimson and Sean Guyot
Α-synuclein transgenic mouse models of Parkinson’s disease Michelle Maurer December 2015.
The Future of Genetics Research Lesson 7. Human Genome Project 13 year project to sequence human genome and other species (fruit fly, mice yeast, nematodes,
BLAST Sequences queried against the nr or grass databases. GO ANALYSIS Contigs classified based on homology to known plant or fungal genes Next.
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS) LECTURE 13 ANALYSIS OF THE TRANSCRIPTOME.
Pathogenesis and pathology of parkinsonism
Biotechnology and Bioinformatics: Bioinformatics Essential Idea: Bioinformatics is the use of computers to analyze sequence data in biological research.
Faculdade de Medicina da Universidade de Coimbra Curso de Medicina 1º Ano Ano lectivo 2009/2010.
Emma Hahs, Brooke Armistead, Sarah Brown, Sok Kean Khoo Department of Cell and Molecular Biology, College of Liberal Arts and Sciences, Grand Valley State.
 What is MSA (Multiple Sequence Alignment)? What is it good for? How do I use it?  Software and algorithms The programs How they work? Which to use?
Using DNA Subway in the Classroom Genome Annotation: Red Line.
 Facilities Open House Functional Genomics Facility Molishree Joshi, Ph.D. 6/1/2015 Contact Information:
Parkinson’s Disease.
The Transcriptional Landscape of the Mammalian Genome
 The human genome contains approximately genes.  At any given moment, each of our cells has some combination of these genes turned on & others.
“The effects of chronic changes to the functioning of the nervous system due to interference to neurotransmitter function, illustrated by the role of Dopamine.
Alpha-synuclein in Parkinson's disease
Pick a Gene Assignment 4 Requirements
Parkinson’s Common neurodegenerative disorder ~10-20/1000
Access to Sequence Data and Related Information
Ensembl Genome Repository.
Next Generation Sequencing and Human Genome Databases
EXTENDING GENE ANNOTATION WITH GENE EXPRESSION
Presentation transcript:

PDbase : A database of Parkinson’s Disease-related genes and genetic variation using substantia nigra ESTs Jin Ok Yang Korean BioInformation Center (KOBIC)

Abbreviation PD: Parkinson’s Disease SN: Substantia Nigra

Parkinson’ Disease (PD) PD  Neurodegenerative movement disorder  Late-onset neurological disorder, after the age of 50 Symptoms : slowness of movement, rest tremor, rigidity, anxiety, depression, disturbance in balance, autonomic disturbance Degeneration of dopaminergic (DA) neurons in Substantia Nigra (SN) : loss of pigmented neurons in the pars compacta of the SN

Pathology & Diagnosis of PD  Degeneration of Dopaminergic Neurons in Substantia Nigra  Lewy body : pathologic hallmark of PD, cytoplasmic inclusion body PET scan Normal PD Lewy Body

Background The substantia nigra (SN) is important resource to understand the mechanism of the PD causation The needs for the resources to provide information of comprehensive PD-related genes and genetic variations We present a consolidated PD database, called PDbase, to capture wide spectrum of molecular events

PDbase database PDbase – A comprehensive PD-related genes and genetic variation database – Contains 2,678 genes and 870,468 SNPs from 1) SN ESTs and 2) public disease- related databases – Provides biological function of the PD-related genes including alternative splicing events, SNPs located in gene structure, mitochondrial proteins, micro-RNA elements, biological pathways, and PPI networks Related work MDPD (The Mutation Database for Parkinson’s Disease) – 202 genes extracted from 576 publications and manually examined by biomedical researchers based on population studies – It provides the PD-related genetic variation effects such as risk factor or ethnic group PDGene – 40~80 PD genetic association studies – PD-related genes and risk factors from association studies

PDbase construction: SN EST discovery and computational analysis

** Information of SN samples - normal SN tissue ; male caucasian 81 yrs of age, died congestive heart failure, negative for HIV 1/2, HBV and HCV. - PD’s SN ; male caucasian 60 yrs of age, diagnosed with PD, died from a gun shot wound, negative for HIV 1/2, HBV and HCV. Global approaches  Useful in the analysis of complex biological phenomena, including certain human diseases.  Helpful to examine general gene expression in the transcriptome. Substantia Nigra (SN) ESTs collection

Cell and Tissue Banking Parkinson’s disease : - PD’s SN Tissues - Normal SN Tissues High-diverse cDNA library Full-length cDNA library Normalized cDNA library Large-scale cDNA sequencing Automatic colony picking Automatic plasmid DNA prep Automatic reaction mixing Base call, Editing, and Clustering Phrep & Phred CAP3 Bioinformatics Group UniGene collection process Human Cell & Tissue cDNA library (Normalized, Full-length) cDNA library (Normalized, Full-length) Picking & Gridding Automatic DNA Extraction Automatic DNA Extraction Workstation for Sequencing Reaction Workstation for Sequencing Reaction PCR Purification of Reaction Mixture Purification of Reaction Mixture Run on Auto-Sequencer Data Editing & Assembly UniGene & Fl-length cDNA Database UniGene & Fl-length cDNA Database cDNA Chip ProteinProtein

Source Library Name Library TypeReads UniGene #217* GeneClusters SN normal tissue Substantia NigraB6NSN0Full-length2, B6NSN0n1Full-length, Normalized PD’ SN tissue Substantia NigraB7PSN0Full-length2, B7PSN0n1Full-length, Normalized SN cDNA Libraries Summary *Number of clusters and genes in NCBI UniGene build#217 contributed by our EST sequences

Alternative Splicing events AS eventsNumber of Genes associated with AS event Alternative starts119 Alternative ends61 Retained intron64 Cassette exon45 Double cassette exon9 Alternative 3’ exon32 Alternative 5’ exon19 We discovered SN ESTs from Full- length cDNA libraries based on oligo- capping methods

Significant differences in gene expression

PDbase system

PDbase Merged PD-related Gene information Alternative splicing events (UniGene isoforms) Gene regulation Gene Ontology Biological Pathways Protein-protein Interaction Normal SN ESTs PD’ SN ESTs Homologous genes (BLAST) Differential expression (Audic algorithm) PD-related genes dbSNP PD-related SNPs mitoDat HGNC Refseq Uniprot Mapping (BLAST) HGMD GAD OMIM UMLS Disease Gene & Protein dbSNP PD-related SNPs

Web interface 1 2

#Gene Symbol DescriptionMore Information

Query Results – SN EST statistics – Gene information – Genetic variation information – Gene regulation – Gene Ontology (GO) – Biological pathways: BioCarta and KEGG – Protein-protein interaction network

Results_1 for the selected gene

Results_2 for the selected gene

FT L *Partners  FTH1  MAP3K12  GADD45A  PTN  MYOC  SMAD9  KNG1  TAF10  MPHOSPH6  MPP6  PUNC

Conclusion PDbase – Provides comprehensive information about Parkinson’s Disease-related genes and genetic variation – highlights to contain not only public resources, but also un-reported PD target genes using normal and PD’s SN ESTs – Helpful in analysis of complex biological phenomena including human brain diseases because of including several genes, genetic variations, expression, and network – available at