Gene Discovery & Genome Browsing

Slides:



Advertisements
Similar presentations
BiGCaT Bioinformatics Hunting strategy of the bigcat.
Advertisements

Peter Tsai, Bioinformatics Institute.  University of California, Santa Cruz (UCSC)  A rapid and reliable display of any requested portion of genomes.
Affymetrix case study Jesper Jørgensen NsGene A/S
Mathematical Statistics, Centre for Mathematical Sciences
Visualization of genomic data Genome browsers. UCSC browser Ensembl browser Others ? Survey.
Gene Expression And Regulation Bioinformatics January 11, 2006 D. A. McClellan
InterPro/prosite UCSC Genome Browser Exercise 3. Turning information into knowledge  The outcome of a sequencing project is masses of raw data  The.
Copyright OpenHelix. No use or reproduction without express written consent1 Organization of genomic data… Genome backbone: base position number sequence.
Visualization of genomic data Genome browsers. How many have used a genome browser ? UCSC browser ? Ensembl browser ? Others ? survey.
Introduction to Genomics, Bioinformatics & Proteomics Brian Rybarczyk, PhD PMABS Department of Biology University of North Carolina Chapel Hill.
ONCOMINE: A Bioinformatics Infrastructure for Cancer Genomics
It & Health 2009 Summary Thomas Nordahl Petersen.
How to access genomic information using Ensembl August 2005.
Genome Browsing with the UCSC Genome Browser
Gene Search by use of MySQL Background – myself NsGene – DTU satellite Parkinson Disease (Affymetrix GeneChip) Analysis of fetal brain tissue Gene Discovery.
Genome Browsers UCSC (Santa Cruz, California) and Ensembl (EBI, UK)
Gene Discovery by use of MySQL Background – myself NsGene – DTU satellite Parkinson Disease (Affymetrix GeneChip) Analysis of fetal brain tissue Search.
It & Health 2010 Summary Thomas Nordahl Petersen.
Visualization of genomic data Genome browsers. UCSC browser Ensembl browser Others ? Survey.
Review of important points from the NCBI lectures. –Example slides Review the two types of microarray platforms. –Spotted arrays –Affymetrix Specific examples.
Whole genome alignments Genome 559: Introduction to Statistical and Computational Genomics Prof. James H. Thomas
PDbase : A database of Parkinson’s Disease-related genes and genetic variation using substantia nigra ESTs Jin Ok Yang Korean BioInformation Center (KOBIC)
Development of Bioinformatics and its application on Biotechnology
Epigenome 1. 2 Background: GWAS Genome-Wide Association Studies 3.
Amandine Bemmo 1,2, David Benovoy 2, Jacek Majewski 2 1 Universite de Montreal, 2 McGill university and Genome Quebec innovation centre Analyses of Affymetrix.
The UCSC Genome Browser Introduction
The Center for Medical Genomics facilitates cutting-edge research with state-of-the-art genomic technologies for studying gene expression and genetics,
Intranasal Delivery of Proteins Using Cationic Liposomes for the Treatment of Parkinson’s Disease and the Use of Bioquant ® Image Analysis Software Presented.
UCSC Genome Browser 1. The Progress 2 Database and Tool Explosion : 230 databases and tools 1996 : first annual compilation of databases and tools.
Microarrays and Their Uses Brad Windle, Ph.D
Part I: Identifying sequences with … Speaker : S. Gaj Date
Browsing the Genome Using Genome Browsers to Visualize and Mine Data.
Professional Development Course 1 – Molecular Medicine Genome Biology June 12, 2012 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services.
Sackler Medical School
Mining Biological Data. Protein Enzymatic ProteinsTransport ProteinsRegulatory Proteins Storage ProteinsHormonal ProteinsReceptor Proteins.
Overview of Bioinformatics 1 Module Denis Manley..
Copyright OpenHelix. No use or reproduction without express written consent1.
Alistair Chalk, Elisabet Andersson Stem Cell Biology and Bioinformatic Tools, DBRM, Karolinska Institutet, September Day 5-2 What bioinformatics.
RNA-Seq Primer Understanding the RNA-Seq evidence tracks on the GEP UCSC Genome Browser Wilson Leung08/2014.
Gene Regulatory Networks and Neurodegenerative Diseases Anne Chiaramello, Ph.D Associate Professor George Washington University Medical Center Department.
Epidemiology 217 Molecular and Genetic Epidemiology Bioinformatics & Proteomics John Witte.
Bioinformatics and Computational Biology
ANALYSIS OF GENE EXPRESSION DATA. Gene expression data is a high-throughput data type (like DNA and protein sequences) that requires bioinformatic pattern.
Α-synuclein transgenic mouse models of Parkinson’s disease Michelle Maurer December 2015.
UCSC Genome Browser Zeevik Melamed & Dror Hollander Gil Ast Lab Sackler Medical School.
Transcriptome What is it - genome wide transcript abundance How do you obtain it - Arrays + MPSS What do you do with it when you have it - ?
Tools in Bioinformatics Genome Browsers. Retrieving genomic information Previous lesson(s): annotation-based perspective of search/data Today: genomic-based.
Annotation of eukaryotic genomes
Genomes at NCBI. Database and Tool Explosion : 230 databases and tools 1996 : first annual compilation of databases and tools lists 57 databases.
Biotechnology and Bioinformatics: Bioinformatics Essential Idea: Bioinformatics is the use of computers to analyze sequence data in biological research.
Visualization of genomic data Genome browsers. How many have used a genome browser ? UCSC browser ? Ensembl browser ? Others ? survey.
Gene_identifier color_no gtm1_mouse 2 gtm2_mouse 2 >fasta_format_description_line >GTM1_HUMAN GLUTATHIONE S-TRANSFERASE MU 1 (GSTM1-1) PMILGYWDIRGLAHAIRLLLEYTDSSYEEKKYTMGDAPDYDRSQWLNEKFKLGLDFPNLPYLIDGAHKI.
Visualization of genomic data Genome browsers. UCSC browser Ensembl browser Others ? Survey.
Introduction to Bioinformatics Summary Thomas Nordahl Petersen.
Microarray Technology and Data Analysis Roy Williams PhD Sanford | Burnham Medical Research Institute.
Bioinformatics Shared Resource Bioinformatics : How to… Bioinformatics Shared Resource Kutbuddin Doctor, PhD.
The regulation of Caspase 8 chIP-seq motifs mRNA expression DNA methylation.
Statistical Applications in Biology and Genetics
Gene expression.
Microarray Technology and Applications
Pick a Gene Assignment 4 Requirements
Visualization of genomic data
Visualization of genomic data
From: TopHat: discovering splice junctions with RNA-Seq
Ensembl Genome Repository.
BLAT Blast Like Alignment Tool
Follow-up from last night: XSEDE credits
Part II SeqViewer AraCyc Help
Schematic representation of a transcriptomic evaluation approach.
Figure Genetic characterization of the novel GYG1 gene mutation (A) GYG1_cDNA sequence and position of primers used. Genetic characterization of the novel.
Presentation transcript:

Gene Discovery & Genome Browsing Background – myself NsGene – DTU satellite Parkinson Disease (Affymetrix GeneChip) Analysis of fetal brain tissue Genome browsing Introduction to the UCSC BLAT browser

Background Thomas Nordahl Petersen Chemist, Ph.D protein Crystallography, University of Copenhagen Computational Scientist, SBI-AT Prediction of protein structure, secondary structure, fold recognition, homology modeling Bioinformatics - Gene discovery, NsGene Devolop novel cell and gene based products for the treatment of neurological diseases.

ECT Products ECT for Parkinson’s Disease Growth of cells in a capsule matrix The therapeutic protein be released directly in the relevant brain area Safe delivery across the blood-brain-barrier Michael J. Fox foundation granted US $3 million to support a clinical “proof-of-concept” (May 2004)

Factor Products Identification of novel genes by use of bioinformatics Neublastin (GDNF family – potent neuroprotective effects) Scanning the human genome or assembled protein sets for different features of interest

A case study Affymetrix GeneChip experiments Fetal brain tissue Search for Parkinson related gene(s) Affymetrix GeneChip experiments Fetal brain tissue

Parkinson Disease Degenerative central nervous system (CNS) disorder

Parkinson Disease Loss of dopamine producing brain cells

Parkinson’s Disease Dopamine from Substantia nigra activates neurons in Striatum/Basal ganglia Important for initiation of movement

Cure for Parkinson’s Disease ? Parkinson disease may be cured provided that new dopamine producing cells replace the dead ones. Dopamin producing brain cells from aborted foetuses have been operated into the brain of parkinson patients and ín some cases cured the disease. Brain tissue from approx 6 foetuses were needed. Major ethical problems ! Search for a protein drug is the only valid option

Parkinson Disease Dopamine producing cells Dopaminergic neurons can be found in the ventral part of the mesencephalon (VM) from approximately 6 weeks No dopaminergic neurons can be found in the neighbouring dorsal part (DM). Dopaminergic differentiation by use of GeneChips to compare the expression profiles of VM and DM

Fetal brain tissue Midbrain mesencephalon - Dopamine producing cells + Dopamine producing cells Vm Dm Aborted feotus brain tissue – Karolinska hospital Feotus of age 6-10 weeks, 2 cases

Midbrain mesencephalon Dm Vm - Dopamine producing cells + Dopamine producing cells Dopamine producing cells at the interface ? Isolate the two samples (Vm/Dm) RNA purification + amplification Affymetrix genechip analysis

GenePublisher (program by Steen Knudsen) Scale, normalize the Affymetrix GeneChip experiments A1 A2 A2 B1 B2 B2 P-value 319 315 314 44 48 38 1.26e-07 314 334 327 443 434 444 6.55e-05 1980 1974 1973 1801 1785 1763 6.77e-05 123 123 126 87 88 93 8.01e-05 103 101 104 77 78 73 0.000112 107 107 111 79 77 82 0.000124 128 123 117 189 184 196 0.000142 179 179 186 145 147 149 0.000191 78 77 79 86 87 87 0.000202 96 90 93 136 129 138 0.000215

Vulcano plot P-value Log2 Fold change

Assigning Affymetrix GeneChip probes to a protein sequence ~20.000 probes on each of the A/B Affymetrix chips. The probes are normally not a part of a protein sequence. Unigene sequence (cDNA) 5’ 3’ IPI protein sequence Blast Affymetrix probe Blast inferred

Internal database

Signal Peptide prediction

Conclusion – so far The most up-regulated genes include several ‘known’ genes like dopamine transporter (good positive control) The most interesting genes are the ‘unknowns’ that were up-regulated in Vm. Futher analysis is ongoing. Roland JR et al., Exp Neur (2006) Vol 198,2,427-437 “Identification of novel genes regulated in the developing human ventral mesencephalon”

BLAT The Blast Like Alignment Tool Smith-Waterman (1980) Local alignment (best alignment) FASTA (1988) Fast alignment BLAST-programs (1990,1997) Query seq versus an indexed database MegaBLAST (2000) Fast alignment of DNA sequences SSAHA (2001) Maps sequence reads to genome

BLAT Blast Like Alignment Tool Very fast searches (MySQL database) Handle introns in RNA/DNA alignments Data for more that 30 genomes (human, mouse, rat…) Exon Intron Exon Splice sites

BLAT genome Browser http://genome.ucsc.edu//

BLAT genome Browser Using a search term or position eg Chr1:10,234-11,567

BLAT genome Browser http://genome.ucsc.edu/

BLAT genome Browser Using a protein or DNA sequence

Blat genome Browser

BLAT genome Browser ”Details”

Blat genome Browser

BLAT genome Browser ”Browser”

BLAT genome Browser Zoom in ’base’

BLAT genome Browser ”Browser” Trembl SwissProt Hypothetical

BLAT genome Browser ”Description & Page Index”

BLAT genome Browser ”Description & Page Index”

Blat Genome Browser ”tracks”

BLAT genome Browser ”SNPs”

BLAT genome Browser ”SNPs”

Gene Sorter

Gene sorter