Major Databases/Portals

Slides:



Advertisements
Similar presentations
INTERNATIONAL HEALTHCARE STANDARDS LANDSCAPE
Advertisements

Mining Brain-Related Transcription Factor- Disease Relationships for Novel Linkages Unified Medical Language System (UMLS) Specialised Databases Atlas.
Peter Tsai, Bioinformatics Institute.  University of California, Santa Cruz (UCSC)  A rapid and reliable display of any requested portion of genomes.
Protein databases Morten Nielsen. Background- Nucleotide databases GenBank, National Center for Biotechnology Information.
CS 425/625 Software Engineering System Models
Biological Databases Notes adapted from lecture notes of Dr. Larry Hunter at the University of Colorado.
Genome Browsers Ensembl (EBI, UK) and UCSC (Santa Cruz, California)
IST Computational Biology1 Information Retrieval Biological Databases 2 Pedro Fernandes Instituto Gulbenkian de Ciência, Oeiras PT.
EBI is an Outstation of the European Molecular Biology Laboratory. UniProt Jennifer McDowall, Ph.D. Senior InterPro Curator Protein Sequence Database:
Mouse Genome Informatics November 2008 Paul Szauter MGI User Support.
Claire O’Donovan EMBL-EBI. In UniProtKB, we aim to provide… o A high quality protein sequence database A non redundant protein database, with maximal.
An Introduction to Bioinformatics Molecular Biology Databases.
Course Module: Introduction to Bioinformatics – CS 2001 July CS Databases.
1 Betsy L. Humphreys, MLS Betsy L. Humphreys, MLS National Library of Medicine National Library of Medicine National Institutes of Health National Institutes.
Bioinformatics for biomedicine
Linking Diseases and Genes through Informatics Knowledge Bases and Ontologies Joyce A. Mitchell, Ph.D. National Library of Medicine University of Missouri.
PhenCode Linking Human Mutations to Phenotype. PhenCode Brings the deep information on genotypes and phenotypes in locus specific databases (LSDBs) into.
Resurrecting SOWG BS, Baltimore, CTS Ontology Workshop April
GENOME-CENTRIC DATABASES Daniel Svozil. NCBI Gene Search for DUT gene in human.
Survey of Medical Informatics CS 493 – Fall 2004 September 27, 2004.
DONNA MAGLOTT, PH.D. PRO AND MEDICAL GENETICS RESOURCES AT NCBI.
DAY 1c: Accessing Completed Genomes 1. UCSC Genome Bioinformatics 2. Ensembl 3. NCBI Genomic Biology.
Online Mendelian Inheritance in Man (OMIM): What it is & What it can do for you Knowledge Management & Eskind Biomedical Library January 27, 2012 helen.
EMBL-EBI EMBL-EBI EMBL-EBI What is the EBI's particular niche? Provides Core Biomolecular Resources in Europe –Nucleotide; genome, protein sequences,
Alastair Kerr, Ph.D. WTCCB Bioinformatics Core An introduction to DNA and Protein Sequence Databases.
Protein Information Resource Protein Information Resource, 3300 Whitehaven St., Georgetown University, Washington, DC Contact
Asp/IEETA Health-Grid Workshop Brussels 20 th September 2002 A. Sousa Pereira Univ. Aveiro - IEETA.
SCRIPPS GENOME ADVISER Galina Erikson Senior Bioinformatics Programmer The Scripps Translational Science Institute Scripps Translational Science Institute.
Sharing Ontologies in the Biomedical Domain Alexa T. McCray National Library of Medicine National Institutes of Health Department of Health & Human Services.
Adding ORDR Rare Diseases Terms to MeSH Tree Structure Stephen C. Groft, Pharm. D. Director, office of Rare Diseases Research National Institutes of Health.
Search Functions Simple Search Advanced Search.
Personalized genomics
PhenCode Connecting Genotype and Phenotype. HbVar: Hemoglobin variants and thalassemia mutations Began as Prof. Titus Huisman’s Syllabus of Hemoglobin.
Welcome to the Protein Database Tutorial. This tutorial will describe how to navigate the section of Gramene that provides collective information on proteins.
Published Standards for Molecular Laboratories Alexis B. Carter, MD, FCAP Director of Pathology Informatics Diplomate, American Board of Pathology, Molecular.
 What is MSA (Multiple Sequence Alignment)? What is it good for? How do I use it?  Software and algorithms The programs How they work? Which to use?
YOUR FUTURE STARTS WITH HOPE YOUR FUTURE STARTS WITH HOPE Genome Biology & Applied Bioinformatics Human Genome Mehmet Tevfik DORAK, MD PhD.
Ontologies in biology, biomedicine and disease genetics.
Genomic Definition by Reference Mapping
Introduction to Genes and Genomes with Ensembl
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
Introduction to Bioinformatics
Genomic Analysis: GWAS
Use cases for molecular pathology/genetics
Week-6: Genomics Browsers
CSE 182 Project.
Mehmet Tevfik DORAK, MD PhD 2nd Practical Bioinformatics Course
Genetic Testing for the Clinician
Interpretation Next Generation Sequencing (Bench Clinic)
Genome Biology & Applied Bioinformatics Mehmet Tevfik DORAK, MD PhD
Archives and Information Retrieval
Bioinformatic Tools for Epigenetic Research
Genome Biology & Applied Bioinformatics Mehmet Tevfik DORAK, MD PhD
Genome Biology & Applied Bioinformatics Mehmet Tevfik DORAK, MD PhD
Functional Annotation of the Horse Genome
Content and Labeling of Tests Marketed as Clinical “Whole-Exome Sequencing” Perspectives from a cancer genetics clinician and clinical lab director Allen.
Genomes and Their Evolution
Deep Phenotyping for Deep Learning (DPDL): Progress Report
Introduction to Bioinformatics
Linking Genetic Variation to Important Phenotypes
Ensembl Genome Repository.
Biological Databases BI420 – Introduction to Bioinformatics
LESSON 1 INTNRODUCTION HYE-JOO KWON, Ph.D /
BIOL 3020 GENOMIC BIOLOGY LABORATORY 05A
Genome Biology & Applied Bioinformatics Mehmet Tevfik DORAK, MD PhD
By Stitziel, Tseng, Pervouchine, Goddeau, Kasif, Liang
Genome Biology & Applied Bioinformatics Mehmet Tevfik DORAK, MD PhD
Virtual Association Studies
Genome Biology & Applied Bioinformatics Mehmet Tevfik DORAK, MD PhD
SUBMITTED BY: DEEPTI SHARMA BIOLOGICAL DATABASE AND SEQUENCE ANALYSIS.
Presentation transcript:

Major Databases/Portals Genome Biology & Applied Bioinformatics Major Databases/Portals Mehmet Tevfik DORAK, MD PhD

Schedule

Outline Genome browsers eQTL / ncRNA / CNV browsers ENCODE, RegulomeDB, HaploReg IHEC Protein databases Immunoinformatics databases Cancer genome databases BioMarts GWAS databases Disease Databases Comparative Toxicogenomics Database (CTDbase)

Major Bioinformatics Portals NCBI https://www.ncbi.nlm.nih.gov EBI (EMBL) http://www.ebi.ac.uk SIB http://www.isb-sib.ch/services-infrastructure/databases-software

Major Bioinformatics Portals: NCBI http://www.ncbi.nlm.nih.gov/guide/all/#databases_

Major Bioinformatics Portals: EMBL http://www.ebi.ac.uk/services http://www.ebi.ac.uk/services/all

Major Bioinformatics Portals: SIB

Major Genome Browsers https://genome.ucsc.edu/cgi-bin/hgGateway http://grch37.ensembl.org/Homo_sapiens/Info/Index http://swissregulon.unibas.ch/fcgi/sr/swissregulon

Genome Browsers

eQTL Browsers GTEx Blood eQTL Browser Chicago eQTL Browser SCAN

ncRNA Browsers

CNV Browsers

ENCODE, HaploReg, RegulomeDB https://www.encodeproject.org http://www.broadinstitute.org/mammals/haploreg/haploreg_v4.php http://regulomedb.org

IHEC Portal Link: http://epigenomesportal.ca/ihec

Protein Databases

ImmunoInformatics Databases

BioMarts

GWAS Databases

Disease Databases Disease Connect CoreMine

Disease Databases: Disease Connect Link: http://disease-connect.org

Disease Databases: Disease Connect Link: http://disease-connect.org/

Disease Databases: CoreMine Link: http://www.coremine.com

Disease Databases: CoreMine Link: http://www.coremine.com/medical/#search?ids=519616&tt=8191&org=hs&i=519616

Gene x Environment Interactions Database Link: http://ctdbase.org

Other Databases Link: http://www.ncbi.nlm.nih.gov/pubmed/26553330 Ac, open to academics, but commercial entities require license; CA, controlled access; CCDS, Consensus CDS; CGH, comparative genome hybridization; Com, commercial; Cs, restricted to consortium members; dbSNP, Database of Single Nucleotide Polymorphism; DO, Disease Ontology87; EFO, Experimental Factor Ontology; EoM, Elements of Morphology115; GWAS, genome-wide association study; HGNC, HUGO Gene Nomenclature Committee; HGVS, Human Genome Variation Society; HPO, Human Phenotype Ontology; ICD, International Classification of Diseases; LRG, Locus Reference Genomic; LSDB, Locus-specific database; MDB, variation (mutation) database; MedDRA, Medical Dictionary for Regulatory Activities; MeSH, Medical Subject Headings; MM, MatchMaking (finding similar patients); MPO, Mammalian Phenotype Ontology; NCI, National Cancer Institute; NEMDB, national and ethnic variation (mutation) databases; NIH, National Institutes of Health; OMIM, Online Mendelian Inheritance in Man; P, public; RG, access restricted to a specific research group; SNoMED, Systematized Nomenclature of Medical Terms; SO, Sequence Ontology; UMLS, Unified Medical Language System; VariO, Variation Ontology; VCF, Variant Call Format; WES, whole-exome sequencing; WGS, whole-genome sequencing; XML, Extensible Markup Language. *Further details are available at the websites of the databases. Owing to space constraints, many important databases had to be omitted. Databases are divided into major categories, but it is recognized that many of the databases can be considered to belong to multiple categories. Scope refers to the major focus of the database. If applicable, information is given about the number of items currently contained in the database. Standards indicates the major terminologies or ontologies that the database uses to annotate and organize data. Data entry indicates whether the data in the database are primarily entered by curators or by users of the database, and if natural language processing (NLP) is employed to gather data. Note that for data access policies, databases with more than one access level tend to provide more information as data access becomes stricter.

Other Databases Link: http://www.ncbi.nlm.nih.gov/pubmed/26553330 Ac, open to academics, but commercial entities require license; CA, controlled access; CCDS, Consensus CDS; CGH, comparative genome hybridization; Com, commercial; Cs, restricted to consortium members; dbSNP, Database of Single Nucleotide Polymorphism; DO, Disease Ontology87; EFO, Experimental Factor Ontology; EoM, Elements of Morphology115; GWAS, genome-wide association study; HGNC, HUGO Gene Nomenclature Committee; HGVS, Human Genome Variation Society; HPO, Human Phenotype Ontology; ICD, International Classification of Diseases; LRG, Locus Reference Genomic; LSDB, Locus-specific database; MDB, variation (mutation) database; MedDRA, Medical Dictionary for Regulatory Activities; MeSH, Medical Subject Headings; MM, MatchMaking (finding similar patients); MPO, Mammalian Phenotype Ontology; NCI, National Cancer Institute; NEMDB, national and ethnic variation (mutation) databases; NIH, National Institutes of Health; OMIM, Online Mendelian Inheritance in Man; P, public; RG, access restricted to a specific research group; SNoMED, Systematized Nomenclature of Medical Terms; SO, Sequence Ontology; UMLS, Unified Medical Language System; VariO, Variation Ontology; VCF, Variant Call Format; WES, whole-exome sequencing; WGS, whole-genome sequencing; XML, Extensible Markup Language. *Further details are available at the websites of the databases. Owing to space constraints, many important databases had to be omitted. Databases are divided into major categories, but it is recognized that many of the databases can be considered to belong to multiple categories. Scope refers to the major focus of the database. If applicable, information is given about the number of items currently contained in the database. Standards indicates the major terminologies or ontologies that the database uses to annotate and organize data. Data entry indicates whether the data in the database are primarily entered by curators or by users of the database, and if natural language processing (NLP) is employed to gather data. Note that for data access policies, databases with more than one access level tend to provide more information as data access becomes stricter.

Other Databases Link: http://www.ncbi.nlm.nih.gov/pubmed/26553330 Ac, open to academics, but commercial entities require license; CA, controlled access; CCDS, Consensus CDS; CGH, comparative genome hybridization; Com, commercial; Cs, restricted to consortium members; dbSNP, Database of Single Nucleotide Polymorphism; DO, Disease Ontology87; EFO, Experimental Factor Ontology; EoM, Elements of Morphology115; GWAS, genome-wide association study; HGNC, HUGO Gene Nomenclature Committee; HGVS, Human Genome Variation Society; HPO, Human Phenotype Ontology; ICD, International Classification of Diseases; LRG, Locus Reference Genomic; LSDB, Locus-specific database; MDB, variation (mutation) database; MedDRA, Medical Dictionary for Regulatory Activities; MeSH, Medical Subject Headings; MM, MatchMaking (finding similar patients); MPO, Mammalian Phenotype Ontology; NCI, National Cancer Institute; NEMDB, national and ethnic variation (mutation) databases; NIH, National Institutes of Health; OMIM, Online Mendelian Inheritance in Man; P, public; RG, access restricted to a specific research group; SNoMED, Systematized Nomenclature of Medical Terms; SO, Sequence Ontology; UMLS, Unified Medical Language System; VariO, Variation Ontology; VCF, Variant Call Format; WES, whole-exome sequencing; WGS, whole-genome sequencing; XML, Extensible Markup Language. *Further details are available at the websites of the databases. Owing to space constraints, many important databases had to be omitted. Databases are divided into major categories, but it is recognized that many of the databases can be considered to belong to multiple categories. Scope refers to the major focus of the database. If applicable, information is given about the number of items currently contained in the database. Standards indicates the major terminologies or ontologies that the database uses to annotate and organize data. Data entry indicates whether the data in the database are primarily entered by curators or by users of the database, and if natural language processing (NLP) is employed to gather data. Note that for data access policies, databases with more than one access level tend to provide more information as data access becomes stricter.

… Looking forward ….. http://www.dorak.info

Course material: http://www.dorak.info/genbiol/course.html