Online Resources for Psychiatric Genetics 19 th March, 2009 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library.

Slides:



Advertisements
Similar presentations
The Human Genome Project Main reference: Nature (2001) 409,
Advertisements

NCBI/WHO PubMed/Hinari Course NCBI Literature Databases: PubMed Background.
Introduction to PubMed® (pubmed.gov)
NCBI data, sliding window programs and dot plots Sept. 25, 2012 Learning objectives-Become familiar with OMIM and PubMed. Understand the difference between.
Creating NCBI The late Senator Claude Pepper recognized the importance of computerized information processing methods for the conduct of biomedical research.
Genes, Proteins and Literature Searching Ansuman Chattopadhyay, PhD Molecular Biology Information Services Health Sciences Library System University of.
Searching Pubmed Database استخدام قاعدة المعلومات Pubmed د. سيناء عبد المحسن العقيل قسم الصيدلة الإكلينيكية برنامج مهارات البحث العلمي.
1.
The National Center for Biotechnology Information (NCBI) a primary resource for molecular biology information Database Resources.
Fatchiyah, PhD Dept Biology UB Fatchiyah.lecture.ub.ac.id
Pharmacy Information Resources TTUHSC Preston Smith Library presents Rev. 08/2014.
Single Nucleotide Polymorphisms Jennifer Lyon Eskind Biomedical Library May 1, 2009 CRC Workshop Series.
Outline to SNP bioinformatics lecture
NATIONAL LIBRARY OF MEDICINE The PubMed ID and Entrez, PubMed and PubMed Central Edwin Sequeira National Center for Biotechnology Information June 21,
Evidence-Based Information Retrieval in Bioinformatics
The NLM Controlled Vocabulary Medical Subject Headings (MeSH) PubMed for Trainers, Spring 2015 U.S. National Library of Medicine (NLM) and NLM Training.
Literature Informatics Beyond PubMed: Next Generation Literature Searching Carrie Iwema, PhD, MLS 24 th August 2011.
Medical Genetics 2 nd February 2010 Carrie Iwema, PhD, MLS Information Specialist in Molecular Biology Health Sciences Library System University of Pittsburgh.
Archives and Information Retrieval
Bioinformatics: a Multidisciplinary Challenge Ron Y. Pinter Dept. of Computer Science Technion March 12, 2003.
Informatics: A Primer Mary Lou Klem, PhD, MLIS Health Sciences Library System University of Pittsburgh.
SNP Resources: Finding SNPs, Databases and Data Extraction Debbie Nickerson NIEHS SNPs Workshop.
Sequence Analysis. Today How to retrieve a DNA sequence? How to search for other related DNA sequences? How to search for its protein sequence? How to.
Medical Subject Headings (MeSH)
Biology in Silico : Online Tools for “Omics” WCRC RETREAT SEPTEMBER 6TH, 2014 ANSUMAN CHATTOPADHYAY, PHD HEAD, MOLECULAR BIOLOGY INFORMATION SERVICE HEALTH.
Genetic Variations Resources May 15, 2013 Ansuman Chattopadhyay, PhD, Head Molecular Biology Information Service Health Sciences Library System University.
Introductory Overview
DbSNP: the NCBI database of genetic variation S. T. Sherry, M.H. Ward, M. Kholodov, J. Baker, L. Phan, E. M. Smigielski and K. Sirotkin, Nucleic Acids.
University of Nebraska-Lincoln PhD. in Biochemistry Protein synthesis initiation in eukaryotic system Vanderbilt University School.
InfoBoosters: Connecting Texts with Databases BOOST BOX, DEC 9, 2014 NATIONAL NETWORK OF LIBRARIES OF MEDICINE, MIDDLE ATLANTIC REGION ANSUMAN CHATTOPADHYAY,
Datamining MEDLINE for Topics and Trends in Dental and Craniofacial Research William C. Bartling, D.D.S. NIDCR/NLM Fellow in Dental Informatics Center.
How to do a literature search Saharuddin Ahmad Aida Jaffar Department of Family Medicine.
Presented by: Andrew McMurry Boston University Bioinformatics Children’s Hospital Informatics Program Harvard Medical School Center for BioMedical Informatics.
X-ray crystallography NMR cryoEM Experimental approaches for structural biology.
Bioinformatics and medicine: Are we meeting the challenge?
PubMed and other Online Tools Michele R. Tennant, Ph.D., M.L.I.S. Health Science Center Libraries/ U.F. Genetics Institute GMS 6014 January.
NCBI’s Bioinformatics Resources Michele R. Tennant, Ph.D., M.L.I.S. Health Science Center Libraries U.F. Genetics Institute January 2015.
Introduction to Bioinformatics CPSC 265. Interface of biology and computer science Analysis of proteins, genes and genomes using computer algorithms and.
Searching PubMed® NCBI, NLM Resources, Micromedex -GSBS TTUHSC Preston Smith Library presents Rev. 08/17/14.
Doug Brutlag 2011 Genomics & Medicine Doug Brutlag Professor Emeritus of Biochemistry &
MEDLINE for Medical Research Juliet Ralph and César Pimenta Hilary Term 2007.
We have displayed the Browse publisher drop down menu. This You have full access to: list for an institution where all the material is included in the.
DONNA MAGLOTT, PH.D. PRO AND MEDICAL GENETICS RESOURCES AT NCBI.
Online Mendelian Inheritance in Man (OMIM): What it is & What it can do for you Knowledge Management & Eskind Biomedical Library January 27, 2012 helen.
Development of an Information Service Program in Molecular Biology and Genetics Ansuman Chattopadhyay, PhD Information Specialist in Molecular Biology.
Going Against Goliath 23 rd May 2010 Katrina Kurtz, MLIS Carrie Iwema, PhD, MLS Ansuman Chattopadhyay, PhD Health Sciences Library System University of.
NCBI Literature Databases: PubMed
RESEARCH – DOING AND ANALYSING Gavin Coney Thomson Reuters May 2009.
Epidemiology 217 Molecular and Genetic Epidemiology Bioinformatics & Proteomics John Witte.
From the Advanced Search page of the Cochrane Library, we have clicked on the Cochrane Reviews: By Topic hyperlink. This has displayed the Topics for Cochrane.
Bioinformatics and Computational Biology
Copyright OpenHelix. No use or reproduction without express written consent1.
PubMed …featuring more than 20 million citations for biomedical literature from MEDLINE, life science journals, and online books.
An Introduction to NCBI & BLAST National Center for Biotechnology Information Richard Johnston Pasadena City College.
Information retrieval and sliding window programs April 5, 2011 Hand in Homework #1. Homework #2 due Tuesday, April 12. Learning objectives- Understand.
Notes: Human Genome (Right side page)
Medical Subject Headings (MeSH)
MEDLINE®/PubMed® PubMed for Trainers, Fall 2015 U.S. National Library of Medicine (NLM) and NLM Training Center An introduction.
NCBI PubMed NCBI Literature Databases: PubMed Session #1, April 28, 2005 Session #2, April 29, 2005 Ho Chi Minh City, VietNam.
1 Finding disease genes: A challenge for Medicine, Mathematics and Computer Science Andrew Collins, Professor of Genetic Epidemiology and Bioinformatics.
GUIDE. P UB M ED
MeSH: Medical Subject Headings Anne Allen, Heather Braum, Paula Davidson, Ellen Rose LI 804: Organization of Information.
Entrez, dbSNP, GEO, OMIM & LinkOut JanPlan Entrez Distributed by NCBI in 1991 on CD-ROM Included linked nodes: GenBank & PDB Translated GenBank,
Biology in silico: Creation of an online bioinformatics.
The National Library of Medicine and its databases
Introduction to bioinformatics lecture 11 SNP by Ms.Shumaila Azam
Lívia Vasas, PhD 2018 The National Library of Medicine and its databases Mozilla Firefox/Google Chrome Lívia Vasas, PhD.
Genomes and Their Evolution
Beyond PubMed--Next Generation Literature Searching
PubMed.
Presentation transcript:

Online Resources for Psychiatric Genetics 19 th March, 2009 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System University of Pittsburgh

How to Search PubMedIdentify Disease Causing Genes Find Genes and Proteins Centered Information Predict Disease Causing SNPs

University of Nebraska-Lincoln PhD. in Biochemistry Protein synthesis initiation in eukaryotic system Vanderbilt University School of Medicine, Nashville Research Fellow Epidermal Growth Factor (EGF) mediated signal transduction Cellomics Inc., Pittsburgh Knowledge Engineer Information Specialist in Molecular Biology and Genetics HSLS, University of Pittsburgh Ansuman Chattopadhyay, Ph.D Present Head, Molecular Biology Information Service HSLS, University of Pittsburgh

Molecular Biology Information Service

HSLS Molecular Biology Information Service WorkshopsWebsite Software Licensing Bioinformatics Consultations

In silico Support Literature Search Sequence Analysis Omics Data Analysis

HSLS Molecular Biology Information Service

Literature Informatics US National Library of Medicine (NLM) Medline Database Medical Subject Heading PubMed Based Literature Informatics Tools  ClusterMed  GoPubMed  NovoSeek  EtBlast PubMed MESH Database ClusterMedGoPubMed

PubMed ( MEDLINE is the largest component of PubMed the freely accessible online database of biomedical journal citations and abstracts created by the U.S. National Library of Medicine (NLM®). 5,200 journals published in the United States and more than 80 other countries Contains citations indexed 1949 to the present.

PubMed Search Stats

PubMed Display: Abstract Plus

PubMed Display: Medline

Medical Subject Headings (MeSH) The U.S. National Library of Medicine's controlled vocabulary (thesaurus) Arranged in a hierarchical manner called the MeSH Tree Structures Updated annually

MeSH Vocabulary Headings  over 24,000 representing concepts found in the biomedical literature (Body Weight, Kidney, Radioactive Waste) Subheadings  attached to headings to describe a specific aspect of a concept (adverse effects, metabolism, diagnosis, therapy) Supplementary Concept Records  over 172,000 terms in a separate chemical thesaurus -updated weekly (cordycepin, valspodar, tacrolimus binding protein 4) Publication Types (Letter, Review, Randomized Controlled Trial)

MeSH Tree Structure A. Anatomy B. Organisms C. Diseases D. Chemical and Drugs E. Analytical, Diagnostic and Therapeutic Techniques and Equipment F. Psychiatry and Psychology G. Biological Sciences H. Physical Sciences I. Anthropology, Education, Sociology and Social Phenomena J. Technology and Food and Beverages K. Humanities L. Information Science M. Persons N. Health Care V. Publication Characteristics Z. Geographic Locations

MeSH Indexing Source: NLM

MeSH Indexing Genes/Chemicals MeSH Terms

Information Overload 197K Breast Cancer 82K Schizophrenia 6.4K BRCA1 48 K p53 5,200 Journals

PubMed Search Retrieve published articles on  Dengue outbreaks  Dengue outbreaks in India  Dengue outbreaks outside India including  Statistical and numerical data on dengue outbreaks

PubMed Search: Dengue Outbreaks 1

PubMed Search: Dengue Outbreaks MESH DATABASE Dengue 2

PubMed Search: Building Query

PubMed Search: Building Query

PubMed Search: Building Query

PubMed Search: Building Query 6

PubMed Search: Building Query 7

PubMed Search: Building Query 8 9

PubMed Search: Building Query 10

PubMed Search: Building Query

PubMed Search: Building Query TermBooleanTermBooleanTerm# of Papers DengueANDoutbreaks687 DengueANDoutbreaksANDIndia109 DengueANDoutbreaksNOTIndia578 DengueANDOutbreaks/ Statistics and numerical data 82

Search Result Clustering

Search Result Clustering

Vivisimo ClusterMed

Query Details DengueANDOutbreaksANDIndia109

ClusterMed Topics

Clustering in PubMed: ClusterMed organizes the long list of results returned by PubMed into hierarchical folders with meaningful categories, allowing researchers to hone in on the most relevant results quickly. does this on-the-fly without requiring any pre- processing, using terms taken from the brief descriptions in the search results. with the folders, users can discover themes, view related articles, and drill down from the topic hierarchy

Topical Clusters Authors Institutions Pub Dates

GoPubMed Developed by Transinsight

GoPubMed DengueANDOutbreaksNOTIndia578

GoPubMed Authors Journals Pub Dates DengueANDOutbreaksNOTIndia578

PubMed Advanced Search

Literature Search Workflow PubMed MESH Database ClusterMedGoPubMed

PubMed Query with Boolean Retrieve articles focused on molecular genetics of Schizophrenia published in past five years: ("schizophrenia"[Majr] AND (("genetics, medical"[MeSH Terms] OR ("genetics"[All Fields] AND "medical"[All Fields]) OR "medical genetics"[All Fields] OR ("medical"[All Fields] AND "genetics"[All Fields])) OR ("genotype"[MeSH Terms] OR "genotype"[All Fields]) OR "genetics"[Subheading] AND ("genetics"[Subheading] OR "genetics"[All Fields] OR "genetics"[MeSH Terms])) AND ("2004/03/13"[PDat] : "2009/03/11"[PDat])

Research on Optimal Search Strategies

PubMed Pre-Build Queries

PubMed Query with Boolean Retrieve articles focused on molecular genetics of Schizophrenia published in past five years: ("schizophrenia"[Majr] AND (("genetics, medical"[MeSH Terms] OR ("genetics"[All Fields] AND "medical"[All Fields]) OR "medical genetics"[All Fields] OR ("medical"[All Fields] AND "genetics"[All Fields])) OR ("genotype"[MeSH Terms] OR "genotype"[All Fields]) OR "genetics"[Subheading] AND ("genetics"[Subheading] OR "genetics"[All Fields] OR "genetics"[MeSH Terms])) AND ("2004/03/13"[PDat] : "2009/03/11"[PDat])

GoPubMed Analysis

GoPubMed Search Result Statistics

Go PubMed Authors Network

GoPubMed World Map of Publications

My NCBI: an Automated Notification Tool Save your searches at MyNCBI and set up an notification on new publication based on your search query

My NCBI

My NCBI 4

NovoSeek : a Search Engine for Biomedical Literature in Medline and US Grants

NovoSeek Search: Alzheimer Clinical Trials

Literature Search Workflow LITERATURELITERATURE SEARCHSEARCH Authors Topics Global Map Authors Networks Journals Genes

eTBLAST ABSTRACT

eTBLAST

eTBLAST Analysis

eTBLAST Analysis on Scientific Papers

Growth of Molecular Databases Source: Nodal Point Blog 2008: og : 1170

Molecular Databases Nucleic Acids Research: Oxford Journals  Annual Databases Issue  Annual Web Server Issue Journals  Bioinformatics  BMC Bioinformatics

HSLS OBRC

Molecular Databases Catalog HSLS OBRC 2379 links to databases and software ~5000 hits/day

Online Bioinformatics Resources Collection

HSLS OBRC

HSLS OBRC

Bioinformatics Resources on Psychiatric Disorders

Disease Gene Databases

Identify Disease Causing Genes US CDC’s HuGE Navigator  Gene Prospector NIH GAD  A Catalog of Published Genome-Wide Association Studies  m m

Identify Disease Causing Genes The Human Genome Epidemiology Network (HuGENet™) : The Human Genome Epidemiology Network (HuGENet™) a voluntary, international collaboration focused on assessing the role of human genome variation in health and disease at the population level. maintained a database of published, population-based epidemiologic studies of human genes extracted and curated from PubMed. HuGE Navigator provides access to a continuously updated knowledge base in human genome epidemiology  information on population prevalence of genetic variants  gene-disease associations  gene-gene and gene- environment interactions  evaluation of genetic tests

Informatics Tools for Human Epidemiology

Finding Disease Causing Genes

Gene to Diseases

Gene/Protein Centered Information NCBI Gene Database Your Favorite Gene – SigmaAldrich Millipore Pathways

US NCBI Databases

SNP Genomic Sequence Expression Profile Interacting Partners 3D Structure mRNA Sequence Chromosomal Localization Disease Amino acid Sequence Homologous Sequences US NCBI : Entrez Gene

Your Favorite Gene Search

Your Favorite Gene Display

Millipore Pathway Search

Single Nucleotide Polymorphisms Scientists expect that comparison of genomic sequences taken from two unrelated individuals will reveal that they are 99.9% identical. The 0.1% difference is due to genetic variations, and mainly one form of variation called single nucleotide polymorphisms These polymorphisms are considered one of the key factors that makes each and every one of us different and can have a major impact on how we respond to diseases; environmental insults such as bacteria, viruses and chemicals; and drugs and other therapies. This makes genetic variations of great value for biomedical research and for developing pharmaceutical products or medical diagnostics

Human Genetic Variations Primarily two types of genetic mutation events create all forms of variations: Single base mutation which substitutes one nucleotide for another -Single Nucleotide Polymorphisms (SNP) Insertion or deletion of one or more nucleotide(s) -Tandem Repeat Polymorphisms -Insertion/Deletion Polymorphisms

Single Nucleotide Polymorphism Single nucleotide polymorphisms (SNP) are DNA sequence variations that occur when a single nucleotide (A,T,C,or G) in the genome sequence is altered. SNPS are the most common class of polymorphisms. example:

Estimated Numbers of Human Genetic Variations SNPs appear at kb average intervals, considering the size of entire human genome, which is 3X10 7 bp, the total number scales up to 5-10 million. In silico estimation of potentially polymorphic VNTR are over 100,000 across the human genome. The short insertion/deletions are very difficult to quantify and the number is likely to fall in between SNPs and VNTR

Polymorphisms and Disease Markers Very few of these polymorphisms show direct impact on deleterious phenotype. The non-disease-causing polymorphisms when mapped to the genome, may serve as markers to identify and map other genes that do cause disease when mutated. If these non-disease-causing variations are found to be inherited with a particular trait, but do not cause the trait, they may provide evidence of where the trait's gene is located in the genome.

SNPs and Mutations Terminology for variation at a single nucleotide position is defined by allele frequency.  A single base change, occurring in a population at a frequency of >1% is termed a single nucleotide polymorphism (SNP)  When a single base change occurs at <1% it is considered to be a mutation

Life Cycle of SNPs and Mutations

Classification of SNPs

Coding SNPs

Genetic Variations Databases dbSNP  Online Mendelian Inheritance in Man (OMIM)  International HapMap Project  Genome Variation Server (Seattle SNPs) 

dbSNP URL: The Single Nucleotide Polymorphism database (dbSNP) is a public- domain archive for a broad collection of simple genetic polymorphisms This collection of polymorphisms includes:  Single-base nucleotide substitutions (also known as single nucleotide polymorphisms or SNPs)  Small-scale multi-base deletions or insertions (also called deletion insertion polymorphisms or DIPs)  Microsatellite repeat variations (also called short tandem repeats or STRs)

dbSNP Data Type The SNP database has two major classes of content:  Submitted data: original observations of sequence variation: Submitted SNPs (SS) with ss# (ss )  Computed data: content generated during the dbSNP "build" cycle by computation on original submitted data: Reference SNP Clusters (Ref SNP) with rs# (rs )

Functional Analysis of SNPs

SNPs and the Structure of a Gene

Decision Tree for SNP Analysis

Exonic Splicing Enhancer/Silencer

Functional Analysis of SNPs A gene variant primarily found in African Americans, that slightly increases the risk for developing an irregular heartbeat, known as arrhythmia. The variant occurs in the cardiac sodium channel gene SCN5A which results a change of amino acid at the position of 1102 from serine to tyrosine (S To Y). Can you predict the effect of this non-synonymous SNP (rs ). Answer

SNP Gene View for SCN5A

Multiple Sequence Alignment

Amino Acids Comparison NCBI Amino Acid Explorer

Compare Amino Acids Properties Amino Acid Properties Table:

Amino Acids Substitution Preference

Tools for Amino Acid Substitution Effect Prediction SIFT  PolyPhen  SNPs3D  pMUT 

Comparison of AAS prediction tools Pauline C. Ng and Steven Henikoff, Annu. Rev. Genomics Hum. Genet :61–80

Tools on Functional SNP Analysis Search.HSLS MolBio link meta?v%3aproject=BioInfoTools&v%3afile=viv_B7AUre&v%3aframe=list&v%3astate=root%7cN891&id=N891&act ion=list& meta?v%3aproject=BioInfoTools&v%3afile=viv_B7AUre&v%3aframe=list&v%3astate=root%7cN891&id=N891&act ion=list& FASTSNP -- an always up-to-date and extendable service for SNP function analysis and prioritization F-SNP: computationally predicted functional SNPs for disease association studies.

FASTSNP Analysis

FASTSNP Analysis

F-SNP : A Collection of Functional SNPs Specifically Prioritized for Disease Association studies

F-SNP : A Collection of Functional SNPs Specifically Prioritized for Disease Association Studies

Summary Literature Search Disease Causing Genes Genes and Proteins Information Genes and Proteins Information SNPs Functional Analysis

Thank you!