Introduction to PubChem BioAssay

Slides:



Advertisements
Similar presentations
Introductory to database handling Endre Sebestyén.
Advertisements

1.
The National Center for Biotechnology Information (NCBI) a primary resource for molecular biology information Database Resources.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Integration of Protein Family, Function, Structure Rich Links to >90 Databases Value-Added Reports for UniProtKB Proteins iProClass Protein Knowledgebase.
NATIONAL LIBRARY OF MEDICINE PubMed Central Brooke Dine National Library of Medicine Medical Library Association Conference May 2004.
1 Welcome to the Protein Database Tutorial This tutorial will describe how to navigate the section of Gramene that provides collective information on proteins.
NATIONAL LIBRARY OF MEDICINE PubMed Central Brooke Dine National Library of Medicine Medical Library Association Conference May 2005.
Basic Genomic Characteristic  AIM: to collect as much general information as possible about your gene: Nucleotide sequence Databases ○ NCBI GenBank ○
1 Enriching UK PubMed Central SPIDER launch meeting, Wolfson College, Oxford Paul Davey, UK PubMed Central Engagement Manager.
Archives and Information Retrieval
Lecture 2.21 Retrieving Information: Using Entrez.
Kate Milova MolGen retreat March 24, Microarray experiments: Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Biological Databases Notes adapted from lecture notes of Dr. Larry Hunter at the University of Colorado.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
IST Computational Biology1 Information Retrieval Biological Databases 2 Pedro Fernandes Instituto Gulbenkian de Ciência, Oeiras PT.
The Protein Data Bank (PDB)
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Bioinformatics & LIS A brief talk for librarians, information scientists, and computer scientists about resources and collaborative opportunities with.
Sequence/Structure Alignment Resources from NCBI Steve Bryant Protein Data Bank Rutgers University November 19, 2005.
Midterm project Course: Statistics in Bioinformatics Date: 指導教授 : 陳光琦 學生 : 吳昱賢.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
BTN323: INTRODUCTION TO BIOLOGICAL DATABASES Day2: Specialized Databases Lecturer: Junaid Gamieldien, PhD
Introductory Overview
Erice 2008 Introduction to PDB Workshop From Molecules to Medicine: Integrating Crystallography in Drug Discovery Erice, 29 May - 8 June Peter Rose
Gene Expression Omnibus (GEO)
NCBI’s Bioinformatics Resources Michele R. Tennant, Ph.D., M.L.I.S. Health Science Center Libraries U.F. Genetics Institute January 2015.
NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.
Biomedical Databases & Tools Rolando Garcia-Milian Biomedical & Health Information Services Department Health Sciences Center Library.
Copyright OpenHelix. No use or reproduction without express written consent1.
NCBI FieldGuide NCBI Molecular Biology Resources March 2007 Using Entrez.
NCBI Literature Databases: PubMed
Gene Expression Omnibus (GEO)
Primary vs. Secondary Databases Primary databases are repositories of “raw” data. These are also referred to as archival databases. -This is one of the.
Copyright OpenHelix. No use or reproduction without express written consent1.
Copyright OpenHelix. No use or reproduction without express written consent1.
Copyright OpenHelix. No use or reproduction without express written consent1.
The Neuroscience information framework A User’s Guide.
PubChem: An Open Repository for Chemical Structure and Biological Activity Information Steve Bryant The NIH Biowulf Cluster: 10 Years of Scientific Supercomputing.
Copyright OpenHelix. No use or reproduction without express written consent1 1.
Copyright OpenHelix. No use or reproduction without express written consent1.
Copyright OpenHelix. No use or reproduction without express written consent1.
NCBI: something old, something new. What is NCBI? Create automated systems for knowledge about molecular biology, biochemistry, and genetics. Perform.
 What is MSA (Multiple Sequence Alignment)? What is it good for? How do I use it?  Software and algorithms The programs How they work? Which to use?
NCBI PubMed NCBI Literature Databases: PubMed Session #1, April 28, 2005 Session #2, April 29, 2005 Ho Chi Minh City, VietNam.
Lecture 1: Introduction to Entrez October 16-19, 2007 NCBI PowerScripting.
PubChem—Substance, Compound, BioAssay Part 1: Essentials Principles of May 24, 2007.
OncoTrack Bioinformatics Workshop Max Planck Institute for Molecular Genetics, Berlin Wednesday 6 th November 2013 TimeSubject 13:30-15:00 Introduction.
Indiana University School of Indiana University ECCR Summary Infrastructure: Cheminformatics web service infrastructure made available as a community resource.
Keeping Current: Genetics Resources. This workshop will provide an overview of NCBI resources for finding-- Background information & journal articles.
PubChem BioAssay: Link chemical research to GenBank and beyond
Cheminformatics and Metabolism Team The EBI Enzyme Portal.
Sequence: PFAM Used example: Database of protein domain families. It is based on manually curated alignments.
Classifying Chemistry: Current Efforts in Canada
Biological Databases By: Komal Arora.
NCBI Molecular Biology Resources
Biological databases: Collection, storage and maintenance
Introduction to PubChem BioAssay
What is Bioinformatics?
Mangaldai College, Mangaldai
Annotation: linking literature to gene products
Gene Expression Omnibus (GEO)
محسن شیرازی کارشناسي علوم کتابداري و اطلاع رساني پزشکی
Tutorial: Bioinformatics Resources
Volume 19, Issue 1, Pages (January 2012)
TargetDB and PEPCDB •
PubMed Database Interface Part A (Basic Course Module 4)
Tantan Liu, Fan Wang, Gagan Agrawal The Ohio State University
How to search NCBI.
Overview of Enzyme, Protein and Network Databases
Presentation transcript:

Introduction to PubChem BioAssay Yanli Wang MCBIOS Little Rock, Arkansas March 24, 2017

Access, search, data retrieval, use Outline Overview of PubChem Access, search, data retrieval, use Hand-on practice *at PubChem we have several goals *public depo system *various sources

Access, search, data retrieval, use…

PubChem Access & Search … Entrez PubChem Home PubChem FTP Structure Search Public search engine Google etc. Bioactivity analysis tools, data view PUG-REST E-utils

PubChem Data Access – text/numeric search – fielded/range search Interfaces – text/numeric search – fielded/range search – precomputed relationship • 2-D, 3-D, identity groups • related bioassays • hierarchical classification – inter-database links • biomedical literature, MeSH • protein, gene, 3D structure • pathways, taxonomy, OMIM – external resource links • Tools – bioactivity analysis • SAR analysis • across assay/target comparison • mine related datasets – chemical structure analysis • structure normalization • 2D, 3D similarity search • structure clustering – data download – FTP site – programmatic Utilities

PubChem Data Access – text/numeric search – fielded/range search Interfaces – text/numeric search – fielded/range search – precomputed relationship • 2-D, 3-D, identity groups • related bioassays • hierarchical classification – inter-database links • biomedical literature, MeSH • protein, gene, 3D structure • pathways, taxonomy, OMIM – external resource links • Tools – bioactivity analysis • SAR analysis • across assay/target comparison • mine related datasets – chemical structure analysis • structure normalization • 2D, 3D similarity search • structure clustering – data download – FTP site – programmatic Utilities

PubChem Access & Search … Entrez BioAssay Classification Browser PubChem FTP

Search PubChem with Entrez … Three PubChem databases Free text search Search by indexed field Advanced search for complex query Refine/combine search Entrez links (Related Data) Access from Gene, PubMed etc.

Goal #1 Collect bioactivity data for a compound …

How to get here??? Bioactivity data for androgen … https://pubchem.ncbi.nlm.nih.gov/assay/bioactivity.html?cid=5995

Goal 2 Collect bioactivity data for a gene …

Bioactivity data for alpha4 nicotinic acetylcholine receptor … https://pubchem.ncbi.nlm.nih.gov/assay/bioactivity.html?geneid=11438 How to get here???

PubChem Access & Search … Entrez BioAssay Classification Browser PubChem FTP

Entrez Substance & Compound … Chemical name Synonym Chemical Property Links: Tools Cross-reference Related data

Search Androgen …

Search result …

Access tools …

Subset of rule 5 & bioactivity data …

Subset with annotations …

Links, related data, cross-references …

Specific search using index field …

Specific search using index field … follow up

Androgen page … https://pubchem.ncbi.nlm.nih.gov/compound/5995 Goal #1 Androgen page … https://pubchem.ncbi.nlm.nih.gov/compound/5995

Entrez PubChem BioAssay … Assay name, description, protocol Target information (protein name, gene symbol Annotations, comments Depositor information Chemical name of tested substance Links: Cross-reference Related data

Search “nicotinic acetylcholine receptors” …

Search rattus …

Access search history …

Search history …

Combine search query …

Advanced search …

Complex query …

Index fields …

BioAssay links to many other databases PubMed research data OMIM Protein BioSystems a pathway db assay target Gene assay target drug annotation MeSH assay target Nucleotide Depositor links *to this end *hosted by NCBI *providing additional annotation Nucleotide: AID 1637 GEO Taxonomy Structure a mirror of Protein Data Bank (PDB) CDD conserved protein family domain

BioAssay links to many other databases PubMed research data OMIM Protein assay target BioSystems a pathway db Gene assay target drug annotation MeSH Nucleotide assay target Depositor links *to this end *hosted by NCBI *providing additional annotation Nucleotide: AID 1637 GEO Taxonomy Structure a mirror of Protein Data Bank (PDB) CDD conserved protein family domain

Access BioAssay from Entrez Gene …

Search “nicotinic acetylcholine receptors” in Gene …

Retrieve genes associated with BioAssay …

Search history …

Retrieve assays targeting nicotinic acetylcholine receptors …

Gene page for …

Links of BioAssay data targeting CHRNA4 …

Bioactivity data for CHRNA4 … Goal #2

Entrez search summary … General free text search Specific search by indexed field Advanced search for complex query via Limits page Refine/combine search with Boolean operation Entrez links (Related Data) Access from Gene, PubMed etc.

PubChem Access & Search … Entrez BioAssay Classification Browser PubChem FTP

BioAssay Tools …

PubChem Access & Search … Entrez BioAssay Classification Browser PubChem FTP

PubChem BioAssay FTP … ftp://ftp.ncbi.nlm.nih.gov/pubchem/Bioassay/

PubChem references Bolton E, Wang Y, et al. PubChem: Integrated Platform of Small Molecules and Biological Activities. Chapter 12 IN Annual Reports in Computational Chemistry, Volume 4, Elsevier: Oxford, UK; 2008, pp. 217-240. Wang Y, et al. PubChem BioAssay: 2017 update. Nucleic Acids Res. 2017, 45(D1):D1075-1082. Li Q, Cheng T, Wang Y*, Bryant SH*. PubChem as a public resource for drug discovery. Drug Discov Today. 2010, 15(23-24):1052-1057. Pan Y, Cheng, T, Wang Y*, Bryant SH*. Pathway Analysis for Drug Repositioning Based on Public Database Mining. J Chem Inf Model. 2014, 54, 407−418 Cheng T, Pan Y, Hao M, Wang Y*, Bryant SH*. PubChem Applications in Drug Discovery – a Bibliometric Analysis, Drug Discovery Today, 2014, 19(11), 1751-6