Problems from last section

Slides:



Advertisements
Similar presentations
Beyond PubMed and BLAST: Exploring NCBI tools and databases Kate Bronstad David Flynn Alumni Medical Library.
Advertisements

Genomic Innovations- Orthology Paralogy. Genomic innovation.
Finding regulatory modules from local alignment - Department of Computer Science & Helsinki Institute of Information Technology HIIT University of Helsinki.
The design, construction and use of software tools to generate, store, annotate, access and analyse data and information relating to Molecular Biology.
Peter Tsai, Bioinformatics Institute.  University of California, Santa Cruz (UCSC)  A rapid and reliable display of any requested portion of genomes.
Visualization of genomic data Genome browsers. UCSC browser Ensembl browser Others ? Survey.
Tutorial 7 Genome browser. Free, open source, on-line broswer for genomes Contains ~100 genomes, from nematodes to human. Many tools that can be used.
Sequence Analysis MUPGRET June workshops. Today What can you do with the sequence? What can you do with the ESTs? The case of SNP and Indel.
Visualization of genomic data Genome browsers. How many have used a genome browser ? UCSC browser ? Ensembl browser ? Others ? survey.
Bioinformatics and Phylogenetic Analysis
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Genome Browsers Ensembl (EBI, UK) and UCSC (Santa Cruz, California)
Genomic Database - Ensembl Ka-Lok Ng Department of Bioinformatics Asia University.
How to access genomic information using Ensembl August 2005.
SNP Resources: Finding SNPs Databases and Data Extraction Mark J. Rieder, PhD Robert J. Livingston, PhD NIEHS Variation Workshop January 30-31, 2005.
Sequence/Structure Alignment Resources from NCBI Steve Bryant Protein Data Bank Rutgers University November 19, 2005.
Sequence Analysis. Today How to retrieve a DNA sequence? How to search for other related DNA sequences? How to search for its protein sequence? How to.
Visualization of genomic data Genome browsers. UCSC browser Ensembl browser Others ? Survey.
SNP Resources: Finding SNPs Databases and Data Extraction Mark J. Rieder, PhD SeattleSNPs Variation Workshop March 20-21, 2006.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Data retrieval BioMart Data sets on ftp site MySQL queries of databases Perl API access to databases Export View.
Doug Brutlag 2011 Genome Databases Doug Brutlag Professor Emeritus of Biochemistry & Medicine Stanford University School of Medicine Genomics, Bioinformatics.
Doug Brutlag Professor Emeritus Biochemistry & Medicine (by courtesy) Genome Databases Computational Molecular Biology Biochem 218 – BioMedical Informatics.
Doug Brutlag 2011 Next Generation Sequencing and Human Genome Databases Doug Brutlag Professor Emeritus of Biochemistry & Medicine Stanford University.
Tri-I Bioinformatics Workshop: Public data and tool repositories Alex Lash & Maureen Higgins Bioinformatics Core Memorial Sloan-Kettering Cancer Center.
NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.
Copyright OpenHelix. No use or reproduction without express written consent 2 Overview of Genome Browsers Materials prepared by Warren C. Lathe, Ph.D.
is accessible at: The following pages are a schematic representation of how to navigate through ALE-HSA21.
UCSC Genome Browser 1. The Progress 2 Database and Tool Explosion : 230 databases and tools 1996 : first annual compilation of databases and tools.
Copyright OpenHelix. No use or reproduction without express written consent1.
COURSE OF BIOINFORMATICS Exam_31/01/2014 A.
Browsing the Genome Using Genome Browsers to Visualize and Mine Data.
BIOINFORMATIK I UEBUNG 2 mRNA processing.
Web Databases for Drosophila Introduction to FlyBase and Ensembl Database Wilson Leung6/06.
NCBI FieldGuide NCBI Molecular Biology Resources March 2007 Using Entrez.
Introduction to Bioinformatics Dr. Rybarczyk, PhD University of North Carolina-Chapel Hill
P HYLO P AT : AN UPDATED VERSION OF THE PHYLOGENETIC PATTERN DATABASE CONTAINS GENE NEIGHBORHOOD Presenter: Reihaneh Rabbany Presented in Bioinformatics.
Epidemiology 217 Molecular and Genetic Epidemiology Bioinformatics & Proteomics John Witte.
How do we represent the position specific preference ? BID_MOUSE I A R H L A Q I G D E M BAD_MOUSE Y G R E L R R M S D E F BAK_MOUSE V G R Q L A L I G.
Bioinformatics and Computational Biology
Cool BaRC Web Tools Prat Thiru. BaRC Web Tools We have.
EBI is an Outstation of the European Molecular Biology Laboratory. UniProtKB Sandra Orchard.
Copyright OpenHelix. No use or reproduction without express written consent1.
Bioinformatics Workshops 1 & 2 1. use of public database/search sites - range of data and access methods - interpretation of search results - understanding.
Introduction to Bioinformatics - Tutorial no. 5 MEME – Discovering motifs in sequences MAST – Searching for motifs in databanks TRANSFAC – the Transcription.
Tools in Bioinformatics Genome Browsers. Retrieving genomic information Previous lesson(s): annotation-based perspective of search/data Today: genomic-based.
Genomes at NCBI. Database and Tool Explosion : 230 databases and tools 1996 : first annual compilation of databases and tools lists 57 databases.
Welcome to the combined BLAST and Genome Browser Tutorial.
NCBI: something old, something new. What is NCBI? Create automated systems for knowledge about molecular biology, biochemistry, and genetics. Perform.
Visualization of genomic data Genome browsers. How many have used a genome browser ? UCSC browser ? Ensembl browser ? Others ? survey.
Visualization of genomic data Genome browsers. UCSC browser Ensembl browser Others ? Survey.
COURSE OF BIOINFORMATICS Exam_30/01/2014 A.
BLAST: Basic Local Alignment Search Tool Robert (R.J.) Sperazza BLAST is a software used to analyze genetic information It can identify existing genes.
Getting GO annotation for your dataset
NCBI Molecular Biology Resources
University of Pittsburgh
Functional Annotation of the Horse Genome
GEP Annotation Workflow
Visualization of genomic data
Access to Sequence Data and Related Information
Visualization of genomic data
gene-CENTRIC database
There are four levels of structure in proteins
Searching the NCBI Databases
Ensembl Genome Repository.
Next Generation Sequencing and Human Genome Databases
with the Ensembl Genome Browser
Public data and tool repositories Section 2 Genome Browsers
Welcome to the GrameneMart Tutorial
Gene Safari (Biological Databases)
Presentation transcript:

Public data and tool repositories Section 2 Survey of analysis tools and tutorials

Problems from last section Query Entrez Gene with the following two queries separately and then explain the differences between the two results using a logical NOT operation: tyrosine kinase[Gene Ontology] AND human[Organism] cd00192[Domain Name] AND human[Organism] Retrieve the APP gene record from NCBI and use the Display dropdown menu to display Conserved Domain Links. Use the ids of the listed domains to query Entrez Gene for records with the same domains. Use the SNP Geneview link at NCBI to identify coding SNPs in the APP gene. Which SNP is missing from this display which was present in the Ensembl APP protein record? Use the Homologene link at NCBI to identify possible functional orthologs for human APP. How does this list compare to the Ensembl list of orthologs that we reviewed previously?

Review of last section example: human APP gene NCBI Entrez databases Constructing queries Gene, Nucleotide and Protein RefSeq UCSC Genome Browser Finding genes Displaying data tracks Comparing data from different sources EBI/Ensembl Viewing Genes, Transcripts, Exons, Proteins and SNPs Common id and data formats

This section Protein structure visualization/analysis example Promoter/enhancer analysis example More information

Amyloid Precursor Protein (APP) G-protein coupled receptor that binds heparin and laminin ß-secretase amyloid fibril amyloid plaque DAEFRHDSGYEVHHQKLVFFAEDVGSNKGAIIGLMVGGVVIA -secretase Ex: Viewing the structure of an amyloid fibril

Other structure tools Structure visualization. Free applications: RasMol Cn3D VMD Structure prediction servers/applications CASP: Critical Assessment of Techniques for Protein Structure Prediction General method: Sequence similarity search to identify closest homolog with known structure Fit to homolog’s known structure, minimizing some constraint

APP Upstream Region 15kb Ex: Extracting and aligning human and mouse APP upstream regions

Promoter/enhancer analysis approaches Same gene, multiple species Assumed evolutionary conservation of non-coding regions Can use pairwise or multiple alignment method Examples: Precomputed: UCSC conservation tracks Dynamic: eg, rVista Different genes, same species Typical output as co-expressed clusters from microarray data Looking for over-represented, small binding sites Much better results if looking for a pattern or clustering of multiple sites Motif-finding algorithm, eg, MEME

Tutorials NCBI EBI UCSC Field Guide Information and tutorials Science Primer EBI 2Can Tutorials UCSC Genome Browser User’s Guide

Next week’s sections John Major Genome Browsers genome build process, ongoing and complete genome projects genome browsers of Ensembl, UCSC and NCBI Mapviewer Bulk downloads how bulk bioinformatics data might be useful common data formats retrieving data