Andy Conley 3/26/2012 1. James Kent. Know that name. He is one of greatest, perhaps the greatest, bioinformatics programmers ever. He was deeply involved.

Slides:



Advertisements
Similar presentations
Microsoft® Access® 2010 Training
Advertisements

May 16, 2005Scott Cain, CSHL. May 16, 2005Scott Cain, CSHL gmod update Gmod RC2 last week New for 0.003: –Generic triggers for Apollo –Greatly enhanced.
KompoZer. This is what KompoZer will look like with a blank document open. As you can see, there are a lot of icons for beginning users. But don't be.
SRI International Bioinformatics 1 Genome Browser Markus Krummenacker Bioinformatics Research Group SRI, International Q
© Wiley Publishing All Rights Reserved. Using Nucleotide Sequence Databases.
Peter Tsai, Bioinformatics Institute.  University of California, Santa Cruz (UCSC)  A rapid and reliable display of any requested portion of genomes.
Genome Browsers Carsten O. Daub Omics Science Center RIKEN, Japan May 2008.
Microsoft ® Office Excel ® 2007 Training Get started with PivotTable ® reports [Your company name] presents:
Microsoft ® Office Excel ® 2007 Training Get started with PivotTable ® reports Guangzhou Newelink Technology Co,. Ltd.
Designing Web Pages Getting to know HTML... What is HTML? Hyper Text Markup Language HTML is the major language of the Internet’s World Wide Web. Web.
Get started with PivotTable reports Make your data work for you Imagine an Excel worksheet of sales figures. It lays out thousands of rows of data about.
HTML and Designing Web Pages. u At its creation, the web was all about –Web pages were clumsily assembled –Web sites were accumulations of hyperlinked.
Lab 8 – C# Programming Adding two numbers CSCI 6303 – Principles of I.T. Dr. Abraham Fall 2012.
INTRODUCTION TO DREAMWEAVER 8. What we already know…  Design basics  Contrast  Repetition  Alignment  Repetition  HTML.
Publishing Your Research Introduction Thinking about publication Publishing by podcasting Getting some feedback Taking time to reflect Talk About It Your.
INTRODUCTION TO FRONTPAGE. TOPICS TO BE DISCUSSED……….  Introduction Introduction  Features Features  Starting Front Page Starting Front Page  Components.
Todd J. Treangen, Steven L. Salzberg
Comparative Genomics Tools in GMOD GMOD.org Dave Clements 1, Sheldon McKay 2, Ken Youns-Clark 2, Ben Faga 3, Scott Cain 4, and the GMOD Consortium 1 National.
London April 2005 London April 2005 Creating Eyeblaster Ads The Rich Media Platform The Rich Media Platform Eyeblaster.
Copyright OpenHelix. No use or reproduction without express written consent 2 Overview of Genome Browsers Materials prepared by Warren C. Lathe, Ph.D.
is accessible at: The following pages are a schematic representation of how to navigate through ALE-HSA21.
Presented by the Virginia 4-H Science and Technology Committee.
Use cases for Tools at the Bovine Genome Database Apollo and Bovine QTL viewer.
Regulatory Genomics Lab Saurabh Sinha Regulatory Genomics Lab v1 | Saurabh Sinha1 Powerpoint by Casey Hanson.
Copyright OpenHelix. No use or reproduction without express written consent1.
GMOD: Managing Genomic Data from Emerging Model Organisms Dave Clements 1, Hilmar Lapp 1, Brian Osborne 2, Todd J. Vision 1 1 National Evolutionary Synthesis.
Apollo Future Plans Nomi Harris, BDGP/FlyBase GMOD Meeting, Cambridge April 27, 2004.
Pathfinders How to engage your students in computer-based learning quickly and easily.
Week 1 – Beginners Content McAfee & Big Fish Games CoderDojo.
Browsing the Genome Using Genome Browsers to Visualize and Mine Data.
Professional Development Course 1 – Molecular Medicine Genome Biology June 12, 2012 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services.
The generic Genome Browser (GBrowse) A combination database and interactive web page for manipulating and displaying annotations on genomes Developed by.
Bringing the power of SVG to a genome browser near you! Christopher T Lewis CMPT 856 – Presentation 1.
240-Current Research Easily Extensible Systems, Octave, Input Formats, SOA.
Microsoft ® Office Excel 2003 Training Using XML in Excel SynAppSys Educational Services presents:
Can’t provide fast insertion/removal and fast lookup at the same time Vectors, Linked Lists, Stack, Queues, Deques 4 Data Structures - CSCI 102 Copyright.
SRI International Bioinformatics 1 Genome Browser Markus Krummenacker Bioinformatics Research Group SRI, International Q
Regulatory Genomics Lab Saurabh Sinha Regulatory Genomics | Saurabh Sinha | PowerPoint by Casey Hanson.
SRI International Bioinformatics 1 Genome Browser Tomer Altman Bioinformatics Research Group SRI, International August 19th, 2009.
Copyright OpenHelix. No use or reproduction without express written consent1.
Visual Page: Getting started There are lots of HTML editors –MS FrontPage –Adobe PageMill –GoLive Cyberstudio Feel free to use any of them you wish We.
Build a database V: Create forms for a new Access database Overview: A window into your data So far in this series of courses, you’ve built tables, relationships,
Bioinformatics for biologists Dr. Habil Zare, PhD PI of Oncinfo Lab Assistant Professor, Department of Computer Science Texas State University Presented.
Sight Words.
Copyright OpenHelix. No use or reproduction without express written consent1.
RULES TO AVOID BAD DESIGN 1. Don’t annoy your viewers. Don't use frames unless you have to! - Frames are annoying and cause people to lose their way when.
Analysis: Tools for directly examining sequence What follows is a simulation of the proposed sequence interface. A PC-based prototype exists, but the interface.
A guided tour of Ensembl This quick tour will give you an outline view of what Ensembl is all about. You will learn: –Why we need Ensembl –What is in the.
Tools in Bioinformatics Genome Browsers. Retrieving genomic information Previous lesson(s): annotation-based perspective of search/data Today: genomic-based.
Accessing and visualizing genomics data
Using public resources to understand associations Dr Luke Jostins Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015.
Windows Movie Maker Pascack Valley Regional High School District.
Here’s Why You Should Choose Website Builders over Other Options Squarespace allows you to add and move around your content (text, video, images, sounds,
Behavior and Phenotype in GMOD Natural Diversity in GMOD
NGS Analysis Using Galaxy
Regulatory Genomics Lab
Bioinformatics Research Group
Bioinformatics Tools for Comparative Genomics of Vectors
Daphnia Genome Preview at wFleaBase.org
Accsess 2013 Creating Database.
Lesson 10 – Thinking about Structure
EPConDB: Endocrine Pancreas Consortium Database
University of Pittsburgh
INFORMATION FLOW AARTHI & NEHA.
Apollo Progress Report
Updates and Future Direction
Regulatory Genomics Lab
Regulatory Genomics Lab
Presentation transcript:

Andy Conley 3/26/2012 1

James Kent. Know that name. He is one of greatest, perhaps the greatest, bioinformatics programmers ever. He was deeply involved in the assembly of the public human genome project. If you were in the fall class, you compiled the James Kent Source tree. Almost all his. He speaks nothing but the truth. 2

“Genome browsers facilitate genomic analysis by presenting alignment, experimental and annotation data in the context of genomic DNA sequences.” Melissa S Cline & James W Kent, 2009 Genome browsers aggregate data 3

4 Clicking on any of these takes you to a page full of details CDKN2ACDKN2A

They are any kind of genomic information Genes Transposable element insertions Transcription factor binding sites Sites prone to recombination Conservation of genomics sequences Extremely important in modern times are tracks displaying ChIP-seq or RNA-seq data 5

Arguably the most advanced genome browser, it is much more than a tool for looking at genomes It integrates a huge amount of data for each gene it displays. The UCSC also has a graphical front end for downloading from its huge backend database 6

It hosts the ENCODE project, one of the largest, probably the largest, assemblies of functional genomic data. It let’s you jump between orthologous regions in different genome: CDKN2ACDKN2A It’s a massive, massive database backend of over 6500 tables. 7

It’s really, really, really hard to install. It’s impossible to understand unless you’ve tried to do it. The UCSC genome browser works so well for the genomes that it has because it is so very, very specialized for those genomes. Each track in the UCSC browser has been lovingly crafted. 8

A ridiculous number of genomes They’re going to be coming out even faster in the next year or two, then faster after that. Things like the new PacBio providing longer reads should make assembling eukaryotic genomes easier. 9

You can’t load them/annotate them by hand – it all has to be automated. The UCSC guys do it for the human genome because it’s the human genome. They’re all different from each other. You have to have some easily deployable storage/display method for your data. 10

There are a number of choices out there for a genome browser There are really just 2 big ones: UCSC GMOD & GBrowse We already discussed why you don’t use the UCSC browser for projects 11

Generic – It can handle any organism Model Organism – Not really, whatever genome Database – Not really a database, but there is a database in it. GMOD just sounds good gmod.org 12

A simple, easily deployable method for storing, viewing and editing genomic data. GMOD has many, many parts Some of the big ones: Apollo – Eww Chado – A mechanism for storing genomic data GBrowse – A genome browser 13

Probably (definitely) the most commonly used of the GMOD components It is a simple but extensible platform for displaying genomic data It is maintained mostly by this man: Scott Cain 14

Many projects use GBrowse as their genome viewer 15

WormBase WormBase is to the C.elegans genome what the UCSC browser is to the human and mouse genomes. It is huge. 16

FlyBase hosts many Drosophila genomes, though not with the depth of WormBase WormBase is really at the top of non-UCSC browsers in it’s depth of information This makes sense, given that nematodes are so heavily studied and very easy to work with. 17

The result of the first couple years of the class Currently maintained by Lee Katz at the CDC 18

19

20 Darker genes had more programs that indicated them being horizontally transferred This shows genes that we thought were horizontally transferred

We had a track of virulence factors in the first year Clicking on any of them took you to details for the gene, a link to VFDB, etc. 21

You can alter how tracks are show in other ways Add and remove tracks, change the link that appears over a feature in the genome. 22

One big, important thing: “Genome browsers facilitate genomic analysis by presenting alignment, experimental and annotation data in the context of genomic DNA sequences.” Melissa S Cline & James W Kent, 2009 Genome browsers, in short, aggregate data. 23

My rotifertranscriptome browser. It doesn’t have to be a genome Not super exciting from this view. Just the predicted coding region of an assembled contig (mRNA) 24

25

The relative ordering of things in a genome. Just a few years ago, this was not available in GBrowse, it is now. This could easily work for comparing different bacterial species 26

27

28

Are genome browsers useful? 29

We deal with huge volumes of data The fall class will recall my hatred of GUIs We want high-throughput Genome browses give you none of this. None. 30

I spent quite a bit of time in undergrad doing bench work for Dr. Nils Kroger across the street. I worked with these little guys: Fascinating creatures I cared about three genes: Sil1, Sil2, Sil3 They day the genome browser came out changed the game 31

Still pretty useful My main uses: 1. Make sure my data are correct. Are my intersections between genes and transposable element insertions correct? 2. Download hosted data. 3. Make nice pictures 4. Like a biologist, gene information about specific genes 32

How useful is it really? It really depends on who you ask It’s really for biologists: they find the browser, search for their favorite gene and get some details about it. Once again, data aggregation. 33

They were super excited about it They use it all the time It is like magic to them. If you were to show an iPhone to somebody from 1975, it would be pretty much the same thing. Almost. 34

Will it ever be the greatest genome browser? No. That will always be the UCSC browser Will it remain the easiest to install for some time? Probably Will you get the best return on time spent Yep Synteny is horribly conserved in Haemophilus, so avoid Gbrowse_syn for this class, but do keep it in mind. 35

Genome browsers: Allow navigation of the genome Show genomic features, whatever they are Show annotations Show comparisons 36

GBrowse, and all of GMOD, use GFF files Generic Feature Format Most of it is pretty simple. Chromosome(contig) start, stop, strand, id The last column is what’s important. It lets you put whatever information about the feature you want in there. It’s a very flexible format. 37

Thanks for listening 38