Less is more Approaches to biologist-driven analysis and next-generation sequencing data Paul Gordon Genome Canada Bioinformatics Platform University of.

Slides:



Advertisements
Similar presentations
Bioinformatics (and Systems Biology?) in Biomedical Research Donald Dunbar Systems Biology Club 30th November 2005.
Advertisements

 3.a.1 – DNA, and in some cases RNA, is the primary source of heritable information (19.2).  3.c.3 – Viral replication results in genetic variation,
Breaking Barriers: getting biologists involved in everyday data integration using Moby Paul Gordon Genome Canada Bioinformatics Platform University of.
By, Mackenzie Pabst Viruses; Section 18-1.
Bioinformatics at Molecular Epidemiology - new tools for identifying indels in sequencing data Kai Ye
A Systematic approach to the Large-Scale Analysis of Genotype- Phenotype correlations Paul Fisher Dr. Robert Stevens Prof. Andrew Brass.
The Golden Age of Biology DNA -> RNA -> Proteins -> Metabolites Genomics Technologies MECHANISMS OF LIFE Health Care Diagnostics Medicines Animal Products.
Bioinformatics: a Multidisciplinary Challenge Ron Y. Pinter Dept. of Computer Science Technion March 12, 2003.
Slide 1 of 24 Copyright Pearson Prentice Hall 14–3 Human Molecular Genetics 14-3 Human Molecular Genetics.
Integration of Bioinformatics into Inquiry Based Learning by Kathleen Gabric.
Prions Fact or Science Fiction?. Stanley Prusiner, 1982 Born in Des Moines, Ia. Suggested that spongiform encephalopathies in animals and humans are caused.
Bioinformatics Student host Chris Johnston Speaker Dr Kate McCain.
Genetics: From Genes to Genomes
The Microbiome and Metagenomics
Presented by Karen Xu. Introduction Cancer is commonly referred to as the “disease of the genes” Cancer may be favored by genetic predisposition, but.
Computational Molecular Biology Biochem 218 – BioMedical Informatics Gene Regulatory.
Computational Methods to study Sequencing data -Meenakshi Sharma.
WebGBrowse A Web Server for GBrowse Configuration Ram Podicheti B.V.Sc. & A.H. (D.V.M.), M.S. Staff Scientist – Bioinformatics Center for Genomics and.
Metagenomic Analysis Using MEGAN4
Copyright Pearson Prentice Hall
Copyright Pearson Prentice Hall 14–3 Human Molecular Genetics 14-3 Human Molecular Genetics.
Gramene Objectives Develop a database and tools to store, visualize and analyze data on genetics, genomics, proteomics, and biochemistry of grass plants.
U.S. Dept. Agriculture Agricultural Research Service Rob Griesbach Technology Transfer Coordinator
Ethics of Biotechnology. CLONING What is CLONING? Creating new and identical organisms using biotechnology.
A New Oklahoma Bioinformatics Company. Microarray and Bioinformatics.
Helping scientists collaborate BioCAD. ©2003 All Rights Reserved.
Integration and analysis of multi-type high-throughput data for biomolecular knowledge discovery Dr. Erik Bongcam-Rudloff SGBC-SLU Uppsala, Sweden.
Slide 1 of 24 Copyright Pearson Prentice Hall Biology.
Agent-based methods for translational cancer multilevel modelling Sylvia Nagl PhD Cancer Systems Science & Biomedical Informatics UCL Cancer Institute.
Implementing computational analysis through Web services Arnaud Kerhornou CRG/INB Barcelona - BioMed Workshop IRB November 2007.
Genomes To Life Biology for 21 st Century A Joint Initiative of the Office of Advanced Scientific Computing Research and Office of Biological and Environmental.
Bacteria Growth. Answer on the page 90 of your notebook using short sentences. Have you have had a cold or stomach “bug?” Did anyone else in your family.
1 Limitations of BLAST Can only search for a single query (e.g. find all genes similar to TTGGACAGGATCGA) What about more complex queries? “Find all genes.
Analyzing Time Course Data: How can we pick the disappearing needle across multiple haystacks? IEEE-HPEC Bioinformatics Challenge Day Dr. C. Nicole Rosenzweig.
An overview of Bioinformatics. Cell and Central Dogma.
A collaborative tool for sequence annotation. Contact:
An approach to carry out research and teaching in Bioinformatics in remote areas Alok Bhattacharya Centre for Computational Biology & Bioinformatics JAWAHARLAL.
COMPUTATIONAL BIOLOGIST DR. MARTIN TOMPA Place of Employment: University of Washington Type of Work: Develops computer programs and algorithms to identify.
Exploring and Exploiting the Biological Maze Zoé Lacroix Arizona State University.
Viral Cycles: Lytic Lysogenic
Prions “Scrapie” “mad cow disease” Nobel Prize 1997
Integration of Bioinformatics into Inquiry Based Learning by Kathleen Gabric.
Viruses and bacteria are the simplest biological systems - microbial models where scientists find life’s fundamental molecular mechanisms in their most.
Biotechnology and Bioinformatics: Bioinformatics Essential Idea: Bioinformatics is the use of computers to analyze sequence data in biological research.
Slide 1 of 24 Copyright Pearson Prentice Hall 14–3 Human Molecular Genetics 14-3 Human Molecular Genetics.
High throughput biology data management and data intensive computing drivers George Michaels.
 What is MSA (Multiple Sequence Alignment)? What is it good for? How do I use it?  Software and algorithms The programs How they work? Which to use?
1 Survey of Biodata Analysis from a Data Mining Perspective Peter Bajcsy Jiawei Han Lei Liu Jiong Yang.
Graduate Research with Bioinformatics Research Mentors Nancy Warter-Perez, ECE Robert Vellanoweth Chem and Biochem Fellow Sean Caonguyen 8/20/08.
Cell Biology Topic 1.1. Cell Theory All organisms are composed of one or more cells. Cells are the smallest units of life. All cells come from pre-existing.
Genetics, Viruses and Bacteria. Quick review of Genetics Mendel ◦ Law of segregation: Mendel’s first law, stating that each allele in a pair separates.
Introduction to Bioinformatics
Copyright Pearson Prentice Hall
Statistical Applications in Biology and Genetics
14-3 Human Molecular Genetics
Copyright Pearson Prentice Hall
Genomes Learning Goal: To explore the important applications of genome research. Success Criteria: I know I am succeeding when I can… explain the significance.
New genes can be added to an organism’s DNA.
Functional Annotation of the Horse Genome
14-3 Human Molecular Genetics
14-3 Human Molecular Genetics
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Biology Biology.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Copyright Pearson Prentice Hall
Copyright Pearson Prentice Hall
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Volume 12, Issue 6, Pages (December 2007)
Presentation transcript:

Less is more Approaches to biologist-driven analysis and next-generation sequencing data Paul Gordon Genome Canada Bioinformatics Platform University of Calgary

What am I doing here? Next Generation Sequencing Next Generation Web Future challenges Genome Canada Bioinformatics Platform

Better tech: less DNA, more sequence 44μm 70nm

PhytoMetaSyn

Sprockets: Hierarchical Gene Models from ESTs Developed in collaboration with BASF Plant Sciences

Genozymes

Hydrocarbon Metagenomics

Exploring gene expression patterns CAVEman Java 3D-based, world-first complete 3D human body atlas (adult male) – 2,335 organs, hierarchical organization following Terminologia Anatomica Numerous applications involving mapping of genetic and disease data More information: Patient MRI stack mapped onto atlas and registered by landmarks Pharmacokinetics visualization (Absorption-distribution-metabolism- excretion of Aspirin)

Basic Research Archaeal UV-light response Large-scale human genome organization ING-protein interactions (cancer and ageing-rated proteins)

Research Applications Kidney transplants: improved rejection diagnostics in Edmonton Mad cow disease/chronic wasting disease: live diagnostics Desulf.: mechanisms of oil pipeline corrosion and its prevention

DNA Diagnostics Discovery for Mad Cow PreclinicalClinicalPreinoculation Controls Control animal #6 Ball toy Photo: S. Czub, CFIA Lethbridge

Next-gen Motif finding (elk dataset) 61 blood samples 107 million base pairs 432 billion pairwise alignments ( ) mers or smaller Uninfected Infected 3 universal Infected Thousands of animal coverage/timepoint combos (CPU intensive) Decypher hardware accelerator

Motif Results

↑ EVI1 ↑PLZF Retrovirus PrP sc (+?) ↓PLZF-controlled genes Infectious agent Circulating Nucleic Acids Endogenous Retrovirus? Consistent with protein-only evidence… Neurovirulent? (e.g. M.L. Labat 1999) Possible mode of action? Virus particles? ~25nm PrP Amyloid fibres Vacuole Manuelidis et al, PNAS 2007 Protected promoters (Motifs A & B) Feedback PrP Integration Nucleoprotein complexes Cell death CNA Export Carp et al., EMBO J., 2006 Leblanc et al., EMBO J Stengel et al., Biochem. Biophys. Res. Commun Lee et al., Biochem. Biophys. Res. Commun Etc. Activation

Better tech: less input, more results Better tech: less DNA, more sequence Generate Manuscript Now Generate Manuscript Now

Where are we at? Bioinformatics Web Emerging Technologies Life Sciences Semantic Web Source: Gartner Inc.

How software works… Functions/ Rules Parameters/Input Results/ Output (article, allele,…) (Gene name, DNA sequence, QTL…)

The problem with the Web Once you label me, you negate me. Søren Kierkegaard 1998 Now

Bluejay Comparative genomics BioMoby linking Waypoints Gene expression integration

The task at hand (biologist) Sequencer Data File (Binary) ACCGT… Known Proteins BLAST Report (related proteins) (computer scientist)

DNASequence NCBI_gi Sequence_Alignment

Audience God Amoeba Taverna self-starters Willing to take training Capable but fearful Self-perception of computer skills

The need for shoehorns The current vision of the Semantic Web intends to create a new structure starting up with no reference to its vast, functioning, but more primitive predecessor … things just don’t happen like that

All the Web as Workflows Seahawk Proxied Web page Drag ‘n’ drop Seahawk prompting

What’s Ahead? The more a man learns, the more he realizes how little he knows

Semantic Web

Take home messages As tech improves, we can ask better questions We will need shoehorns to access existing resources for the foreseeable future