Integration and analysis of multi-type high-throughput data for biomolecular knowledge discovery Dr. Erik Bongcam-Rudloff SGBC-SLU Uppsala, Sweden.

Slides:



Advertisements
Similar presentations
Biology Day 1.
Advertisements

Walter Lab: Gut microbiome and its interactions with metabolic disease
LESSON 1: What is Genetic Research? PowerPoint slides to accompany Using Bioinformatics : Genetic Research.
CSU IDRC Next Generation Sequencing Core Genomic Sequencing Services.
Module 12 Human DNA Fingerprinting and Population Genetics p 2 + 2pq + q 2 = 1.
Use of the genomic data o Reconstruction of metabolic properties o Nature’s Microbiome o NGS in Population Genetics.
Creating NCBI The late Senator Claude Pepper recognized the importance of computerized information processing methods for the conduct of biomedical research.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Lesson Overview 1.1 What Is Science?.
Next-generation sequencing
AI and Bioinformatics From Database Mining to the Robot Scientist.
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey What is Metagenomics?  Traditional microbial genomics 
living organisms According to Presence of cell The non- cellular organism The cellular organisms According to Type the Eukaryotes the prokaryotes human.
Greg Phillips Veterinary Microbiology
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
Integration of Bioinformatics into Inquiry Based Learning by Kathleen Gabric.
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
Topics in Computational Biology (COSI 230a) Pengyu Hong 09/02/2005.
The Microbiome and Metagenomics
Professionals and patients need clean clear knowledge for decision making just as they need clean clear water for hand washing Water may look clear.
Microbial Genomes Features Analysis Role of high-throughput sequencing Yeast - the eukaryotic model microbe Databases –TIGR CMR –NCBI Microbial Genomes.
GENETIC ENGINEERING B-4.9. GENETIC ENGINEERING GENETIC ENGINEERING IS THE PROCESS OF SPECIFIC GENES IN AN ORGANISM IN ORDER TO ENSURE THAT THE ORGANISM.
Molecular Microbial Ecology
CSE 6406: Bioinformatics Algorithms. Course Outline
KEY CONCEPT Biology is the study of all forms of life.
H = -Σp i log 2 p i. SCOPI Each one of the many microbial communities has its own structure and ecosystem, depending on the body environment it exists.
What is Science? A way of learning and thinking about the natural world using experimentation to make conclusions Scientists collect information, look.
Bioinformatics and medicine: Are we meeting the challenge?
Lesson Overview Lesson Overview Studying the Human Genome Lesson Overview 14.3 Studying the Human Genome.
Biology and You Section 1: Themes of Biology
Finish up array applications Move on to proteomics Protein microarrays.
Genomes and Their Evolution. GenomicsThe study of whole sets of genes and their interactions. Bioinformatics The use of computer modeling and computational.
What will mapping the genome mean to the industry? Clare A. Gill, Ph.D. Department of Animal Science Texas A&M University.
The Human Genome Project. The Human Genome Project (HGP) was started in 1989 and finished in Its goals included: Identifying positions on chromosomes.
713 Lecture 15 Host metagenomics. Progression of techniques Culture based –Use phenotypes and genotypes to ID Non-culture based, focused on 16S rDNA –Clone.
Bioinformatics field of science in which biology, computer science, and information technology merge to form a single discipline.
Chapter 21 Eukaryotic Genome Sequences
1 Biology and You-Chapter 1. 2 I. Themes of Biology A. Living Organisms have certain characteristics in common. 1. Biology is the study of life.
Lesson Overview Lesson Overview What Is Science? Lesson Overview 1.1 What Is Science?
WMU CS 6260 Parallel Computations II Spring 2013 Presentation #1 about Semester Project Feb/18/2013 Professor: Dr. de Doncker Name: Sandino Vargas Xuanyu.
Analyzing Time Course Data: How can we pick the disappearing needle across multiple haystacks? IEEE-HPEC Bioinformatics Challenge Day Dr. C. Nicole Rosenzweig.
A collaborative tool for sequence annotation. Contact:
An approach to carry out research and teaching in Bioinformatics in remote areas Alok Bhattacharya Centre for Computational Biology & Bioinformatics JAWAHARLAL.
Introduction to Biotechnology Chapter 13. What is biotechnology? “ Any technique that uses living organisms or their products to make or modify a product,
Lesson Overview Lesson Overview What Is Science? Lesson Overview 1.1 What Is Science?
__________________________________________________________________________________________________ Fall 2015GCBA 815 __________________________________________________________________________________________________.
PROTEIN INTERACTION NETWORK – INFERENCE TOOL DIVYA RAO CANDIDATE FOR MASTER OF SCIENCE IN BIOINFORMATICS ADVISOR: Dr. FILIPPO MENCZER CAPSTONE PROJECT.
Biotechnology and Bioinformatics: Bioinformatics Essential Idea: Bioinformatics is the use of computers to analyze sequence data in biological research.
High throughput biology data management and data intensive computing drivers George Michaels.
Big Data in Indian Agriculture D. Rama Rao Director, NAARM.
DNA Technology. Please pick up notes on the front desk.
DNA Technology and its Applications. Objective  AKS 9. Analyze how biological traits are passed on to successive generations  9g. Examine the use of.
Biotechnology You Will Learn About… Transformation Cloning DNA Fingerprinting by Restriction Fragment Length Polymorphism (RFLP) What is the name of the.
Genomics and the Growing World Steve Rounsley Dow Agrosciences.
1 General Biology Chapter 11 Gene Technology. 2 Genetic Engineering Recombinant DNA is made when a DNA fragment is put into the DNA of a vector Gel electrophroesis.
Boundless Lecture Slides Free to share, print, make copies and changes. Get yours at Available on the Boundless Teaching Platform.
Metagenomic Species Diversity.
Joslynn Lee – Data Science Educator
CBTTC Expansion in China through BGI/CNBC
Toward Next Generation Biodiversity Research
생물정보학 Bioinformatics.
Genomes and Their Evolution
H = -Σpi log2 pi.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Microbiome studies for microbial disease pathogenesis research
Dissemination of the mcr-1 colistin resistance gene
Gut Microbiome Studies
Lesson Overview 1.1 What Is Science?.
Toward Accurate and Quantitative Comparative Metagenomics
Presentation transcript:

Integration and analysis of multi-type high-throughput data for biomolecular knowledge discovery Dr. Erik Bongcam-Rudloff SGBC-SLU Uppsala, Sweden

Biologists modus operandi Observing a phenomenon that is in some way interesting or puzzling. Making a guess as to the explanation of the phenomenon. Making a guess as to the explanation of the phenomenon. Devising a test to show how likely this explanation is to be true or false. Devising a test to show how likely this explanation is to be true or false. Carrying out the test, and, on the basis of the results, deciding whether the explanation is a good one or not. In the latter case, a new explanation will (with luck) 'spring to mind' as a result of the first test. Carrying out the test, and, on the basis of the results, deciding whether the explanation is a good one or not. In the latter case, a new explanation will (with luck) 'spring to mind' as a result of the first test. http : // http : //

The Observed phenomenon

Selection of test times

But was is the real event?

Sometimes you could be lucky Positive “Positive” results are used “negative” rejected Why? Only positive results are publishable “Positive” results are used “negative” rejected Why? Only positive results are publishable

Next Generation techniques

New challenges 1 TB data

Gbases produced at Sanger

World NGS Map

But this is wonderful! Or? Sequence without knowledge connected to it is worth: 0 The deluge of data produced by these hordes of machines worldwide demand automatic workflows Complete new systems to shuffle data around Storage of never used amounts Machines with gigantic amounts of RAM

COSTS

PROBLEMS NOmenclature Publishing culture Moving target development Old ways of work and resistance to changes in culture

Publishing culture as example We get tax payers money, we pay publishers to publish, the publishers sell the articles and obtain the copy rights To connect knowledge to sequences we need automatic methods, workflows, text mining. Most of this is limited by close database systems. Only available is PubMed. But PubMed has only short abstracts. NO information about conditions, M&M etc We need to change this culture

The BLAST analogy... By far the most used tool by biologists Not possible if databases were not Open Access and freely searchable Imagine if Nucleotide and Protein databases followed the life science publishing model

BLAST

BLAST

BLAST

BLAST

BLAST

Human centric What about all other areas of the Life Sciences? Most genes are named by sequence similarity, but are the functions the same?

Microbiome A microbiome is the totality of microbes, their genetic elements (genomes), and environmental interactions in a particular environment. microbesent.microbesent.

Fat and lean Metabolic effects of transplanting gut microbiota from lean donors to subjects with metabolic syndrome. A. Vrieze et al, EASD abstracts, 24 September The result was: Lean donor faecal infusion improves hepatic and peripheral insulin resistance as well as fasting lipid levels in obese individuals with the metabolic syndrome

Genome sizes

How many species? Several orders of magnitude: Some estimates: 3-50 million species of arthropods million species of nematodes Only a portion of bacterias have being identified, 99% of bacterias cannot be cultured. “Once the diversity of the microbial worldis catalogued, it will make astronomy to look like a pitiful science” Julian Davies, Professor Emeritus. UBC

New research strategies MicrobialLivestockPlants

Typical Sources of Metagenomics Soil samples Sea water samples Air samples Medical samples Farm animal samples Ancient bones Human microbiome

Ion Proton: "Personal Genome Machine". LIFE TECHNOLOGIES CORPORATION Real tests of transcriptome sequencing on the Proton. Using 500 ng of input poly-A RNA, it was possible to generate 50 million reads from a melanoma cancer sample. Joe Boland of the National Cancer Institute according to Genomeweb.

Oxford Nanopore

High technology everywhere!

New applications Only imagination will put the limits of what its possible to be done using Next Generation Technologies!

The big challenge: Open Access, Open source, collaborative networks Data sharing Common language Tool systems to glue all together!!

SeqAhead COST Action BM1006: Next Generation Sequencing Data Analysis Network COST Action 25 countries

ALLBIO 10 partners 8 countries FP7 project Broadening the Bioinformatics Infrastructure to unicellular, animal, and plant science

THANKS!! Como 2012