DNA sequencing, big data and health Mikael Huss Science for Life Laboratory / Stockholm Follow the Data blog:

Slides:



Advertisements
Similar presentations
Regulation of Consumer Tests in California AAAS Meeting June 1-2, 2009 Beatrice OKeefe Acting Chief, Laboratory Field Services California Department of.
Advertisements

The Australian National Data Service Ross Wilkinson Feb 26 th
CSU IDRC Next Generation Sequencing Core Genomic Sequencing Services.
Scientific themes in personal genetics Personal Genetics Education Project (pgEd) Harvard Medical School - Wu Laboratory
Sino-Danish Breast Cancer Research Centre Beijing Genomic Institute, Shenzhen University of Copenhagen University of Århus University of Southern Denmark.
Wrapup. NHGRI strategic plan What does the NIH think genomics should be for the next 10 years? [Nature, Feb. 2011]
© 2013 RNA Diagnostics Inc. All rights reserved. Confidential 1 “RDA: better chemotherapy management” Investor Presentation April 2013.
Strategy 2012 Karolinska Institutet June 2010Strategy 2012.
The data flood: We need a bigger boat James A. Foster The Initiative for Bioinformatics and Evolutionary Studies (IBEST) Biological Sciences, Bioinformatics.
Printed by Genomes, Disorders, and Databases Isabel C. Ibarra Soto Episcopal Cathedral School This project investigates all four.
Introduction to Genomics, Bioinformatics & Proteomics Brian Rybarczyk, PhD PMABS Department of Biology University of North Carolina Chapel Hill.
Workshop in Bioinformatics 2010 What is it ? The goals of the class… How we do it… What’s in the class Why should I take the class..
Epigenetics of Celiac Disease MEDICEL Malta 2011.
Future Trends: Translational Informatics James J. Cimino Chief, Laboratory for Informatics Development Mark O. Hatfield Clinical Research Center National.
Introduction of Cancer Molecular Epidemiology Zuo-Feng Zhang, MD, PhD University of California Los Angeles.
The Sorcerer II Global ocean sampling expedition Katrine Lekang Global Ocean Sampling project (GOS) Global Ocean Sampling project (GOS) CAMERA CAMERA METAREP.
U.S. Department of the Interior U.S. Geological Survey Development of Inferential Sensors for Real-time Quality Control of Water- level Data for the EDEN.
Stephanie St.Onge, Michelle Vance, Jonathan Wright, Alton Havey.
Joakim Dillner, M.D. Professor Department of Laboratory Medicine Karolinska Institutet Sweden Karolinska Institutet.
Research. Broad research, from science to community planning Research involves more than 3,000 staff members –More than 600 people belong to the Faculty.
Big Data.
Big Data Use Cases in the cloud Peter Sirota, GM Elastic
1 Project Description Development Tabbetha Dobbins Created for Louisiana Tech’s NSF-Funded Research Experiences in Micro/Nano Engineering Program.
University of Utah Department of Human Genetics Pharmacogenomics Louisa A. Stark, Ph.D. Director.
Sublinear time algorithms Ronitt Rubinfeld Computer Science and Artificial Intelligence Laboratory (CSAIL) Electrical Engineering and Computer Science.
Tyson Condie.
Metagenomic Analysis Using MEGAN4
DEVELOPMENT OF THE TOOLS FOR PCR-DETECTION OF HEPATITIS A AND C VIRUSES IN INTRAHOSPITAL VIRAL CONTAMINATION RESEARCH. 1 D. I. Ivanovsky Virology Institute,
Physiological Integration in Organismal Biology Hannah V. Carey, Ph.D. Department of Comparative Biosciences University of Wisconsin School of Veterinary.
1 Distributed Big Data & Analytics University of Cincinnati –Bioinformatics Project/Research Title: NIH BD2K-LINCS Perturbation Data Coordination and Integration.
The use of human biospecimens in cancer research Christopher A. Moskaluk M.D., Ph.D. University of Virginia.
Genomics and Personalized Healthcare Lecture 2 Bailee Ludwig Quality Management.
Beyond the Human Genome Project Future goals and projects based on findings from the HGP.
Cancer Staging.
1 © Copyright 2012 EMC Corporation. All rights reserved. It’s An Exciting Time In The Industry Bill Schmarzo CTO, EIM&A Practice EMC Consulting
H = -Σp i log 2 p i. SCOPI Each one of the many microbial communities has its own structure and ecosystem, depending on the body environment it exists.
Strategic Research Areas “If I have seen a little further it is by standing on the shoulders of giants”
Introduction to Bioinformatics CPSC 265. Interface of biology and computer science Analysis of proteins, genes and genomes using computer algorithms and.
DNA Technology - 2.
The rise of digitized medicine disrupts current research and business models Jesper Tegnér Director of the Unit for Computational Medicine, Department.
The NIH Roadmap and the Human Microbiome Project Francis S. Collins, M.D., Ph.D. National Human Genome Research Institute April 22, 2007.
Biobanks of Cerice Center for Gene Expression Research in Cancer Epidemiology Eiliv Lund, UiTø.
Why study genetics? It has a profound effect on your life!
1 CENS is comprised mostly of Computer Science and Electrical Engineering faculty and graduate students, collaboratively developing: Technologies (hardware.
Sackler Medical School
Introduction to Business Analytics OPS 370. Business Analytics Use of: – Data – Quantitative Analysis – Predictive and Prescriptive Models Leading to:
Basics of Biostatistics for Health Research Session 1 – February 7 th, 2013 Dr. Scott Patten, Professor of Epidemiology Department of Community Health.
The Human Genome Project Eric Lander PhD Director Whitehead Center for Genome Research Cambridge, MA Eric Topol MD Provost and Chief Academic Officer Chairman.
Scientific themes in personal genetics Personal Genetics Education Project (pgEd) Harvard Medical School - Wu Laboratory
DNA Methylation + Epigenetics
SciLifeLab Science for Life Laboratory Eva Molin, PhD, MEd Project coordinator.
Biol 456/656 Molecular Epigenetics Lecture #2 Wed August 26, 2015.
INTERPRETING GENETIC MUTATIONAL DATA FOR CLINICAL ONCOLOGY Ben Ho Park, M.D., Ph.D. Associate Professor of Oncology Johns Hopkins University May 2014.
Science for Life Laboratory Eva Molin Project coordinator.
Biopreservation Industry Share, Growth, Analysis, Statistics, Trends, Forecast Report, 2024
New research areas in personalised medicines
Big Data in Genomics, Diagnostics, and Precision Medicine
Toronto region is the only growing metro region in north America
Science for Life Laboratory
Detection of genome regulation sequences
Jeopardy Testing 1, 2, 3 She Has The Cancer Radiation or Chemo?
Human Genome Project, Gene Therapy, and Cloning
The African Soil Microbiology project
Precision Medicine Market share to see growth of 10.5% from 2017 to 2024
 The human genome contains approximately genes.  At any given moment, each of our cells has some combination of these genes turned on & others.
محاضرة عامة التقنيات الحيوية (هندسة الجينات .. مبادئ وتطبيقات)
Techniques for Analyzing DNA
Genomes and Their Evolution
VISUALIZING COMPLEX BACTERIAL POPULATIONS IN ANIMAL MODELS
Altered Caspase-8 Expression
Presentation transcript:

DNA sequencing, big data and health Mikael Huss Science for Life Laboratory / Stockholm Follow the Data blog: Stockholm Big Data Meetup #2

All* living organisms have DNA as their blueprint GTTACGTAACCGTTACGTA….. CCTTGATCGTAAC…. Etc. (2x3 billion letters for humans) *OK, some viruses have RNA ?

Science For Life Laboratory or SciLifeLab (Karolinska Institute Science Park, Solna) Joint research centre between KI, Royal Inst of Tech (KTH), Stockholm Uni., Uppsala Uni.. Presently sequencing ~3 megabases per second Corresponding to about 3 human genome sizes per hour

Mount Sinai Medical Center / Eric Schadt

Exploring the human microbiome Estimated 10x more bacterial cells than human cells in human body

Environmental samples: soil, ocean etc Identifying new viruses in human or environmental samples; <1% known so far

“Big data” in genomics: Data is often “transposed” compared to other “big data” types Genomics: Few samples, collected at great cost, information rich Example: 20 tissue samples x 30,000 features (genes); “large p, small n” Twitter, log files, purchase data etc.: Lots of samples, cheap, low information content Example: 200,000,000 tweets x 150 features (words) Analysis challenges: “large p, small n” Samples are hard to come by and expensive to collect, although you get a lot of information about each sample Hard to get enough data for statistics  extra important to share data and analysis methods globally Not enough people looking at the data that has been generated already

Analysis challenges: Dealing with the size of raw data Growth in sequencing capacity has outstripped Moore’s law Need to throw away data  Tailored streaming / approximate algorithms The Economist

Personal sequencing? Genomics apps

Predictive modelling competition for breast cancer prognosis

Community genomics & crowdsourced clinical trials

Coming challenges: ecology and lifestyle Perhaps: “genomic observatories” continuously monitoring environmental DNA  streaming, real-time analysis important Genes – Epigenetics – Lifestyle - Environment Understanding the interplay of lifestyle (including environment) and genes through the “interface layer”, epigenetics. Massive correlational analyses …

Thanks for listening!