“Comparative Human Microbiome Analysis” Remote Video Talk to CICESE Big Data, Big Network Workshop Ensenada, Mexico October 10, 2013 Dr. Larry Smarr Director,

Slides:



Advertisements
Similar presentations
A Systems Approach to Personalized Medicine Talk and Discussion NASA Ames Mountain View, CA March 28, 2013 Dr. Larry Smarr Director, California Institute.
Advertisements

Sequencing Genomics: The New Big Data Driver IntermezzoTalk SURFnet7, Part of GigaPort3 Utrecht, Netherlands December 7, 2011 Dr. Larry Smarr Director,
Reading Out the State of the Body and How it Changes Under Therapy Guest Lecture Pharmacy Informatics 2013 University of California San Diego June 7, 2013.
Large Memory High Performance Computing Enables Comparison Across Human Gut Microbiome of Patients with Autoimmune Diseases and Healthy Subjects XSEDE.
“Tracking Immune Biomarkers and the Human Gut Microbiome: Inflammation, Crohn's Disease, and Colon Cancer” USC Monthly Seminar Series Physical Sciences.
Exploring Our Inner Universe Using Supercomputers and Gene Sequencers Physics Department Colloquium UC San Diego October 24, 2013 Dr. Larry Smarr Director,
Discussion Janssen La Jolla Research and Development La Jolla, CA
Leveraging Biomedical Big Data: Quantified Self & Beyond Invited Talk FutureMed Singularity University NASA Ames Campus February 5, 2013 Dr. Larry Smarr.
“Building US/Mexico Collaborations Using Optical Networks” Opening Workshop Welcome Big Data Big Network 2 Calit2’s Qualcomm Institute February 10, 2014.
“The Systems Biology Dynamics of the Human Immune System and Gut Microbiome” Invited Talk UCI Systems Biology Seminar Series Irvine, CA October 14, 2013.
“Using Data Analytics to Discover the 100 Trillion Bacteria Living Within Each of Us” Invited Talk New Applications of Computer Analysis to Biomedical.
“Finding the Patterns in the Big Data From Human Microbiome Ecology” Invited Talk Exponential Medicine November 10, 2014 Dr. Larry Smarr Director, California.
“Personalized Medicine, Colorectal Cancer and Gut Bacteria”
The Microbiome and Metagenomics
“Quantifying Your Superorganism Body Using Big Data Supercomputing” Ken Kennedy Institute Distinguished Lecture Rice University Houston, TX November 12,
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology Metagenomics Center for Earth Observations and Applications Advisory Committee.
“The Quantified Self Movement: The Technologies That Are Revolutionizing Health and Fitness” Panel Discussion MIT Enterprise Forum San Diego UC San Diego.
“Discovering the Other 90% of our Human Superorganism” Remote Video Lecture to The eResearch Australasia Conference 2014 Melbourne, Australia October 28,
“Inflammation, Gut Microbiome, Bacteriophages, and the Initiation of Colorectal Cancer” Seminar Lecture City of Hope Pasadena, CA October 20, 2014 Dr.
My N=1 Experience Pioneer Session: "N=1: Pioneers of Self-Tracking“ Panel at the Genomes, Environment, and Traits Conference Harvard Medical School Cambridge,
“Mapping the Human Gut Microbiome in Health and Disease Using Sequencing, Supercomputing, and Data Analysis” Invited Talk Delivered by Mehrdad Yazdani,
“The Quantified Self: From Idiosyncratic Hobby to an Emerging Growth Industry” Invited Lecture Science & Technology Discovery Series Technology Alliance.
“Measuring the Human Brain-Gut Microbiome-Immune System Dynamics: a Big Data Challenge” Plenary Talk 45 th Annual Meeting of the Behavior Genetics Association.
“The Digital Transformation of Healthcare”
“Big Data and Superorganism Genomics – Microbial Metagenomics Meets Human Genomics” NGS and the Future of Medicine Illumina Headquarters La Jolla, CA February.
“Quantifying The Dynamics of Your Superorganism Body Using Big Data Supercomputing” Distinguished Lecturer Series Computer Science and Engineering.
“The Deeply Quantified Self: A Case Study” Future Technology Keynote Minimally Invasive Surgery Week 2015 Society of Laparoendoscopic Surgeons New York.
“Quantified Health and Disease” Lecture for the Osher Lifetime Learning Institute UCSD Extension Calit2’s Qualcomm Institute, UCSD La Jolla, CA February.
“Using Data Analytics to Discover the 100 Trillion Bacteria Living Within Each of Us” Invited Talk Ayasdi Menlo Park, CA December 5, 2014 Dr. Larry Smarr.
“Toward Novel Human Microbiome Surveillance Diagnostics to Support Public Health” Invited Talk Institute for Public Health University of California San.
“Tracking Large Variations in My Immune Biomarkers and My Gut Microbiome: Inflammation, Crohn's Disease, and Colon Cancer” IBD Conference Speaker Series.
“An Integrated Science Cyberinfrastructure for Data-Intensive Research” Panel CISCO Executive Symposium San Diego, CA June 9, 2015 Dr. Larry Smarr Director,
“Quantified Self- On Being a Personal Genomic Observatory” Keynote in the “Humans as Genomic Observatories” Meeting Session in the Genomics Standards Consortium.
“The Human Microbiome and the Revolution in Digital Health” The Florida Institute for Human and Machine Cognition Pensacola Evening Lecture Series Pensacola,
“Calit2: A UC Experiment for Living in the Future" Talk to UCSD Near You La Jolla, CA April 11, 2006 Dr. Larry Smarr Director, California Institute.
“Using Supercomputing & Advanced Analytic Software to Discover Radical Changes in the Human Microbiome in Health and Disease” Invited Remote Presentation.
“Creating a High Performance Cyberinfrastructure to Support Analysis of Illumina Metagenomic Data” DNA Day Department of Computer Science and Engineering.
Developing a North American Global LambdaGrid Dr. Larry Smarr Director, California Institute for Telecommunications and Information Technology Harry E.
“Individual, Consumer-Driven Care of the Future -- Taking Wellness One Step Further” Closing Keynote Address The World Congress 2 nd Annual Leadership.
Innovative Research Alliances Invited Talk IUCRP Fellows Seminar UCSD La Jolla, CA July 10, 2006 Dr. Larry Smarr Director, California Institute for Telecommunications.
“Inspired by Carl: Exploring the Microbial Dynamics Within” Invited Talk Looking in the Right Direction: Carl Woese and the New Biology University of Illinois,
“Living in a Microbial World” Global Health Program Council on Foreign Relations New York, NY April 10, 2014 Dr. Larry Smarr Director, California Institute.
“How Studying Astrophysics and Coral Reefs Enabled Me to Become an Empowered, Engaged Patient” Invited Talk FutureMed at the Hotel Del Coronado, CA November.
“Deciphering the Dynamic Coupling of the Human Immune System and the Gut Microbiome” Overview Data-Enabled Life Sciences Research (DELSA) DELSA Workshop.
“Observing the Dynamics of the Human Immune System Coupled to the Microbiome in Health and Disease” CASIS Workshop on Biomedical Research Aboard the ISS.
“Quantifying Your Superorganism Body Using Big Data Supercomputing” ACM International Workshop on Big Data in Life Sciences BigLS 2014 Newport Beach, CA.
“Assay Lab Within Your Body: Biometrics and Biomes” Invited Lecture TSensors Summit La Jolla, CA November 12, 2014 Dr. Larry Smarr Director, California.
“Discovering the Other 90% of our Human Superorganism” Remote Video Lecture to The eResearch Australasia Conference 2014 Melbourne, Australia October 28,
“Quantifying the Time Progression of the Interaction of the Human Immune System with the Gut Microbiome” Research Council Presentation UC San Diego Health.
“The Pacific Research Platform: a Science-Driven Big-Data Freeway System.” Big Data for Information and Communications Technologies Panel Presentation.
“CAMERA Goes Live!" Presentation with Craig Venter National Press Club Washington, DC March 13, 2007 Dr. Larry Smarr Director, California Institute for.
“The UCSD Big Data Freeway System” Invited Short Talk Workshop on “Enriching Human Life and Society” UC San Diego February 6, 2014 Dr. Larry Smarr Director,
Lecture Science & Entertainment Exchange National Academy of Sciences Los Angeles June 13, 2013 Dr. Larry Smarr Director, California Institute for Telecommunications.
“Know Thyself: Quantifying Your Human Body and Its One Hundred Trillion Microbes” Understanding Cultures and Addressing Disparities in Society: Degrees.
“Using Genetic Sequencing to Unravel the Dynamics of Your Superorganism Body” Weekly Bioinformatics Seminar Series UC San Diego La Jolla, CA October 17,
“Adding Consumer-Generated and Microbiome Data to the Electronic Medical Record” Using Big Data to Advance Healthcare Panel National Health Policy Conference.
“Genomics: The CAMERA Project" Invited Talk 5 th Annual ON*VECTOR International Photonics Workshop UCSD February 28, 2006 Dr. Larry Smarr Director,
High Performance Cyberinfrastructure Discovery Tools for Data Intensive Research Larry Smarr Prof. Computer Science and Engineering Director, Calit2 (UC.
“OptIPuter: From the End User Lab to Global Digital Assets" Panel UC Research Cyberinfrastructure Meeting October 10, 2005 Dr. Larry Smarr.
“ Building an Information Infrastructure to Support Microbial Metagenomic Sciences " Presentation to the NBCR Research Advisory Committee UCSD La Jolla,
“Quantifying Your Dynamic Human Body (Including Its Microbiome), Will Move Us From a Sickcare System to a Healthcare System” Invited Presentation Microbiology.
Keynote Presentation Cavendish Global Health Impact Forum
“Connecting Body Time Series to Macro Body Changes”
“Analyzing the Human Gut Microbiome Dynamics in Health and Disease Using Supercomputers and Supernetworks” Invited Presentation ESnet CrossConnects Bioinformatics.
“Linking Phenotype Changes to Internal/External Longitudinal Time Series in a Single Human” Invited Presentation at EMBC ‘16 38th International Conference.
“Machine Learning in Healthcare Diagnostics”
Briefing for Dell Analytics Team Calit2’s Qualcomm Institute
Invited Presentation Machine Learning in Healthcare
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Toward Accurate and Quantitative Comparative Metagenomics
Presentation transcript:

“Comparative Human Microbiome Analysis” Remote Video Talk to CICESE Big Data, Big Network Workshop Ensenada, Mexico October 10, 2013 Dr. Larry Smarr Director, California Institute for Telecommunications and Information Technology Harry E. Gruber Professor, Dept. of Computer Science and Engineering Jacobs School of Engineering, UCSD 1

Abstract We are carrying out very deep metagenomic sequencing of human gut microbiomes from healthy subjects and from people with the autoimmune Inflammatory Bowel Disease. We compare one subject with IBD to metagenomic datasets downloaded from the NIH Human Microbiome Project repository, including 35 healthy subjects and 20 with IBD. We also analyze the changes in this one subject over multiple times, including comparing before and after drug therapy. The dataset of Illumina short reads for one person is ~10GB. The total comparison dataset contains ~0.5 trillion DNA bases. These Big Data had to be moved across the network to the San Diego Supercomputer Center where over 200,000 cpu-hours were consumed in the analysis and then back to Calit2 where a 64 megapixel wall was used for visual analysis. This approach could be extended for cross-border comparisons of human gut microbiomes to examine differences in food intake and various disease states. Larry Smarr is the Harry E. Gruber Professor in the Department of Computer Science and Engineering of the Jacobs School of Engineering at UC San Diego. He was the founding director of the California Institute for Telecommunications and Information Technology in 2000 and of the National Center for Supercomputing Applications in Weizhong Li currently leads a group of researchers funded by NIH and NSF at the Center for Research in Biological System in UC San Diego. He has more than 20 years of experience in bioinformatics, computational biology, and computational chemistry.

Your Body Has 10 Times As Many Microbe Cells As Human Cells Inclusion of the Microbiome Will Radically Change Medicine 99% of Your DNA Genes Are in Microbe Cells Not Human Cells

Gut Microbiome Metagenomic Datasets Comparing Healthy and Diseased States One “Read” = 100 DNA Bases Total of 12.5 Billion Reads! Source: Weizhong Li, CRBS, UCSD

We Created a Reference Database Of Known Gut Genomes NCBI April 2013 –2471 Complete Draft Bacteria & Archaea Genomes –2399 Complete Virus Genomes –26 Complete Fungi Genomes –309 HMP Eukaryote Reference Genomes Total 10,741 genomes, ~30 GB of sequences Now to Align Our 12.5 Billion Reads Against the Reference Database Source: Weizhong Li, Sitao Wu, CRBS, UCSD

Computational NextGen Sequencing Pipeline: From “Big Equations” to “Big Data” Computing PI: (Weizhong Li, CRBS, UCSD): NIH R01HG ( , $1.1M)

Creating a Big Data Freeway System: Coupling ‘Omics Data Generators with Supercomputers Using Optical Fiber with 1000x Shared Internet Speeds

We Used SDSC’s Gordon Data-Intensive Supercomputer to Analyze a Wide Range of Gut Microbiomes ~180,000 Core-Hrs on Gordon –KEGG function annotation: 90,000 hrs –Mapping: 36,000 hrs –Used 16 Cores/Node and up to 50 nodes –Duplicates removal: 18,000 hrs –Assembly: 18,000 hrs –Other: 18,000 hrs Gordon RAM Required –64GB RAM for Reference DB –192GB RAM for Assembly Gordon Disk Required –Ultra-Fast Disk Holds Ref DB for All Nodes –8TB for All Subjects Enabled by a Grant of Time on Gordon from SDSC Director Mike Norman

Comparing 3 LS Time Snapshots (Left) with Healthy, Crohn’s, UC (Right Top to Bottom) Calit2 VROOM-FuturePatient Expedition

Phyla Gut Microbial Abundance Without Viruses: LS, Crohn’s, UC, and Healthy Subjects Crohn’s Ulcerative Colitis Healthy LS Toward Noninvasive Microbial Ecology Diagnostics Source: Weizhong Li, Sitao Wu, CRBS, UCSD

Lessons From Ecological Science: Invasive Species Dominate After Major Species Destroyed ”In many areas following these burns invasive species are able to establish themselves, crowding out native species.” invasive species Source: Ponderosa Pine Fire Ecology

Rare Firmicutes Bloom in Colon Disappearing After Antibiotic/Immunosuppressant Therapy Firmicutes Families LS Time 1 LS Time 2 Healthy Average Parvimonas spp.

Thanks to Our Great Team! UCSD Metagenomics Team Weizhong Li Sitao Wu Future Patient Team Jerry Sheehan Tom DeFanti Kevin Patrick Jurgen Schulze Andrew Prudhomme Philip Weber Fred Raab Joe Keefe Ernesto Ramirez JCVI Team Karen Nelson Shibu Yooseph Manolito Torralba SDSC Team Michael Norman Mahidhar Tatineni Robert Sinkovits