“Using Genetic Sequencing to Unravel the Dynamics of Your Superorganism Body” Weekly Bioinformatics Seminar Series UC San Diego La Jolla, CA October 17,

Slides:



Advertisements
Similar presentations
A Systems Approach to Personalized Medicine Talk and Discussion NASA Ames Mountain View, CA March 28, 2013 Dr. Larry Smarr Director, California Institute.
Advertisements

The Quantified Self: Personal Monitoring and the Control of Health Interview by Andre de Fusco Future in Review 2011 Laguna Beach, CA May 26, 2011 Dr.
Reading Out the State of the Body and How it Changes Under Therapy Guest Lecture Pharmacy Informatics 2013 University of California San Diego June 7, 2013.
Large Memory High Performance Computing Enables Comparison Across Human Gut Microbiome of Patients with Autoimmune Diseases and Healthy Subjects XSEDE.
Deep Self - Quantifying the State of Your Body Invited Talk NextMed / MMVR20 San Diego February 21, 2013 Dr. Larry Smarr Director, California Institute.
“Tracking Immune Biomarkers and the Human Gut Microbiome: Inflammation, Crohn's Disease, and Colon Cancer” USC Monthly Seminar Series Physical Sciences.
Exploring Our Inner Universe Using Supercomputers and Gene Sequencers Physics Department Colloquium UC San Diego October 24, 2013 Dr. Larry Smarr Director,
Discussion Janssen La Jolla Research and Development La Jolla, CA
Leveraging Biomedical Big Data: Quantified Self & Beyond Invited Talk FutureMed Singularity University NASA Ames Campus February 5, 2013 Dr. Larry Smarr.
Personal Data Tracking and the Digital Transformation of Healthcare Invited Talk University of Illinois Silicon Valley Round Table Palo Alto, CA December.
The Microbiome: What’s the immune system got to do with it?
“The Systems Biology Dynamics of the Human Immune System and Gut Microbiome” Invited Talk UCI Systems Biology Seminar Series Irvine, CA October 14, 2013.
“Using Data Analytics to Discover the 100 Trillion Bacteria Living Within Each of Us” Invited Talk New Applications of Computer Analysis to Biomedical.
“Finding the Patterns in the Big Data From Human Microbiome Ecology” Invited Talk Exponential Medicine November 10, 2014 Dr. Larry Smarr Director, California.
“Personalized Medicine, Colorectal Cancer and Gut Bacteria”
The Microbiome and Metagenomics
Correlation Between Bacteria and Inflammatory Bowel Diseases
“Quantifying Your Superorganism Body Using Big Data Supercomputing” Ken Kennedy Institute Distinguished Lecture Rice University Houston, TX November 12,
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology Metagenomics Center for Earth Observations and Applications Advisory Committee.
“The Quantified Self Movement: The Technologies That Are Revolutionizing Health and Fitness” Panel Discussion MIT Enterprise Forum San Diego UC San Diego.
“Discovering the Other 90% of our Human Superorganism” Remote Video Lecture to The eResearch Australasia Conference 2014 Melbourne, Australia October 28,
“Inflammation, Gut Microbiome, Bacteriophages, and the Initiation of Colorectal Cancer” Seminar Lecture City of Hope Pasadena, CA October 20, 2014 Dr.
My N=1 Experience Pioneer Session: "N=1: Pioneers of Self-Tracking“ Panel at the Genomes, Environment, and Traits Conference Harvard Medical School Cambridge,
“Mapping the Human Gut Microbiome in Health and Disease Using Sequencing, Supercomputing, and Data Analysis” Invited Talk Delivered by Mehrdad Yazdani,
“The Quantified Self: From Idiosyncratic Hobby to an Emerging Growth Industry” Invited Lecture Science & Technology Discovery Series Technology Alliance.
"Towards Digitally Enabled Genomic Medicine" Distinguished Lecture Series Department of Computer Science and Engineering UC San Diego October 15, 2012.
“Measuring the Human Brain-Gut Microbiome-Immune System Dynamics: a Big Data Challenge” Plenary Talk 45 th Annual Meeting of the Behavior Genetics Association.
“The Digital Transformation of Healthcare”
“Big Data and Superorganism Genomics – Microbial Metagenomics Meets Human Genomics” NGS and the Future of Medicine Illumina Headquarters La Jolla, CA February.
H = -Σp i log 2 p i. SCOPI Each one of the many microbial communities has its own structure and ecosystem, depending on the body environment it exists.
“Quantifying The Dynamics of Your Superorganism Body Using Big Data Supercomputing” Distinguished Lecturer Series Computer Science and Engineering.
“The Deeply Quantified Self: A Case Study” Future Technology Keynote Minimally Invasive Surgery Week 2015 Society of Laparoendoscopic Surgeons New York.
“Quantified Health and Disease” Lecture for the Osher Lifetime Learning Institute UCSD Extension Calit2’s Qualcomm Institute, UCSD La Jolla, CA February.
“Using Data Analytics to Discover the 100 Trillion Bacteria Living Within Each of Us” Invited Talk Ayasdi Menlo Park, CA December 5, 2014 Dr. Larry Smarr.
“Toward Novel Human Microbiome Surveillance Diagnostics to Support Public Health” Invited Talk Institute for Public Health University of California San.
“Tracking Large Variations in My Immune Biomarkers and My Gut Microbiome: Inflammation, Crohn's Disease, and Colon Cancer” IBD Conference Speaker Series.
“Quantified Self- On Being a Personal Genomic Observatory” Keynote in the “Humans as Genomic Observatories” Meeting Session in the Genomics Standards Consortium.
“The Human Microbiome and the Revolution in Digital Health” The Florida Institute for Human and Machine Cognition Pensacola Evening Lecture Series Pensacola,
“Using Supercomputing & Advanced Analytic Software to Discover Radical Changes in the Human Microbiome in Health and Disease” Invited Remote Presentation.
“Creating a High Performance Cyberinfrastructure to Support Analysis of Illumina Metagenomic Data” DNA Day Department of Computer Science and Engineering.
“Comparative Human Microbiome Analysis” Remote Video Talk to CICESE Big Data, Big Network Workshop Ensenada, Mexico October 10, 2013 Dr. Larry Smarr Director,
The Human Microbiome: PSC, IBD, and the Gut-Liver Axis
“Individual, Consumer-Driven Care of the Future -- Taking Wellness One Step Further” Closing Keynote Address The World Congress 2 nd Annual Leadership.
“Inspired by Carl: Exploring the Microbial Dynamics Within” Invited Talk Looking in the Right Direction: Carl Woese and the New Biology University of Illinois,
“Living in a Microbial World” Global Health Program Council on Foreign Relations New York, NY April 10, 2014 Dr. Larry Smarr Director, California Institute.
“How Studying Astrophysics and Coral Reefs Enabled Me to Become an Empowered, Engaged Patient” Invited Talk FutureMed at the Hotel Del Coronado, CA November.
“Frontiers of Self-Tracking” Plenary Talk Quantified Self Conference 2012 Stanford University September 15, 2012 Dr. Larry Smarr Director, California Institute.
“Deciphering the Dynamic Coupling of the Human Immune System and the Gut Microbiome” Overview Data-Enabled Life Sciences Research (DELSA) DELSA Workshop.
“Observing the Dynamics of the Human Immune System Coupled to the Microbiome in Health and Disease” CASIS Workshop on Biomedical Research Aboard the ISS.
“Quantifying Your Superorganism Body Using Big Data Supercomputing” ACM International Workshop on Big Data in Life Sciences BigLS 2014 Newport Beach, CA.
“Assay Lab Within Your Body: Biometrics and Biomes” Invited Lecture TSensors Summit La Jolla, CA November 12, 2014 Dr. Larry Smarr Director, California.
“Discovering the Other 90% of our Human Superorganism” Remote Video Lecture to The eResearch Australasia Conference 2014 Melbourne, Australia October 28,
“Quantifying the Time Progression of the Interaction of the Human Immune System with the Gut Microbiome” Research Council Presentation UC San Diego Health.
Lecture Science & Entertainment Exchange National Academy of Sciences Los Angeles June 13, 2013 Dr. Larry Smarr Director, California Institute for Telecommunications.
“Know Thyself: Quantifying Your Human Body and Its One Hundred Trillion Microbes” Understanding Cultures and Addressing Disparities in Society: Degrees.
“Adding Consumer-Generated and Microbiome Data to the Electronic Medical Record” Using Big Data to Advance Healthcare Panel National Health Policy Conference.
“Quantifying Your Dynamic Human Body (Including Its Microbiome), Will Move Us From a Sickcare System to a Healthcare System” Invited Presentation Microbiology.
Keynote Presentation Cavendish Global Health Impact Forum
“Connecting Body Time Series to Macro Body Changes”
“Analyzing the Human Gut Microbiome Dynamics in Health and Disease Using Supercomputers and Supernetworks” Invited Presentation ESnet CrossConnects Bioinformatics.
“Linking Phenotype Changes to Internal/External Longitudinal Time Series in a Single Human” Invited Presentation at EMBC ‘16 38th International Conference.
“Machine Learning in Healthcare Diagnostics”
Briefing for Dell Analytics Team Calit2’s Qualcomm Institute
Invited Presentation Machine Learning in Healthcare
H = -Σpi log2 pi.
Metagenomics and Personalized Medicine
Daniel A. Peterson, Daniel N. Frank, Norman R. Pace, Jeffrey I. Gordon 
Daniel A. Peterson, Daniel N. Frank, Norman R. Pace, Jeffrey I. Gordon 
Metagenomics and Personalized Medicine
Toward Accurate and Quantitative Comparative Metagenomics
Presentation transcript:

“Using Genetic Sequencing to Unravel the Dynamics of Your Superorganism Body” Weekly Bioinformatics Seminar Series UC San Diego La Jolla, CA October 17, 2013 Dr. Larry Smarr Director, California Institute for Telecommunications and Information Technology Harry E. Gruber Professor, Dept. of Computer Science and Engineering Jacobs School of Engineering, UCSD 1

Abstract The human body is host to 100 trillion microorganisms, ten times the number of cells in the human body, and these microbes contain 100 times the number of DNA genes that our human DNA does. The microbial component of this "superorganism" is comprised of hundreds of species spread over many taxonomic phyla. The human immune system is tightly coupled with this microbial ecology and in cases of autoimmune disease, both the host immune system and the microbial ecology can have excursions far from normal. I will review some of the known 163 SNPs in the human genome which pre-dispose the host to develop autoimmune inflammatory bowel disease (IBD). Motivated by a diagnosis that I have Crohn’s disease, a form of IBD, I have been collecting massive amounts of data on my own body over the last five years. Analysis and graphing of this data demonstrates the episodic evolution of this coupled immune-microbial system. To decode the details of the microbial ecology requires high resolution genome sequencing feeding Big Data parallel supercomputers coupled to scalable visualization systems. The complexities of my time-varying microbial ecology will be compared to the NIH Human Microbiome Program data on people in states of health and disease.

By Measuring the State of My Body and “Tuning” It Using Nutrition and Exercise, I Became Healthier 2000 Age Age Age I Arrived in La Jolla in 2000 After 20 Years in the Midwest and Decided to Move Against the Obesity Trend I Reversed My Body’s Decline By Quantifying and Altering Nutrition and Exercise

From One to a Billion Data Points Defining Me: The Exponential Rise in Body Data in Just One Decade! Billion: My Full DNA, MRI/CT Images Million: My DNA SNPs, Zeo, FitBit Hundred: My Blood Variables One: My Weight Weight Blood Variables SNPs Microbial Genome Improving Body Discovering Disease Each is a Personal Time Series And Compared Across Population

Visualizing Time Series of 150 LS Blood and Stool Variables, Each Over 5-10 Years Calit2 64 megapixel VROOM

I Discovered I Had Episodic Chronic Inflammation by Tracking Complex Reactive Protein In My Blood Samples Normal Range <1 mg/L Normal 27x Upper Limit Antibiotics CRP is a Generic Measure of Inflammation in the Blood

By Adding Stool Samples, I Discovered I Had High Levels of the Protein Lactoferrin Shed from Neutrophils Normal Range <7.3 µg/mL 124x Upper Limit Antibiotics Lactoferrin is a Protein Shed from Neutrophils - An Antibacterial that Sequesters Iron Typical Lactoferrin Value for Active IBD

Descending Colon Sigmoid Colon Threading Iliac Arteries Major Kink Confirming the IBD (Crohn’s) Hypothesis: Finding the “Smoking Gun” with MRI Imaging I Obtained the MRI Slices From UCSD Medical Services and Converted to Interactive 3D Working With Calit2 Staff & DeskVOX Software Transverse Colon Liver Small Intestine Diseased Sigmoid Colon Cross Section MRI Jan 2012

MRE Reveals Inflammation in 6 Inches of Sigmoid Colon Thickness 15cm – 5x Normal Thickness “Long segment wall thickening in the proximal and mid portions of the sigmoid colon, extending over a segment of approximately 16 cm, with suggestion of intramural sinus tracts. Edema in the sigmoid mesentery and engorgement of the regional vasa recta.” – MRI report Clinical MRI Slice Program DeskVOX 3D Image Crohn's disease affects the thickness of the intestinal wall. Having Crohn's disease that affects your colon increases your risk of colon cancer.

Colonoscopy Images Show Inflamed Pseudopolyps in 6 inches of Sigmoid Colon Dec 2010 May 2011

Why Did I Have an Autoimmune Disease like IBD? Despite decades of research, the etiology of Crohn's disease remains unknown. Its pathogenesis may involve a complex interplay between host genetics, immune dysfunction, and microbial or environmental factors. --The Role of Microbes in Crohn's Disease Paul B. Eckburg & David A. Relman Clin Infect Dis. 44: (2007) So I Set Out to Quantify All Three!

I Wondered if Crohn’s is an Autoimmune Disease, Did I Have a Personal Genomic Polymorphism? From SNPs Associated with CD Polymorphism in Interleukin-23 Receptor Gene — 80% Higher Risk of Pro-inflammatory Immune Response NOD2 ATG16L1 IRGM Now Comparing 163 Known IBD SNPs with 23andme SNP Chip

Variance Explained by Each of the 163 SNPs Associated with IBD The width of the bar is proportional to the variance explained by that locus Bars are connected together if they are identified as being associated with both phenotypes Loci are labelled if they explain more than 1% of the total variance explained by all loci “Host–microbe interactions have shaped the genetic architecture of inflammatory bowel disease,” Jostins, et al. Nature 491, (2012)

Crohn’s May be a Related Set of Diseases Driven by Different SNPs Me-Male CD Onset At 60-Years Old Female CD Onset At 20-Years Old NOD2 (1) rs Il-23R rs

I Had My Full Human Genome Sequenced in Million/Year by My Anonymized Human Genome is Available for Download PGP Used Complete Genomics, Inc. to Sequence my Human DNA Next Step: Compare Full Genome With IBD SNPs

Fine Time Resolution Sampling Reveals Unexpected Dynamics of Innate and Adaptive Immune System Normal Time Points of Metagenomic Sequencing of LS Stool Samples Therapy: 1 Month Antibiotics +2 Month Prednisone Innate Immune System Normal Adaptive Immune System

LS Cultured Bacterial Abundance Reveals Dynamic Microbiome Dysfunction Time Points of Metagenomic Sequencing of LS Stool Samples

Next: Analyze the Dynamics of My Microbiome Ecology- 85% of the Species Can Not Be Cultured Inclusion of the Microbiome Will Radically Change Medicine 99% of Your DNA Genes Are in Microbe Cells Not Human Cells Your Body Has 10 Times As Many Microbe Cells As Human Cells

To Map My Gut Microbes, I Sent a Stool Sample to the Venter Institute for Metagenomic Sequencing Gel Image of Extract from Smarr Sample-Next is Library Construction Manny Torralba, Project Lead - Human Genomic Medicine J Craig Venter Institute January 25, 2012 Shipped Stool Sample December 28, 2011 I Received a Disk Drive April 3, 2012 With 35 GB FASTQ Files Weizhong Li, UCSD NGS Pipeline: 230M Reads Only 0.2% Human Required 1/2 cpu-yr Per Person Analyzed! Sequencing Funding Provided by UCSD School of Health Sciences

We Created a Reference Database Of Known Gut Genomes NCBI April 2013 –2471 Complete Draft Bacteria & Archaea Genomes –2399 Complete Virus Genomes –26 Complete Fungi Genomes –309 HMP Eukaryote Reference Genomes Total 10,741 genomes, ~30 GB of sequences Now to Align Our 12.5 Billion Reads Against the Reference Database Source: Weizhong Li, Sitao Wu, CRBS, UCSD

Computational NextGen Sequencing Pipeline: From “Big Equations” to “Big Data” Computing PI: (Weizhong Li, CRBS, UCSD): NIH R01HG ( , $1.1M)

We Used SDSC’s Gordon Data-Intensive Supercomputer to Analyze a Wide Range of Gut Microbiomes ~180,000 Core-Hrs on Gordon –KEGG function annotation: 90,000 hrs –Mapping: 36,000 hrs –Used 16 Cores/Node and up to 50 nodes –Duplicates removal: 18,000 hrs –Assembly: 18,000 hrs –Other: 18,000 hrs Gordon RAM Required –64GB RAM for Reference DB –192GB RAM for Assembly Gordon Disk Required –Ultra-Fast Disk Holds Ref DB for All Nodes –8TB for All Subjects Enabled by a Grant of Time on Gordon from SDSC Director Mike Norman

A Significant Fraction of the Reads Do Not Map Onto The Reference Genome Set Source: Weizhong Li, CRBS, UCSD

Phyla Gut Microbial Abundance Without Viruses: LS, Crohn’s, UC, and Healthy Subjects Crohn’s Ulcerative Colitis Healthy LS Toward Noninvasive Microbial Ecology Diagnostics Source: Weizhong Li, Sitao Wu, CRBS, UCSD

Using Scalable Visualization Allows Comparison of the Relative Abundance of 200 Microbe Species Calit2 VROOM-FuturePatient Expedition Comparing 3 LS Time Snapshots (Left) with Healthy, Crohn’s, UC (Right Top to Bottom)

Comparison of 35 Healthy to 15 CD and 6 UC Gut Microbiomes at the Phyla Level Explosion of Proteobacteria Collapse of Bacteroidetes Expansion of Actinobacteria

Time Series Reveals Autoimmune Dynamics of Gut Microbiome by Phyla Therapy Six Metagenomic Time Samples Over 16 Months

From Taxonomy to Function: Analysis of LS Clusters of Orthologous Groups (COGs) Analysis: Weizhong Li & Sitao Wu, UCSD

The Adult Healthy Gut Microbiome Is Remarkably Stable Over Time Source: Eric Alm, MIT

Lessons from Ecological Dynamics I: Gut Microbiome Has Multiple Relatively Stable Equilibria “The Application of Ecological Theory Toward an Understanding of the Human Microbiome,” Elizabeth Costello, Keaton Stagaman, Les Dethlefsen, Brendan Bohannan, David Relman Science 336, (2012)

Lessons From Ecological Dynamics II: Invasive Species Dominate After Major Species Destroyed ”In many areas following these burns invasive species are able to establish themselves, crowding out native species.” invasive species Source: Ponderosa Pine Fire Ecology

Almost All Abundant Species (≥1%) in Healthy Subjects Are Severely Depleted in Larry’s Gut Microbiome

Top 20 Most Abundant Microbial Species In LS vs. Average Healthy Subject 152x 765x 148x 849x 483x 220x 201x 522x 169x Number Above LS Blue Bar is Multiple of LS Abundance Compared to Average Healthy Abundance Per Species Source: Sequencing JCVI; Analysis Weizhong Li, UCSD LS December 28, 2011 Stool Sample

The Dramatic Bloom of Enterobacteriaceae bacterium 9_2_54FAA 21,000x LS5  LS6 1,000x This Microbe is a Proteobacteria Targeted by the NIH HMP

Focusing in on the Dynamical Change Within Proteobacteria

Inflammation Enables Anaerobic Respiration Which Leads to Phylum-Level Shifts in the Gut Microbiome Sebastian E. Winter, Christopher A. Lopez & Andreas J. Bäumler, EMBO reports VOL 14, p (2013)

E. coli/Shigella Phylogenetic Tree Miquel, et al. PLOS ONE, v. 5, p (2010) Does Intestinal Inflammation Select for Pathogenic Strains That Can Induce Further Damage? “Adherent-invasive E. coli (AIEC) are isolated more commonly from the intestinal mucosa of individuals with Crohn’s disease than from healthy controls.” “Thus, the mechanisms leading to dysbiosis might also select for intestinal colonization with more harmful members of the Enterobacteriaceae* —such as AIEC— thereby exacerbating inflammation and interfering with its resolution.” Sebastian E. Winter, et al., EMBO reports VOL 14, p (2013) *Family Containing E. coli AIEC LF82

Chronic Inflammation Can Accumulate Cancer-Causing Bacteria in the Human Gut Escherichia coli Strain NC101

Phylogenetic Tree 778 Ecoli strains =6x our 2012 Set D A B1 B2 E S Deep Metagenomic Sequencing Enables Strain Analysis

We Divided the 778 E. coli Strains into 40 Groups, Each of Which Had 80% Identical Genes LS00 1 LS00 2 LS00 3 Median CD Median UC Median HE Group 0: D Group 2: E Group 3: A, B1 Group 4: B1 Group 5: B2 Group 7: B2 Group 9: S Group 18,19,20: S Group 26: B2 LF82 NC101

Reduction in E. coli Over Time With Major Shifts in Strain Abundance Strains >0.5% Included Therapy

Next Step: Time Series of Metagenomic Gut Microbiomes and Immune Variables in an N=100 Clinic Trial Goal: Understand The Coupled Human Immune-Microbiome Dynamics In the Presence of Human Genetic Predispositions

Thanks to Our Great Team! UCSD Metagenomics Team Weizhong Li Sitao Wu Future Patient Team Jerry Sheehan Tom DeFanti Kevin Patrick Jurgen Schulze Andrew Prudhomme Philip Weber Fred Raab Joe Keefe Ernesto Ramirez JCVI Team Karen Nelson Shibu Yooseph Manolito Torralba SDSC Team Michael Norman Mahidhar Tatineni Robert Sinkovits