Meta’omic functional profiling with HUMAnN Curtis Huttenhower 01-19-15 Harvard School of Public Health Department of Biostatistics U. Oregon.

Slides:



Advertisements
Similar presentations
MY NCBI (module 4.5). MODULE 4.5 PubMed/How to Use MY NCBI Instructions - This part of the: course is a PowerPoint demonstration intended to introduce.
Advertisements

Mouse Tutorial for Win XP In this tutorial you will learn to: Move the mouse Click and double click Make selections Manage windows Use menus Click the.
SMART WEB USER GUIDE Log-in and Password Directory Search Sending Pages Changing Pager Status Assigning Pager Coverage, Referral, and Page Block Reviewing.
Meta’omic functional profiling with HUMAnN Curtis Huttenhower Harvard School of Public Health Department of Biostatistics U. Oregon META Center.
Lab6 – Debug Assembly Language Lab
Amplicon functional profiling with PICRUSt
Computational metagenomics and the human microbiome Curtis Huttenhower Harvard School of Public Health Department of Biostatistics.
Scaffold Download free viewer:
Google Confidential and Proprietary 1 Intro to Docs Google Apps Apps.
Introduction to Word 2013 What new features does Word 2013 offer? Introducing the “Landing Page”. Pinning/Unpinning recently used documents within the.
Ogden Air Logistics Center. Purpose of Excel2FV Many agencies produce point lists of different data (target lists, force locations, etc.) in either Excel.
Downloading and Installing PAF Insight PAF Insight can be easily downloaded Or can be installed from a CD A license is needed t0 activate the program.
Customer Portal – Customer User. You will receive an indicating that your Customer Portal registration is complete. A link to the Customer Portal,
C O M P U T E R G R A P H I C S Jie chen Computer graphic -- OpenGL Howto.
The trkNavigator plugin 22/10/ CERN - DataBase meeting Simone Paoletti INFN - Sez. Firenze.
Giuseppe D'Auria Norwich September 2014 FISABIO, Valencia Introduction into the processing of raw data.
An Anecdote An artist friend subscribes to AOL for and web services. Recently AOL attempted to make an automatic on- line upgrade to her address.
Processing Lab 3 – Header issues and trace editing Bryce Hutchinson Objectives: Fixing elevation issues Define an LMO function Pick first breaks Kill traces.
Meta’omic functional profiling with ShortBRED Galeb Abu-Ali Curtis Huttenhower Harvard T.H. Chan School of Public Health Department of Biostatistics.
Meta’omic Analysis with MetaPhlAn, HUMAnN, and LEfSe Curtis Huttenhower Harvard School of Public Health Department of Biostatistics.
1 Data Manipulation (with SQL) HRP223 – 2010 October 13, 2010 Copyright © Leland Stanford Junior University. All rights reserved. Warning: This.
IPlant Collaborative Discovery Environment RNA-seq Basic Analysis Log in with your iPlant ID; three orange icons.
11/25/2015Slide 1 Scripts are short programs that repeat sequences of SPSS commands. SPSS includes a computer language called Sax Basic for the creation.
Parsing BLAST output. Output of a local BLAST search “less” program Full path to the BLAST output file.
Download all complete prokaryotic genomes from the NCBI RefSeq database Extract 16S rRNA sequences from each genome. Use UCLUST algorithm to cluster 16S.
Copyright © Curt Hill Tortoise SVN A Subversion Client.
How to Set Up Your Science Fair Blog Gali Elkon Lev.
 We all have less! › Less time. › Less resources.  Skype will help you gain some of your time back. › Instead of having to move and physically ‘meet’
Bioinformatics for biologists Dr. Habil Zare, PhD PI of Oncinfo Lab Assistant Professor, Department of Computer Science Texas State University Presented.
ARTSYS Tutorial Welcome! If you are a first-time user, we strongly recommend that you go through the entire tutorial. Just single click or press the page.
Testing External Survey Automatic Credit Granting Shepherd University Department of Psychology.
Meta’omic functional profiling with ShortBRED Curtis Huttenhower Harvard School of Public Health Department of Biostatistics U. Oregon.
CERN, 16/9/2003 S. Paoletti: trkNavigator 1 trkNavigator: A Java client for the Tracker DB 16/9/ CERN - mod. Prod meeting Simone Paoletti INFN -
The Treatment-Naive Microbiome in New-Onset Crohn’s Disease Dirk Gevers, Subra Kugathasan, Lee A. Denson, Yoshiki Vázquez-Baeza, Will Van Treuren, Boyu.
Mail merge I: Use mail merge for mass mailings Overview: Mailings en masse What if you need to send to each of your employees a letter containing personal.
1 Data Manipulation (with SQL) HRP223 – 2009 October 12, 2009 Copyright © Leland Stanford Junior University. All rights reserved. Warning: This.
Canadian Bioinformatics Workshops
What’s changed, and why Lesson 1 By the end of this lesson you will be able to complete the following: Get a handle on the new look of Excel. Understand.
A gentle introduction to Bootstrap TDAP Jeremy Shafer Department of MIS Fox School of Business Temple University Spring
An Introduction to Meta’omic Analyses Curtis Huttenhower Galeb Abu-Ali Eric Franzosa Harvard T.H. Chan School of Public Health Department of Biostatistics.
Functional profiling with HUMAnN2
Using the bioBakery Curtis Huttenhower
Meta’omic functional profiling with ShortBRED
(PubMed) MY NCBI (Advanced Course: Module 2)
Day 1 on Google Cloud Platform
TDA Direct Certification
Strain profiling with StrainPhlAn and PanPhlAn
An Introduction to Meta’omic Analyses
Automating reproducible analyses with AnADAMA2 and bioBakery Workflows
Functional profiling with HUMAnN2
Workshop on Microbiome and Health
Collaboration with Google Docs
Taxonomic profiling with MetaPhlAn2
Identifying personal microbiomes using metagenomic codes
Installing the HI 6600 into the CompactLogix System.
(PubMed) MY NCBI (Advanced Course: Module 2)
Taxonomic profiling with MetaPhlAn2
Strain profiling with StrainPhlAn
Curtis Huttenhower Galeb Abu-Ali Eric Franzosa
(PubMed) MY NCBI (Advanced Course: Module 2)
Welcome to the Markers Database Tutorial
A Microbiome Foundation for the Study of Crohn’s Disease
This is a setup file for a Jeopardy game.
This is a setup file for a Jeopardy game.
Enter Your Presentation Title
Volume 17, Issue 3, Pages (March 2015)
A Microbiome Foundation for the Study of Crohn’s Disease
This is a setup file for a Jeopardy game.
A typical current computational meta'omic pipeline to analyze and contrast microbial communities. A typical current computational meta'omic pipeline to.
MY NCBI (module 4.5).
Presentation transcript:

Meta’omic functional profiling with HUMAnN Curtis Huttenhower Harvard School of Public Health Department of Biostatistics U. Oregon

2 The two big questions… Who is there? (taxonomic profiling) What are they doing? (functional profiling)

Setup notes reminder Slides with green titles or text include instructions not needed today, but useful for your own analyses Keep an eye out for red warnings of particular importance Command lines and program/file names appear in a monospaced font. Commands you should specifically copy/paste are in monospaced bold blue. 3

What they’re doing: HUMAnN As a broad functional profiler, you could download HUMAnN at: 4 Click here

What they’re doing: HUMAnN...but instead we’ve already downloaded it Expand HUMAnN (no install!) Cd ~/workshop_data/metagenomics tar -xzf ~/workshop_data/metagenomics/biobakery/software/humann-v0.99.tar.gz Set up a link to the KEGG reference DB: ln -s ~/workshop_data/metagenomics/biobakery/data/kegg.reduced.udb And although you would normally download USEARCH from here: – We’re going to use it preinstalled instead 5

What they’re doing: HUMAnN If we weren’t all running this, you’d need to: –Get KEGG – used to be free, now it’s not! Fortunately, we have a HUMAnN-compatible distributable version; contact me... –Index it for USEARCH: usearch -makeudb_usearch kegg.reduced.fasta -output kegg.reduced.udb This takes a minute or two, so we’ve precomputed it; thus, forge ahead... 6

What they’re doing: HUMAnN Did you notice that we didn’t QC our data at all? –MetaPhlAn is very robust to junk sequence –HUMAnN is pretty robust, but not quite as much We’ve already run a standard metagenomic QC: –Quality trim by removing bad bases (typically Q ~15) –Length filter to remove short sequences (typically <75%) 7

What they’re doing: HUMAnN Must start from FASTQ files to do this Quality trim by removing bad bases: FASTX: fastx_trimmer ea-utils: fastq-mcf Length filter by removing short sequences: –75% of original length is standard (thus 75nt from 100nt reads) FASTX: fastx_quality_filter USEARCH: fastq_filter Now convert your FASTQ to a FASTA: FASTX: fastq_to_fasta USEARCH: fastq_filter Some final caveats: –If you’re using paired end reads, match filters! –See my course homeworks at –Aren’t you glad you’re not doing this today? 8

What they’re doing: HUMAnN Enter the humann directory and fetch the data ln -s ~/workshop_data/metagenomics/biobakery/data/ SRS Stool.fasta cd humann-0.99 Run your first translated BLAST search: usearch -usearch_local../ SRS Stool.fasta -db../kegg.reduced.udb -id 0.8 -blast6out input/ SRS Stool.txt What did you just do? less input/ SRS Stool.txt –Recall BLAST’s tab-delimited output headers: qseqid sseqid pident length mismatch gapopen qstart qend sstart send evalue bitscore Rinse and repeat for the remaining samples 9

What they’re doing: HUMAnN 10

What they’re doing: HUMAnN Normally you’d need to install SCons from: – Instead, we’ll use it preinstalled as well, so... GO! ~/workshop_data/metagenomics/biobakery/software/ext/scons-2.3.2/bin/scons You should see a bunch of text scroll by –Note: you can run scons -j8 to parallelize tasks 11

What they’re doing: HUMAnN After a minute or two, you should see: 12

What they’re doing: HUMAnN This has created four main files: –Two each for pathways (big) and modules (small) –Two each for coverage and relative abundance Each is tab-delimited text with one column per sample All four are in the output directory: output/04a-hit-keg-mpt-cop-nul-nve-nve-xpe.txt Coverage (a) of pathways (t) output/04a-hit-keg-mpm-cop-nul-nve-nve-xpe.txt Coverage (a) of modules (m) output/04b-hit-keg-mpt-cop-nul-nve-nve.txt Abundance (b) of pathways (t) output/04b-hit-keg-mpm-cop-nul-nve-nve.txt Abundance (b) of modules (m) I almost always just use 04b-mpm (module abundances) 13

What they’re doing: HUMAnN Let’s take a look: less output/04b-hit-keg-mpm-cop-nul-nve-nve.txt 14

What they’re doing: HUMAnN That’s ugly; it gets much better in Excel –Note: this is very sparse since we’re using a small subset of KEGG –Note: the mock community demo data is included on the right 15

What they’re doing: HUMAnN And there’s nothing stopping us from using MeV –Or R, or QIIME, or LEfSe, or anything that’ll read tab-delimited text 16

Alex Kostic Levi Waldron Human Microbiome Project 2 Lita Procter Jon Braun Dermot McGovern Subra Kugathasan Ted Denson Janet Jansson Ramnik Xavier Dirk Gevers Jane Peterson Sarah Highlander Barbara Methe George Weingart Tim Tickle Xochitl Morgan Daniela Boernigen Emma Schwager Jim Kaminski Afrah Shafquat Eric Franzosa Boyu Ren Regina Joice Koji Yasuda Bruce Birren Chad Nusbaum Clary Clish Joe Petrosino Thad Stappenbeck Tiffany Hsu Kevin Oh Thanks! Randall Schwager Chengwei Luo Keith Bayer Moran Yassour Human Microbiome Project Karen Nelson George Weinstock Owen White Alexandra Sirota Galeb Abu-Ali Ali Rahnavard Katherine Lemon Brendan Bohannan James Meadow Andy Chan Wendy Garrett Lauren McIver Ayshwarya Subramanian Gautam Dantas Molly Gibson Nicola Segata