CuffDiff ran successfully. Output files include gene_exp.diff What are the next steps? Use Navigation bar to find files; they may be under DNA Subway if.

Slides:



Advertisements
Similar presentations
Converting CSV Files to Excel This presentation will aid and assist you in converting a downloaded CSV file from the website to a data.
Advertisements

The essentials managers need to know about Excel
Building a Sanitized UPC Notification List.
DATA ANALYTICS. NORMS Cell Phones on Vibrate Respect all opinions.
© Paradigm Publishing, Inc Excel 2013 Level 2 Unit 1Advanced Formatting, Formulas, and Data Management Chapter 1Advanced Formatting Techniques.
Microsoft Office XP Microsoft Excel
X-Media V2.0 Healthcare Training Jayex Technology Limited X-Media V2.0 March 2010 v
Review. Microsoft Office Excel 2013 provides powerful tools to organize, analyze, manage, and share information Locations where work is done are cells,
Using Excel to Understand Your Data Clayton County Public Schools Department of Research, Evaluation and Assessment Assistant Principal In-Service.
Managing Grades with Excel Viewing Help To view Help 1.Open Excel on your computer. 2.In the top right hand corner of the Excel Screen type in the.
Microsoft Excel 2010 Chapter 7
Downloading and Installing AutoCAD Architecture 2015 This is a 4 step process 1.Register with the Autodesk Student Community 2.Downloading the software.
Scaffold Download free viewer:
Inventory Throughout this slide show there will be hyperlinks (highlighted in blue) follow the hyperlinks to navigate to the specified Topic or Figure.
CTS130 Spreadsheet Lesson 13 Working with Lists. Copying Data between Workbooks  Use the [Copy ]and [Paste] Buttons  Use the CTRL+[C] and CTRL + [V]
Working with SharePoint Document Libraries. What are document libraries? Document libraries are collections of files that you can share with team members.
Google Earth How to create a Google Earth Tour and place it in your Wiki.
Integrating Microsoft Project with Other Programs
Working with the Conifer_dbMagic database: A short tutorial on mining conifer assembly data. This tutorial is designed to be used in a “follow along” fashion.
Downloading and Installing PAF Insight PAF Insight can be easily downloaded Or can be installed from a CD A license is needed t0 activate the program.
Using Backstage Lesson 2. Objectives Software Orientation: Backstage View Backstage view’s left-side navigation pane (see figure on the next slide) gives.
Create Database Tables
Differential Analysis & FDR Correction
October 2003Bent Thomsen - FIT 3-21 IT – som værktøj Bent Thomsen Institut for Datalogi Aalborg Universitet.
European Computer Driving Licence Syllabus version 5.0 Module 4 – Spreadsheets Chapter 22 – Functions Pass ECDL5 for Office 2007 Module 4 Spreadsheets.
Microsoft Excel Spreadsheet Review. Templates  Templates can be produced for the following elements:  Text and Graphics  Formatting Information – Layouts,
StAR web server tutorial for ROC Analysis. ROC Analysis ROC Analysis: This module allows the user to input data for several classifiers to be tested.
1 Data List Spreadsheets or simple databases - a different use of Spreadsheets Bent Thomsen.
Networks and Interactions Boo Virk v1.0.
Using Office Backstage Using Office Backstage Lesson 3 © 2014, John Wiley & Sons, Inc.Microsoft Official Academic Course, Microsoft Word Microsoft.
Support.ebsco.com EBSCOhost Visual Search Tutorial.
Teacher’s Assessment Assistant Worksheet Builder Starting the Program
Regulatory Genomics Lab Saurabh Sinha Regulatory Genomics Lab v1 | Saurabh Sinha1 Powerpoint by Casey Hanson.
Create Forms Lesson 5. Objectives Software Orientation The Forms group (below) is located on the Create tab in the Ribbon and can be used to create a.
Downloading and Installing Autodesk Revit 2016
Analysing Data with Excel Viewing Help To view Help 1.On the Start menu, point to Programs, and then click Microsoft Excel. 2.On the Help menu,
Page 1 Non-Payroll Cost Transfer Enhancements Last update January 24, 2008 What are the some of the new enhancements of the Non-Payroll Cost Transfer?
The Next Generation. Parent Access Grade History and Attendance.
For additional assistance, please call the Help Desk Searching 1. If a Search window does not appear after logging into the system, click the Search icon.
1 An Introduction to SPSS for Windows Jie Chen Ph.D. 6/4/20161.
IPlant Collaborative Discovery Environment RNA-seq Basic Analysis Log in with your iPlant ID; three orange icons.
11/25/2015Slide 1 Scripts are short programs that repeat sequences of SPSS commands. SPSS includes a computer language called Sax Basic for the creation.
Basic Local Alignment Search Tool BLAST Why Use BLAST?
Regulatory Genomics Lab Saurabh Sinha Regulatory Genomics | Saurabh Sinha | PowerPoint by Casey Hanson.
Statistical Testing with Genes Saurabh Sinha CS 466.
Refraction Statics Bryce Hutchinson Sumit Verma. 3D Statics display 1. Click this button on the right side of the statics window to open a 3D statics.
Input data for analysis Users that have expression values (dataset 1_ chicken affy_foldchane.txt. can upload that file as shown in slide 30.
McGraw-Hill/Irwin The Interactive Computing Series © 2002 The McGraw-Hill Companies, Inc. All rights reserved. Microsoft Excel 2002 Working with Data Lists.
By: Ms. Abeer Helwa 1. CREATE A WORD DOCUMENT 2 Blank document Templates To create a new blank document: click the File tab and click Blank document.
Excel Tips to Make Your Life Easier Michael Winecoff Associate University Librarian for Technical Services November 5, 2015.
Importing Data to Excel. Suppose you have a delimited* text file and you need to bring it into Excel. Follow these steps… *Delimited means text separated.
CCA Scheduling Completing CCA Timetabling Template Integrated National Education Information System (iNEIS TM )
Tutorial 8 Gene expression analysis 1. How to interpret an expression matrix Expression data DBs - GEO Clustering –Hierarchical clustering –K-means clustering.
MS Excel Lesson 1. Starting Excel Excel opens to a list of templates and in most cases you choose Blank workbook or open a previous file. Think of a workbook.
Important! Be sure to connect the communication cable to your computer before opening the Aqua4Plus Lite software. (Click to continue)
Microsoft Excel Prepared by the Academic Faculty Members of IT.
Exporting & Formatting Budgets from FlexGen, NextGen & Zortec into Excel.
Customize Your View of Data Training Presentation for Supply Chain Platform: BAE Systems May 2015.
For Datatel and other applications Presented by Cheryl Sullivan.
Chapter 7 Creating Templates, Importing Data, and Working with SmartArt, Images, and Screen Shots Microsoft Excel 2013.
Enlisted Association of the National Guard of the United States Data Extract Instructional Guide.
Converting CSV Files to Excel
IUIE Reporting Basics Workshop
TDA Direct Certification
Using Excel with Google Maps
2. Double-click on file to open in Excel
Navya Thum January 30, 2013 Day 5: MICROSOFT EXCEL Navya Thum January 30, 2013.
HIBBs is a program of the Global Health Informatics Partnership Learning the Basics of Microsoft Word 2019 and Microsoft office support TFN
Community-Engaged Partnership Database: VCU’s Commitment to Community Engagement
Microsoft Excel 2007 – Level 2
Presentation transcript:

CuffDiff ran successfully. Output files include gene_exp.diff What are the next steps? Use Navigation bar to find files; they may be under DNA Subway if Green Line was used. Click on gene_exp.diff Note size of file, 9.41 MB.

Under Download tab, click on Simple Download. Save File, OK Open Downloads Folder Find gene_exp.diff; make sure size is similar as in iPlant folder. Open a blank excel workbook and then open the gene_exp.diff file

Click Delimited, My data has headers, Next Tab Delimiters, Next General, Finish Workbook opens Save As xlsx.

Need to first filter for fold-change. Select log2(fold change) column, Filter, Number Filters, Between Custom AutoFilter window comes up. Use these settings for 2-fold cutoffs, recall that values are log2. Then sort that column from highest to lowest. Z to A icon.

Select significant column, Under Data tab, sort Z to A so that yes genes are first. yes is q-value ≤ Click to expand the selection and Sort. Scroll down to yes/no junction. After filtering, many rows are no longer visible, so gene count cannot be done by looking at row number. The list of yes genes with log2 fold change≤ -1 or ≥ 1 (2-fold difference) can be copied and pasted to another sheet to get gene number and clean gene list for Gene Ontology analysis = 2769 genes UP = 2759 genes DOWN Highlight junction between up- and down-regulated genes.

Open DAVID ( Open Functional Annotation. How do you go from a list of 2000 plus genes to something that is biologically relevant? Gene Ontology can be used to determine if certain biological processes are enriched in your set of genes. The genome has a certain percentage of genes with identified biological process; are any of these biological processes observed at a higher frequency in your gene list? If so, they are “enriched”. One tool to look for enrichment is DAVID. Note if your gene list is small, it will be difficult to get significant enrichment scores.

Select up-regulated genes from excel file. Click on first gene, scroll down to highlighted spot which separates up- and down-regulated genes. Shift, then click to select all up-regulated genes. Paste list into section A: Paste a list Select Identifier. The sample gene list is from Arabidopsis thaliana. Many other options are available in DAVID, but mostly model organisms. Identify List Type Submit

The gene list will now show as genes plus some unknowns. Unknowns can be viewed (View Unmapped Ids); in this case they were nearly all mitochondrial genes, and not a big concern. To keep analysis simple, Clear All Categories, Open Gene_Ontology and select GOTERM_BP_FAT, which is all the Biological Processes. Just use this one category. Scroll down, Click on Functional Annotation Clustering.

Annotation Clusters are related groups of BP GO terms. Note that all those in Cluster 1 are related to phosphate and phosphorylation. The Count is the number of genes for each GO term. P-value and Benjamini indicate probability that enrichment is real versus spurious. Scroll down the results to see much higher p-values. The file must be downloaded (red arrow) to get the False Discovery Rate (FDR), the probability that enrichment is spurious.

File will download into a new window. Right click and Select All and Copy. Open Excel and select Paste Special. Choose Unicode Text and Click OK. Expand column B to see GO terms. The column with the FDR is highlighted. 1% (or even 5%) is an acceptable cutoff. The top annotation clusters will be likely be much lower than the cutoff IF your gene list is large.

Scroll down to see higher FDRs. In this cluster, the top two GO terms have FDRs below 1%. Some of the other categories have FDRs at 99%; one can be certain they are spurious. Go through this list carefully to find GO terms with acceptable FDRs. Hopefully, this analysis has suggested some interesting Biological Processes to pursue. All genes for each GO term are listed in the Genes Column. Highlight interesting BP GO terms. Be sure to save this excel file. Repeat for down-regulated genes.

An alternative GO analysis tool is BiNGO which runs within Cytoscape. BiNGO works with a much larger group of organisms and has a useful network display which connects similar GO Biological Process terms. To use BiNGO, first download Cytoscape, a free software tool for network analysis. Open Cytoscape, and Under App Manager, search for BiNGO, and install BiNGO. Once installed, go to App Manager, Click on Bingo and Bingo Settings will come up.

Provide a meaningful name. Copy and Paste gene list from excel file. Use default for what to access. Set FDR to 0.01 if 1% is desired cutoff for enrichment. Choose correct organism. Start BiNGO.

Output shows graphical view and BiNGO output.

Move Bingo Output behind by clicking on network image. Zoom In and use purple square below to focus on certain areas of interest.

This cluster of related Biological Processes is similar to annotation cluster 1 from DAVID, protein phosphorylation.

Clicking on a particular node will bring up a panel that describes the node. Amino acid transport is related to arginine and basic amino acid transport. Spend some time looking through the BiNGO output as was done for DAVID. Be sure to save your file under Save As, so it can be opened in the future by BiNGO. These two tools will provide information for genes worthy of future study depending on the biological questions of interest.