Using geWorkbench: Working with Sets of Data Fan Lin, Ph. D. Molecular Analysis Tools Knowledge Center Columbia University and The Broad Institute of MIT.

Slides:



Advertisements
Similar presentations
Holdings Management Overview
Advertisements

Microsoft Expression Web-Illustrated Unit J: Creating Forms.
Learning the Basics – Lesson 1
Microsoft Office XP Microsoft Excel
Excel Tutorial 6 Managing Multiple Worksheets and Workbooks
The Maize Inflorescence Project Website Tutorial Nov 7, 2014.
WBLE Training Prepared by : Albert Yong and Jass Kok Web-Based Learning Environment Version 1.0 (August 2009) Centre for Learning and Teaching.
How to Work With Affymetrix .Cel Files in geWorkbench
A Simple Guide to Using SPSS© for Windows
Viewbox 4 Tutorial How to create a Template Please view this tutorial as a Slide Show in PowerPoint, because it contains animations that will not appear.
Tutorial Holdings Management Adding, Editing, and Assigning Notes support.ebsco.com.
Using UPT: Set up Application & Create caArray Users Fan Lin, Ph. D. Molecular Analysis Tools Knowledge Center Columbia University and The Broad Institute.
Adobe Forms THE FORM ELEMENT PANEL. Creating a form using the Adobe FormsCentral is a quick and easy way to distribute a variety of forms including surveys.
Programming with App Inventor Computing Institute for K-12 Teachers Summer 2012 Workshop.
GeWorkbench Remote Access to caArray Data Fan Lin Ph.D. Molecular Analysis Tools Knowledge Center Columbia University and The Broad Institute of MIT and.
InDesign CS3 Lessons 1 and 2. Work Area When First Opened.
Getting Started with Expression Web 3
1 geWorkbench Hands-On Training Session Date: Session Length: Target Audience: Trainer: Developer Subject Matter Expert:
4/22/2017 5:36 PM EViews Training Creating Workfiles.
 Starting Excel 2003  Using Help  Workbook Management  Cursor Management  Manipulating Data  Using Formulae and Functions  Formatting Spreadsheet.
1 Using caArray to Share Pre- Publishing Data Fan Lin Ph.D. Molecular Analysis Tools Knowledge Center Columbia University and The Broad Institute of MIT.
Microsoft Expression Web-Illustrated Unit I: Working with Tables.
Office 2003 Advanced Concepts and Techniques M i c r o s o f t Access Web Feature Data Access Pages.
June 21, Objectives  Grouping and ungrouping worksheets  Applying formulas and formatting to multiple worksheets  Referencing cells and ranges.
Automating Database Processing Chapter 6. Chapter Introduction Design and implement user-friendly menu – Called navigation form Macros – Automate repetitive.
Basic features for portal users. Agenda - Basic features Overview –features and navigation Browsing data –Files and Samples Gene Summary pages Performing.
Designing Interface Components. Components Navigation components - the user uses these components to give instructions. Input – Components that are used.
Aligning the data in cells By default, Excel aligns text entries on the left margin of the cell (left justification) and aligns numeric entries on the.
GUI development with Matlab: GUI Front Panel Components 1 GUI front panel components In this section, we will look at -GUI front panel components -Programming.
Virtual Interaction Manager
VistA Imaging Capture via Scanning. October VistA Imaging Capture via Scanning The information in this documentation includes only new and updated.
© Paradigm Publishing Inc. MICROSOFT WINDOWS XP MAINTAINING FILES AND CUSTOMIZING WINDOWS Section 2.
Using geWorkbench: Hierarchical & SOM Clustering Fan Lin, Ph. D Molecular Analysis Tools Knowledge Center Columbia University and The Broad Institute of.
Alerts Manager Refer to Slide 2 for instructions on how to view the full-screen slideshow.Slide 2.
 Whether using paper forms or forms on the web, forms are used for gathering information. User enter information into designated areas, or fields. Forms.
With Windows 7 Introductory© 2011 Pearson Education, Inc. Publishing as Prentice Hall1 Windows 7 Introductory Chapter 3 Advanced File Management and Advanced.
Page 1 Non-Payroll Cost Transfer Enhancements Last update January 24, 2008 What are the some of the new enhancements of the Non-Payroll Cost Transfer?
Creating Graphical User Interfaces (GUI’s) with MATLAB By Jeffrey A. Webb OSU Gateway Coalition Member.
Microsoft Access 2010 Chapter 8 Advanced Form Techniques.
XP New Perspectives on Microsoft Office FrontPage 2003 Tutorial 7 1 Microsoft Office FrontPage 2003 Tutorial 7 – Creating and Using Templates in a Web.
GISMO/GEBndPlan Overview Geographic Information System Mapping Object.
Using Advanced Options Lesson 14 © 2014, John Wiley & Sons, Inc.Microsoft Official Academic Course, Microsoft Word Microsoft Word 2013.
FrontPage Tutorial Part 2 Creating a Course Web Site.
CaIntegrator2 – Part 1: Create a Study with Clinical Data Fan Lin, Ph. D Molecular Analysis Tools Knowledge Center Columbia University and The Broad Institute.
Introduction to EBSCOhost Tutorial support.ebsco.com.
MBAT User Workflows View an Atlas Open Data Upload Data Run a Query –Search Data Further Examination Microarray Data Further Examination of 2D Data –Search.
Indicator 13 Secondary Transition. Main Menu SPP13 has a navigation toolbar located at the top of each screen. If you use the toolbar to navigate to another.
Chapter 3 Device Monitor Screen Otasuke GP-EX! Chapter 3 Device Monitor Screen Chapter 3 Device Monitor Screen.
GeWorkbench Overview Support Team Molecular Analysis Tools Knowledge Center Columbia University and The Broad Institute of MIT and Harvard.
Agency Web Site Navigation Structure. Building the agency web site Agency Web Site Navigation Structure, Slide 2Copyright © 2004, Jim Schwab, University.
COMPUTER PROGRAMMING I 3.01 Apply Controls Associated With Visual Studio Form.
COMPUTER PROGRAMMING I 3.01 Apply Controls Associated With Visual Studio Form.
Laboratory Exercise # 10 – Microsoft Word Additional Topics Office Productivity Tools 1 Laboratory Exercise # 10 Microsoft Word Additional Topics Objectives:
You’ll Make a spreadsheet which will be like a Mad Libs Game. These Are The Directions.
Editing and Debugging Mumps with VistA and the Eclipse IDE Joel L. Ivey, Ph.D. Dept. of Veteran Affairs OI&T, Veterans Health IT Infrastructure & Security.
Visual Basic.NET Comprehensive Concepts and Techniques Chapter 6 Looping and Multiple Forms.
geWorkbench Hands-On Training
Standard Operating Procedure
Learning the Basics – Lesson 1
Tutorial Introduction to support.ebsco.com.
Chapter 7 Advanced Form Techniques
Finding Magazine and Journal Articles in
Introduction to EBSCOhost
Word offers a number of features to help you streamline the formatting of documents. In this chapter, you will learn how to use predesigned building blocks.
EBSCOhost Advanced Search Guided Style
Tutorial Introduction to help.ebsco.com.
Presentation transcript:

Using geWorkbench: Working with Sets of Data Fan Lin, Ph. D. Molecular Analysis Tools Knowledge Center Columbia University and The Broad Institute of MIT and Harvard

Background  geWorkbench makes extensive use of the notion of sets: it allows the full set of markers or arrays/phenotypes to be divided into different subsets.  The multiple different subsets of the data allows the same data to be characterized and analyzed in different ways in geWorkbench.

geWorkbench offers two different way to group data: 1.Individual markers or arrays can be grouped into sets : ►Sets can be defined by the user, or may be created as a result of an analysis. ►Sets of arrays can be used to distinguish between different experimental states, for example as part of a statistical analysis. ♦The t-test requires two states, represented by sets, be defined for comparison. ►Sets of markers are returned from various analysis routines. For example the t-test returns a list of markers showing significant differential expression, and after hierarchical clustering, the markers in a subtree of the resulting dendrogram can be saved. 2.Sets of markers or arrays are grouped into collections. A collection named “Default” is automatically created by geWorkbench. Different types of data grouping

Overview ►How to create a set of markers or arrays. ► How to mark a set of arrays as "Active“. ► How to classify a set of arrays, e.g. as "case" vs. "control". ► How to deactivate a data set from data analysis. ►How to group markers or arrays in different ways with descriptive tags. In this presentation you will learn

Sets of Markers or Arrays Overview  Individual markers (genes) or arrays can be grouped into Set.  A Set of markers or array can be used to dissect the potentially massive expression data into more manageable chunks.

Sets of Markers or Arrays Sample data  To demonstrate how to create sets of markers or arrays, we will use the samples data from a congestive cardiomyopathy experiment, which are found in geWorkbench tutorial data section:  we will load 10 individual Affymetrix MAS5 format files (all beginning with JB-) and merge them into a single dataset as our sample data.

Sets of Markers or Arrays Sample data preparation 2. Next, right-click on the new Project entry and select Open Files. 1.Create a Project. All data must belong to a project. Right-click on the Workspace entry in the Project Folders window at upper left to create a new project. To load the sample data, following steps below:

Sets of Markers or Arrays Sample data preparation 3. Select file type Affymetrix MAS5/GCOS as shown. 4. Make sure to check the Merge files checkbox. 5. Select 10 MAS5 format text files from the tutorial data directory. 6. Click Open. The chip type HG_U95Av2 is recognized...

The merged dataset is now listed in the Project folder. The data is displayed, in single array format, in the Microarray Viewer. Note we have increased the intensity slider to maximum here. Sets of Markers or Arrays Sample data preparation

In this example, we will create two sets of array data for disease and normal states and leave them in the Default collection. 1.In the Arrays/Phenotypes component, select the six arrays beginning with JB-ccmp, which represent the samples from the congestive cardiomyopathy disease state. 2. Right click, select Add to Set. Sets of Markers or Arrays Assigning arrays to sets 1 2 First Select and label arrays which contain samples from the congestive cardiomyopathy disease state:

3. Enter "CCMP" in the input box and click OK. 4. Next, similarly label the arrays beginning with JB-n as "Normal“. The Array/Phenotype Sets component will now show the two sets added: 4 Sets of Markers or Arrays Assigning arrays to sets 3

Sets of Markers or Arrays Activating sets The boxes next to the set name can be checked to indicate that a set of arrays is "Active". Various analysis and visualization components can be set to only use/display activated arrays or markers. Note – if no Array sets are explicitly activated, then all Array are implicitly active. The same applies to Marker.

For statistical tests such as the t-test, Case and Control groups can be specified. 1. Left-click on the thumb-tack icon in front of the phenotype name. 2. Select Case to specify the disease arrays as the "Case". The remaining "Normal" arrays are by default considered Control. Sets of Markers or Arrays Classifying data set for statistical tests 1 2

3. A red thumbtack indicates an array set has been marked as "Case". 3 Sets of Markers or Arrays Classifying dataset for statistical tests

Sets of Markers or Arrays Deactivate a data set  To deactivate a set, click on the set and the selected set will be highlighted. Then perform one of the following actions:  Right-clicking on the set and then select Deactivate  Unselecting the checkbox next to the set  Through the main menu, select Commands Panel> Deactivate Panel

Collection of Sets Overview There could be different grouping requirements of the same arrays in the Arrays/Phenotypes and Marker components. geWorkbench uses Collections to hold sets of arrays or markers to facilitate a better data management.  Different collection of sets can be made, both for Markers and for Arrays. They may differ in membership or in how members are named (e.g. amount of detail).  The collection of sets in geWorkbench offers a highly efficient way for users to manage sets of data with descriptive tags.

Collection of Sets Creating a new collection Both Marker and Array/Phenotypes tab have two sections in the GUI: the upper frame lists the full data set, and the lower frame lists any user-defined groupings. geWorkbench automatically creates a default collection “Default ” to hold sets of data. To create a new collection for the array, click on the New button on Array/Phenotype Sets located at the lower left in the application (arrow labeled New). The drop down collection list (arrow on the left) will be updated to reflect the addition in the collection.. New

Collection of Sets Examples of array collections ►Here we show how several different collections are defined in the example data file " Bcell-100.exp ”, which can be found in geWorkbench’s tutorial data ( Bcell-100.zip). ( ) ►After loading this file into geWorkbench as type "Affymetrix File Matrix", four collections of sets can be seen in the Arrays/Phenotypes group pull-down menu at right.

If we choose the collection called "Class", the sets of arrays at right are displayed: Collection of Sets Examples of array collections

If instead we choose the collection “Source detailed", a different collection of sets of the same arrays is seen: Collection of Sets Examples of array collections

Need More Information? NCI is developing an extensive knowledge base to support various NCI molecular analysis tools. Visit us at NCI’s Molecular Analysis Tool Knowledge center at: For more information on how to use geWorkbench, please visit NCI Knowledge Center, geWorkbench section at : kc.nci.nih.gov/Molecular/KC/index.php/GeWorkbench.geWorkbench section kc.nci.nih.gov/Molecular/KC/index.php/GeWorkbench Have a geWorkbench related question? Find the answers in geWorkbench FAQ section at: kc.nci.nih.gov/Molecular/KC/index.php/GeWorkbench_FAQ.geWorkbench FAQ kc.nci.nih.gov/Molecular/KC/index.php/GeWorkbench_FAQ New more helps? Post it in geWorkbench Forum at : kc.nci.nih.gov/Molecular/forums/viewforum.php?f=3.geWorkbench Forum kc.nci.nih.gov/Molecular/forums/viewforum.php?f=3