Introduction to caArray caBIG ® Molecular Analysis Tools Knowledge Center April 3, 2011.

Slides:



Advertisements
Similar presentations
DIGIDOC A web based tool to Manage Documents. System Overview DigiDoc is a web-based customizable, integrated solution for Business Process Management.
Advertisements

LeadManager™- Internet Marketing Lead Management Solution May, 2009.
XProtect ® Express Integration made easy. With support for up to 48 cameras, XProtect Express is easy and affordable IP video surveillance software with.
ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
Bio-IT World April 13, Operating System (Linux) Persistence (MySQL, PostgreSQL) Web Infrastructure (JBoss, Tomcat, Apache) General Applications.
Tom Sheridan IT Director Gas Technology Institute (GTI)
IAEA International Atomic Energy Agency INIS Collection Search: Introduction and main features INIS Training Seminar 7-11 October 2013, Vienna Domenico.
August 29, 2002InforMax Confidential1 Vector PathBlazer Product Overview.
Passage Three Introduction to Microsoft SQL Server 2000.
Diabetes and Endocrinology Research Center The BCM Microarray Core Facility: Closing the Next Generation Gap Alina Raza 1, Mylinh Hoang 1, Gayan De Silva.
GeWorkbench Remote Access to caArray Data Fan Lin Ph.D. Molecular Analysis Tools Knowledge Center Columbia University and The Broad Institute of MIT and.
Gene expression services: ArrayExpress and the Gene Expression Atlas Contact: Gabriella Rustici, PhD Functional Genomics Team EBI-EMBL
Moving forward our shared data agenda: a view from the publishing industry ICSTI, March 2012.
Call in: Participant Passcode: Centra: Meeting ID: ICR_meetinghttp://ncicb.centra.com April 1, 2009 caArray.
Customized cloud platform for computing on your terms !
Support for MAGE-TAB in caArray 2.0 Overview and feedback MAGE-TAB Workshop January 24, 2008.
Gene Expression Omnibus (GEO)
The MGED Society Facilitating Data Sharing and Integration with Standards CTSA Omics Data Standards Working Group Chris Stoeckert Dept. of Genetics and.
1 Using caArray to Share Pre- Publishing Data Fan Lin Ph.D. Molecular Analysis Tools Knowledge Center Columbia University and The Broad Institute of MIT.
Abstract BarleyBase is a USDA-funded public repository for plant microarray data. BarleyBase houses raw and normalized expression data from the 22K Affymetrix.
CaBench-to-Bedside (caB2B) A caGrid TM client to facilitate translational research Key Stakeholders Involved: Developer Washington University Persistent.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
TCGA The Cancer Genome Atlas Project January 24, 2008.
Call in: Participant Passcode: Centra: Meeting ID: ICR_meetinghttp://ncicb.centra.com October 1, 2008 caArray.
BI Funcasts The Mac-Guyver Techniques BI - The Mac-Guyver Techniques : Office Sharepoint Excel Services Gunter Staes –
GenePattern Overview for MAGE-TAB Workshop Ted Liefeld January 24, 2007.
Preserving Digital Culture: Tools & Strategies for Building Web Archives : Tools and Strategies for Building Web Archives Internet Librarian 2009 Tracy.
Taverna Workflow. A suite of tools for bioinformatics Fully featured, extensible and scalable scientific workflow management system – Workbench, server,
1 Computer Programming (ECGD2102 ) Using MATLAB Instructor: Eng. Eman Al.Swaity Lecture (1): Introduction.
Review of Array Express Thomas, M.D. Georgia Institute of Technology 21 June, 2006.
KMS Products By Justin Saunders. Overview This presentation will discuss the following: –A list of KMS products selected for review –The typical components.
1 maxdLoad The maxd website: © 2002 Norman Morrison for Manchester Bioinformatics.
GeWorkbench Highlights caBIG ® Molecular Analysis Tools Knowledge Center AACR Annual Meeting, April 3, 2011.
Genboree Discovery Process Integration Aleksandar Milosavljevic, PhD Baylor College of Medicine January 10 th, 2008; modified April 1 st 2008.
Clinical Collaboration Platform Overview ST Electronics (Training & Simulation Systems) 8 September 2009 Research Enablers  Consulting  Open Standards.
Analysis of GEO datasets using GEO2R Parthav Jailwala CCR Collaborative Bioinformatics Resource CCR/NCI/NIH.
CaIntegrator2 – Part 1: Create a Study with Clinical Data Fan Lin, Ph. D Molecular Analysis Tools Knowledge Center Columbia University and The Broad Institute.
Cooperative experiments in VL-e: from scientific workflows to knowledge sharing Z.Zhao (1) V. Guevara( 1) A. Wibisono(1) A. Belloum(1) M. Bubak(1,2) B.
SWGData and Software Access - 1 UCB, Nov 15/16, 2006 THEMIS SCIENCE WORKING TEAM MEETING Data and Software Access Ken Bromund GST Inc., at NASA/GSFC.
CaArray User Community Meeting Release Demonstration Call in: Participant Passcode: Centra: Meeting.
Introduction to caIntegrator caBIG ® Molecular Analysis Tools Knowledge Center April 3, 2011.
Applied Bioinformatics Week 9 Jens Allmer. Theory I Gene Expression Microarray.
IAEA International Atomic Energy Agency INIS Collection Search: Introduction and main features The Role of the International Nuclear Information System.
CaArray User Community Meeting Feature Overview and Review of MAGE-TAB Update and Export Specification Call in: Participant Passcode:
Introduction and Applications of Microarray Databases Chen-hsiung Chan Department of Computer Science and Information Engineering National Taiwan University.
ArrayExpress - a Public Repository for Microarray Based Gene Expression Data European Bioinformatics Institute - EMBL outstation and German Cancer Research.
Call in: Participant Passcode: Centra: Meeting ID: ICR_WShttp://ncicb.centra.com August 11, 2010 ICR-WS Meeting.
Family Connection Collaborative Webs A Tool for Creating and Managing Web sites.
Features Of SQL Server 2000: 1. Internet Integration: SQL Server 2000 works with other products to form a stable and secure data store for internet and.
WebDat: A Web-based Test Data Management System J.M.Nogiec January 2007 Overview.
An Enterprise Clinical Data Search Solution. is Designed for: Informatics professionals, clinicians, statisticians, data managers and process/quality.
The National Cancer Imaging Archive (NCIA) In Action: An Introduction for Users A Tool Demonstration from caBIG™ Carl Jaffe, MD NCI-Cancer Imaging Program.
Analytics Plus Product Overview. Introduction Analytics Plus is a self-service Business Intelligence and advanced analytics software. On-premise reporting.
Bioinformatics Shared Resource Introduction to Gene Expression Omnibus (GEO) bsrweb.sanfordburnham.org
ArrayExpress Ugis Sarkans EMBL - EBI
Overview and Demo of CaIntegrator2 A Tool for Publishing and Analyzing Integrated Study Data.
Store and exchange data with colleagues and team Synchronize multiple versions of data Ensure automatic desktop synchronization of large files B2DROP is.
MESA A Simple Microarray Data Management Server. General MESA is a prototype web-based database solution for the massive amounts of initial data generated.
InSilicoLab – Grid Environment for Supporting Numerical Experiments in Chemistry Joanna Kocot, Daniel Harężlak, Klemens Noga, Mariusz Sterzel, Tomasz Szepieniec.
GEO (Gene Expression Omnibus) Deepak Sambhara Georgia Institute of Technology 21 June, 2006.
Essex Insight Introduction to Essex Insight Training Guide Source: Research and Analysis Unit v4.
GISELA & CHAIN Workshop Digital Cultural Heritage Network
Joslynn Lee – Data Science Educator
System Overview Training on the use of the new countrystat
Power BI Security Best Practices
SMART GROUND platform overview
AMGA Web Interface Vincenzo Milazzo
Code Analysis, Repository and Modelling for e-Neuroscience
GISELA & CHAIN Workshop Digital Cultural Heritage Network
Code Analysis, Repository and Modelling for e-Neuroscience
Presentation transcript:

Introduction to caArray caBIG ® Molecular Analysis Tools Knowledge Center April 3, 2011

caArray Overview More than a simple repository for microarray data. Supports data management throughout the life of experiment. Allows collaborative sharing of pre-publication data with partners. Provide data to other biomedical/clinical tools to form a comprehensive solution for array data management, search, and analysis.

Why use caArray? Target Users: Bench scientists performing microarray data collection and annotation Microarray core facility scientists and technicians Bioinformatics and data management coordinators Multi-institutional data coordinating center informaticians Addressing Critical Needs: Manage all aspects of array data: raw data, derived data, sample annotation, experimental design Ensure data are private (in a local instance) until published Supports array data sharing using a federated model Find what you are looking for fast: query annotated data within, and across, datasets Facilitate data integration: provide annotated data to other analytical caBIG ® tools

Key Functions of caArray Query annotated data within and across datasets with search and navigate features Uploading of array files from industry formats (e.g., Affymetrix, GenePix, Illumina, Agilent) Annotation of data to harmonize datasets and reduce time to aggregate data MAGE-TAB import and export functionality GEO-SOFT export functionality Security and authentication features that include group- based permissions Provide annotated data to other caBIG ® tools that support analytical analysis Rich programmatic APIs that allow analytical tools (on and off the Grid) to pull data from caArray and visualize/analyze it.

Web Interface: Find Things Fast User-friendly web interface for browse and search

Platform Support: Grow Towards All Inclusive The collection of most available Affymetrix, Illumina, and Agilent array platforms/designs in caArray ensures that most native data files can be stored, parsed, and associated to samples.

Parsed Data Formats: the More, the Better for Users MAGE-TAB format Agilent raw TXT for aCGH, expression and miRNA assays Agilent GEML/XML array designs Nimblegen pair Report TXT (raw and normalized) Nimblegen NDF array designs Illumina CSV Illumina Sample Probe Profile TXT Illumina genotyping processed data matrix TXT Illumina BGX/TXT array designs Affymetrix CEL and CHP in AGCC/Calvin formats in addition to the GCOS formats Affymetrix CNCHP copy number data (CN4 and CN5) Copy Number data in a prescribed MAGE-TAB Data Matrix format.

MAGE-TAB: Save Time on Sample Annotation IDF SDRF Excel-like Format, Controlled Vocab

Data Management: Loading Data

Data Management: Sample Annotation and Datasets

Data Export: Zip, MAGE-TAB, or GEO Soft

Collaboration and Data Sharing Investigators define collaboration groups for sharing of pre-publication data with a set of partners. Access control at the experiment level or at individual samples. Data is private until made public by the Data Owner.

Data Analysis: Tool Integration gene expression data gene expression data and SNP data Cross-query over many caArray instances gene expression data and copy number data

A Glance at the Technology Tool Platform: Enterprise-web based system that works within a Firefox or Internet Explorer browser CBIIT-Hosted Installation of caArray: Limited computer skills are required to use the application; directed at laboratory researchers Local Installation of caArray: Moderate technical expertise is required to install the tool Upgrade Availability: To make upgrades as seamless as possible, an upgrade installer, both available in GUI format as well as command line format, upgrades installed caArray instance while maintaining data integrity.

The Next Step: Accessing Online Resources for caArray Molecular Analysis Tools Knowledge Center caArray User Forumhttps://cabig- kc.nci.nih.gov/Molecular/forums/viewforum.php?f=6 Tool Landing Pagehttps://cabig.nci.nih.gov/tools/caArray Access to Demo caArray Instance (Register from that site for a training account) Application Support Phone: Toll-free: