Chinook: A collaborative system for bioinformatics analysis. RIP The Genome Sciences Centre Stephen Montgomery.

Slides:



Advertisements
Similar presentations
Experiences with GridWay on CRO NGI infrastructure / EGEE User Forum 2009 Experiences with GridWay on CRO NGI infrastructure Emir Imamagic, Srce EGEE User.
Advertisements

CPSCG: Constructive Platform for Specialized Computing Grid Institute of High Performance Computing Department of Computer Science Tsinghua University.
Striving for Alignment: One Funder's Lessons in Supporting Advocacy.
High Performance Computing Course Notes Grid Computing.
Chinook: A collaborative system for bioinformatics analysis. UBIC/CMMT Talk October 2004 Stephen Montgomery.
Bioinformatics at WSU Matt Settles Bioinformatics Core Washington State University Wednesday, April 23, 2008 WSU Linux User Group (LUG)‏
Extreme Bioinformatics Stephen Montgomery Genetics Graduate Program, UBC Supervisor: Steven Jones.
Bioinformatics of mammaliain gene expression (BoMGE) 07 June 2005 Gene Regulation Informatics.
Chinook: A collaborative system for bioinformatics analysis. VanBUG October 2004 Stephen Montgomery.
Flying to the Top, One Tweet at a Time: Using Social Media to Rank Online Search Results Robyn B. Reed, MA, MLIS Co-authors: Carrie L. Iwema, PhD, MLS.
Many genes have unknown function 30% have unknown function only 9% are experimentally verified The Arabidopsis Genome Initiative, Nature 2000 of the 25,498.
Different views of bioinformatics CS view: what are the algorithms, data structures and other concepts needed to develop tools to solve biological problems.
CS335 Networking & Network Administration Tuesday, April 20, 2010.
Structural Bioinformatics Dr. Avraham Samson Course no.: Credit points: 1.5 Final grade is based on 10 assignments Course homepage:
ExPASy - Expert Protein Analysis System The bioinformatics resource portal and other resources An Overview.
Presented by Liu Qi An introduction to Bioinformatics Algorithms Qi Liu
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
26-28 th April 2004BioXHIT Kick-off Meeting: WP 5.2Slide 1 WorkPackage 5.2: Implementation of Data management and Project Tracking in Structure Solution.
CECS 474 Computer Network Interoperability Notes for Douglas E. Comer, Computer Networks and Internets (5 th Edition) Tracy Bradley Maples, Ph.D. Computer.
Institute of Systems Biology (INBIOSIS)/ School of Biosciences & Biotechnology (Faculty of Science & Technology), Bioinformatics Development in Malaysia.
The GeoConnections Discovery Portal Michael Robson MacDonald Dettwiler and Associates Brian McLeod, Michael Adair Natural Resources Canada.
Client Server Technologies Middleware Technologies Ganesh Panchanathan Alex Verstak.
DynamicBLAST on SURAgrid: Overview, Update, and Demo John-Paul Robinson Enis Afgan and Purushotham Bangalore University of Alabama at Birmingham SURAgrid.
Presentation Outline (hidden slide) Technical Level: 100 Intended Audience: TDMs, ITPros, ITDMs, BI specialists Objectives (what do you want the audience.
Modeling Complex Interactions of Overlapping River and Road Networks in a Changing Landscape Programmatic overview Hypothesis Preliminary findings.
CoG Kit Overview Gregor von Laszewski Keith Jackson.
Building Biodiversity Information Education: Next Generation Bioinformaticians P. Bryan Heidorn Carole Palmer Dan Wright Graduate School of Library and.
Authors Project Database Handler The project database handler dbCCP4i is a small server program that handles interactions between the job database and.
A human-aided genome annotation pipeline Francis Ouellette Director, UBC Bioinformatics Centre UBC Bioinformatics Centre.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Objectives.
IPlant cyberifrastructure to support ecological modeling Presented at the Species Distribution Modeling Group at the American Museum of Natural History.
The Environmental Genomics Thematic Programme Data Centre Dawn Field, Director.

Adam Siepel Assistant Professor Biological Statistics & Computational Biology Cornell University.
Cloud infrastructure for training in Life Sciences Manuel Corpas The Genome Analysis Centre.
1 4/23/2007 Introduction to Grid computing Sunil Avutu Graduate Student Dept.of Computer Science.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
Interoperability Grids, Clouds and Collaboratories Ruth Pordes Executive Director Open Science Grid, Fermilab.
Bioinformatics Core Facility Guglielmo Roma January 2011.
Open Health Tools Strategic Plan. Mission “to significantly contribute to the health and well-being of individuals and communities by improving their.
BRUDNO LAB: A WHIRLWIND TOUR Marc Fiume Department of Computer Science University of Toronto.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
Presented By:- Sudipta Dhara Roll Table of Content Table of Content 1.Introduction 2.How it evolved 3.Need of Middleware 4.Middleware Basic 5.Categories.
Bio-Linux 3.0 An integrated bioinformatics solution for the EG community ClustalX showing DNA polymerase alignment GeneSpring showing yeast transcriptome.
Jonathan Crabtree Assistant Director of Computing and Archival Research UNC Chapel Hill, Odum Institute Vision for Sociometric Analysis of Long-tail Science.
Jini Architecture Introduction System Overview An Example.
© 2007 Cisco Systems, Inc. All rights reserved.Cisco Public 1 Version 4.0 Connecting to the Network Introduction to Networking Concepts.
Pathogenomics How this project began: Ann Rose - take advantage of DNA sequence information - genomics Julian Davies - use the information to understand.
Lightweight construction of rich scientific applications Daniel Harężlak(1), Marek Kasztelnik(1), Maciej Pawlik(1), Bartosz Wilk(1) and Marian Bubak(1,
XML-Based Grid Data System for Bioinformatics Development Noppadon Khiripet, Ph.D Wasinee Rungsarityotin, MS Chularat Tanprasert, Ph.D Royol Chitradon.
Architecture View Models A model is a complete, simplified description of a system from a particular perspective or viewpoint. There is no single view.
B i o i n f o r m a t i c s / B i o m e d i c a l A p p l i c a t i o n s i n E E L A Mexico, D.F., october 22 – 26, e – s c i e n c e M e x i c.
| nectar.org.au NECTAR TRAINING Module 2 Virtual Laboratories and eResearch Tools.
Internet2 Applications Group: Renater Group Presentation T. Charles Yun Internet2 Program Manager, Applications Group 30 October 2001.
Globus: A Report. Introduction What is Globus? Need for Globus. Goal of Globus Approach used by Globus: –Develop High level tools and basic technologies.
Parag Mhashilkar Computing Division, Fermi National Accelerator Laboratory.
High throughput biology data management and data intensive computing drivers George Michaels.
Mohssen Mohammed Sakib Pathan Building Customer Trust in Cloud Computing with an ICT-Enabled Global Regulatory Body Mohssen Mohammed Sakib Pathan.
Parasoft : Improving Productivity in IT Organizations David McCaw.
CyVerse Workshop Discovery Environment Overview. Welcome to the Discovery Environment A Simple Interface to Hundreds of Bioinformatics Apps, Powerful.
Galaxy for analyzing genome data Hardison October 05, 2010
The LIBI Federated database
BME435 BIOINFORMATICS.
CyVerse Discovery Environment
BBMRI Competence Centre Status Report
Grid Portal Services IeSE (the Integrated e-Science Environment)
Large Scale Distributed Computing
Programmatic overview Hypothesis Preliminary findings
Presentation transcript:

Chinook: A collaborative system for bioinformatics analysis. RIP The Genome Sciences Centre Stephen Montgomery

How is bioinformatics done? Data and Innovation Innovation Requirements Organization and Interpretation

Each community has its own culture

An interaction map of biologists and bioinformaticians

Things get more complicated…  Organizational boundaries (EnsEMBL vs. UCSC)  Access to different resources (computational, monetary)  Time (where to start, what to use…)  Different tools and user interfaces  Overlap in integrative efforts (suite building)  Communication efficacy (who you know…)  Installation requirements  Reliability and repeatability  Attribution issues

An improved interaction map of biologists and bioinformaticians Improved Communication Access to communities Access to resources Retain sub-organization

A community-based approach to bioinformatics analysis

Chinook overview Use the principles of peer-to-peer technology  Allow biologists to easily discover and run bioinformatics tools  Create a dynamic, reliable network for analysis  Reduce overlapping integration efforts  Address problems relevant to bioinformatics Attribution Data management Resource distribution

Discovering and Running jobs

Monitoring jobs

Server Information Panel Static Service Entry Panel Filter Pane

Algorithms integrated into Chinook ClustalW Genscan Conreal Sim4 DIALIGN MSCAN LAGAN ANN-Spec Mauve Recursive Gibbs Motif Sampler ORCA MEME Shuffle-LAGAN Motifsampler T-Coffee RSAT oligo analysis Promoterwise STUBB Primer3 Teiresias Eponine wConsensus ELPH

Projects involving Chinook OrthoSEQ provides analysis through the Chinook Perl API. Sockeye uses Chinook to deliver state-of-the-art alignment, PCR prediction, and regulatory analysis Pegasys plans to provide pipeline management to services advertised by Chinook. Bio-Linux plans to integrate a subset of their algorithms.

Use cases of Chinook Grid/Cluster computing Internally connect teams / individuals Collaborate with remote individuals Provide an API layer to your algorithms Insert bioinformatics analysis into applications

Future Plans

Acknowledgements GENOME SCIENCES CENTRE Steve Jones Tony Fu Jun Guan Keven Lin Asim Siddiqui Genereg GSC Mark Mayo Bernard Li CENTRE FOR MOLECULAR MEDICINE AND THERAPEUTICS Wyeth Wasserman Jonathan Lim UBC BIOINFORMATICS CENTRE Francis Ouellette Graham McVicker Sohrab Shah Funding: MSFHR, Genome Canada