Biopackages.net Operating System Packages for Bioinformatics Allen Day 2005.05.17.

Slides:



Advertisements
Similar presentations
05/11/2001 CPT week Natalia Ratnikova, FNAL 1 Software Distribution in CMS Distribution unitFormContent Version of SCRAM managed project.
Advertisements

May 16, 2005Scott Cain, CSHL. May 16, 2005Scott Cain, CSHL gmod update Gmod RC2 last week New for 0.003: –Generic triggers for Apollo –Greatly enhanced.
Generic model/many/my organism database toolkit Dec 2007 Don Gilbert Genome Informatics Lab, Biology Dept., Indiana University GMOD.
System Center Configuration Manager Push Software By, Teresa Behm.
Software Factory Assembling Applications with Models, Patterns, Frameworks and Tools Anna Liu Senior Architect Advisor Microsoft Australia.
Microsoft® Desktop Deployment Assistance Program 4: SMS OS Deployment Feature Pack Thomas Lee Chief Technologist QA plc
Packaging of EPICS-basedControl System Software
Julie McEnery1 Installing the ScienceTools The release manager automatically compiles each release of the Science Tools, it creates a set of wrapper scripts.
Installing Linux softwares Sirak Kaewjamnong. 2 Software packets  When Linux developers create their software they typically bundle all the executable.
Genome database & information system for Daphnia Don Gilbert, October 2002 Talk doc at
Downloading & Installing Software Chapter 13. Maintaining the System Yum Pirut BitTiorrent Rpm Keeping Software Up To Date Up2date Red Hat Network Wget.
Module 10 – Linux Installations. Wikipedia: Red Hat Package Manager or RPM Package Manager (RPM) is a package management system. The name RPM variously.
Rocks cluster : a cluster oriented linux distribution or how to install a computer cluster in a day.
WFleaBase Daphnia Genome Database from Common Components Daphnia Genomic Consortium Meeting, Sept Don Gilbert,
Migration XenDesktop 7. © 2013 Citrix | Confidential – Do Not Distribute Migration prerequisites Set up a XenDesktop 7 Site, including the site database.
Linux Operations and Administration
Dustin Harman VM Workshop 2015 RPM Basics. What is RPM? Red Hat Package Manager Native package manager on RHEL, Fedora, SUSE, some Mandriva RPM/SRPM files.
Framework for Automated Builds Natalia Ratnikova CHEP’03.
By: Paul Hill Technology Coordinator Gwinn Area Community Schools.
The GMOD Project: Creating Reusable Software Components for Genome Data Scott Cain GMOD Project Coordinator Cold Spring Harbor Laboratory.
October, Scientific Linux INFN/Trieste B.Gobbo – Compass R.Gomezel - T.Macorini - L.Strizzolo INFN - Trieste.
Workstation Imaging Process Overview Thomson Reuters –Manoj Shah –Mike Bowers –Curt Ricard –Sangkhone Stoltz –Joe Ness March 26 th, 2009.
Copyright OpenHelix. No use or reproduction without express written consent 2 Overview of Genome Browsers Materials prepared by Warren C. Lathe, Ph.D.
Generic model/many/my organism database Oct 2007 Don Gilbert Genome Informatics Lab, Biology Dept., Indiana University GMOD.
Taverna and my Grid Open Workflow for Life Sciences Tom Oinn
CERN Manual Installation of a UI – Oxford July - 1 LCG2 Administrator’s Course Oxford University, 19 th – 21 st July Developed.
Install Software. UNIX Shell The UNIX/LINUX shell is a program important part of a Unix system. interface between the user & UNIX kernel starts running.
Contents 1.Introduction, architecture 2.Live demonstration 3.Extensibility.
Creating Manageable Setups With The Windows ® Installer (Part 1 Of 2) Rob Collie, Program Manager Windows Installer Microsoft Corporation Jim Masson, Program.
GMOD: Managing Genomic Data from Emerging Model Organisms Dave Clements 1, Hilmar Lapp 1, Brian Osborne 2, Todd J. Vision 1 1 National Evolutionary Synthesis.
SUSE Linux Enterprise Desktop Administration Chapter 6 Manage Software.
1Offline Weekly Meeting May May 2006 AliRoot Build Integration and (Testing) System Peter Hristov Vagner Morais.
Browsing the Genome Using Genome Browsers to Visualize and Mine Data.
Got genom e? Community Meetings GMOD.org The GMOD community meets semi- annually to discuss GMOD components, best practices,
1 / 22 AliRoot and AliEn Build Integration and Testing System.
05/29/2002Flavia Donno, INFN-Pisa1 Packaging and distribution issues Flavia Donno, INFN-Pisa EDG/WP8 EDT/WP4 joint meeting, 29 May 2002.
University of Illinois at Urbana-Champaign BeeSpace Navigator v4.0 and Gene Summarizer beespace.uiuc.edu `
Digesting the Genome Glut Promoting the Use and Extension of GMOD To Emerging Model Organisms David Clements 1 Brian Osborne 2 Hilmar Lapp 1 Xianhua Liu.
20-May-2003HEPiX Amsterdam EDG Fabric Management on Solaris G. Cancio Melia, L. Cons, Ph. Defert, I. Reguero, J. Pelegrin, P. Poznanski, C. Ungil Presented.
GMODWeb, Biopackages, & Virtual Machines Brian O'Connor Nelson Lab, UCLA 1/16/2009.
Rational ClearCase and Rational ClearQuest IBM VA TPF User Conference Terry Durkin ClearCase Product Manager October 2000 Terry Durkin ClearCase Product.
Building WormBase database(s). SAB 2008 Wellcome Trust Sanger Insitute Cold Spring Harbor Laboratory California Institute of Technology ● RNAi ● Microarray.
Microsoft Management Seminar Series SMS 2003 Change Management.
GLite build and integration system Building and Packaging Robert HARAKALY
ARGOS (A Replicable Genome InfOrmation System) for FlyBase and wFleaBase Don Gilbert, Hardik Sheth, Vasanth Singan { gilbertd, hsheth, vsingan
What's new with GMOD Scott Cain GMOD Coordinator
Maite Barroso - 10/05/01 - n° 1 WP4 PM9 Deliverable Presentation: Interim Installation System Configuration Management Prototype
Yannick Patois - Datagrid Software Repository Presentation - March, n° 1 Datagrid Software Repository Presentation CVS, packages and automatic.
WLCG Software Lifecycle First ideas for a post EMI approach 0.
RedHat Package Management RPM and YUM in RedHat Enterprise, Fedora, Suse and Centos.
GLite build and integration system Building and Packaging Robert HARAKALY
DataGrid is a project funded by the European Commission under contract IST EDG Baseline API Document Document build description and current.
The Bovine Genome Database Abstract The Bovine Genome Database (BGD, facilitates the integration of bovine genomic data. BGD is.
Aaron Corso COSC Spring What is LAMP?  A ‘solution stack’, or package of an OS and software consisting of:  Linux  Apache  MySQL  PHP.
Scientific Linux Inventory Project (SLIP) Troy Dawson Connie Sieh.
Getting Started in RPM Packaging Izhar Firdaus / KageSenshi Contributor Fedora Project
Into Fedora RPM packages Lubomir Rintel Presented by Fedora Package Maintainer Creative Commons Attribution-ShareAlike license applies. Packaging Gems.
MirrorManager: The Fedora Mirror System Matt Domsch Fedora Mirror Wrangler Linux Technology Strategist Office of the CTO Dell, Inc.
1 April 2, Software Packaging and Releasing Best Practices William Cohen NCSU CSC 591W April 2, 2008.
RPM101: A gentle intro to creating your own packages Richard Keech Red Hat Asia-Pacific.
System Center 2012 Configuration Manager
Install external command line softwares
Behavior and Phenotype in GMOD Natural Diversity in GMOD
COP 4343 Unix System Administration
CompTIA Server+ Certification (Exam SK0-004)
Overview – SOE PatchTT November 2015.
BIND 10 Packaging & Distribution
More Scripting & Chapter 11
RedHat Package Management
University of Texas Rio Grande Valley Systems Administration CSCI 6175
Presentation transcript:

Biopackages.net Operating System Packages for Bioinformatics Allen Day

What is a package?  Software, config files, documentation, and/or data encapsulated in a single file  Metadata describing: Version, license, package “category” Dependencies What the package provides

 GMOD target audience Small MODs

Package Dependency Graph Dependencies What the package provides chado chado-Hsa genome-Hsa-nibucsc-blat genome-Hsa-annotation-affymetrix genome-Hsa-annotation-gene postgresql-AffxSeq postgresql-server perl-bioperl obo-core perl-go-perl

Dependencies  Build Dependency  Installation Dependency

What is a Package Manager?  Tools to manage installation, upgrade, uninstallation of packages Verify package integrity (checksums) Maintain system integrity  Transactional  Allow rollbacks Dependency checking Dependency graph recursion Allow software customization (patches)

Current Generation of PMs  RPM  Dpkg  Apt  Yum  Emerge  tgz/bz2  Windows Installer

Why bioinformatics packages?  Consistency of installation process Bioinfo. package installs vary wildly, and commonly lack documentation  Automatic dependency installation Perl modules especially bad – bioperl has 60+ modules in its dependency tree  Integrity/Auditing of system state Know an installed package works, which version, how to replicate system setup  Tighter integration with operating system Daemons, config & log file locations, etc.

What’s available?  RPM packages only right now Primary focus on Fedora Core 2  Some RPMs also available for Fedora Core 3 RedHat 9 Cygwin

What’s available?  Three primary foci Applications Libraries Data sets

Applications  Gbrowse  Textpresso  BLAT daemon  NCBI Toolkit (BLAST, etc)  HMMer

What’s available?  Libraries Bioperl R & Bioconductor Squid EMBOSS

What’s available?  Data sets Genome & protein sequence Sequence features Ontologies All installed using a common directory structure

What’s available?  UCSC tools (utilities, BLAT system service, CGI scripts)  Bioperl  R / Bioconductor  GMOD apps (Gbrowse, Textpresso, …)  Data packages Genome sequence (fa, nib, blastdb) Genome features (Affy probeset alignments, mRNA, etc)

GMOD Components Available chado-Hsagbrowsetextpresso gmod-web-Hsa turnkey chado das2-Hsa apollo-Hsa cmap-Hsa  ‘Hsa’ can be substituted for your organism  Currently built for ‘Cel’, ‘Hsa’, ‘Sce’ ucsc-BLATgenome-Hsa-nib

More details… chado chado-Hsa genome-Hsa-nibucsc-blat perl-go-perl genome-Hsa-annotation-affymetrix genome-Hsa-annotation-gene postgresql-AffxSeq postgresql-server perl-bioperl ……………

Gene Expression Components chado-HsaBioconductorR Quant/Norm Pipeline chado-GEC DAS/2 for Genotyping, GeneChip

Resources  ~1000 RPMs for Fedora Core 2, 3 Available via yum  See site for a configuration example.

TODO  Support more architectures Build for Cygwin & OS X. RPM has been ported to both  Automate package build process Build farm of multiple architectures, controllable via scheduler (GridEngine)  Automate (if possible) inclusion of new software / data releases

TODO  Build community interest and involvement Keep adding more packages! Keep existing packages current!

Acknowledgements  Patrick Alger  Jared Fox  Brian O’Connor  Todd Harris  Lincoln Stein  Stanley Nelson

Anatomy of a specfile  Metadata Name Depends Provides Changelog  Build & install script hooks %prep %build %install %post %preun