Spring Cleaning the Software Repositories Matthias Schröder

Slides:



Advertisements
Similar presentations
Backing Up Your Computer Hard Drive Lou Koch June 27, 2006.
Advertisements

Batch Production and Monte Carlo + CDB work status Janusz Martyniak, Imperial College London MICE CM37 Analysis, Software and Reconstruction.
1 Distributed File System, and Disk Quotas (Week 7, Thursday 2/21/2007) © Abdou Illia, Spring 2007.
Updating an installation with Windows® Embedded Developer Update. Windows and Microsoft are registered trademarks, All rights reversed. KRAK LLC © 2011.
#RefreshCache CI - Daily Builds w/Jenkins – an Open Source Continuous Integration Server Nick Airdo Community Developer Advocate Central Christian Church.
CERN IT Department CH-1211 Genève 23 Switzerland t Some Hints for “Best Practice” Regarding VO Boxes Running Critical Services and Real Use-cases.
Managing Software using RPM. ♦ Overview In Linux, Red Hat Package Manager referred as RPM is a tool used for managing software packages and its main function.
EGEE is a project funded by the European Union under contract IST Quattor Installation of Grid Software C. Loomis (LAL-Orsay) GDB (CERN) Sept.
 For Oracle Based Products  Windows XP Professional-SP3 / Window7-SP1 (32 Bit)  Min. Dual Processor or above  RAM 2GB or above  Internet explorer.
October, Scientific Linux INFN/Trieste B.Gobbo – Compass R.Gomezel - T.Macorini - L.Strizzolo INFN - Trieste.
Step By Step Windows Server 2003 Installation Guide Step By Step Windows Server 2003 Installation Guide.
Computer Security! Emma Campbell, 8K VirusesHackingBackups.
1 The new Fabric Management Tools in Production at CERN Thorsten Kleinwort for CERN IT/FIO HEPiX Autumn 2003 Triumf Vancouver Monday, October 20, 2003.
Quattor-for-Castor Jan van Eldik Sept 7, Outline Overview of CERN –Central bits CDB template structure SWREP –Local bits Updating profiles.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Stephen Childs Trinity College Dublin &
M1G Introduction to Database Development 5. Doing more with queries.
Security monitoring boxes Andrew McNab University of Manchester.
By Abigail Farris Little Families Are Better Than Big Families.
20-May-2003HEPiX Amsterdam EDG Fabric Management on Solaris G. Cancio Melia, L. Cons, Ph. Defert, I. Reguero, J. Pelegrin, P. Poznanski, C. Ungil Presented.
Chapter 10 Chapter 10: Managing the Distributed File System, Disk Quotas, and Software Installation.
By: Anuj Sharma. Topics covered:  GIT Introduction  GIT Benefits over different tools  GIT workflow  GIT server creation  How to use GIT for first.
CERN Using the SAM framework for the CMS specific tests Andrea Sciabà System Analysis WG Meeting 15 November, 2007.
Refactoring and Synchronization with the StarTeam Plug-in for Eclipse  Jim Wogulis  Principal Architect, Borland Software Corporation.
What is a port The Ports Collection is essentially a set of Makefiles, patches, and description files placed in /usr/ports. The port includes instructions.
Installing, running, and maintaining large Linux Clusters at CERN Thorsten Kleinwort CERN-IT/FIO CHEP
SPMA & SWRep: Basic exercises HEPiX hands-on, NIKHEF 5/03 German Cancio
Slide 1 Archive Computing: Scalable Computing Environments on Very Large Archives Andreas J. Wicenec 13-June-2002.
2-Dec Offline Report Matthias Schröder Topics: Scientific Linux Fatmen Monte Carlo Production.
Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Tools and techniques for managing virtual machine images Andreas.
Maite Barroso - 10/05/01 - n° 1 WP4 PM9 Deliverable Presentation: Interim Installation System Configuration Management Prototype
INFSO-RI Enabling Grids for E-sciencE Pan Compiler C. Loomis (LAL-Orsay) Quattor Workshop (Madrid) October 2007.
What Makes a Computer Slow. These programs run behind the programs you are using, and usually they take a lot of memory. To see or remove these programs:
1Maria Dimou- cern-it-gd LCG November 2007 GDB October 2007 VOM(R)S Workshop report Grid Deployment Board.
CDB Namespaces, “service templates” and profile structure Véronique Lefébure June the 13 th 2008 ELFms Brainstorming meeting.
John Samuels October, Why Now?  Vista Problems  New Features  >4GB Memory Support  Experience.
2-December Offline Report Matthias Schröder Topics: Monte Carlo Production New Linux Version Tape Handling Desktop Computers.
Some topics for discussion 31/03/2016 P. Hristov 1.
SCDB Update Michel Jouvin LAL, Orsay March 17, 2010 Quattor Workshop, Thessaloniki.
CERN IT Department CH-1211 Genève 23 Switzerland M.Schröder, Hepix Vancouver 2011 OCS Inventory at CERN Matthias Schröder (IT-OIS)
10 QuickBooks issues and how to resolve them. Updating the data file fails This can be very tricky. When you are updating QuickBooks from one version.
Installing Linux: Partitioning and File System Considerations Kevin O'Brien Washtenaw Linux Users Group
Deleting programs There are various reasons why you may want to delete a program. While deleting a file is not a problem care should be taken when deleting.
INFN-T1 migration to scdb Andrea Chierici 8 th Quattor Workshop Bruxelles.
KNOW HOW TO ‪‎ TUNEUP ‬ YOUR LAPTOPS ‪‎ TUNEUP ‬ Is your PC running too slow? Are you intending how to 'speed up pc'? Here is DIY tips for your slow HP.
FreeBSD ports & packages. FreeBSD ports & packages - overview Different UNIX distributions use differents package systems for distributing software Debian.
Obtain and review client feedback. Creating evaluation or feedback tools Importance of client feedback  The use of client feedback is very useful to.
THE BEST CRM SOFTWARE FOR YOUR BUSINESS
Autonomy Paradigm Warning: This document is a part of my “Responsible Programming” theme. All docs related to that theme just gather some of my ideas.
HTCondor Annex (There are many clouds like it, but this one is mine.)
The CDB evolution - part 4
AI How to: System Update and Additional Software
Seamless Guest OS's and more!
Installation 1. Installation Sources
Java on the LEGO Mindstorms EV3
Quattor Usage at Nikhef
Quality Control in the dCache team.
DPM releases and platforms status
Lecture 13 RPM and its advantages.
Experience with Maintaining the GPU Enabled Version of COSMO
Introduction to Configuration Management
Resilience for Success
Introduction To Barre Chords
Introduction To Barre Chords
Optimize Your Java Code By Tools
Installing Analysis Software
Module P3 Practical: Building a webapp in nodejs and
FreeBSD ports & packages
Tools to make the process safer : secured filing
Installing Analysis Software
Francis Soriano (EN/ACE)
Presentation transcript:

Spring Cleaning the Software Repositories Matthias Schröder Topics: Introduction The Issue What can be done Conclusion 27-March-2008 Spring Cleaning Repositories

Spring Cleaning Repositories Introduction Software Repositories hold all packages used on any quattor managed node Organised by ARCH_OS We keep all versions of packages ever put into the repositories To be sure that we can reinstall a machine if needed Number of packages grows with time… ARCH_OS i386_slc3 x86_64_slc3 i386_slc4 x86_64_slc4 Number of packages 22271 5389 15389 16369 27-March-2008 Spring Cleaning Repositories

Spring Cleaning Repositories The Issue The number of packages has reached a critical level Going through the listings to find packages takes longer and longer Panc compile times directly effected Impossible to upload the repository listings to CDBs with little memory Ok, 1GB is not a lot, but still… Impossible to compile templates for enclosures with many nodes Running out of memory even on CDBs with lots of installed memory… We have to clean up the repositories now! 27-March-2008 Spring Cleaning Repositories

Possible Ways to Clean Up Prepare a positive list of most frequent packages, keep all others List of versions to keep is difficult to obtain We can’t be sure that we will keep all versions required …andwe can not count on panc to warn us We might only discover that packages are missing when we have to reinstall a node Prepare a list of packages to remove Might be difficult to obtain for all packages Same risk as above Tedious We probably keep many versions we don’t use any more => too little gain Make list of versions still in use from xml profile of all nodes Safest But also have to keep newer versions not yet in use Highest gain Needs a new tool to extract list of packages and versions from xml profiles 27-March-2008 Spring Cleaning Repositories

Spring Cleaning Repositories Tests Done With a positive list for castor and CA’s for i386_slc3 But castor list now known to be too small… Results very encouraging Have to keep more castor versions Can gain more on other packages Full repo listing Svelte repo listing # packages 22271 14868 Compile time 54 s 35 s Memory usage 3.8 GB 1.7 GB 27-March-2008 Spring Cleaning Repositories

Spring Cleaning Repositories How to Continue We have to be very careful with removal of versions We have to reduce the repo listings now Some profiles do not compile any more! My proposal: two step approach Step 1: Remove known obsolete castor and CA versions, keep all the rest These are the packages for which the biggest gain can be achieved with little work Step 2: Make a listing of all used versions from XML profiles Maximum gain, safe method, but takes longer to prepare Have to rethink the ways how we handle kernels (i386_slc3: 1122 “kernel” packages!) Only keep these kernels that are configured on nodes? 27-March-2008 Spring Cleaning Repositories

What to do With Obsolete Versions We should not delete them Disk space is not an issue Copy them to another place on SWREP Then remove them via swrep-soap-client 27-March-2008 Spring Cleaning Repositories

Spring Cleaning Repositories Conclusion We have to reduce the size of the repository listings This is a delicate operation, we have to ensure that all used versions are kept Can we agree on two step approach? 27-March-2008 Spring Cleaning Repositories