James N. Bellinger University of Wisconsin at Madison 28 July 2010

Slides:



Advertisements
Similar presentations
Lectures on File Management
Advertisements

Online Magazine Bryan Ng. Goal of the Project Product Dynamic Content Easy Administration Development Layered Architecture Object Oriented Adaptive to.
16: Distributed Systems1 DISTRIBUTED SYSTEM STRUCTURES NETWORK OPERATING SYSTEMS The users are aware of the physical structure of the network. Each site.
Introduction To C++ Programming 1.0 Basic C++ Program Structure 2.0 Program Control 3.0 Array And Structures 4.0 Function 5.0 Pointer 6.0 Secure Programming.
S. Veseli - SAM Project Status SAMGrid Developments – Part I Siniša Veseli CD/D0CA.
An introduction for Data Reporters. College Credit Plus Replaces PSEO Replaces dual enrollment.
…using Git/Tortoise Git
CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services Job Monitoring for the LHC experiments Irina Sidorova (CERN, JINR) on.
Code Management James N. Bellinger University of Wisconsin at Madison 16 March
Optimistic Design 1. Guarded Methods Do something based on the fact that one or more objects have particular states  Make a set of purchases assuming.
SONIC-3: Creating Large Scale Installations & Deployments Andrew S. Neumann Principal Engineer, Progress Sonic.
SONIC-3: Creating Large Scale Installations & Deployments Andrew S. Neumann Principal Engineer Progress Sonic.
Preparing for the OCR Functional Skills Maths Assessment
University of Macau Faculty of Science and Technology Programming Languages Architecture SFTW 241 spring 2004 Class B Group 3.
Code Management James N. Bellinger University of Wisconsin at Madison 8 December
1 Update at RAL and in the Quattor community Ian Collier - RAL Tier1 HEPiX FAll 2010, Cornell.
Migration to SL5 James Bellinger University of Wisconsin at Madison 7-Apr
Migration to SL5 James Bellinger University of Wisconsin at Madison 21-Apr
Code Migration James N. Bellinger University of Wisconsin at Madison 7 July
WLCG Operations Coordination Andrea Sciabà IT/SDC GDB 11 th September 2013.
10 QuickBooks issues and how to resolve them. Updating the data file fails This can be very tricky. When you are updating QuickBooks from one version.
Data Import Export Framework: Performance and Troubleshooting André Arnaud de Calavon, Kaya Consulting.
UNICORE and Argus integration Krzysztof Benedyczak ICM / UNICORE Security PT.
Lesson 9: SOFTWARE ICT Fundamentals 2nd Semester SY
EAuthentication What is it? What is needed for FFAVORS?
DXL to PST Converter presents
Course Contents KIIT UNIVERSITY Sr # Major and Detailed Coverage Area
Chapter 1: Introduction
High School Application Process School Year
IW2D migration to HTCondor
Required Data Files Review
RCM Turbo SQL Version.
LCGAA nightlies infrastructure
James N. Bellinger University of Wisconsin at Madison 3-August-2011
Building Grids with Condor
Application Development Theory
Quicken Contact Support Number You need to have a good printing machine so that you can print the data files or reports that have been prepared.
Quicken Configuration Issues Number On your Quicken dashboard, you should be able to find each details of the transactions. In some cases,
University of Wisconsin at Madison
Chapter 1: Introduction
Introduction to Configuration Management
University of Wisconsin at Madison
University of Wisconsin at Madison
Weird Stuff I Saw While ... Supporting a Java Team
Diskless network security
James N. Bellinger University of Wisconsin at Madison 10 November 2010
Completing the tasks for A452 with….
University of Wisconsin at Madison
James N. Bellinger University of Wisconsin at Madison 13 October 2010
University of Wisconsin-Madison
CSCI 2141 – Intro to Database Systems
File Handling Programming Guides.
Conditions and Ifs BIS1523 – Lecture 8.
University of Wisconsin at Madison
University of Wisconsin at Madison
James N. Bellinger University of Wisconsin at Madison 3 November 2010
James N. Bellinger University of Wisconsin at Madison 21 July 2010
Lesson Objectives Aims Understand how machine code is generated
James N. Bellinger University of Wisconsin at Madison 6 October 2010
University of Wisconsin at Madison BMU Alignment Corrections
University of Wisconsin at Madison
University of Wisconsin at Madison
* 2000/08/1307/16/96 This presentation will probably involve audience discussion, which will create action items. Use PowerPoint to keep track of these.
The Problem ~6,000 PCs Another ~1,000 boxes But! Affected by:
University of Wisconsin at Madison
James N. Bellinger University of Wisconsin at Madison 18-January-2012
James N. Bellinger University of Wisconsin at Madison 11-January-2012
James N. Bellinger University of Wisconsin at Madison 4 August 2010
University of Wisconsin at Madison
Running & Testing Programs :: Translators
Presentation transcript:

James N. Bellinger University of Wisconsin at Madison 28 July 2010 Code James N. Bellinger University of Wisconsin at Madison 28 July 2010 28 July 2010

Migration News First stage of migration is complete Problems as of 9:26 28-July Build of development on SL5 was delayed no machine where we can update it directly Cruft remained from merely moving the old development instead of deleting it setup python v2_1 complains CafSubmit generates errors but apparently works anyway except on SL4 where there were kerberos problems setup cdfsoft2 development does not yet work on ILP SL3 Not sure about remote sites: probably not there either 28 July 2010

Postmortem: Subgroup to discuss later What went wrong? CafSubmit failure on SL4 development not ready on Wednesday morning What should have been done better? I’ve got a little list. Make it longer. What loose ends are there? 28 July 2010 3

Code Speedup valgrind/callgrind works w/ maxopt ProductionExe 14% of time spent in new/malloc/free/delete/strings 9% of time spent in Chebyshev polynomial calculations for silicon: Looks like 2% at least saved if save results instead of recalc each time String usage is driven by DBCorrector::correctLadderSegment 3.4% ?? CdfTrackHits::class_name 0.5% 28 July 2010 4

Where used? Largest usage in a single routine: chebFit chebyshev: could call once and save results 28 July 2010 5

For Subgroup Meeting 28 July 2010 6

What went wrong? Glitches in final move and build of development versions CafSubmit failed on SL4 What else? 28 July 2010 7

What should have been done better? Earlier coordination with infrastructure Timeline for conversion, plan tests as it progressed Did not start early for testing batch operation I should have been on this earlier Needed to shake down as many “tensor elements” of submission as possible; farm location: originating site: file access method Find CafSubmit expert early on to get it repaired Spell out all the details People’s memories differed. My fault 28 July 2010 8

What should have been done better: 2? Physics group tests might have been easier to arrange without ICHEP deadlines 28 July 2010 9

Loose Ends We are carrying around python v2_1 Why? CafSubmit code still needs rework large task, populate cafutils Can we simply move the code as is? (modulo logical names) Atom cluster should use central code server? Revisit xrootd server issues Ancient versions are still out there Non-frontierized remote site submissions 28 July 2010 10