CD FY10 Budget and Tactical Plan Review FY10 Tactical Plan for SCF / System Administration DocDB #3389 Jason Allen 10/06/2009.

Slides:



Advertisements
Similar presentations
Managing Hardware and Software Assets
Advertisements

Digital Edge Solutions Overview Services – Application Support.
IT Asset Management Status Update 02/15/ Agenda What is Asset Management and What It Is Not Scope of Asset Management Status of Key Efforts Associated.
Team 1: Aaron, Austin, Dan, Don, Glenn, Mike, Patrick.
Yale University Information Technology Services Administrative Systems Art Hunt 3/22/04 Software Service Level Agreement with Finance, Procurement and.
Copyright 2009 FUJITSU TECHNOLOGY SOLUTIONS PRIMERGY Servers and Windows Server® 2008 R2 Benefit from an efficient, high performance and flexible platform.
NOT FOR PUBLIC DISTRIBUTION State of Minnesota Technology Summary February 24, 2011.
Managing the Information Technology Resource Jerry N. Luftman
CD FY08 Tactical Plan Status FY08 Tactical Plan Status Report for Network Operations & Administration Rick Finnegan April 22, 2008.
CD FY08 Tactical Plan Status FY08 Tactical Plan Status Report for Network Infrastructure Upgrades Rick Finnegan April 22, 2008.
Basel Accord IITRANSITIONSERVICES Business Integration Support FCM Management Limited Paris New York Toronto.
Cloud Computing. 2 A division of Konica Minolta Business Solutions USA Inc. What is Cloud Computing? A model for enabling convenient, on-demand network.
CD FY10 Budget and Tactical Plan Review FY10 Tactical Plans for Computer Security Ron Cudzewicz October 8, 2009 Tactical plan names listed here…DocDB#
Section 11.1 Identify customer requirements Recommend appropriate network topologies Gather data about existing equipment and software Section 11.2 Demonstrate.
Introduction Optimizing Application Performance with Pinpoint Accuracy What every IT Executive, Administrator & Developer Needs to Know.
Natick Public Schools Technology Presentation February 6, 2006 Dennis Roche, CISA Director of Technology.
SOLUTIONS FOR THE EFFICIENT ENTERPRISE Sameer Garde Country GM,India.
Asset Record Does Not Equal CI: The confusion between Asset and Configuration Management Christine M. Russo Manager, IT Asset Management and Property.
NCSX Management Overview Hutch Neilson, NCSX Project Manager NCSX Conceptual Design Review Princeton, NJ May 23, 2002.
Jack Schmidt Fermilab NLIT  The 1 st Year  Staffing Issues  Internal Tool Audit  Problem Management  Change management  Process Improvements.
Installation and Maintenance of Health IT Systems
The Solution To Help You Take Control of Printing.
CD FY09 Tactical Plan Status FY09 Tactical Plan Status Report for Site Networking Anna Jordan April 28, 2009.
CD FY09 Tactical Plan Review FY09 Tactical Plans for Database Services J.Trumbo Sept. 24, 2008.
CD FY10 Budget and Tactical Plan Review FY10 Tactical Plans for Financial Management Valena Sibley October 8, 2009 Tactical plan nameDocDB# FY10 Tactical.
SCF/FEF Virtualization Strategy Jason Allen August 12, 2009.
Using Virtual Servers for the CERN Windows infrastructure Emmanuel Ormancey, Alberto Pace CERN, Information Technology Department.
CD FY08 Tactical Plan Status FY08 Tactical Plan Status Report for Information Management Matt Arena June 16, 2008.
CD FY09 Tactical Plan Review FY10 Tactical Plans for Administrative Support (DAS) Griselda Lopez September 26, 2008.
1 Evolution and Revolution: Windows 7 and Desktop Virtualization How to Accelerate Migration to Windows 7 Miguel Sian, Sr. Enterprise Solutions Consultant.
CD FY08 Tactical Plan Status FY08 Tactical Plan Status Report for Network Infrastructure Upgrades Rick Finnegan April 22, 2008.
Workshop on Computing for Neutrino Experiments - Summary April 24, 2009 Lee Lueking, Heidi Schellman NOvA Collaboration Meeting.
CERN Physics Database Services and Plans Maria Girone, CERN-IT
Telecom Management: Six Questions You Need to Ask Liz Carroll Manager – Legal Solutions
CD FY10 Budget and Tactical Plan Review FY10 Tactical Plans for Database Services Nelly Stanfield October 7, 2009 Database Services3425-v1.
CD FY08 Tactical Plan Status FY08 Tactical Plan Status Report for Videoconf Support Sheila Cisko 6/17/2008.
Commodity Node Procurement Process Task Force: Status Stephen Wolbers Run 2 Computing Review September 13, 2005.
Lecture 4. IS Planning & Acquisition To be covered: To be covered: – IS planning and its importance Cost-benefit analysis Cost-benefit analysis Funding.
US ATLAS Tier 1 Facility Rich Baker Brookhaven National Laboratory Review of U.S. LHC Software and Computing Projects Fermi National Laboratory November.
FEF Puppet Implementation Project Jason Allen 8/18/2010.
CD FY09 Tactical Plan Review FY09 Tactical Plans for ES&H Amy Pavnica September 26, 2008.
CD FY10 Budget and Tactical Plan Review FY10 Tactical Plans for Database Services [Presenter’s Name] [Date] Database Services3425-v1.
CD FY09 Tactical Plan Status FY09 Tactical Plan Status Report for Neutrino Program (MINOS, MINERvA, General) Margaret Votava April 21, 2009 Tactical plan.
CD FY08 Tactical Plan Status FY08 Tactical Plan Status Report for LSCS-DBI-APP Dennis Box June
Bay Ridge Security Consulting (BRSC) Cloud Computing.
Computing Division FY03 Budget and budget outlook for FY04 + CDF International Finance Committee April 4, 2003 Vicky White Head, Computing Division.
CERN IT Department CH-1211 Genève 23 Switzerland t CERN IT Monitoring and Data Analytics Pedro Andrade (IT-GT) Openlab Workshop on Data Analytics.
CD FY09 Tactical Plan Review FY09 Tactical Plans for Scientific Database Applications Igor Mandrichenko Sept. 24, 2008.
Introduction to ITIL and ITIS. CONFIDENTIAL Agenda ITIL Introduction  What is ITIL?  ITIL History  ITIL Phases  ITIL Certification Introduction to.
Staff Assessment Technology Services Department Palmyra Area School District.
R. Krempaska, October, 2013 Wir schaffen Wissen – heute für morgen Controls Security at PSI Current Status R. Krempaska, A. Bertrand, C. Higgs, R. Kapeller,
Projects, Tools and Engineering Patricia McBride Computing Division Fermilab March 17, 2004.
Practical IT Research that Drives Measurable Results Get Started Bringing Order to Help Desk Request Chaos.
Status: Central Storage Services CD/LSC/CSI/CSG June 26, 2007.
CD FY10 Budget and Tactical Plan Review FY10 Tactical Plans for Computing Infrastructure(CD Business Apps & CD Infrastructure Apps) Jim Fromm/Scott Nolan.
Town of Watertown Staffing and Operational Assessment of the Public Works Department September 10, 2013 EDWARD J. COLLINS CENTER FOR PUBLIC MANAGEMENT.
CD FY10 Budget and Tactical Plan Review FY10 Tactical Plans for Scientific Computing Facilities / General Physics Computing Facility (GPCF) Stu Fuess 06-Oct-2009.
GPCF* Update Present status as a series of questions / answers related to decisions made / yet to be made * General Physics Computing Facility (GPCF) is.
CD FY09 Tactical Plan Review FY09 Tactical Plans for Computing Infrastructure Igor Mandrichenko 9/24/2008.
Computing Infrastructure Arthur Kreymer 1 ● Power status in FCC (UPS1) ● Bluearc disk purchase – coming soon ● Planned downtimes – none ! ● Minos.
Minos Computing Infrastructure Arthur Kreymer 1 ● Grid – Going to SLF 5, doubled capacity in GPFarm ● Bluearc - performance good, expanding.
Chapter 6: Securing the Cloud
PSS Plans for Improved Reliability and Availability
Building a Virtual Infrastructure
FY09 Tactical Plan Status Report for Site Networking
Public Employees Retirement Association Infrastructure Upgrade
FY10 Tactical Plans for Enterprise Information Systems
Public Employees Retirement Association Infrastructure Upgrade
FY10 Tactical Plans for Enterprise Information Systems
UpgradeX and CloudSuite
Presentation transcript:

CD FY10 Budget and Tactical Plan Review FY10 Tactical Plan for SCF / System Administration DocDB #3389 Jason Allen 10/06/2009

CD FY10 Budget and Tactical Plan Review 2 FY10 Tactical Plan for SCF / System Administration FEF Members Jason Allen Glenn Cooper Ed Simmonds LaDerrick Honeycutt Ling Ho Jason Harrington Etta Burns Seth Graham Mark Schmitz Rennie Scott Current Customers D0 Offline D0 Online CDF Offline CDF Online EAG Minerva MiniBoone Minos SciBoone GP Farm MIPP SCF / System Administration plan is executed by the SCF/FEF Department.

CD FY10 Budget and Tactical Plan Review 3 FY10 Tactical Plan for SCF / System Administration Tactical Plan Leader: Jason Allen Service Activity List Online Systems Management Compute Node Management Server Management Storage Management Batch System Management Event and Incident Management Problem Management Operational Planning and Consulting Support Procurement Support Professional Development Project Activity List Short Term Projects

CD FY10 Budget and Tactical Plan Review 4 Service Activity: Online Systems Management Goals Related to this Activity –Common goal for all services: Support scientific computing at Fermilab by providing server, compute node, and storage management. Constantly strive to improve operational efficiency while maintaining a high level of customer satisfaction. Key Metrics –Tickets per month –Number of systems Service Documentation : Issues and Risks (specific to this activity, includes allocation impact) 1.Aging hardware with no plans to replace. 2.Greater demands being put on CD staff to keep equipment running. 3.Little control over operational decisions.

CD FY10 Budget and Tactical Plan Review 5 Service Activity: Compute Node Management Goals Related to this Activity –Common goal for all services: Support scientific computing at Fermilab by providing server, compute node, and storage management. Constantly strive to improve operational efficiency while maintaining a high level of customer satisfaction. Key Metrics –Number of compute nodes managed, upgraded, and installed Service Documentation : Issues and Risks (specific to this activity, includes allocation impact) 1.Endemic issue when purchasing new hardware.

CD FY10 Budget and Tactical Plan Review 6 Service Activity: Server Management Goals Related to this Activity –Common goal for all services: Support scientific computing at Fermilab by providing server, compute node, and storage management. Constantly strive to improve operational efficiency while maintaining a high level of customer satisfaction. Key Metrics –Number of servers managed, upgraded, and installed. Service Documentation : Issues and Risks (specific to this activity, includes allocation impact) 1.Moving equipment to from one facility to another consumes a huge amount of effort. 2.Quality control of SLF and Fermi specific packages. Proper testing procedures must be followed to prevent deployment of bad RPMs. 3.Poor hardware support from vendors. 4.VM sprawl.

CD FY10 Budget and Tactical Plan Review 7 Service Activity: Batch System Management Goals Related to this Activity –Common goal for all services: Support scientific computing at Fermilab by providing server, compute node, and storage management. Constantly strive to improve operational efficiency while maintaining a high level of customer satisfaction. Key Metrics –Number of reported batch system related Service Desk tickets. Number of job slots on batch system. –Current and historical status: Service Documentation : Issues and Risks (specific to this activity, includes allocation impact) 1.Reliant on Torque which is a community supported Open Source batch system. Poor quality control is a concern.

CD FY10 Budget and Tactical Plan Review 8 Service Activity: Procurement Support Goals Related to this Activity –Common goal for all services: Support scientific computing at Fermilab by providing server, compute node, and storage management. Constantly strive to improve operational efficiency while maintaining a high level of customer satisfaction. Key Metrics –Total dollar amount of approved requisitions. Number of reqs approved. Service Documentation : Issues and Risks (specific to this activity, includes allocation impact) 1.Purchasing hardware with endemic problems. 2.Vendor lock-in.

CD FY10 Budget and Tactical Plan Review 9 Project Activity: Short Term Projects: Configuration Management Goals Related to this Activity –1. Reassess current tools and methods used for configuration management, identify strengths and weaknesses. –2. Evaluate potential replacement configuration management tools, Puppet etc. –3. Tighter coupling of configuration management, provisioning, and monitoring tools. –4. Better reporting and auditing of configuration changes. Key Milestones –Start: Fourth quarter CY 2009 –End: Second quarter CY 2010 Project Documentation : Issues and Risks (specific to this activity, includes allocation impact) 1.Possible service interruption while migrating to a new tool. 2.New tools could be less reliable than the old.

CD FY10 Budget and Tactical Plan Review 10 Project Activity: Short Term Projects: Business Intelligence Goals Related to this Activity –1. Construct a data store from various sources containing asset and operational data. –2. Deploy tools which allow historical and current operational views using ad-hoc or canned reports. Key Milestones –Start: First quarter CY 2010 –End: Second quarter CY 2010 Project Documentation : Issues and Risks (specific to this activity, includes allocation impact) 1.Reporting incorrect information.

CD FY10 Budget and Tactical Plan Review 11 Project Activity: Short Term Projects: GPCF Deployment To be discussed in the GPCF presentation.

CD FY10 Budget and Tactical Plan Review 12 FY10 FTE: Request vs. Allocation Level 0/1 Activity: SCF / System Administration Activity Level 2FTEs Operational Planning and Consulting Support0.15 Procurement Support0.45 Professional Development0.5 Batch System Management0.5 Compute Server Management1.0 Event and Incident Management1.5 Online Systems Management1.0 Problem Management0.5 Storage Management0.6 System Administration Management1.5 Server Management2.0 Total10.2

CD FY10 Budget and Tactical Plan Review 13 FY10 M&S: Request vs. Allocation Level 0/1 Activity: SCF / System Administration Activity Level 2Project or Service Project PriorityM&S RequestedM&S Allocated Professional DevelopmentService ---$15, Compute Server ManagementService ---$20, GP GridService --- $434, Server ManagementService ---$136, Total $605,000.00

CD FY10 Budget and Tactical Plan Review 14 Ripple Effect on Shared IT Services Activity Level 2Network Connectivity: Expanded Service  GP Grid140 switch portsNew, Steady- State service drives

CD FY10 Budget and Tactical Plan Review 15 M&S Requests Level 0/1 Activity: SCF/ System Administration GP Farm core count was determined by FermiGrid Services based on projected experiment need and node retirements. RequestDescriptionRisk of Reduced Allocation 434k1120 cores for GP FarmReduced analysis CPU 40kRacks and related infrastructure.Must reuse old hardware 41kAdmin and spare machines.Less operational reliability 40kVirtualization management software.Could postpone VM rollouts 20kReplace Lantronix and Opengear console servers. Less reliable console servers could mean more downtime

CD FY10 Budget and Tactical Plan Review 16 Summary of Past Action Items CDACTIONITEM-211: Need plan to conform D0 CAB cluster to OSE baseline. State: Open D0 OSE Taskforce lead by Mike Diesburg is examining how to bring CAB in line with OSE requirements. CDACTIONITEM-174: Batch system management (Torque /PBS) group? State: Closed It was determined after several meetings that FermiGrid Services doesn’t currently have sufficient effort to support the D0 CAB batch system. CDACTIONITEM-173: Desktop Computer Management should it be in a different group? State: Closed Support for CDF Desktops was migrated to Central Services at the beginning of 2009.

CD FY09 Tactical Plan Status 17 Tactical Plan Summary: FY09 Accomplishments Completed “Take over management of EAG servers” project. Completed “Upgrade and migrate CAB status web pages” project. Completed “Revamp system console and remote power-cycling infrastructure” project. Setup a new “interactive/batch cluster” for the Minerva experiment. Deployed virtualized clusters supporting high availability for D0 Offline and CDF Online.

CD FY09 Tactical Plan Status 18 Tactical Plan Summary: Objectives for FY10 Maintain existing scientific computing infrastructure for running Fermilab experiments. Scope includes system management, procurement of new systems, retiring old equipment, and troubleshooting technical issues. Improve system administration efficiency by streamlining procedures and refining existing system management infrastructure. Implement virtualization technologies in an effort to consolidate physical systems and increase operational reliability. (not virtualization, just for the sake of virtualization) Improve system monitoring (see business intelligence project).

CD FY09 Tactical Plan Status 19 Tactical Plan Summary: Objectives for FY10 (cont) Improve operational reporting. Document processes and procedures related to system management with the goal of improving service quality. Increase technical proficiency of department members. Improve technical proficiency of system administrators. Share technical expertise and standardize system administration tools/procedures with other CD departments. Support the division’s effort to implement ITIL Promote a safe and harmonious work environment

CD FY09 Tactical Plan Status 20 Tactical Plan Summary: Risk Assessment Reduction in available effort due to resignations, budget shortfalls, or reassignments. Increased number of requests from customers because of reduced support from scientific staff. This could be a particular problem with D0/CDF as RunII starts to wind down. Maintain high quality SLF and Fermi RPM releases. Proper testing procedures must be followed to prevent deployment of bad packages. Newer hardware may only run SLF 5.x or newer, experiment code must be compatible with current OS. Endemic problems with hardware purchases, especially disk vibration issues, are always a concern.

CD FY09 Tactical Plan Status 21 Tactical Plan Summary: Summary Significant challenges in FY10, particularly in regard to effort. Our focus this year to maintain stable operations. Lots of interesting things happening with virtualization. Hope to make significant improvements in configuration management and reporting. Significant challenges in FY10, particularly in regard to effort. Our focus this year to maintain stable operations. Lots of interesting things happening with virtualization. Hope to make significant improvements in configuration management and reporting.