Subtitle of Presentation

Slides:



Advertisements
Similar presentations
How To: Insert Headers and Footers
Advertisements

Report of Liverpool HEP Computing during 2007 Executive Summary. Substantial and significant improvements in the local computing facilities during the.
Duke Atlas Tier 3 Site Doug Benjamin (Duke University)
Mainframe Replication and Disaster Recovery Services.
Technology Steering Group January 31, 2007 Academic Affairs Technology Steering Group February 13, 2008.
Secure Off Site Backup at CERN Katrine Aam Svendsen.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment Chapter 8: Implementing and Managing Printers.
Efficiently Sharing Common Data HTCondor Week 2015 Zach Miller Center for High Throughput Computing Department of Computer Sciences.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment, Enhanced Chapter 8: Implementing and Managing Printers.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment Chapter 8: Implementing and Managing Printers.
A+ Guide to Hardware: Managing, Maintaining, and Troubleshooting, Sixth Edition Chapter 9, Part 11 Satisfying Customer Needs.
UCL Site Report Ben Waugh HepSysMan, 22 May 2007.
Introduction Optimizing Application Performance with Pinpoint Accuracy What every IT Executive, Administrator & Developer Needs to Know.
Status of WLCG Tier-0 Maite Barroso, CERN-IT With input from T0 service managers Grid Deployment Board 9 April Apr-2014 Maite Barroso Lopez (at)
Speaker Notes for employee discussions on how Canada Post is getting greener – August 2008 The Issues Canada Post needs to upgrade its current printer.
OSG Public Storage and iRODS
ISG We build general capability Introduction to Olympus Shawn T. Brown, PhD ISG MISSION 2.0 Lead Director of Public Health Applications Pittsburgh Supercomputing.
© CCI Learning Solutions Inc. 1 Lesson 5: Basic Troubleshooting Techniques Computer performance Care of the computer Working with hardware Basic maintenance.
Paul Scherrer Institut 5232 Villigen PSI HEPIX_AMST / / BJ95 PAUL SCHERRER INSTITUT THE PAUL SCHERRER INSTITUTE Swiss Light Source (SLS) Particle accelerator.
ATLAS DC2 seen from Prague Tier2 center - some remarks Atlas sw workshop September 2004.
System Development Lifecycle Verification and Validation.
TV/Movie Blog Instructions 1.Open a word document. Set the document to double space. Type your name and the date at the top.
Test and Review chapter State the differences between archive and back-up data. Answer: Archive data is a copy of data which is no longer in regular.
Infrastructure for QA and automatic trending F. Bellini, M. Germain ALICE Offline Week, 19 th November 2014.
1 User Analysis Workgroup Discussion  Understand and document analysis models  Best in a way that allows to compare them easily.
DoC Private IaaS Cloud Thomas Joseph Cloud Manager
2-Dec Offline Report Matthias Schröder Topics: Scientific Linux Fatmen Monte Carlo Production.
Tier 3 Status at Panjab V. Bhatnagar, S. Gautam India-CMS Meeting, July 20-21, 2007 BARC, Mumbai Centre of Advanced Study in Physics, Panjab University,
VO Box Issues Summary of concerns expressed following publication of Jeff’s slides Ian Bird GDB, Bologna, 12 Oct 2005 (not necessarily the opinion of)
Status of India CMS Grid Computing Facility (T2-IN-TIFR) Rajesh Babu Muda TIFR, Mumbai On behalf of IndiaCMS T2 Team July 28, 20111Status of India CMS.
ISG We build general capability Introduction to Olympus Shawn T. Brown, PhD ISG MISSION 2.0 Lead Director of Public Health Applications Pittsburgh Supercomputing.
Microsoft ® Excel 2010 Core Skills Lesson 5 Viewing and Printing Workbooks Courseware #: 3243 Microsoft ® Office Excel 2010.
1 NATO UNCLASSIFIED Joe Delorie DSPO NATO Standardization Tasking Review and Analysis Process (STRAP) DoD Standardization Conference March 16-18, 2004.
Platform & Engineering Services CERN IT Department CH-1211 Geneva 23 Switzerland t PES AI Images, flavours and partitions Vítor Gouveia,
The RAL PPD Tier 2/3 Current Status and Future Plans or “Are we ready for next year?” Chris Brew PPD Christmas Lectures th December 2007.
II EGEE conference Den Haag November, ROC-CIC status in Italy
Platform & Engineering Services CERN IT Department CH-1211 Geneva 23 Switzerland t PES Agile Infrastructure Project Overview : Status and.
INFN/IGI contributions Federated Clouds Task Force F2F meeting November 24, 2011, Amsterdam.
Job offer IT System & Software Specialist We are currently looking for an IT database administrator in order to respond to one key-account customer demand.
The CMS Beijing Tier 2: Status and Application Xiaomei Zhang CMS IHEP Group Meeting December 28, 2007.
Computing Infrastructure Arthur Kreymer 1 ● Power status in FCC (UPS1) ● Bluearc disk purchase – coming soon ● Planned downtimes – none ! ● Minos.
Computing Infrastructure Arthur Kreymer 1 ● Power status in FCC (UPS1) ● Bluearc disk purchase – still coming soon ● Planned downtimes – none.
Valencia Cluster status Valencia Cluster status —— Gang Qin Nov
HPC In The Cloud Case Study: Proteomics Workflow
WLCG IPv6 deployment strategy
DCS Status and Amanda News
The Beijing Tier 2: status and plans
Installation 1. Installation Sources
Examples Example: UW-Madison CHTC Example: Global CMS Pool
Diskpool and cloud storage benchmarks used in IT-DSS
UBUNTU INSTALLATION
Luca dell’Agnello INFN-CNAF
Oxford Site Report HEPSYSMAN
Welcome! Thank you for joining us. We’ll get started in a few minutes.
Статус ГРИД-кластера ИЯФ СО РАН.
PES Lessons learned from large scale LSF scalability tests
Status of Full Simulation for Muon Trigger at SLHC
Debunking the Top 10 Myths of Small Business Server: Using Windows SBS in Larger Environments Abstract: This session will debunk some of the common myths.
Chapter III, Desktop Imaging Systems and Issues: Lesson II Storing Image Data
The Scheduling Strategy and Experience of IHEP HTCondor Cluster
THE BASICS.
Experience with an IT Asset Management System
Exploring the Power of EPDM Tasks - Working with and Developing Tasks in EPDM By: Marc Young XLM Solutions
The Problem ~6,000 PCs Another ~1,000 boxes But! Affected by:
Specifications Clean & Match
Trillo Apparel Company Revised Delivery Plan
PerformanceBridge Application Suite and Practice 2.0 IT Specifications
Drupal user guide Evashni Jansen Web Office.
CS-Status Results from workshop 2008 Statistics Miscellaneous
Credential Management in HTCondor
Presentation transcript:

Subtitle of Presentation NAF Status Report Subtitle of Presentation Yves Kemp DESY 11.10.2017

DUST

DUST – Doubling capacity Purchase Exactly doubling the capacity Data and Metadata Current capacity: Proposal Doubling capacity of the active groups … not exactly doubling: Reduce overcommitment factor (ideally from 1.25 to 1) x2 | Presentation Title | Name Surname, Date (Edit by "Insert > Header and Footer")

DUST – Upgrade Plan Last steps First step: Done New quotas: Do be done Need to upgrade current DUST from 4.0 -> 5.0 During operation Unfortunately, HW trouble occurred (also inducing temporary performance issues), delaying upgrade Second step: Done Current DUST from 5.0 -> 5.2 During Operation, no troubles seen Third step: Done Install new components with 5.2 Enlarge Systems Last steps New quotas: Do be done | Presentation Title | Name Surname, Date (Edit by "Insert > Header and Footer")

DUST – Metadata Problem Another solution Metadata is located on special server with SSD Space is tight … posing problems some weeks ago: People could no longer write into DUST despite quota available We contacted experts and users, ehm, little response If response, turned out to be garbage In future, we have to resort to more drastic approaches to ensuring functionality E.g. setting quota of problematic users to zero We do not want to set a Metadata Quota With current ratio Metadata/Data, ~700 TB of enlarged DUST cannot be written as not enough metatada available Another solution Purchase another MetaData server With this server, we will be just safe to write the ~700 TB data with current ratio FYI: This solution costs ~100k EUR FYI: User file count available via AMFORA quota management for admins (not for users currently) | Presentation Title | Name Surname, Date (Edit by "Insert > Header and Footer")

BIRD: SGE and HTCondor

BIRD: SGE Status Well used Some problems, usually because SGE AFS volume troubled Reason unclear. We suspect user induced troubles, but cannot pinpoint this further Solution: Switch to another batch system | Presentation Title | Name Surname, Date (Edit by "Insert > Header and Footer")

BIRD: Hardware Status Currently 495 WNs (in SGE, plus 25 in HTCondor pilot) ~60 are rather old, will be decommissioned end 2017 ~60 new WNs will be purchased: 20 cores with 128 GB RAM each Would like to put them into operation with HTCondor And use some for WGS server | Presentation Title | Name Surname, Date (Edit by "Insert > Header and Footer")

BIRD: HTCondor Status Pilot is there: Working AFS and Kerberos Initial documentation for migration http://bird.desy.de/htc/ In progress Dedicated WGS for experiments (small VMs, would be transformed into more powerful physical machines when pilot is advancing) Pilot user mailing list What we lack are test users that actually do tests Contact e.g. sge.service@desy.de for becoming a pilot user The funny patterns in the occupation come from YK to present at least something for the PRC next week | Presentation Title | Name Surname, Date (Edit by "Insert > Header and Footer")

BIRD: HTCondor … next steps Next steps: Short term future User testing and reporting Fixing things like missing DUST mounts etc. (DONE) User input: SSH-to-job and DAGman is considered useful, will be implemented Growing #WNs: Move additional 33 WNs from SGE to HTCondor Setting up powerful WGS as needed Adding IPv6 (almost done) Next steps: Longer future Monitoring and accounting Migration: Add even more nodes still 2017 (e.g. new purchases) -> 50/50 SGE/HTCondor on 1.1.2018 2018Q1: Push all users to migrate 1.4.2018: Shut down BIRD/SGE 2018Q2: Work on containers to help SL6 -> EL7 migration Currently only IT managed containers envisionned Users might be offered Singularity 2018Q3: Integration Grid/HTCondor with BIRD/HTCondor | Presentation Title | Name Surname, Date (Edit by "Insert > Header and Footer")

Meetings

Meetings PRC next week Regular NUC meetings NAF Users Meeting @TeraScale meeting | Presentation Title | Name Surname, Date (Edit by "Insert > Header and Footer")