Joël Surget / Pierre-Francois Honoré

Slides:



Advertisements
Similar presentations
IHEP Site Status Jingyan Shi, Computing Center, IHEP 2015 Spring HEPiX Workshop.
Advertisements

S. Gadomski, "ATLAS computing in Geneva", journee de reflexion, 14 Sept ATLAS computing in Geneva Szymon Gadomski description of the hardware the.
Technology Steering Group January 31, 2007 Academic Affairs Technology Steering Group February 13, 2008.
Technology Steering Group January 31, 2007 Academic Affairs Technology Steering Group February 13, 2008.
Project Cysera Hardware Configuration Drafted by Zoebir Bong.
© Hitachi Data Systems Corporation All rights reserved. 1 1 Det går pænt stærkt! Tony Franck Senior Solution Manager.
Bob Thome, Senior Director of Product Management, Oracle SIMPLIFYING YOUR HIGH AVAILABILITY DATABASE.
Verify Hardware Requirements Install Windows Server 2008 R2 Configure Active Directory Install SQL Server 2008 Install SharePoint Server 2010 Configure.
CC - IN2P3 Site Report Hepix Fall meeting 2009 – Berkeley
University of Washington Windows and Unix Servers IEEAF – RENU Network Design Workshop Seattle - 30 Nov 2007 Lori Stevens, Director, Distributed Systems.
Datacenters of the Past StorageNetworkCompute Today’s datacenter.
Oxford Update HEPix Pete Gronbech GridPP Project Manager October 2014.
Configuration Management with Cobbler and Puppet Kashif Mohammad University of Oxford.
RAL PPD Computing A tier 2, a tier 3 and a load of other stuff Rob Harper, June 2011.
JLab Scientific Computing: Theory HPC & Experimental Physics Thomas Jefferson National Accelerator Facility Newport News, VA Sandy Philpott.
Tim Bell 24/09/2015 2Tim Bell - RDA.
CEA DSM Irfu IRFU site report. CEA DSM Irfu HEPiX Fall 0927/10/ Computing centers used by IRFU people IRFU local computing IRFU GRIF sub site Windows.
HEP Computing Status Sheffield University Matt Robinson Paul Hodgson Andrew Beresford.
Gareth Smith RAL PPD RAL PPD Site Report. Gareth Smith RAL PPD RAL Particle Physics Department Overview About 90 staff (plus ~25 visitors) Desktops mainly.
FROM QUATTOR TO PUPPET A T2 point of view. BACKGROUND GRIF distributed T2 site  6 sub-sites  Used quattor for  GRIF-LAL is the home of 2 well known.
RAL PPD Tier 2 (and stuff) Site Report Rob Harper HEP SysMan 30 th June
INTERFACES MANAGEMENT CRYOMODULES Vincent HENNION SYSTEM ENGINEERING ACTIVITIES.
R. Krempaska, October, 2013 Wir schaffen Wissen – heute für morgen Controls Security at PSI Current Status R. Krempaska, A. Bertrand, C. Higgs, R. Kapeller,
IRFU SITE REPORT Pierrick Micout CEA/DSM/IRFU/SEDI.
U N C L A S S I F I E D LA-UR Leveraging VMware to implement Disaster Recovery at LANL Anil Karmel Technical Staff Member
Extreme Scale Infrastructure
Australia Site Report Lucien Boland Goncalo Borges Sean Crosby
Dev and Test Solution reference architecture.
Happy Endings: Reengineering Wesleyan’s Software Deployment to Labs and Classrooms Kyle Tousignant 03/22/2016.
Pete Gronbech GridPP Project Manager April 2016
IT Services Katarzyna Dziedziniewicz-Wojcik IT-DB.
Dev and Test Solution reference architecture.
Dev and Test Solution reference architecture.
Operations and plans - Polish sites
Marc Dobson On behalf of CMS DAQ Team
Working With Azure Batch AI
Task T HTS Dipole Magnet Design and Construction
HEPiX Spring 2014 Annecy-le Vieux May Martin Bly, STFC-RAL
CC - IN2P3 Site Report Hepix Spring meeting 2011 Darmstadt May 3rd
Dev and Test Solution reference architecture.
Update on Plan for KISTI-GSDC
Welcome! Thank you for joining us. We’ll get started in a few minutes.
Dev and Test Solution reference architecture.
VceTests VCE Test Dumps
PK-CIIT Grid Operations in Pakistan
CernVM Status Report Predrag Buncic (CERN/PH-SFT).
XFEL Cold mass & vacuum vessel non-conformities Detected At saclay
Azure.
NTC 324 RANK Education Your Life - ntc324rank.com.
NTC 324Competitive Success/tutorialrank.com
,Dell PowerEdge 13 gen servers rental.
NTC 324 Education for Service-- tutorialrank.com.
NTC 324 RANK Perfect Education/ ntc324rank.com.
NTC 324 RANK Education for Service-- ntc324rank.com.
Zero Clients and Virtual Desktops in Academic Environments
The Infrastructure of the CDS Group
Microsoft Virtual Academy
Dev and Test Solution reference architecture.
Managing Services with VMM and App Controller
MFCF’s Mac Management Methods
MMG: from proof-of-concept to production services at scale
Critical design review #1 for medium beta cavity cryomodules 3-4 APRIL main evolutions between M-ECCTD and serie - Bonjour, Franck Peauger, Je.
Identification and marking of ESSI deliverables
Cryomodule Product Breakdown Structure and configuration traceability
Day 2, Session 2 Connecting System Center to the Public Cloud
PerformanceBridge Application Suite and Practice 2.0 IT Specifications
Pete Gronbech, Kashif Mohammad and Vipul Davda
Containers on Azure Peter Lasne Sr. Software Development Engineer
10th srf collaboration LASA JUNE CAVITY interfaces - Vincent Hennion
John Taylor, Deputy CISO Martin Myers, IT Architect
Presentation transcript:

Joël Surget / Pierre-Francois Honoré CMS Double Chooz HESS Edelweiss Herschel ALICE Interpreting radiations from the Universe. Site report 2016 IRFU Joël Surget / Pierre-Francois Honoré

New orgAnisAtion (1/1/2016) CEA ( French Alternative Energies and Atomic Energy Commission ) Materials Sciences Division Life Sciences Division Nuclear Energy Division Technologies Division Defense Division Basic Research Division 6200 people 15 Institutes/Labs IRFU 800 people Site report 2016 IRFU

IRFU full member of « University Paris Saclay »

SUMMARY Unix GRID Infrastructure Windows Team Site report 2016 IRFU|

Puppet / Foreman for configuration management Unix evolutions Cobbler for Linux installation supported: CentOS, Ubuntu laptop encryption with LUKS Puppet / Foreman for configuration management git + r10k security update, software deployment Evaluating Mac OSX integration with Munki OpenLDAP & Active Directory File Server : GPFS to Ceph ? testing mid range multisite Ceph cluster Site report 2016 IRFU

OpenLDAP : mechanism OpenLDAP runs on Hyper-V cluster 4 NIS = 1500 accounts => conflicts : login and uid home directory migration Site report 2016 IRFU

Ceph infrastructure 7 servers : Dell PowerEdge 730XD  2x Intel E5-2620 v3 12 cores, 64GB  16x 4TB HDD Site report 2016 IRFU

GRID IPV6 New computing resources New storage facilities Network Finally “un-white-listed” on 12/04/2016. Suffering major performance issues : 200mbits max. Anyone with IPv6 XP on Cisco Catalyst 6500 routing ? IPv4 : IPv6 : New computing resources 8 Dell C6320 : Intel Xeon E5-2650 v3 with 128 GB memory Still not production since 11/2015 : the 10G cards cannot be plugged took 3 months to get the cards but still missing 1 metallic piece for the chassis New storage facilities 11 DELL PowerEdge R730xd 16 x 4TB HDD / server Try Ceph (without SSD). Doesn’t scale well: Read 8.5GB/s Write 3.3GB/s Network Deployed a 80G (2x40 LACP) « backbone » between computing rooms Site report 2016 IRFU

CEPH scaling : 64Threads/4M blocks/jemalloc/15 Clients/write perf Added 6 R510 ceph.com : « petabyte scaling »....... ?

Suffered many power cuts since last time GRID Suffered many power cuts since last time And we now think we understood why: failsafe - 0 log, 0 doc automation - 0 problem Site report 2016 IRFU

GRID - OPS puppet OS Monitoring https://prometheus.io/ reinstalled on CentOS7 : huge performance boost (ruby 2) started fixing all modules for puppet4 : again, huge boost foreseen OS moved all SL6 to SL6.7 because of epel/rhel policy, moving rpms to « latest only ». Monitoring Installed collectd with graphite exporter And this destroyed our graphite. (10k+ IOPS just for a few collectd) a cooling rack just died Installed prometheus as a « collectd replacement » keeping graphite for now, (long term low IOPS graphs) 10000+ => 300 IOPS (VM + Ceph +ssd pool) (~350 servers, 24K metrics/s) NOT influxdb : clustering just went closed source. https://prometheus.io/

Microsoft Windows 10 under study Create a secure official CEA image SCCM 2016 (System Center Configuration Manager): this summer Sharepoint 2013: Web site and collaborative site CEA (IRFU) Education-Research Identity Federation (Renater): this summer Site report 2016 IRFU

Infrastructure 3 rooms mostly finished and up to date 126 square meters 31 water cooled racks (20 and 40 kW) Cooling installation : 500 kW (4 groups) New 600A low voltage panel installed 7 years of work Site report 2016 IRFU

IT Team 2014/2015 retired personnel : Pierrick Micout, Joseph Le Foll 2 new engineers in 2015 (one Windows, one Linux) 2 job vacancies one permanent position for support activities one 1-year position (from June) for the Indigo DataCloud project Contact : http://moorea.cea.fr or joel.surget@cea.fr Site report 2016 IRFU

Interpreting radiations from the Universe. CMS Double Chooz HESS Edelweiss Herschel ALICE Interpreting radiations from the Universe. Commissariat à l’énergie atomique et aux énergies alternatives Centre de Saclay | 91191 Gif-sur-Yvette Cedex Etablissement public à caractère industriel et commercial | RCS Paris B 775 685 019 Direction de la Recherche Fondamentale Institut de Recherche sur les lois Fondamentales de l’Univers