CERN IT Department CH-1211 Geneva 23 Switzerland www.cern.ch/i t Data & Storage Services Technical student CERN IT-DSS-FDO University of Vigo WCSFSS 2014.

Slides:



Advertisements
Similar presentations
UBIQUITY V3 An extensible platform for creating dynamic, customized, and geocentric native mobile applications.
Advertisements

Manchester HEP Desktop/ Laptop 30 Desktop running RH Laptop Windows XP & RH Home server AFS using openafs 3 DB servers. Web server AFS Mail Server.
Grid and CDB Janusz Martyniak, Imperial College London MICE CM37 Analysis, Software and Reconstruction.
Novell Server Linux vs. windows server 2008 By: Gabe Miller.
Seafile - Scalable Cloud Storage System
Virtual Network Servers. What is a Server? 1. A software application that provides a specific one or more services to other computers  Example: Apache.
Upgrading the Platform - How to Get There!
BNL Oracle database services status and future plans Carlos Fernando Gamboa RACF Facility Brookhaven National Laboratory, US Distributed Database Operations.
Computing Facilities CERN IT Department CH-1211 Geneva 23 Switzerland t CF CERN Business Continuity Overview Wayne Salter HEPiX April 2012.
CT NIKHEF June File server CT system support.
Mass RHIC Computing Facility Razvan Popescu - Brookhaven National Laboratory.
Take An Internal Look at Hadoop Hairong Kuang Grid Team, Yahoo! Inc
CERN IT Department CH-1211 Genève 23 Switzerland t Next generation of virtual infrastructure with Hyper-V Michal Kwiatek, Juraj Sucik, Rafal.
CERN IT Department CH-1211 Genève 23 Switzerland t Integrating Lemon Monitoring and Alarming System with the new CERN Agile Infrastructure.
The Era of the Cloud OS: Transform the Datacentre
UCL Site Report Ben Waugh HepSysMan, 22 May 2007.
CSC 456 Operating Systems Seminar Presentation (11/13/2012) Leon Weingard, Liang Xin The Google File System.
US ATLAS Western Tier 2 Status and Plan Wei Yang ATLAS Physics Analysis Retreat SLAC March 5, 2007.
CERN - IT Department CH-1211 Genève 23 Switzerland t Tier0 database extensions and multi-core/64 bit studies Maria Girone, CERN IT-PSS LCG.
Hadoop Hardware Infrastructure considerations ©2013 OpalSoft Big Data.
Data & Storage Services CERN IT Department CH-1211 Genève 23 Switzerland t DSS From data management to storage services to the next challenges.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES P. Saiz (IT-ES) AliEn job agents.
Win202 Database Administration. Introduction Welcome to OpenEdge. Type 2 Storage Areas. One of the big selling points for the OpenEdge platform and Win202.
Introduction to U.S. ATLAS Facilities Rich Baker Brookhaven National Lab.
Site Report: Tokyo Tomoaki Nakamura ICEPP, The University of Tokyo 2013/12/13Tomoaki Nakamura ICEPP, UTokyo1.
DataNet – Flexible Metadata Overlay over File Resources Daniel Harężlak 1, Marek Kasztelnik 1, Maciej Pawlik 1, Bartosz Wilk 1, Marian Bubak 1,2 1 ACC.
Manchester HEP Desktop/ Laptop 30 Desktop running RH Laptop Windows XP & RH OS X Home server AFS using openafs 3 DB servers Kerberos 4 we will move.
Grid Computing at Yahoo! Sameer Paranjpye Mahadev Konar Yahoo!
Architecture and ATLAS Western Tier 2 Wei Yang ATLAS Western Tier 2 User Forum meeting SLAC April
Operating Systems & Information Services CERN IT Department CH-1211 Geneva 23 Switzerland t OIS Update on Windows 7 at CERN & Remote Desktop.
Securely Synchronize and Share Enterprise Files across Desktops, Web, and Mobile with EasiShare on the Powerful Microsoft Azure Cloud Platform MICROSOFT.
CERN - IT Department CH-1211 Genève 23 Switzerland t Oracle Real Application Clusters (RAC) Techniques for implementing & running robust.
Consulting Services JobScheduler Architecture Decision Template Information for Consulting Parties Information for Consulting Parties.
CERN-IT Oracle Database Physics Services Maria Girone, IT-DB 13 December 2004.
CERN - IT Department CH-1211 Genève 23 Switzerland t OIS Deployment of Exchange 2010 mail platform Pawel Grzywaczewski, CERN IT/OIS HEPIX.
Future home directories at CERN
SYNC & SHARE FOR THE DUTCH RESEARCH & HIGHER EDUCATION SURFdrive
CERN IT Department CH-1211 Genève 23 Switzerland t Load Testing Dennis Waldron, CERN IT/DM/DA CASTOR Face-to-Face Meeting, Feb 19 th 2009.
1 D0 Taking Stock By Anil Kumar CD/LSCS/DBI/DBA June 11, 2007.
Scientific Storage at FNAL Gerard Bernabeu Altayo Dmitry Litvintsev Gene Oleynik 14/10/2015.
Cloud Computing is a Nebulous Subject Or how I learned to love VDF on Amazon.
CERN - IT Department CH-1211 Genève 23 Switzerland t High Availability Databases based on Oracle 10g RAC on Linux WLCG Tier2 Tutorials, CERN,
 Introduction  Architecture NameNode, DataNodes, HDFS Client, CheckpointNode, BackupNode, Snapshots  File I/O Operations and Replica Management File.
Consulting Services JobScheduler Architecture Decision Template Information for Consulting Parties Information for Consulting Parties.
Service ETH Zurich > Status:Prod since June 2013||Beta||Test||Planned Number of users (current, target):7400 Default and Maximum quota:50GB Linux/Mac/Win.
Grid Technology CERN IT Department CH-1211 Geneva 23 Switzerland t DBCF GT Upcoming Features and Roadmap Ricardo Rocha ( on behalf of the.
Operating Systems & Information Services CERN IT Department CH-1211 Geneva 23 Switzerland t OIS Drupal at CERN Juraj Sucik Jarosław Polok.
CERN IT Department CH-1211 Genève 23 Switzerland t Migration from ELFMs to Agile Infrastructure CERN, IT Department.
CERN - European Organization for Nuclear Research FOCUS March 2 nd, 2000 Frédéric Hemmer - IT Division.
Grid Technology CERN IT Department CH-1211 Geneva 23 Switzerland t DBCF GT Overview of DMLite Ricardo Rocha ( on behalf of the LCGDM team.
1 5/4/05 Fermilab Mass Storage Enstore, dCache and SRM Michael Zalokar Fermilab.
Next Generation of Apache Hadoop MapReduce Owen
The RAL PPD Tier 2/3 Current Status and Future Plans or “Are we ready for next year?” Chris Brew PPD Christmas Lectures th December 2007.
Platform & Engineering Services CERN IT Department CH-1211 Geneva 23 Switzerland t PES Agile Infrastructure Project Overview : Status and.
CDF SAM Deployment Status Doug Benjamin Duke University (for the CDF Data Handling Group)
BIG DATA/ Hadoop Interview Questions.
Sql Server Architecture for World Domination Tristan Wilson.
CERN IT Department CH-1211 Genève 23 Switzerland t DPM status and plans David Smith CERN, IT-DM-SGT Pre-GDB, Grid Storage Services 11 November.
CERN IT Department CH-1211 Geneva 23 Switzerland t OIS Operating Systems & Information Services CERN IT Department CH-1211 Geneva 23 Switzerland.
CERN IT-Storage Strategy Outlook Alberto Pace, Luca Mascetti, Julien Leduc
Compute and Storage For the Farm at Jlab
BNL Box Hironori Ito Brookhaven National Laboratory
Jenny Pange University of Ioannina
Introduction to PHP FdSc Module 109 Server side scripting and
GSIAF & Anar Manafov, Victor Penso, Carsten Preuss, and Kilian Schwarz, GSI Darmstadt, ALICE Offline week, v. 0.8.
Simple Storage Service
Microsoft Azure Fundamentals: Data Understanding Microsoft Azure SQL
SharePoint 2019 Changes Point of View.
Transarc AFS Client for NT
Presentation transcript:

CERN IT Department CH-1211 Geneva 23 Switzerland t Data & Storage Services Technical student CERN IT-DSS-FDO University of Vigo WCSFSS 2014

CERNBOX Service Owncloud 5 with NFS storage (currently deployed) Desktop sync clients iOS, Android branded mobile apps Owncloud 7 with EOS storage backend (currently migrating) Desktop sync clients iOS, Android official mobile apps 2

CERNBOX/NFS deployment Setup 100% RH6 on “standard” hardware Guaranteed failover (redundant nodes) 3 NFS servers, async, RAID JBODs Initial space: 20 TB MySQL server 48GB RAM Apache, PHP 5.4 (SCL1.0) mod_proxy_balancer 64 core, 64GB RAM diagram source: owncloud.com

Usage of CERNBOX/NFS CERNBox Beta 2014 MarchAprilMayJune…OctoberNovember (until 15th) users190 (*) files191K907K1.6M2.7M7.6M9.6M size480GB1TB1.5TB1.9TB3.8TB4.3TB 4 Avg ~5GB Avg ~10K files

File access patterns GET/PUT ratio: 2/1 File type distribution: 1200 different file extensions! 30%.c.h.C 30%.jpg.png 15% no extension (UNIX world!) 25% other:.pdf,.txt,.ppt,.docx,.root,.py,.eps,.tex ~1104 shares 833 via public link (only 48 password protected!) 271 via internal share 5

CERNBox - J.T.Moscicki, M.Lamanna - TNC2014 Dublin6

Platform ratio 7

Clients ratio 8 Mobile access Desktop access (sync clients) Web access

It´s time to change 9

10

CERNBOX/EOS Storage Deployed capacity in EOSUSER repo: 650TB raw disk => 2 replica on top of raid-1 (same as 4 replicas, hyper safe…) 8 machines (disk servers) Average file footprint 250B (in memory) 2 head nodes (1 master plus 1 slave with 48 GB RAM) Migrating to 1 master and 1 slave with 256GB 11

What’s new in CERNBOX/EOS ? 12 Awesome PERFORMANCE UNLIMITED user quota INTEGRATION with PETABYTE-SIZE scientific repos COLLABORATION between different communities NO METADATA IN SQL database (oc_filecache) => All metadata in EOS xattrs => No rescan of file systems triggered USER-PERMISSIONS not OC permissions (apache:apache) DIRECT MAPPING between OC permissions and EOS acls

What’s new in CERNBOX/EOS ? 13

What’s new in CERNBOX/EOS ? 14

User feedback “I start using the cernbox since I'm a heavy user of Dropbox and I recently reached the limit of free disk space (5Gb). For work it will be great to have at least 50Gb of personal space “ “I would like to have is a free client for Android, which should be much more stable. “ “I find the service perfect to be able to get always the files/sources/documents I need independently of the place and connection. ”

User feedback “On my Macbook Air I noticed that the battery was draining much faster than usual. I checked on the activity monitor and CERNBOX was consuming 80-95% of the total energy. “ “What I would like to do in the future is to combine my private data like my photos for example on my home owncloud server, and my work data on the Cern owncloud server.” “I'm very glad that CERN has launched the service using the OwnCloud platform. “ “I hope that you will be supporting this service officially soon!! ” 16

Risk factors Stability of the sync clients across diff versions Resiliency of the system with “exotic” failure modes Risk to lose files!!! Risk to corrupt data!!! Product evolution Stability of the sync protocol (controlled evolution) Bulletproof core functionality Market evolution 17

Status:Beta Number of users (current, target):Current 1000 users, targeting CERN users Default and Maximum quota:No quota Linux/Mac/Win user ratio:20/60/20 Desktop clients/Mobile Clients/Web access ratio:4/70/26 Technology:Owncloud 7 + EOS + No DB for metadata (EOS xattrs) Target communities:CERN staff (Engineers, physicists and administrative staff) Integration in your current environment (examples):Synchronizing batch data submitted to GRID to your laptop Risk factors:Previous slide Most important functionality:Robustness of the sync clients Missing functionality (if any):Shibboleth integration, end-to-end checksums, client journaling Service summary

Questions ? 19

BACKUP SLIDES CERNBox - J.T.Moscicki, M.Lamanna - TNC2014 Dublin20

Example of integration 21 GRID (~ cores) CERNBOX/EOS 1.User submit job to grid FUSE MOUNTED User laptop 2. Data results synced automatically