Presentation is loading. Please wait.

Presentation is loading. Please wait.

J. Templon Nikhef Amsterdam Physics Data Processing Group Storage federations Why & How : site perspective Jeff Templon Pre-GBD october 2012 1.CDN vision.

Similar presentations


Presentation on theme: "J. Templon Nikhef Amsterdam Physics Data Processing Group Storage federations Why & How : site perspective Jeff Templon Pre-GBD october 2012 1.CDN vision."— Presentation transcript:

1 J. Templon Nikhef Amsterdam Physics Data Processing Group Storage federations Why & How : site perspective Jeff Templon Pre-GBD october 2012 1.CDN vision for wLCG 2.Relation to storage federation 3.“what would we need to do to make this ready for sites to deploy?”

2 J. Templon Nikhef Amsterdam Physics Data Processing Group “Menial Labor” Inefficient use of storage Jobs failing due to ◦ Disk server overloaded (NOT AN ERROR) ◦ Actual problem with disk server ◦ Experiment catalog out of sync Protocol Zoo What problems need solving? Jeff Templon, Storage Federations WG225 october 2012

3 J. Templon Nikhef Amsterdam Physics Data Processing Group Archiving: Storage of data. “Home” for a specific file. “Master” copy. “origin server” Like tape at T1, what else? Data delivery: focus on performance. Permanence not expected. “proxy/cache” Such servers “everywhere”? Esp. at T2s, T3s, local analysis clusters … CDN division of labor Jeff Templon, Storage Federations WG325 october 2012

4 J. Templon Nikhef Amsterdam Physics Data Processing Group Clients never access origin servers; they become less complex (do one thing well) Clients may not even notice origin server in maintenance (served directly from delivery layer) Popular files rapidly everywhere, boring files only on origin server Caches are read-only from user perspective: simpler. Focus on delivery. Consequences of model Jeff Templon, Storage Federations WG425 october 2012

5 J. Templon Nikhef Amsterdam Physics Data Processing Group Jeff Templon, Storage Federations WG525 october 2012 Amsterdam DM Workshop, June 2010

6 J. Templon Nikhef Amsterdam Physics Data Processing Group Jeff Tem plon, Stor age Fed erat ions WG 6 T1 100 Gbits/s T2 T3 jobs data cache The vision (René’s version) This is us … 25 october 2012

7 J. Templon Nikhef Amsterdam Physics Data Processing Group Some CDNs overlay on P2P network “Catalogue search” is a query operation on a distributed hash tabledistributed hash table ◦ No centralized catalog to fail ◦ DHT is resilient: not a “nothing works unless everything works” system. Delivery nodes can collaborate – client redirection or inter-node retrieval, avoids traffic to origin servers CDN + P2P Jeff Templon, Storage Federations WG725 october 2012

8 J. Templon Nikhef Amsterdam Physics Data Processing Group SF seems to be a milder version of CDN Origin servers are explicitly part of the federation. SF & CDN Jeff Templon, Storage Federations WG825 october 2012

9 J. Templon Nikhef Amsterdam Physics Data Processing Group Time for questions before next part Jeff Templon, Storage Federations WG925 october 2012

10 J. Templon Nikhef Amsterdam Physics Data Processing Group Focus on operational aspects: normally ignored by our middleware developers How much work is it to install? How much “crap” gets dragged in, how “specific” are packages? How much work is it to configure? How easy to automate? How much is redundant (pref. none). How stable is config? Site Perspective : how Jeff Templon, Storage Federations WG1025 october 2012

11 J. Templon Nikhef Amsterdam Physics Data Processing Group How easy is it to tell that the service is correctly working? How easy is it to tell that the service is not correctly working? How easy is it to tell why the service is not correctly working? How easy is it to start from a user- reported error and trace back to a cause? More site how Jeff Templon, Storage Federations WG1125 october 2012

12 J. Templon Nikhef Amsterdam Physics Data Processing Group Observation : developers focus on a different goal for logging: that is, to verify that the program follows the expected sequence of events / steps. Unfortunately this is operationally … the least interesting case. More more site how Jeff Templon, Storage Federations WG1225 october 2012

13 J. Templon Nikhef Amsterdam Physics Data Processing Group Clear time stamps in std format Make log file strings easy to parse For multithreaded or multi-machine actions, provide a uniqueID that will be present in all messages Follow conventions for log levels. Std level should for success produce the minimal output needed for auditability, but all errors and serious warnings should appear Example : logging EMI RT #1202 Jeff Templon, Storage Federations WG1325 october 2012

14 J. Templon Nikhef Amsterdam Physics Data Processing Group NO!!! Sites *respond* in the same way to queries. How site implements the responding mechanism is unimportant as long as it works and performs well enough. Are all sites configured in same way? Jeff Templon, Storage Federations WG1425 october 2012


Download ppt "J. Templon Nikhef Amsterdam Physics Data Processing Group Storage federations Why & How : site perspective Jeff Templon Pre-GBD october 2012 1.CDN vision."

Similar presentations


Ads by Google