Presentation is loading. Please wait.

Presentation is loading. Please wait.

Creating a Sustainable Cycle of Innovation Harvey B Newman, Caltech WSIS Pan European Regional Ministerial Conference Bucharest, November 7-9 2002 WSIS.

Similar presentations


Presentation on theme: "Creating a Sustainable Cycle of Innovation Harvey B Newman, Caltech WSIS Pan European Regional Ministerial Conference Bucharest, November 7-9 2002 WSIS."— Presentation transcript:

1 Creating a Sustainable Cycle of Innovation Harvey B Newman, Caltech WSIS Pan European Regional Ministerial Conference Bucharest, November 7-9 2002 WSIS Pan European Regional Ministerial Conference Bucharest, November 7-9 2002 Global Virtual Organizations for Data Intensive Science

2 Challenges of Data Intensive Science and Global VOs è Geographical dispersion: of people and resources è Scale: Tens of Petabytes per year of data è Complexity: Scientic Instruments and information 5000+ Physicists 250+ Institutes 60+ Countries Major challenges associated with: Communication and collaboration at a distance Managing globally distributed computing & data resources Cooperative software development and physics analysis New Forms of Distributed Systems: Data Grids

3 Emerging Data Grid User Communities Grid Physics Projects (GriPhyN/iVDGL/EDG) ATLAS, CMS, LIGO, SDSS; BaBar/D0/CDF NSF Network for Earthquake Engineering Simulation (NEES) Integrated instrumentation, collaboration, simulation Access Grid; VRVS: supporting new modes of group-based collaboration And Genomics, Proteomics,... The Earth System Grid and EOSDIS Federating Brain Data Computed MicroTomography … Virtual Observatories Grids are Having a Global Impact on Research in Science & Engineering

4 Global Networks for HENP and Data Intensive Science uNational and International Networks with sufficient capacity and capability, are essential today for è The daily conduct of collaborative work in both experiment and theory èData analysis by physicists from all world regions èThe conception, design and implementation of next generation facilities, as “global (Grid) networks” u“Collaborations on this scale would never have been attempted, if they could not rely on excellent networks” – L. Price, ANL u Grids Require Seamless Network Systems with Known, High Performance

5 Data volume Moore’s law High Speed Bulk Throughput BaBar Example [and LHC] u Driven by: è HENP data rates, e.g. BaBar ~500TB/year, Data rate from experiment >20 MBytes/s; [5-75 Times More at LHC] è Grid of Multiple regional computer centers (e.g. Lyon-FR, RAL-UK, INFN-IT, CA: LBNL, LLNL, Caltech) need copies of data u Need high-speed networks and the ability to utilize them fully è High speed Today = 1 TB/day (~100 Mbps Full Time) è Develop 10-100 TB/day Capability (Several Gbps Full Time) within the next 1-2 years Data Volumes More than Doubling Each Yr; Driving Grid, Network Needs

6 HENP Major Links: Bandwidth Roadmap (Scenario) in Gbps Continuing the Trend: ~1000 Times Bandwidth Growth Per Decade; We are Rapidly Learning to Use and Share Multi-Gbps Networks

7 AMS-IX Internet Exchange Thruput Accelerating Growth in Europe (NL) Monthly Traffic 4X Growth In 14 Months 8/01 – 10/02 ↓ 0 5 Gbps 10 Gbps Hourly Traffic 11/02/02 2 Gbps 8 Gbps 6 Gbps 4 Gbps HENP & World BW Growth: 3-4 Times Per Year; 2 to 3 Times Moore’s Law

8 National Light Rail Footprint 15808 Terminal, Regen or OADM site Fiber route PIT POR FRE RAL WAL NAS PHO OLG ATL CHI CLE KAN OGD SAC BOS NYC WDC STR DAL DEN LAX SVL SEA SDG NLR  Buildout Starts November 2002  Initially 4 10 Gb Wavelengths  To 40 10Gb Waves in Future NREN Backbones reached 2.5-10 Gbps in 2002 in Europe, Japan and US; US: Transition now to optical, dark fiber, multi-wavelength R&E network

9 Distributed System Services Architecture (DSSA): CIT/Romania/Pakistan u Agents: Autonomous, Auto- discovering, self-organizing, collaborative u “Station Servers” (static) host mobile “Dynamic Services” u Servers interconnect dynamically; form a robust fabric in which mobile agents travel, with a payload of (analysis) tasks u Adaptable to Web services: OGSA; and many platforms u Adaptable to Ubiquitous, mobile working environments StationServer StationServer StationServer LookupService LookupService Proxy Exchange Registration Service Listener Lookup Discovery Service Remote Notification Managing Global Systems of Increasing Scope and Complexity, In the Service of Science and Society, Requires A New Generation of Scalable, Autonomous, Artificially Intelligent Software Systems

10 By I. Legrand (Caltech)  Deployed on US CMS Grid  Agent-based Dynamic information / resource discovery mechanism  Implemented in  Java/Jini; SNMP  WDSL / SOAP with UDDI  Part of a Global “ Grid Control Room ” Service  http://cil.cern.ch:8080/MONALISA/ MonaLisa: A Globally Scalable Grid Monitoring System

11 History - Throughput Quality Improvements from US to World Bandwidth of TCP < MSS/(RTT*Sqrt(Loss)) (1) 80% annual improvement Factor ~100/8 yr Progress, but the Digital Divide is Maintained: Action is Required

12 NREN Core Network Size (Mbps-km): http://www.terena.nl/compendium/2002 Logarithmic Scale 1k 100k 100 100M 10M 1M 10k Ro It Pl Gr Ir Ukr Hu Cz Es Nl Fi Ch Lagging In Transition Leading Advanced Perspectives on the Digital Divide: Int’l, Local, Regional, Political

13 Building Petascale Global Grids: Implications for Society u Meeting the challenges of Petabyte-to-Exabyte Grids, and Gigabit-to-Terabit Networks, will transform research in science and engineering u These developments could create the first truly global virtual organizations (GVO) u If these developments are successful, and deployed widely as standards, this could lead to profound advances in industry, commerce and society at large è By changing the relationship between people and “persistent” information in their daily lives è Within the next five to ten years u Realizing the benefits of these developments for society, and creating a sustainable cycle of innovation compels us è TO CLOSE the DIGITAL DIVIDE

14 Recommendations uTo realize the Vision of Global Grids, governments, international institutions and funding agencies should: èDefine IT international policies (for instance AAA) èSupport establishment of international standards èProvide adequate funding to continue R&D in Grid and Network technologies èDeploy international production Grid and Advanced Network testbeds on a global scale èSupport education and training in Grid & Network technologies for new communities of users èCreate open policies, and encourage joint development programs, to help Close the Digital Divide uThe WSIS RO meeting, starting today, is an important step in the right direction

15 Some Extra Slides Follow

16 IEEAF: Internet Educational Equal Access Foundation; Bandwidth Donations for Research and Education

17 Next Generation Requirements for Physics Experiments u Rapid access to event samples and analyzed results drawn from massive data stores è From Petabytes in 2002, ~100 Petabytes by 2007, to ~1 Exabyte by ~2012. u Coordinating and managing the large but LIMITED computing, data and network resources effectively u Persistent access to physicists throughout the world, for collaborative work Grid Reliance on Networks u Advanced applications such as Data Grids rely on seamless operation of Local and Wide Area Networks è With reliable, quantifiable high performance

18 Networks, Grids and HENP u Grids are changing the way we do science and engineering u Next generation 10 Gbps network backbones are here: in the US, Europe and Japan; across oceans è Optical Nets with many 10 Gbps wavelengths will follow u Removing regional, last mile bottlenecks and compromises in network quality are now All on the critical path u Network improvements are especially needed in SE Europe, So. America; and many other regions: è Romania; India, Pakistan, China; Brazil, Chile; Africa u Realizing the promise of Network & Grid technologies means è Building a new generation of high performance network tools; artificially intelligent scalable software systems è Strong regional and inter-regional funding initiatives to support these ground breaking developments

19 Closing the Digital Divide What HENP and the World Community Can Do uSpread the message: ICFA SCIC, IEEAF et al. can help uHelp identify and highlight specific needs (to Work On) èPolicy problems; Last Mile problems; etc. uEncourage Joint programs [Virtual Silk Road project; Japanese links to SE Asia and China; AMPATH to So. America] è NSF & LIS Proposals: US and EU to South America uMake direct contacts, arrange discussions with gov’t officials è ICFA SCIC is prepared to participate where appropriate uHelp Start, Get Support for Workshops on Networks & Grids è Encourage, help form funded programs uHelp form Regional support & training groups [Requires Funding]

20 LHC Data Grid Hierarchy Tier 1 Tier2 Center Online System CERN 700k SI95 ~1 PB Disk; Tape Robot FNAL: 200k SI95; 600 TB IN2P3 Center INFN Center RAL Center Institute Institute ~0.25TIPS Workstations ~100-400 MBytes/sec 2.5-10 Gbps 0.1–10 Gbps Physicists work on analysis “channels” Each institute has ~10 physicists working on one or more channels Physics data cache ~PByte/sec ~2.5-10 Gbps Tier2 Center ~2.5 Gbps Tier 0 +1 Tier 3 Tier 4 Tier2 Center Tier 2 Experiment CERN/Outside Resource Ratio ~1:2 Tier0/(  Tier1)/(  Tier2) ~1:1:1

21 Two centers are trying to work as one: -Data not duplicated -Internationalization -transparent access, etc Tier A "Physicists have indeed foreseen to test the GRID principles starting first from the Computing Centres in Lyon and Stanford (California). A first step towards the ubiquity of the GRID." Le Monde 12 april 2001 CERN-US Line + Abilene Renater + ESnet 3/2002 LHC Grid Wkshop 3/02; 2003: to 1 Gbps range 0.5 PB and UP; LHC 10 to 100 Times Greater

22 foster@mcs.anl.gov ARGONNE  CHICAGO Why Grids? l 1,000 physicists worldwide pool resources for petaop analyses of petabytes of data l A biochemist exploits 10,000 computers to screen 100,000 compounds in an hour l Civil engineers collaborate to design, execute, & analyze shake table experiments l Climate scientists visualize, annotate, & analyze terabyte simulation datasets l An emergency response team couples real time data, weather model, population data

23 foster@mcs.anl.gov ARGONNE  CHICAGO Why Grids? (contd) l Scientists at a multinational company collaborate on the design of a new product l A multidisciplinary analysis in aerospace couples code and data in four companies l An HMO mines data from its member hospitals for fraud detection l An application service provider offloads excess load to a compute cycle provider l An enterprise configures internal & external resources to support e-business workload

24 foster@mcs.anl.gov ARGONNE  CHICAGO Grids: Why Now?  Moore’s law improvements in computing produce highly functional endsystems  The Internet and burgeoning wired and wireless provide universal connectivity  Changing modes of working and problem solving emphasize teamwork, computation  Network exponentials produce dramatic changes in geometry and geography  9-month doubling: double Moore’s law!  1986-2001: x340,000; 2001-2010: x4000?

25 A Short List: Revolutions in Information Technology (2002-7) u Scalable Data-Intensive Metro and Long Haul Network Technologies è DWDM: 10 Gbps then 40 Gbps per ; 1 to 10 Terabits/sec per fiber è 10 Gigabit Ethernet (See www.10gea.org) 10GbE / 10 Gbps LAN/WAN integration è Metro Buildout and Optical Cross Connects è Dynamic Provisioning  Dynamic Path Building ­ “Lambda Grids” u Defeating the “Last Mile” Problem (Wireless; or Ethernet in the First Mile) è 3G and 4G Wireless Broadband (from ca. 2003); and/or Fixed Wireless “Hotspots” è Fiber to the Home è Community-Owned Networks

26 foster@mcs.anl.gov ARGONNE  CHICAGO Grid Architecture “Talking to things”: Communication (Internet protocols) & security “Sharing single resources”: Negotiating access, controlling use “Coordinating multiple resources”: ubiquitous infrastructure services, app- specific distributed services “Controlling things locally”: Access to, & control of resources Connectivity Resource Collective Application Fabric Internet Transport Appli- cation Link Internet Protocol Architecture More info: www.globus.org/research/papers/anatomy.pdf

27 u Grid projects have been a step forward for HEP and LHC: a path to meet the “LHC Computing” challenges  But: the differences between HENP Grids and classical Grids are not yet fully appreciated u The original Computational and Data Grid concepts are largely stateless, open systems: known to be scalable  Analogous to the Web u The classical Grid architecture has a number of implicit assumptions  The ability to locate and schedule suitable resources, within a tolerably short time (i.e. resource richness)  Short transactions; Relatively simple failure modes u HEP Grids are data-intensive and resource constrained  Long transactions; some long queues  Schedule conflicts; [policy decisions]; task redirection  A Lot of global system state to be monitored+tracked LHC Distributed CM: HENP Data Grids Versus Classical Grids

28 Upcoming Grid Challenges: Building a Globally Managed Distributed System uMaintaining a Global View of Resources and System State èEnd-to-end System Monitoring èAdaptive Learning: new paradigms for execution optimization (eventually automated) uWorkflow Management, Balancing Policy Versus Moment-to-moment Capability to Complete Tasks èBalance High Levels of Usage of Limited Resources Against Better Turnaround Times for Priority Jobs èGoal-Oriented; Steering Requests According to (Yet to be Developed) Metrics uRobust Grid Transactions In a Multi-User Environment uRealtime Error Detection, Recovery èHandling User-Grid Interactions: Guidelines; Agents uBuilding Higher Level Services, and an Integrated User Environment for the Above

29  ( Physicists’) Application Codes  Experiments’ Software Framework Layer  Needs to be Modular and Grid-aware: Architecture able to interact effectively with the Grid layers  Grid Applications Layer (Parameters and algorithms that govern system operations)  Policy and priority metrics  Workflow evaluation metrics  Task-Site Coupling proximity metrics  Global End-to-End System Services Layer  Monitoring and Tracking Component performance  Workflow monitoring and evaluation mechanisms  Error recovery and redirection mechanisms  System self-monitoring, evaluation and optimization mechanisms Interfacing to the Grid: Above the Collective Layer

30 NL SURFnet GENEVA UK SuperJANET4 ABILENE ESNET CALREN It GARR-B GEANT NewYork Fr Renater STAR-TAP STARLIGHT DataTAG Project u EU-Solicited Project. CERN, PPARC (UK), Amsterdam (NL), and INFN (IT); and US (DOE/NSF: UIC, NWU and Caltech) partners u Main Aims: è Ensure maximum interoperability between US and EU Grid Projects è Transatlantic Testbed for advanced network research u 2.5 Gbps Wavelength Triangle 7/02 (10 Gbps Triangle in 2003) Wave Triangle

31 TeraGrid (www.teragrid.org) NCSA, ANL, SDSC, Caltech NCSA/UIUC ANL UIC Multiple Carrier Hubs Starlight / NW Univ Ill Inst of Tech Univ of Chicago Indianapolis (Abilene NOC) I-WIRE Caltech San Diego DTF Backplane: 4 X 10 Gbps Abilene Chicago Indianapolis Urbana OC-48 (2.5 Gb/s, Abilene) Multiple 10 GbE (Qwest) Multiple 10 GbE (I-WIRE Dark Fiber) Source: Charlie Catlett, Argonne A Preview of the Grid Hierarchy and Networks of the LHC Era

32 Baseline BW for the US-CERN Link: HENP Transatlantic WG (DOE+NSF )  DataTAG 2.5 Gbps Research Link in Summer 2002  10 Gbps Research Link by Approx. Mid-2003 Transoceanic Networking Integrated with the Abilene, TeraGrid, Regional Nets and Continental Network Infrastructures in US, Europe, Asia, South America Baseline evolution typical of major HENP links 2001-2006

33 HENP As a Driver of Networks: Petascale Grids with TB Transactions u Problem: Extract “Small” Data Subsets of 1 to 100 Terabytes from 1 to 1000 Petabyte Data Stores u Survivability of the HENP Global Grid System, with hundreds of such transactions per day (circa 2007) requires that each transaction be completed in a relatively short time. u Example: Take 800 secs to complete the transaction. Then u Transaction Size (TB) Net Throughput (Gbps) u 1 10 u 10 100 u 100 1000 (Capacity of Fiber Today) u Summary: Providing Switching of 10 Gbps wavelengths within ~3 years; and Terabit Switching within 5-8 years would enable “Petascale Grids with Terabyte transactions”, as required to fully realize the discovery potential of major HENP programs, as well as other data-intensive fields.

34 National Research Networks in Japan u SuperSINET è Started operation January 4, 2002 è Support for 5 important areas: HEP, Genetics, Nano-Technology, Space/Astronomy, GRIDs è Provides 10 ’s: r 10 Gbps IP connection r 7 Direct intersite GbE links r Some connections to 10 GbE in JFY2002 u HEPnet-J Will be re-constructed with MPLS-VPN in SuperSINET Will be re-constructed with MPLS-VPN in SuperSINET Proposal: Two TransPacific 2.5 Gbps Wavelengths, and Japan-CERN Grid Testbed by ~2003 Proposal: Two TransPacific 2.5 Gbps Wavelengths, and Japan-CERN Grid Testbed by ~2003 Tokyo Osaka Nagoya Internet Osaka U Kyoto U ICR Kyoto-U Nagoya U NIFS NIG KEK Tohoku U IMS U-Tokyo NAO U Tokyo NII Hitot. NII Chiba IP WDM path IP router OXC ISAS

35 National R&E Network Example Germany: DFN TransAtlantic Connectivity Q1 2002 STM 4 STM 16 u 2 X 2.5G Now: NY-Hamburg and NY-Frankfurt u ESNet peering at 34 Mbps u Direct Peering to Abilene and Canarie expected u UCAID will add another 2 OC48’s; Proposing a Global Terabit Research Network (GTRN) u FSU Connections via satellite: Yerevan, Minsk, Almaty, Baikal è Speeds of 32 - 512 kbps u SILK Project (2002): NATO funding è Links to Caucasus and Central Asia (8 Countries) è Currently 64-512 kbps è Propose VSAT for 10-50 X BW: NATO + State Funding

36 RNP Brazil (to 20 Mbps) FIU Miami/So. America (to 80 Mbps)

37 Models Of Networked Analysis At Regional Centers)  The simulation program developed within MONARC (Models Of Networked Analysis At Regional Centers) uses a process- oriented approach for discrete event simulation, and provides a realistic modelling tool for large scale distributed systems. Modeling and Simulation: MONARC System SIMULATION of Complex Distributed Systems for LHC

38 Farm Monitor Client (other service) Lookup Service Lookup Service Farm Monitor Discovery Proxy uComponent Factory uGUI marshaling uCode Transport uRMI data access Push & Pull rsh & ssh scripts; snmp Globally Scalable Monitoring Service I. Legrand RC Monitor Service Registration

39 MONARC SONN: 3 Regional Centres Learning to Export Jobs NUST 20 CPUs CERN 30 CPUs CALTECH 25 CPUs 1MB/s ; 150 ms RTT 1.2 MB/s 150 ms RTT 0.8 MB/s 200 ms RTT Day = 9 = 0.73 = 0.66 = 0.83 By I. Legrand

40 COJAC: CMS ORCA Java Analysis Component: Java3D Objectivity JNI Web Services Demonstrated Caltech-Rio de Janeiro (Feb.) and Chile

41 Internet2 HENP WG [*] u Mission: To help ensure that the required è National and international network infrastructures (end-to-end) è Standardized tools and facilities for high performance and end-to-end monitoring and tracking [Gridftp; bbcp…] è Collaborative systems u are developed and deployed in a timely manner, and used effectively to meet the needs of the US LHC and other major HENP Programs, as well as the at-large scientific community. è To carry out these developments in a way that is broadly applicable across many fields u Formed an Internet2 WG as a suitable framework: October 2001 u [*] Co-Chairs: S. McKee (Michigan), H. Newman (Caltech); Sec’y J. Williams (Indiana) u Website: http://www.internet2.edu/henp; also see the Internet2 End-to-end Initiative: http://www.internet2.edu/e2e http://www.internet2.edu/henphttp://www.internet2.edu/e2ehttp://www.internet2.edu/henphttp://www.internet2.edu/e2e

42 A Short List: Coming Revolutions in Information Technology u Storage “Virtualization” [ A Single Logical Resource] è Grid-enabled Storage Resource Middleware (SRM) è iSCSI (Internet Small Computer Storage Interface); Integrated with 10 GbE  Global File Systems u Internet Information Software Technologies è Global Information “Broadcast” Architecture ­ E.g the Multipoint Information Distribution Protocol (Tie.Liao@inria.fr) è Programmable Coordinated Agent Architectures ­ E.g. Mobile Agent Reactive Spaces (MARS) by Cabri et al., University of Modena u The “Data Grid” - Human Interface è Interactive monitoring and control of Grid resources ­ By authorized groups and individuals ­ By Autonomous Agents

43 Palat Telefoane 1G link 1G backup link Romana Victoriei Gara de Nord Eroilor Izvor Universitate Unirii NOC Cat3550-24-L3 C7206 w Gigabit C7513 w Gigabit Cat4000 L3 Sw Bucharest MAN for Ro-Grid ICI IFIN 100Mbps 10/100/1000Mbps

44 RoEdu Network


Download ppt "Creating a Sustainable Cycle of Innovation Harvey B Newman, Caltech WSIS Pan European Regional Ministerial Conference Bucharest, November 7-9 2002 WSIS."

Similar presentations


Ads by Google