August 28, 2003APAN, Logistical Networking WS DiDaS Distributed Data Storage Ludek Matyska Masaryk University, Institute of Comp. Sci. and CESNET, z.s.p.o.

Slides:



Advertisements
Similar presentations
APAN Logistical Networking Session Report Hyun-chul Kim (Fri)
Advertisements

Recent Developments in Logistical Networking Micah Beck, Assoc. Prof. & Director Logistical Computing & Internetworking (LoCI) Lab Computer Science Department.
Dynamic Replica Placement for Scalable Content Delivery Yan Chen, Randy H. Katz, John D. Kubiatowicz {yanchen, randy, EECS Department.
Enterprise Wireless Solutions: Controller-based vs. Controller-less What should you be implementing in 2014?
Toolbox Mirror -Overview Effective Distributed Learning.
Peer-to-Peer Networks as a Distribution and Publishing Model Jorn De Boever (june 14, 2007)
Jean-Yves Nief, CC-IN2P3 Wilko Kroeger, SCCS/SLAC Adil Hasan, CCLRC/RAL HEPiX, SLAC October 11th – 13th, 2005 BaBar data distribution using the Storage.
SERVER LOAD BALANCING Presented By : Priya Palanivelu.
1 Exploring Data Reliability Tradeoffs in Replicated Storage Systems NetSysLab The University of British Columbia Abdullah Gharaibeh Matei Ripeanu.
Self Stabilizing Distributed File System Implementing a VFS Module.
Grid IO APIs William Gropp Mathematics and Computer Science Division.
Virtual Network Servers. What is a Server? 1. A software application that provides a specific one or more services to other computers  Example: Apache.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
RAID-x: A New Distributed Disk Array for I/O-Centric Cluster Computing Kai Hwang, Hai Jin, and Roy Ho.
CT NIKHEF June File server CT system support.
Mass RHIC Computing Facility Razvan Popescu - Brookhaven National Laboratory.
Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved DISTRIBUTED SYSTEMS.
1 Exploring Data Reliability Tradeoffs in Replicated Storage Systems NetSysLab The University of British Columbia Abdullah Gharaibeh Advisor: Professor.
Computer Science Perspective Ludek Matyska Faculty of Informatics, Masaryk University, Brno and also CESNET, Prague.
COnvergence of fixed and Mobile BrOadband access/aggregation networks Work programme topic: ICT Future Networks Type of project: Large scale integrating.
Database Services for Physics at CERN with Oracle 10g RAC HEPiX - April 4th 2006, Rome Luca Canali, CERN.
GIS technologies and Web Mapping Services
Providing Controlled Quality Assurance in Video Streaming across the Internet Yingfei Dong, Zhi-Li Zhang and Rohit Rakesh Computer Networking and Multimedia.
SCAN: a Scalable, Adaptive, Secure and Network-aware Content Distribution Network Yan Chen CS Department Northwestern University.
End-to-end QoE Optimization Through Overlay Network Deployment Bart De Vleeschauwer, Filip De Turck, Bart Dhoedt and Piet Demeester Ghent University -
DISTRIBUTED SYSTEMS Principles and Paradigms Second Edition ANDREW S
ALICE data access WLCG data WG revival 4 October 2013.
Data Management Kelly Clynes Caitlin Minteer. Agenda Globus Toolkit Basic Data Management Systems Overview of Data Management Data Movement Grid FTP Reliable.
© 2006 Cisco Systems, Inc. All rights reserved.Cisco Public 1 Version 4.0 Identifying Application Impacts on Network Design Designing and Supporting Computer.
Profiling Grid Data Transfer Protocols and Servers George Kola, Tevfik Kosar and Miron Livny University of Wisconsin-Madison USA.
IMPROUVEMENT OF COMPUTER NETWORKS SECURITY BY USING FAULT TOLERANT CLUSTERS Prof. S ERB AUREL Ph. D. Prof. PATRICIU VICTOR-VALERIU Ph. D. Military Technical.
Why GridFTP? l Performance u Parallel TCP streams, optimal TCP buffer u Non TCP protocol such as UDT u Order of magnitude greater l Cluster-to-cluster.
The Data Grid: Towards an Architecture for the Distributed Management and Analysis of Large Scientific Dataset Caitlin Minteer & Kelly Clynes.
DISTRIBUTED ENCODING ENVIRONMENT BASED ON GRIDS AND IBP INFRASTRUCTURE Petr Holub *‡ and Lukáš Hejtmánek * * Faculty of Informatics and ‡ Institute of.
© 2006 Cisco Systems, Inc. All rights reserved.Cisco PublicITE I Chapter 6 1 Identifying Application Impacts on Network Design Designing and Supporting.
The University of Bolton School of Games Computing & Creative Technologies LCT2516 Network Architecture CCNA Exploration LAN Switching and Wireless Chapter.
Large Scale Test of a storage solution based on an Industry Standard Michael Ernst Brookhaven National Laboratory ADC Retreat Naples, Italy February 2,
High-Availability MySQL DB based on DRBD-Heartbeat Ming Yue September 27, 2007 September 27, 2007.
GStore: GSI Mass Storage ITEE-Palaver GSI Horst Göringer, Matthias Feyerabend, Sergei Sedykh
Working together towards a next generation storage element Surya D. Pathak Advanced Computing Center for Research and Education.
Introduction to dCache Zhenping (Jane) Liu ATLAS Computing Facility, Physics Department Brookhaven National Lab 09/12 – 09/13, 2005 USATLAS Tier-1 & Tier-2.
Data Logistics in Particle Physics Ready or Not, Here it Comes… Prof. Paul Sheldon Vanderbilt University Prof. Paul Sheldon Vanderbilt University.
Fermi National Accelerator Laboratory SC2006 Fermilab Data Movement & Storage Multi-Petabyte tertiary automated tape store for world- wide HEP and other.
Spending Plans and Schedule Jae Yu July 26, 2002.
DISTRIBUTED COMPUTING Introduction Dr. Yingwu Zhu.
McLean HIGHER COMPUTER NETWORKING Lesson 15 (a) Disaster Avoidance Description of disaster avoidance: use of anti-virus software use of fault tolerance.
Advanced IT to Support Digital Libraries Research and Academic Computing & Telecommunications Divisions UITS.
JLAB Computing Facilities Development Ian Bird Jefferson Lab 2 November 2001.
Fast Crash Recovery in RAMCloud. Motivation The role of DRAM has been increasing – Facebook used 150TB of DRAM For 200TB of disk storage However, there.
LEGS: A WSRF Service to Estimate Latency between Arbitrary Hosts on the Internet R.Vijayprasanth 1, R. Kavithaa 2,3 and Raj Kettimuthu 2,3 1 Coimbatore.
7 March 2000EU GRID Project Proposal Meeting CERN, M. Lokajicek 1 Proposal for Participation of the Czech Republic in the EU HEP GRID Project Institute.
Click to add text Introduction to the new mainframe: Large-Scale Commercial Computing © Copyright IBM Corp., All rights reserved. Chapter 6: Accessing.
ALCF Argonne Leadership Computing Facility GridFTP Roadmap Bill Allcock (on behalf of the GridFTP team) Argonne National Laboratory.
RHIC/US ATLAS Tier 1 Computing Facility Site Report Christopher Hollowell Physics Department Brookhaven National Laboratory HEPiX Upton,
Super Computing 2000 DOE SCIENCE ON THE GRID Storage Resource Management For the Earth Science Grid Scientific Data Management Research Group NERSC, LBNL.
An Architectural Approach to Managing Data in Transit Micah Beck Director & Associate Professor Logistical Computing and Internetworking Lab Computer Science.
GridKa December 2004 Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft Doris Ressmann dCache Implementation at FZK Forschungszentrum Karlsruhe.
NeST: Network Storage John Bent, Venkateshwaran V Miron Livny, Andrea Arpaci-Dusseau, Remzi Arpaci-Dusseau.
Decentralized Access to Medical Images in Research and Enterprise PACS Tomáš Kulhánek, Milan Šárek
Dsitributed File Systems
DIS Final Project Proposal Content Express R 許坤進 R 余世傑 R 洪啓仁.
1 Data Management for Internet Backplane Protocol by Tang Ming Assoc/Prof. Francis Lee School of Computer Engineering, Nanyang Technological University,
Internet2 Distributed Storage Infrastructure Status Micah Beck, Chair Network Storage WG Innovative Computing Laboratory University of Tennessee, Knoxville.
Management of Broadband Media Assets on Wide Area Networks Lars-Olof Burchard.
Fault – Tolerant Distributed Multimedia Streaming Web Application By Nirvan Sagar – Srishti Ganjoo – Syed Shahbaaz Safir
Storage discovery in AliEn
22 September 2017, ESA/ESRIN - Frascati
Open Source distributed document DB for an enterprise
Grid Computing.
An Introduction to Computer Networking
Presentation transcript:

August 28, 2003APAN, Logistical Networking WS DiDaS Distributed Data Storage Ludek Matyska Masaryk University, Institute of Comp. Sci. and CESNET, z.s.p.o

August 28, 2003APAN, Logistical Networking WS Outline Motivation Infrastructure Applications Future extensions

August 28, 2003APAN, Logistical Networking WS Motivation Increased need for network storage –Computational Grids –Data Grids –Temporary Data Deposits –Transient Caches –Video deposits National Library Requirements –Distribution of digitized content

August 28, 2003APAN, Logistical Networking WS Requirements Transparent Location independent –Good geographical distribution Providing support for –Access quality (e.g. Streaming) –Reliability (no single point of failure)

August 28, 2003APAN, Logistical Networking WS Infrastructure Data depots –Control: Personal computer with Linux –Storage: RAID of IDE disks –Capacity  1,5 TB each –Number: 7 (total capacity  10 TB) Connectivity –Directly to the backbone –100 Mb/s or 1 Gb/s

August 28, 2003APAN, Logistical Networking WS

August 28, 2003APAN, Logistical Networking WS

August 28, 2003APAN, Logistical Networking WS Data Layer IBP (70% capacity) –General use GridFTP servers (30% capacity) –Grid support –Computer independent temporary data storage –Comparison with IBP based solution

August 28, 2003APAN, Logistical Networking WS Traffic optimisation Network traffic cost function Inter-depots topology known Instrumented clients –Measurement from depot to client –Simultaneous data transfer and measurements Real-time transfer rate prediction –Choose depot –Decision between point and multipoint transfers

August 28, 2003APAN, Logistical Networking WS Applications National Technical Library Video Streaming Nonspecific Users

August 28, 2003APAN, Logistical Networking WS National Technical Library Requirements –Program of content digitalisation –Data stored on the central tape robot –Not optimised for distribution Danger of overload –Model data: old cartographic maps

August 28, 2003APAN, Logistical Networking WS National Technical Library DiDaS role –Cache like storage –Load balancing optimisation –Data transfer reliability (multistreaming)

August 28, 2003APAN, Logistical Networking WS Video Streaming Permanent storage Specific clients QoS requirements (pre-caching) Replica management –Not yet implemented

August 28, 2003APAN, Logistical Networking WS Nonspecific Users Temporary data deposits Provide data for load balancing –Transfer outside of DiDaS core Access reliability –Automatic replica generation –Transparent multi-access –Ability to react on connectivity loss

August 28, 2003APAN, Logistical Networking WS Future work New clients development –support for new application areas Extended and transparent replica management Full instrumentation –Data for Load balancing Replica creation/deletion User access optimisation

August 28, 2003APAN, Logistical Networking WS Thank you for your interest