István Csabai Eötvös University Budapest. observation theoryreality Science: early times.

Slides:



Advertisements
Similar presentations
Trying to Use Databases for Science Jim Gray Microsoft Research
Advertisements

World Wide Telescope mining the Sky using Web Services Information At Your Fingertips for astronomers Jim Gray Microsoft Research Alex Szalay Johns Hopkins.
1 Online Science The World-Wide Telescope as a Prototype For the New Computational Science Jim Gray Microsoft Research
Common Instrument Middleware Architecture and Federation of Instrument Resources for X-ray Crystallography Rick McMullen Indiana University.
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Data Grids for Collection Federation Reagan W. Moore University.
© 2006 Open Grid Forum Network Services Interface Introduction to NSI Guy Roberts.
Remote Visualisation System (RVS) By: Anil Chandra.
Building the Measurement Infrastructure and Virtual Data Observatory in the European Future Internet Experimental Facility Péter Hága Eötvös Loránd University.
Welcome to Middleware Joseph Amrithraj
Eötvös University Budapest in the Network.  Seniors: István Csabai (node coordinator): »Photometric redshift estimation, virtual observatories, science.
A Prototype Implementation of a Framework for Organising Virtual Exhibitions over the Web Ali Elbekai, Nick Rossiter School of Computing, Engineering and.
Components of GIS.
VO-DAS Chenzhou CUI Chao LIU, Haijun TIAN, Yang YANG, etc National Astronomical Observatories, CAS.
Development of China-VO ZHAO Yongheng NAOC, Beijing Nov
The Big Picture Scientific disciplines have developed a computational branch Models without closed form solutions solved numerically This has lead to.
0 General information Rate of acceptance 37% Papers from 15 Countries and 5 Geographical Areas –North America 5 –South America 2 –Europe 20 –Asia 2 –Australia.
ETOMIC measurements in EVERGROW Gábor Vattay Eötvös University/Collegium Budapest.
1 In VINI Veritas: Realistic and Controlled Network Experimentation Jennifer Rexford with Andy Bavier, Nick Feamster, Mark Huang, and Larry Peterson
CPSC 695 Future of GIS Marina L. Gavrilova. The future of GIS.
Measuring the Spatial Structure of Traffic Congestion in the Internet Gábor Vattay Center for Communication Networks Data Analysis, Collegium Budapest.
A Detailed Path-latency Model for Router Geolocation Sándor Laki *, Péter Mátray, Péter Hága, István Csabai and Gábor Vattay Department of Physics of Complex.
SESSION 9 THE INTERNET AND THE NEW INFORMATION NEW INFORMATIONTECHNOLOGYINFRASTRUCTURE.
Winter Retreat Connecting the Dots: Using Runtime Paths for Macro Analysis Mike Chen, Emre Kıcıman, Anthony Accardi, Armando Fox, Eric Brewer
EGEMS A Dedicated Web Based System for Ground Water Data Processing Analysis and Storage.
Passage Three Introduction to Microsoft SQL Server 2000.
Evolved from ARPANET (Advanced Research Projects Agency of the U.S. Department of Defense) Was the first operational packet-switching network Began.
Section 6.1 Explain the development of operating systems Differentiate between operating systems Section 6.2 Demonstrate knowledge of basic GUI components.
Discussion and conclusion The OGC SOS describes a global standard for storing and recalling sensor data and the associated metadata. The standard covers.
MAKING BUSINESS INTELLIGENT
Supported by the National Science Foundation’s Information Technology Research Program under Cooperative Agreement AST with The Johns Hopkins University.
Internet GIS (and its applications to transportation) Keivan Khoshons GEOG 516 March 9, 2004.
Geographic Information System GIS This project is implemented through the CENTRAL EUROPE Programme co-financed by the ERDF GIS Geographic Inf o rmation.
material assembled from the web pages at
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
Science with the Virtual Observatory Brian R. Kent NRAO.
Etomic: a pan-European testbed Péter Hága Collegium Budapest Institute for Advanced Study.
OEI’s Services Portfolio December 13, 2007 Draft / Working Concepts.
Data Management BIRN supports data intensive activities including: – Imaging, Microscopy, Genomics, Time Series, Analytics and more… BIRN utilities scale:
EÖTVÖS UNIVERSITY BUDAPEST Department of Physics of Complex Systems VO Spectroscopy Workshop, ESAC Spectrum Services 2007 László Dobos (ELTE)
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
Frontiers in Massive Data Analysis Chapter 3.  Difficult to include data from multiple sources  Each organization develops a unique way of representing.
2Object-Oriented Analysis and Design with the Unified Process Objectives  Describe the differences and similarities between relational and object-oriented.
DataTAG Research and Technological Development for a Transatlantic Grid Abstract Several major international Grid development projects are underway at.
1 - GEC8, San Diego, July 20-22, 2010 Measurement Tools in PlanetLab Europe Tanja Zseby (Fraunhofer FOKUS, Berlin, Germany) (some slides from other OneLab.
Neural Network Based Available Bandwidth Estimation in the ETOMIC Infrastructure Péter Hága, Sándor Laki, Ferenc Tóth, István Csabai, József Stéger, Gábor.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
Metadata Management of Terabyte Datasets from an IP Backbone Network: Experience and Challenges Sue B. Moon and Timothy Roscoe.
Research Networks and Astronomy Richard Schilizzi Joint Institute for VLBI in Europe
1 Computing Challenges for the Square Kilometre Array Mathai Joseph & Harrick Vin Tata Research Development & Design Centre Pune, India CHEP Mumbai 16.
Review for the Final Exam. Data Communication Device (Modem) Computers present within a building or a campus can be directly connected by cables. Steps.
Distributed Data Analysis & Dissemination System (D-DADS ) Special Interest Group on Data Integration June 2000.
Web based spectrum databases and utilities László Dobos Tamás Budavári István Csabai MAGPOP kick-off meeting, January Cassis.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
A Detailed Path-latency Model for Router Geolocation* Internetes hosztok mérés alapú geolokalizációja Sándor Laki, Péter Mátray, Péter Hága, István Csabai.
Precision Measurements with the EVERGROW Traffic Observatory Péter Hága István Csabai.
Introduction Computer networks: – definition – computer networks from the perspectives of users and designers – Evaluation criteria – Some concepts: –
Distributed Archives Interoperability Cynthia Y. Cheung NASA Goddard Space Flight Center IAU 2000 Commission 5 Manchester, UK August 12, 2000.
IVOA Small Projects Meeting Application to the science S. Honda, Y. Shirasaki, M. Tanaka and JVO team National Astronomical Observatory of Japan.
István Csabai Dept. of Physics of Complex Systems, Eotvos University, Budapest Big Data Wigner, September 12, 2013.
Network Topologies for Scalable Multi-User Virtual Environments Lingrui Liang.
Managing Data Resources File Organization and databases for business information systems.
Grid Optical Burst Switched Networks
MATLAB Distributed, and Other Toolboxes
CHAPTER 3 Architectures for Distributed Systems
Instantiation of the Concept in GAMMA Prototypes
University of Technology
#01 Client/Server Computing
Staying afloat in the sensor data deluge
Google Sky.
#01 Client/Server Computing
Presentation transcript:

István Csabai Eötvös University Budapest

observation theoryreality Science: early times

observation theoryreality models experiment instruments test predictions Science: past

observation theoryreality models experiment instruments virtual reality predictions test Science: present

observation theoryreality models experiment instruments virtual reality predictions test Other disciplines are similar: whole genomes, satellite maps, sensor networks, etc.

To understand the complex reality, we need complex models To verify complex models we need a lot of data (large statistical sample)

Virtual Observatories: few key elements  Modern web and database technologies to standardize and share data  Advanced indexing techniques for multi- terabyte, multidimensional archives  Distributed computing for analysis and modeling – move computation to data  Visualization  Same challenges for different disciplines: we can share the tools and solutions

Internet  Although man-made but there is no “blueprint”  “Astronomical” number of components, complex non-linear interactions  We need similar methods as in natural sciences to understand its behavior  We need to  observe / experiment  model  Future internet: self- aware, self-managing, self-healing …

Measurement/experiment facilities  Planetlab / Onelab  A network of open computers distributed across the world and available for the development of new network services.  Realistic platform available for trial deployment and experimentation, with services such as distributed storage, network mapping, peer-to-peer systems, distributed hash tables and query processing.  OneLab2 EU Integrated Project ,29 participants Europe wide + 1 from Japan, budget 7.5 M€

Measurement/experiment facilities  Etomic  An open m easurement infrastructure in Europe to carry out high temporal resolution (~10 nano second), globally synchronized, active measurements. Provides high resolution, spatially extended dynamic picture of fast changes in the network traffic.

ETOMIC hardware  Server PC architecture, Linux  Endace DAG 3.6 GE card or ARGOS FPGA measurement card  with packet sending capability (packet offset ~60 ns)  GPS antenna for global time synchronization  New low cost version  based on Blackfin microcontroller  dedicated IP packet time stamping module (< μsec) ‏  Can be called as SOAP web services / from stored procedures  low cost (300 €)

Web interface: experiment bundle

Scheduling Experiments Time slot reservation in a calendar system for the experiments

User experiments

Science archive – based on CasJobs CasJobs: database + batch + web services (Developed for SDSS; Alex Szalay, William O’Mullane, Nolan Li, María Nieto-Santisteban, Ani Thakar, Jim Gray) CasJobs: database + batch + web services (Developed for SDSS; Alex Szalay, William O’Mullane, Nolan Li, María Nieto-Santisteban, Ani Thakar, Jim Gray)

 most network measurement projects: use a single dedicated infrastructure scan only narrow sub-segments analyze a limited set of network characteristics centralized and separated from each other  key idea: try to interconnect separate measurement efforts large-scale behavior long-term evolution Network Measurement Virtual Observatory

 The challenge: seamless integration of multiple tools and data sources  We need easy to use standards for data access/exchange  Searchable metadata, organized access  Working prototypes, use cases  Unified interface:  standardized data model (NetXML)  wrapper each platform (to convert data into NetXML)  standardized way of reaching services (Web services)  Mediator:  metadata registry  service registry (e.g. time slice service)  joint queries: query broker Many ideas, concepts from astronomical VO

database Processing functions Stored procedures Search tools Visualization tools Web service interface graphical user interface traceroute archive SQL data subset XML, SOAP WSDL, processed data database Processing functions Stored procedures Search tools Visualization tools Web service interface graphical user interface delay archive SQL data subset XML, SOAP WSDL, processed data database Processing functions Stored procedures Search tools Visualization tools Web service interface graphical user interface BGP archive SQL data subset XML, SOAP WSDL, processed data Mediator/query broker GUI WS

Use cases: what do we measure  one way delay (60 ns resolution)  tracking topology changes  available bandwidth meter  transport protocol testing  queuing delay tomography  geolocalization  …

Geolocalization A B

 Useful for: content distribution, user identification, network diagnostics, language selection,…  WhoIS and DNS databases are not reliable  Method: probabilistic triangulation from landmark nodes  More landmarks – more precise localization – more computation

Hierarchical Triangular Mesh Hierarchical subdivision of spherical triangles, represented as quad tree. Developed for indexing the sky (SDSS; A. Szalay, G. Fekete) Hierarchical subdivision of spherical triangles, represented as quad tree. Developed for indexing the sky (SDSS; A. Szalay, G. Fekete) Standalone library + SQL Server integration Quick routines for intersecting spherical polygons: speedup geolocalization calculations Standalone library + SQL Server integration Quick routines for intersecting spherical polygons: speedup geolocalization calculations

observation theoryreality models experiment instruments virtual reality predictions test Conclusion

observation theoryreality models experiment instruments virtual reality predictions test Conclusion

observation theoryreality models experiment instruments virtual reality predictions test Conclusion

 P. Mátray, P. Hága, S. Laki, J. Stéger B. Hullár, G. Vattay (Eötvös University)  collaborations  Universidad Autonoma de Madrid  Universidad Publica de Navarra  Ericsson Research  Tel Aviv University This work was partially supported by the National Office for Research and Technology (NAP 2005/ KCKHA005) and the EU ICT Moment (#215225) and OneLab2 Integrated Project (#224263). Collaborators, acknowledgements

Sok boldog születésnapot, Sanyi!

Sándor Laki (C) IP Geolocation 31 Plans  Probabilistic geolocation  Using the distribution of the velocity of signal propagation  Pre-clustering delay measurements

New kind of science: mega-surveys  New technology has allowed new kind of surveys  All-sky surveys  GALEX,SDSS, IRAS,WMAP, NVSS, …  Multi-wavelength  Radio, infrared, optical, UV, X-ray, Gamma-ray  Gravitational waves, neutrinos & cosmic particles  Time domain  Pan-STARRS, LSST, GAIA, …  Collecting data in (public) digital archives  Instead of targeting special objects: observe everything  New challenge: store, organize, analyze huge amount of data  Virtual Observatories / E-science

 main goals: create large archives to:  share available measurement data & analysis results  provide easy-to-use “online” network data analysis tools integrate existing measurement and monitoring infrastructures toward a common and open platform:  standardisation of network measurement data  design and implement a system to share network data

 store & share raw data joint analysis of different types of measurement data reanalysis (with new statistical methods) reference data (historical comparison)  share analysis tools server side processing simplifies client applicatons no need to transfer bulk data packages: online processing  exchange data in a standardized way existing efforts (OGF, IETF, KML) Network Measurement Virtual Observatory (nmVO) Database Web services netXML

Science archive – based on CasJobs database Web service interface Stored procedure

Open repository