Network of Excellence in Internet Science Network of Excellence in Internet Science (EINS) 2 nd REVIEW Brussels, 4-5 February 2014 FP7-ICT-2011.1.6-288021.

Slides:



Advertisements
Similar presentations
OMV Ontology Metadata Vocabulary April 10, 2008 Peter Haase.
Advertisements

Federation eCrystals Federation: Open Repositories for Data-driven Science Dr Liz Lyon, UKOLN, University of Bath, UK Dr Simon Coles, University of Southampton,
DSpace: the MIT Libraries Institutional Repository MacKenzie Smith, MIT EDUCAUSE 2003, November 5 th Copyright MacKenzie Smith, This work is the.
Microdata and schema.org. Basics Microdata is a simple semantic markup scheme that’s an alternative to RDFa Microdata Developed by WHATWG and supported.
Network of Excellence in Internet Science Network of Excellence in Internet Science (EINS) 4 th Plenary Meeting Bologna, June 2014 FP7-ICT
Ontology Notes are from:
Opening up the bibliography for the future The Danish Scenario: taking Danish National Bibliography reuse to the next level Carsten H. Andersen Director.
Data Sets, Vocabularies and Tools Pablo N. Mendes Freie Universität Berlin 1st year review Luxembourg, December /02/11.
HUBZERO AT INDIANA UNIVERSITY: THE INDIANA CTSI HUB Bill Barnett EDUCAUSE October 14, 2010.
Microdata and schema.org. Basics Microdata is a simple semantic markup scheme that’s an alternative to RDFa Microdata Developed by WHATWG and supported.
Network of Excellence in Internet Science Network of Excellence in Internet Science (EINS) 2 nd REVIEW Brussels, 4-5 February 2014 FP7-ICT
Key integrating concepts Groups Formal Community Groups Ad-hoc special purpose/ interest groups Fine-grained access control and membership Linked All content.
Advances in Technology and CRIS Nikos Houssos National Documentation Centre / National Hellenic Research Foundation, Greece euroCRIS Task Group Leader.
Supporting Research with Weblogs: A Study on Web-based Research Support Systems JingTao Yao Department of Computer Science, University or Regina CANADA.
1 © Netskills Quality Internet Training, University of Newcastle Metadata Explained © Netskills, Quality Internet Training.
8/28/97Organization of Information in Collections Introduction to Description: Dublin Core and History University of California, Berkeley School of Information.
Network of Excellence in Internet Science Network of Excellence in Internet Science (EINS) Joint Workshop and 4 th Plenary Meeting Bologna June 13, 2014.
Joint Information Systems Committee Supporting Higher and Further Education Catherine Grout Assistant Director for Development, JISC/DNER
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
Network of Excellence in Internet Science Network of Excellence in Internet Science (EINS) 1 st REVIEW Brussels, 12th April 2013 FP7-ICT
Network of Excellence in Internet Science Network of Excellence in Internet Science (EINS) 1 st REVIEW Brussels, 12th April 2013 FP7-ICT
Save time. Reduce costs. Find and reuse interoperability solutions on Joinup for developing European public services Nikolaos Loutas
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Network of Excellence in Internet Science Network of Excellence in Internet Science (EINS) 1 st REVIEW Brussels, 12th April 2013 FP7-ICT
Themes Architecture Content Metadata Interoperability Standards Knowledge Organisation Systems Use and Users Legal and Economic Issues The Future.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
Joint agINFRA & SCI-BUS workshop, 30/05/2013, Budapest, Hungary FP 7-INFRASTRUCTURES programme agINFRA Joint agINFRA & SCI-BUS workshop agINFRA.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
By Addison, Jessica, and Lauren. Management The Mountain West Digital Library is a program of the Utah Academic Library Consortium (UALC) Three Governing.
National Center for Supercomputing Applications Barbara S. Minsker, Ph.D. Associate Professor National Center for Supercomputing Applications and Department.
Own research related to workshop Can we produce “knowledge maps” to locate and find (scientific) works across collections, time and space?
Grid Computing & Semantic Web. Grid Computing Proposed with the idea of electric power grid; Aims at integrating large-scale (global scale) computing.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
It’s all semantics! The premises and promises of the semantic web. Tony Ross Centre for Digital Library Research, University of Strathclyde
© Copyright 2013 STI INNSBRUCK “How to put an annotation in HTML?” Ioannis Stavrakantonakis.
The VIRTUAL SOLAR-TERRESTRIAL OBSERVATORY - Exploring paradigms for interdisciplinary data-driven science Peter Fox 1 Don Middleton 2,
APAN AG-WG Bangkok Food and Agriculture Organization of the UN Library and Documentation Systems Division Margherita Sini Slide Sustainable.
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
The US Long Term Ecological Research (LTER) Network: Site and Network Level Information Management Kristin Vanderbilt Department of Biology University.
A centre of expertise in digital information management Shaping the e-future? Grids, Web Services and Digital Libraries Professor Tony.
Find Research Data b2find.eudat.eu B2FIND User Training How to find data objects and collections using EUDAT’s B2FIND This work is licensed.
WDS Knowledge Networks Summary of Major Elements.
System Development & Operations NSF DataNet site visit to MIT February 8, /8/20101NSF Site Visit to MIT DataSpace DataSpace.
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
European Science Cloud for Research Towards a common vision Per Öster CSC – IT Center for Science Ltd.
Fedora Commons Overview and Background Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
Announcing the 2014 National Digital Stewardship Agenda.
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
Enhancements to Galaxy for delivering on NIH Commons
The Semantic Web By: Maulik Parikh.
Jarek Nabrzyski Director, Center for Research Computing
DataNet Collaboration
Joseph JaJa, Mike Smorul, and Sangchul Song
Microdata and schema.org
Marketplace & service catalog concepts, first design analysis
Antonella Fresa Technical Coordinator
Introducing da|raSearchNet
EOSCpilot All Hands Meeting 9 March 2018, Pisa
Cataloging the Internet
An ecosystem of contributions
Session 2: Metadata and Catalogues
EOSCpilot All Hands Meeting 9 March 2018, Pisa
Microdata and schema.org
Bird of Feather Session
EOSC-hub Contribution to the EOSC WGs
Australian and New Zealand Metadata Working Group
AUC’s Role In Facilitating Access To Knowledge In The Arab World
Presentation transcript:

Network of Excellence in Internet Science Network of Excellence in Internet Science (EINS) 2 nd REVIEW Brussels, 4-5 February 2014 FP7-ICT EINS JRA3: Evidence and Expermentation Federico Morando (NEXA) Thanassis Tiropanis (SOTON)

2nd EINS Review, Brussels, 4-5 February 2014 WP Vision  Develop a knowledge community involved in identifying, assessing and providing a repository of the set of tools and methodologies to measure and adequately represent  Internet data (Metrology) and  information (Mediametry) traffic, as well as  the existing available platforms (Experimentation), including social-beds.  Broader the involvement and outreach beyond the core original Computer Science base JRA3

2nd EINS Review, Brussels, 4-5 February 2014 Achievements  Initiated knowledge base of Internet Science methods and tools.  Liaison with the Web Science community on synergies on the evidence base (ACM WebSci13 workshop)  Calibration of and refocusing of JRA3 effort given the recent developments in the area aiming for:  Tangible outcomes  Online resources  Community building  An online community actively engaging in the identification and collection of datasets, methods and tools for Internet Science  Initial version of online resource for an Internet Science evidence base ( JRA3

2nd EINS Review, Brussels, 4-5 February 2014 Achievements – Evidence base portal JRA3

2nd EINS Review, Brussels, 4-5 February 2014 Achievements – Evidence base portal JRA3

2nd EINS Review, Brussels, 4-5 February 2014 Achievements - Network Traffic Repository  Goal: To create a traffic repository where researchers use and share network traces  Different approach: Distributed storage  Research institutions provide storage resources  Cooperation: To increase storage space  Reliability: Data is replicated among contributors  Move from a classic centralized approach to a cloud approach  Further step is to provide computing resources, not just storage: Hadoop apps JRA3

2nd EINS Review, Brussels, 4-5 February 2014 Achievements - Network Traffic Repository  UAM organizes and administers the cloud  Users register through a webpage  The repository looks like an UNIX filesystem  User/group permissions allow for restricting access to traces  All users have access to a public area (traces examples etc…)  Access via Web (WebDAV) HDFS API Locally mounting the filesystem (HDFS-Fuse) JRA3

2nd EINS Review, Brussels, 4-5 February 2014 Achievements - Network Traffic Repository  Distributed file system HDFS  Each research institution provides datanodes JRA3 Institution 1Institution 2

2nd EINS Review, Brussels, 4-5 February 2014 Achievements - Schemas There are no standards for publishing datasets Everyone uses a proprietary description format E.g. snap.stanford.edu/data, konect.uni- koblenz.de/downloads/#full_datasets In domains such as bioinformatics, medical reports, etc., there are more widely accepted ontologies, which are used as schemas There are a lot of links in the LOD cloud, but for a particular application context you need to realize the mappings you need; this is challenging Our approach Use Microdata markup and vocabularies When we lack a taxonomy or to inform taxonomies we use DBpedia other taxonomies (e.g. ACM taxonomy) JRA3

2nd EINS Review, Brussels, 4-5 February 2014 Achievements - Schemas Microdata Basics Microdata is a simple semantic markup scheme that’s an alternative to RDFa Developed by WHATWG and supported by major search companies (Google, MSFT, Yahoo) Like RDFa, it uses HTML tag attributes to host metadata Vocabularies are controlled and hosted at schema.org Using Microdata The microdata effort has two parts: markup and a set of vocabularies The markup is similar to RDFa in that it provides a way to identify subjects, types, properties and objects The sanctioned vocabularies are found at schema.org and include a small number of very useful ones: people, movies, etc. JRA3

2nd EINS Review, Brussels, 4-5 February 2014 Achievements – Schemas JRA3 Avatar Director: James Cameron (born 1954) Science fiction <a href =”avatar- trailer.html ">Trailer An itemscope attribute identifies a content subtree that is the subject about which we want to say something The itemtype attribute specifies the subject’s type An itemprop attribute gives a property of that type Avatar Director: James Cameron (born 1954) Science fiction Trailer

2nd EINS Review, Brussels, 4-5 February 2014 Achievements - DBpedia-based taxonomy  Classification of collected Internet Science resources using Dbpedia categories  DBpedia categories mined using the extraction/classification software TellMeFirst using the textual description of each tool/dataset/infrastructure): – online demo is available at – the process may be further automated JRA3

2nd EINS Review, Brussels, 4-5 February 2014 Achievements - DBpedia-based taxonomy  Examples of mined categories: – Category:Network_analyzers, Category:Data_management, Category:Bots,Category:Web_application _frameworks, Category:Computer_benchmarks, Category:Wireless_sensor_network, Category:Virtualization_software JRA3

2nd EINS Review, Brussels, 4-5 February 2014 Achievements – Schemas JRA3 schema.org vocabularies Thing::CreativeWork::Dataset  Properties from Thing  additionalType (typeof Dbpedia categories, etc.)  alternateName  description  image  name  sameAs (original site or wikipedia entry)  url  Properties from Dataset  catalog  distribution  spatial  temporal Properties from CreativeWork (subset) –about –audience (or scientific community) –author –award –awards –citation –contributor –copyrightHolder –copyrightYear –creator –dateCreated –dateModified –datePublished –keywords (e.g. from ACM taxonomy) –version

2nd EINS Review, Brussels, 4-5 February 2014 Achievements  These achievements are inline with the activities R3.2 and R3.3 under the revised plan  Online experimental and empirical evidence base (e.g. portal and repository)  Setting up a multidisciplinary dialogue (e.g. ACM WebSci13 workshop)  Workshop co-located with the Web Science Community advanced interdisciplinary dialogue and agenda setting in terms of experimentation base  We already have a vibrant community engaging in providing an online evidence base JRA3

2nd EINS Review, Brussels, 4-5 February 2014 Links with other activities  Working with JRA1 and JRA2 as the third link in the process: theory, methodology, experimentation  Collaboration with JRA2 on sharing experiences for online catalogues and repositories – possible integration in the future  Collaboration with JRA1, JRA2 & JRA4 on net neutrality, and JRA5 & JRA6 on open data as a case of examining network measurements methods and tools from an interdisciplinary viewpoint  Collaboration with JRA8 on evidence base for sustainability of the Future Internet JRA3

2nd EINS Review, Brussels, 4-5 February 2014 JRA Challenges: Input to Internet Science Roadmap  The datasets, tools and methods for Internet Science are scattered across repositories used within individual disciplines.  Providing harmonized access to those resources is essential for Internet Scientists.  A scalable and sustainable online resource for Internet Science research is needed.  Bootstrapping this activity involves  Online catalogues and repositories.  Interdisciplinary dialogue forums.  Dialogue with other interdisciplinary areas. JRA3

2nd EINS Review, Brussels, 4-5 February 2014 Future Steps  Enhancing the online evidence base and provision of community engagement mechanisms  Repository for network traffic datasets contributed and shared by the community  Multi/Inter-disciplinary dialogue forum on:  Network measurement methods and tools (with application on net neutrality)  Data quality assessment (with application on open data repositories) JRA3

2nd EINS Review, Brussels, 4-5 February 2014 Conclusions  The JRA met is objectives and performed significant work in terms of community building  Joint workshops with other interdisciplinary areas  Foundation of evidence base  In light of recent developments in e- Infrastructures and repositories it has re-scoped its activity.  It has already delivered significant tangible outcomes in terms of an online evidence base. JRA3