Mário J. Silva Universidade de Lisboa, Faculdade de Ciências, Departamento de Informática WP3 – Information Platform.

Slides:



Advertisements
Similar presentations
Usage Statistics in Context: related standards and tools Oliver Pesch Chief Strategist, E-Resources EBSCO Information Services Usage Statistics and Publishers:
Advertisements

The DRIVER Infrastructure (Digital Repository Infrastructure Vision for European Research) Paolo Manghi ISTI - National Research Council, Italy.
Configuration management
Welcome to Middleware Joseph Amrithraj
5/30/2012. Provides a method for finding services/data on the Exchange Network – discover data. Supports User Friendly Tools Can automatically collect.
 Copyright 2006 Digital Enterprise Research Institute. All rights reserved. The Future is Now JeromeDL A Digital Library on Social Semantic.
Basic Searching Engineering Village. Agenda What is Engineering Village? Setting up a personal account Searching Engineering Village How to.
WP3 – Information Platform
Supported by EU projects 12/12/2013 Athens, Greece Open Data in Agriculture Hands-on with data infrastructures that can power your agricultural data products.
Demonstrators: Mudasir Nazir(08-CS-41).  I am highly addicted to this field.  Working with W3C in research program(building CSS for creating web site.
The Documentum Team Lance Callaway, Brooke Durbin, Perry Koob, Lorie McMillin, Jennifer Song Missouri University of Science and Technology Rolla, Missouri.
Advanced Searching Engineering Village.
Engineering Village ™ Basic Searching.
Validata Release Coordinator Accelerated application delivery through automated end-to-end release management.
14 October 2003ADASS 2003 – Strasbourg1 Resource Registries for the Virtual Observatory R.Plante (NCSA), G. Greene (STScI), R. Hanisch (STScI), T. McGlynn.
Engineering Village ™ ® Basic Searching On Compendex ®
ADAPT An Approach to Digital Archiving and Preservation Technology Principal Investigator: Joseph JaJa Lead Programmers: Mike Smorul and Mike McGann Graduate.
Massimo Cafaro GridLab Review GridLab WP10 Information Services Massimo Cafaro CACT/ISUFI University of Lecce, Italy.
Robust Tools for Archiving and Preserving Digital Data Joseph JaJa, Mike Smorul, and Mike McGann Institute for Advanced Computer Studies Department of.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
Cloud based linked data platform for Structural Engineering Experiment Xiaohui Zhang
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Drupal Workshop Introduction to Drupal Part 1: Web Content Management, Advantages/Disadvantages of Drupal, Drupal terminology, Drupal technology, directories.
Building Library Web Site Using Drupal
Automated Tracking of Online Service Policies J. Trent Adams 1 Kevin Bauer 2 Asa Hardcastle 3 Dirk Grunwald 2 Douglas Sicker 2 1 The Internet Society 2.
Project Proposal: Academic Job Market and Application Tracker Website Project designed by: Cengiz Gunay Client: Cengiz Gunay Audience: PhD candidates and.
Www. ScoutsOnline.co.uk On-Brand Websites for Scout Groups.
These materials are prepared only for the students enrolled in the course Distributed Software Development (DSD) at the Department of Computer.
Content Strategy.
M i SMob i S Mob i Store - Mobile i nternet File Storage Platform Chetna Kaur.
ISpheres Project. Project Overview iSpheresCore iSpheresImage Demonstration References.
BARCELONA January 2011 European Commission Information Society and Media GaLA Game and Learning Alliance The European Network of Excellence on Serious.
Introduction to Apache OODT Yang Li Mar 9, What is OODT Object Oriented Data Technology Science data management Archiving Systems that span scientific.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
Creating and Operating a Digital Library for Information and Learning– the GROW Project Muniram Budhu Department of Civil Engineering & Engineering Mechanics.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
A DΙgital Library Infrastructure on Grid EΝabled Technology ETICS Usage in DILIGENT Pedro Andrade
PLoS ONE Application Journal Publishing System (JPS) First application built on Topaz application framework Web 2.0 –Uses a template engine to display.
EdReNe – 2nd Strategic Seminar, June 2008 Spanish Educational Repository agrega+
Towards Low Overhead Provenance Tracking in Near Real-Time Stream Filtering Nithya N. Vijayakumar, Beth Plale DDE Lab, Indiana University {nvijayak,
19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick.
Aude Dufresne and Mohamed Rouatbi University of Montreal LICEF – CIRTA – MATI CANADA Learning Object Repositories Network (CRSNG) Ontologies, Applications.
Seattle Drupal Clinic Introduction to Drupal Part 1: Web Content Management, Advantages/Disadvantages of Drupal, Drupal terminology.
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
@ 2008 Copyright NIC I Do not distribute without permission E-Services for Transforming to the Next Generation Government “A Case Study of India” Suchitra.
Any data..! Any where..! Any time..! Linking Process and Content in a Distributed Spatial Production System Pierre Lafond HydraSpace Solutions Inc
SOAP-based Web Services Telerik Software Academy Software Quality Assurance.
Automatic Metadata Discovery from Non-cooperative Digital Libraries By Ron Shi, Kurt Maly, Mohammad Zubair IADIS International Conference May 2003.
An answer to your common XACML dilemmas Asela Pathberiya Senior Software Engineer.
WP3 Information and Monitoring Rob Byrom / WP3
System Development & Operations NSF DataNet site visit to MIT February 8, /8/20101NSF Site Visit to MIT DataSpace DataSpace.
OOI Cyberinfrastructure and Semantics OOI CI Architecture & Design Team UCSD/Calit2 Ocean Observing Systems Semantic Interoperability Workshop, November.
Steven Perry Dave Vieglais. W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Overview WASABI is a framework for.
ETICS An Environment for Distributed Software Development in Aerospace Applications SpaceTransfer09 Hannover Messe, April 2009.
Download Manager software Training Workshop Ostend, Belgium, 20 th May 2014 D.M.A. Schaap - Technical Coordinator.
Workflow based Knowledge Management Solutions 1 InfoAxon Technologies Limited Confidential.
5/29/2001Y. D. Wu & M. Liu1 Content Management for Digital Library May 29, 2001.
Building Library Web Site Using Drupal
Cloud based linked data platform for Structural Engineering Experiment
Moving on : Repository Services after the RAE
SMART GROUND platform overview
Service-centric Software Engineering
eCulture Science Gateway – reloaded
AMGA Web Interface Vincenzo Milazzo
Metadata Framework as the basis for Metadata-driven Architecture
BUILDING A DIGITAL REPOSITORY FOR LEARNING RESOURCES
JISC Information Environment Service Registry (IESR)
Is a Content Management System in Your Future?
Mark Quirk Head of Technology Developer & Platform Group
Presentation transcript:

Mário J. Silva Universidade de Lisboa, Faculdade de Ciências, Departamento de Informática WP3 – Information Platform

What will be necessary to predict epidemics precisely? 15 Mar nd Epiwork Review Brussels 2 Data of many different types and many unrelated sources. Improved accuracy makes required data a never- ending story We all want to see realistic and timely plots of epidemics propagation. Available, but hard to find, collect and maintain!

15 Mar nd Epiwork Review Brussels 3 Epiwork

15 Mar nd Epiwork Review Brussels 4

Other Internet Monitoring Sources 15 Mar nd Epiwork Review Brussels 5

Social Media Sources 15 Mar nd Epiwork Review Brussels 6

Data.gov.uk, keyword=epidemiology 15 Mar nd Epiwork Review Brussels 7

data.gov, epidemiology 15 Mar nd Epiwork Review Brussels 8

Linked Data 15 Mar nd Epiwork Review Brussels 9

Data in Epiwork Classic SourcesModern Sources 15 Mar nd Epiwork Review Brussels 10 [National Bureau of Statistics] demographics, transportation data,.. [Public Health authorities] surveillance data (maybe?) [Internet Monitoring Sources] [Social Media] behavioural data To be shared by epidemic modellers in a digital library, dubbed the Epidemic Marketplace

Epiwork Mar nd Epiwork Review Brussels

Outline 15 Mar nd Epiwork Review Brussels The need for an Epidemic Marketplace 2. Epidemic Marketplace D3.3 Public Release of the Epidemic Marketplace Platform 4. Where we stand and plans for work ahead

Steps for Creating the EM 15 Mar nd Epiwork Review Brussels Elaborate meta-model for describing datasets used by epidemic modellers. 2. Provide query services over the meta-data to discover resources. 3. Select ontologies for characterizing data and develop an ontology of epidemic concepts. 4. Ingest, harmonize and cross-link data. 5. Provide query services to select epidemic data using the EM meta-data and ontologies.

Common Reference Model 15 Mar nd Epiwork Review Brussels 14 Open domain: detailed description of the datasets used in the models of all sorts of epidemics would require describing virtually every kind of information, given the diversity of factors and the interdisciplinary of epidemiologic studies. Data model needs to support interlinked data.

Meta-data and Ontologies 15 Mar nd Epiwork Review Brussels 15 The information model of the EM is directly defined as metadata and ontologies. Ontology and Meta-data standards, the Pros and Cons of using them, annotation and deployment strategies, and the steps for creating an metamodel for epidemic data were the subject of D3.1 reviewed last year.

EM: Main Components 15 Mar nd Epiwork Review Brussels 16

EM 1.0 Software Components Fedora Commons 2.X for the implementation of the main features of the repository. Access control in the platform XACML (OASIS 2010), LDAP (Tuttle et al. 2004) Shibolleth (identity management). Front-end based in Muradora Forum based on phpBB (+ Muradora) Mar nd Epiwork Review Brussels

Outline 15 Mar nd Epiwork Review Brussels The need for an Epidemic Marketplace 2. Epidemic Marketplace D3.3 Public Release of the Epidemic Marketplace Platform 4. Where we stand and plans for work ahead

What is new since Mar 2010? 15 Mar nd Epiwork Review Brussels Improved reliability 2. MEDCollector – automatic data collector 3. Meta-data policies and editor 4. Web services API + Simple EM Client 5. Improved user interface 6. Public: anyone can browse and register (required for upload)

Improved Reliability Reorganizations and back-end Services Before Public Deployment Virtualized environment: every major component running on two separate virtual machines - production + development environments (Xen+CentOS) Monitoring and alerts for all services (Nagios) Logging and Analysis (Google Analytics) 15 Mar nd Epiwork Review Brussels 20

MEDCollector Web Services Workflow Processes Local Storage Dashboard for Workflow Design Mar nd Epiwork Review Brussels

Geonames.org: All Countries and Capitals UMLS: twitter searched ion subset of 89 terms related to “Disease or Syndrome” MEDCollector Data Model Mar nd Epiwork Review Brussels

MEDCollector Services Data Collection Services Query Selection Services Data Harvesting Services XML Transformation Services Database Loading Service Data Packaging Services To CSV Mar nd Epiwork Review Brussels

MEDCollector - BPEL Language to define how Web-Services Communicate Standard graphical notation – BPMN → Complex! Mar nd Epiwork Review Brussels

MEDCollector: Dashboard WireIt! Mar nd Epiwork Review Brussels

MEDCollector: Dashboard Watch the Demo! Mar nd Epiwork Review Brussels

Automatically Collected Data Twitter: 89 diseases, world-coverage ProMed-mail Google Flu Trends CDC RSS Feeds Flu updates Travel Notices... Periodically packed and uploaded to the EM repository 15 Mar nd Epiwork Review Brussels 27

What is new since Mar 2010? 15 Mar nd Epiwork Review Brussels Improved reliability 2. MEDCollector – automatic data collector 3. Meta-data policies and editor 4. Web services API + Simple EM Client 5. Improved user interface 6. Public: anyone can browse and register (for upload)

Meta-data Policies and Editor Meta-data introduction simplified Editor that pops-up on upload now fills most of the entries with appropriate defaults. EM Repository Meta-data Vocabulary Generic DCTERMS adopted for datasets characterisation Epidemics-specific DCTERMS defined for epidemic datasets characterisation 15 Mar nd Epiwork Review Brussels 29

DC Term Example: RightsHolder 15 Mar nd Epiwork Review Brussels 30

EM Term Example: HostGroup 15 Mar nd Epiwork Review Brussels 31

Mediator Web Services 15 Mar nd Epiwork Review Brussels 32 OpenLDAP Mediator Client Fedora Commons Repository OAI-PMH RESTful Interface OAI-ORE Fetch/Search Upload

Simple EM Client Mapping of client filenames to EM resources (FC data streams and Collections) Operations: Check-out, check-in 15 Mar nd Epiwork Review Brussels 33 Watch the Demo! Download from

EM 15 Mar nd Epiwork Review Brussels 34 Old Graphic Style

15 Mar nd Epiwork Review Brussels 35 Try it at:

Outline 15 Mar nd Epiwork Review Brussels The need for an Epidemic Marketplace 2. The Epidemic Marketplace 3. D3.3 Public Release of the Epidemic Marketplace Platform 4. Where we stand and plans for work ahead

WP3: status (what we have done) 15 Mar nd Epiwork Review Brussels 37 Deliverable D3.1 (meta-model) released Deliverable D3.2 (prototype) released Hardware and base software deployed; Initial prototype of EM with initial set of characterized datasets Deliverable D3.3 (public version) released Data-collector EM DCAP and meta-data handling Web Services

Events 2nd year London, Delhi, Bilbao, ERCIM News 15 Mar nd Epiwork Review Brussels 38

Publications in the 1st year 15 Mar nd Epiwork Review Brussels Mário J. Silva, Fabrício A.B. Silva, Luís Filipe Lopes, Francisco M. Couto, Building a Digital Library for Epidemic Modelling. Proceedings of ICDL The International Conference on Digital Libraries 1, p , New Delhi, India, February, TERI Press -- New Delhi, India. Invited Paper. 2. Luis Filipe Lopes, João Zamite, Bruno Tavares, Francisco Couto, Fabrício A.B. Silva, Mário J. Silva, Automated Social Network Epidemic Data Collector. INForum - Simpósio de Informática September, 2009.

EM-related Publications (2nd year) 1. Mário J. Silva, Fabrício A.B. Silva, Luís Filipe Lopes, Francisco M Couto, Building a Digital Library for Epidemic Modelling. Proceedings of ICDL The International Conference on Digital Libraries 1, p. 447–459, New Delhi, India, 23–27 February, TERI Press–New Delhi, India. Invited Paper. 2. Fabrício A.B. Silva, Mário J. Silva, Francisco M Couto, Epidemic Marketplace: an e-Science Platform for Epidemic Modelling and Analysis. ERCIM News 82 – Special Theme: Computational Biology. July, Luis Filipe Lopes, Fabrício A.B. Silva, Francisco M Couto, João Zamite, Hugo Ferreira, Carla Sousa, Mário J. Silva, Epidemic Marketplace: An Information Management System for Epidemiological Data. Proceedings of ITBAM'10 - 1st International Conference on Information Technology in Bio- and Medical Informatics - DEXA 2010 August, João Zamite, Fabrício A.B. Silva, Francisco M Couto, Mário J. Silva, MEDCollector: Multisource Epidemic Data Collector. Proceedings of ITBAM'10 - 1st International Conference on Information Technology in Bio- and Medical Informatics - DEXA 2010 August, João Zamite, Multisource Epidemic Data Collector, Master Dissertation, University of Lisbon, Faculty of Sciences, September Luis Filipe Lopes, A Metadata Model for the Annotation of Epidemiological Data, Master Dissertation, University of Lisbon, Faculty of Sciences, September Hugo Ferreira, O Mediador do Epidemic Marketplace. Master Dissertation, University of Lisbon, Faculty of Sciences, September, 2010; (in Portuguese). 15 Mar nd Epiwork Review Brussels 40

WP3: status (what we will do) Overcoming the initial difficulties in hiring the planned resources Refreshed team with competencies required for the 2nd and 3rd year; Hiring 1 sw eng for push in release of EM 2.0 Working on Epidemic Marketplace 2.0 D3.4 and D3.5 due Feb 2012 site analytics interlinking Peeking on how to address challenges for the 4th year negotiating access to content 15 Mar nd Epiwork Review Brussels 41

Changes in UL WP3 Team Out Fabrício Silva Luis F. Lopes (meta-data) Hugo Ferreira (mediator) In Dulce Domingos (access control) Juliana Duque (information architecture, graphics) João Ferreira (ontologies) + (always in) Mário Francisco João Zamite 15 Mar nd Epiwork Review Brussels 42

Scheduled Deliverables 15 Mar nd Epiwork Review Brussels 43

Todo List and Planning (Brussels, Mar 2011) 1. Evolve Simple EM Client and GleamViz to become showcase for tight integration with Computational Platform 2. Refine and populate the catalogue of epidemic resources: enrichment, interlinking and semantification of epidemic data 3. Release second version of the EM. Re-implemented Web Services (no more Muradora) New information architecture, new front-end design New social network access control Mar nd Epiwork Review Brussels

On the nature of Soc Intelligent Systems Who should learn behaviours about individuals from the network? No Silver Bullet “Classic” Engineering approaches too slow for 21st century pace We are now all part of a huge Living Lab How much longer will the fact that your cat sneezed be relevant?...we might have to ask again. Are we still under control? We may need more flexible ways to control access to sensitive data Aug Assyst, London

Classical Approaches Role Based Access Control (RBAC): Advantages:  Roles are intuitive concepts in organizations  Users can easily be reassigned from one role to another Disadvantages:  Central Administration has to manage roles  Does not take into account collaborative/social dynamics

Access Control Based on Social Networks Objects have owners (or publishers) Owners are part of a social network and define access policies based on the network information

EM 2.0 Software Components Fedora Commons main features of the repository. Mediator services reimplemented. Webservices provided by FC invoked directly. Access control in the platform XACML + LDAP (Tuttle et al. 2004) Shibolleth (identity management). Access Control Based on Social Networks Front-end based in the Drupal CMS Integrated forum Mar nd Epiwork Review Brussels

EM 2.0 Mock-up interface 15 Mar nd Epiwork Review Brussels 49

WP3 SWOT Analysis Strengths Epiwork-driven EM Standards-based Open Source modules Supported (until 2012) Weaknesses Unpopulated EM Looking for the right policies What are the incentives? Interfaces to WP4 and WP5? 15 Mar nd Epiwork Review Brussels 50 Unchanged !

WP3 SWOT Analysis Opportunities Epiwork testbed Creation of a baseline for epidemic modelling Showcase for partners’ outputs Threats Consortium enters “everyone for himself” mode. “Somebody will take care of that” attitude EM perceived as a very expensive, complex and useless cache 15 Mar nd Epiwork Review Brussels 51 Unchanged !

15 Mar nd Epiwork Review Brussels 52

Current Focus – EM Mar nd Epiwork Review Brussels 53 Designing and implementing the new user interface. Must be useful to the expert and occasional user. New Access Control mechanisms addressing data privacy in socially intelligent environment. Refining and populating, enriching the catalogue of epidemic resources using initial prototype. The method of scanning published epidemic modelling studies and then inferring the metadata descriptions has shown to be very useful.

Todo list and planning (Torino, Nov 2009) 15 Mar nd Epiwork Review Brussels Populate Repository 2. Linked Epidemic Data 3. Ethics, Privacy and Anonimization 4. Access control policies 5. Dataset selection generation 6. Distributed Authentication 7. Replicate EM node

Todo list and planning (Brussels, Mar 2010) 1. Populate Repository 2. Linked Epidemic Data 3. Ethics, Privacy and Anonimization 4. Access control policies 5. Dataset selection generation 6. Distributed Authentication 7. Replicate EM node 15 Mar nd Epiwork Review Brussels 55

Todo list and planning (Torino, Dec 2010) 1. Populate Repository 2. New Front-end and updated components (prepare market) 3. Access control policies & distributed authentication 4. Linked Epidemic Data 5. Ethics, Privacy and Anonimization 6. Replicate EM node 15 Mar nd Epiwork Review Brussels 56

The falacies of free-text 15 Mar nd Epiwork Review Brussels 57 Initial “proof-of-concept” prototype showed the limitations spanning from annotating the datasets using free text in the meta-data description fields. A much simpler model, inspired on web2.0 “tags.” EM users will be able to freely annotate their datasets using their own terminologies (also dubbed as “folksonomies”).

Kdnuggets, march Mar nd Epiwork Review Brussels 58