Download presentation
Presentation is loading. Please wait.
Published byEvelyn Edrington Modified over 9 years ago
1
Mário J. Silva Universidade de Lisboa, Faculdade de Ciências, Departamento de Informática mjs@di.fc.ul.pt WP3 – Information Platform
2
What will be necessary to predict epidemics precisely? 15 Mar 2011 - 2nd Epiwork Review Brussels 2 Data of many different types and many unrelated sources. Improved accuracy makes required data a never- ending story We all want to see realistic and timely plots of epidemics propagation. Available, but hard to find, collect and maintain!
3
15 Mar 2011 - 2nd Epiwork Review Brussels 3 Epiwork
4
http://www.gripenet.pt/ 15 Mar 2011 - 2nd Epiwork Review Brussels 4
5
Other Internet Monitoring Sources 15 Mar 2011 - 2nd Epiwork Review Brussels 5
6
Social Media Sources 15 Mar 2011 - 2nd Epiwork Review Brussels 6
7
Data.gov.uk, keyword=epidemiology 15 Mar 2011 - 2nd Epiwork Review Brussels 7
8
data.gov, epidemiology 15 Mar 2011 - 2nd Epiwork Review Brussels 8
9
Linked Data 15 Mar 2011 - 2nd Epiwork Review Brussels 9 http://linkeddata.org/
10
Data in Epiwork Classic SourcesModern Sources 15 Mar 2011 - 2nd Epiwork Review Brussels 10 [National Bureau of Statistics] demographics, transportation data,.. [Public Health authorities] surveillance data (maybe?) [Internet Monitoring Sources] [Social Media] behavioural data To be shared by epidemic modellers in a digital library, dubbed the Epidemic Marketplace
11
Epiwork 11 15 Mar 2011 - 2nd Epiwork Review Brussels
12
Outline 15 Mar 2011 - 2nd Epiwork Review Brussels 12 1. The need for an Epidemic Marketplace 2. Epidemic Marketplace 1.0 3. D3.3 Public Release of the Epidemic Marketplace Platform 4. Where we stand and plans for work ahead
13
Steps for Creating the EM 15 Mar 2011 - 2nd Epiwork Review Brussels 13 1. Elaborate meta-model for describing datasets used by epidemic modellers. 2. Provide query services over the meta-data to discover resources. 3. Select ontologies for characterizing data and develop an ontology of epidemic concepts. 4. Ingest, harmonize and cross-link data. 5. Provide query services to select epidemic data using the EM meta-data and ontologies.
14
Common Reference Model 15 Mar 2011 - 2nd Epiwork Review Brussels 14 Open domain: detailed description of the datasets used in the models of all sorts of epidemics would require describing virtually every kind of information, given the diversity of factors and the interdisciplinary of epidemiologic studies. Data model needs to support interlinked data.
15
Meta-data and Ontologies 15 Mar 2011 - 2nd Epiwork Review Brussels 15 The information model of the EM is directly defined as metadata and ontologies. Ontology and Meta-data standards, the Pros and Cons of using them, annotation and deployment strategies, and the steps for creating an metamodel for epidemic data were the subject of D3.1 reviewed last year.
16
EM: Main Components 15 Mar 2011 - 2nd Epiwork Review Brussels 16
17
EM 1.0 Software Components Fedora Commons 2.X for the implementation of the main features of the repository. Access control in the platform XACML (OASIS 2010), LDAP (Tuttle et al. 2004) Shibolleth (identity management). Front-end based in Muradora Forum based on phpBB (+ Muradora) 17 15 Mar 2011 - 2nd Epiwork Review Brussels
18
Outline 15 Mar 2011 - 2nd Epiwork Review Brussels 18 1. The need for an Epidemic Marketplace 2. Epidemic Marketplace 1.0 3. D3.3 Public Release of the Epidemic Marketplace Platform 4. Where we stand and plans for work ahead
19
What is new since Mar 2010? 15 Mar 2011 - 2nd Epiwork Review Brussels 19 1. Improved reliability 2. MEDCollector – automatic data collector 3. Meta-data policies and editor 4. Web services API + Simple EM Client 5. Improved user interface 6. Public: anyone can browse and register (required for upload)
20
Improved Reliability Reorganizations and back-end Services Before Public Deployment Virtualized environment: every major component running on two separate virtual machines - production + development environments (Xen+CentOS) Monitoring and alerts for all services (Nagios) Logging and Analysis (Google Analytics) 15 Mar 2011 - 2nd Epiwork Review Brussels 20
21
MEDCollector Web Services Workflow Processes Local Storage Dashboard for Workflow Design 21 15 Mar 2011 - 2nd Epiwork Review Brussels
22
Geonames.org: All Countries and Capitals UMLS: twitter searched ion subset of 89 terms related to “Disease or Syndrome” MEDCollector Data Model 22 15 Mar 2011 - 2nd Epiwork Review Brussels
23
MEDCollector Services Data Collection Services Query Selection Services Data Harvesting Services XML Transformation Services Database Loading Service Data Packaging Services To CSV 23 15 Mar 2011 - 2nd Epiwork Review Brussels
24
MEDCollector - BPEL Language to define how Web-Services Communicate Standard graphical notation – BPMN → Complex! 24 15 Mar 2011 - 2nd Epiwork Review Brussels
25
MEDCollector: Dashboard WireIt! - http://javascript.neyric.com/wireit/ 25 15 Mar 2011 - 2nd Epiwork Review Brussels
26
MEDCollector: Dashboard Watch the Demo! 26 15 Mar 2011 - 2nd Epiwork Review Brussels
27
Automatically Collected Data Twitter: 89 diseases, world-coverage ProMed-mail Google Flu Trends CDC RSS Feeds Flu updates Travel Notices... Periodically packed and uploaded to the EM repository 15 Mar 2011 - 2nd Epiwork Review Brussels 27
28
What is new since Mar 2010? 15 Mar 2011 - 2nd Epiwork Review Brussels 28 1. Improved reliability 2. MEDCollector – automatic data collector 3. Meta-data policies and editor 4. Web services API + Simple EM Client 5. Improved user interface 6. Public: anyone can browse and register (for upload)
29
Meta-data Policies and Editor Meta-data introduction simplified Editor that pops-up on upload now fills most of the entries with appropriate defaults. EM Repository Meta-data Vocabulary Generic DCTERMS adopted for datasets characterisation Epidemics-specific DCTERMS defined for epidemic datasets characterisation 15 Mar 2011 - 2nd Epiwork Review Brussels 29
30
DC Term Example: RightsHolder 15 Mar 2011 - 2nd Epiwork Review Brussels 30
31
EM Term Example: HostGroup 15 Mar 2011 - 2nd Epiwork Review Brussels 31
32
Mediator Web Services 15 Mar 2011 - 2nd Epiwork Review Brussels 32 OpenLDAP Mediator Client Fedora Commons Repository OAI-PMH RESTful Interface OAI-ORE Fetch/Search Upload
33
Simple EM Client Mapping of client filenames to EM resources (FC data streams and Collections) Operations: Check-out, check-in 15 Mar 2011 - 2nd Epiwork Review Brussels 33 Watch the Demo! Download from http://epimarketplace.net/mediator/
34
EM 15 Mar 2011 - 2nd Epiwork Review Brussels 34 Old Graphic Style
35
15 Mar 2011 - 2nd Epiwork Review Brussels 35 Try it at: http://epimarketplace.net
36
Outline 15 Mar 2011 - 2nd Epiwork Review Brussels 36 1. The need for an Epidemic Marketplace 2. The Epidemic Marketplace 3. D3.3 Public Release of the Epidemic Marketplace Platform 4. Where we stand and plans for work ahead
37
WP3: status (what we have done) 15 Mar 2011 - 2nd Epiwork Review Brussels 37 Deliverable D3.1 (meta-model) released Deliverable D3.2 (prototype) released Hardware and base software deployed; Initial prototype of EM with initial set of characterized datasets Deliverable D3.3 (public version) released Data-collector EM DCAP and meta-data handling Web Services
38
Events 2nd year London, Delhi, Bilbao, ERCIM News 15 Mar 2011 - 2nd Epiwork Review Brussels 38
39
Publications in the 1st year 15 Mar 2011 - 2nd Epiwork Review Brussels 39 1. Mário J. Silva, Fabrício A.B. Silva, Luís Filipe Lopes, Francisco M. Couto, Building a Digital Library for Epidemic Modelling. Proceedings of ICDL 2010 - The International Conference on Digital Libraries 1, p. 447--459, New Delhi, India, 23--27 February, 2010. TERI Press -- New Delhi, India. Invited Paper. 2. Luis Filipe Lopes, João Zamite, Bruno Tavares, Francisco Couto, Fabrício A.B. Silva, Mário J. Silva, Automated Social Network Epidemic Data Collector. INForum - Simpósio de Informática September, 2009.
40
EM-related Publications (2nd year) 1. Mário J. Silva, Fabrício A.B. Silva, Luís Filipe Lopes, Francisco M Couto, Building a Digital Library for Epidemic Modelling. Proceedings of ICDL 2010 - The International Conference on Digital Libraries 1, p. 447–459, New Delhi, India, 23–27 February, 2010. TERI Press–New Delhi, India. Invited Paper. 2. Fabrício A.B. Silva, Mário J. Silva, Francisco M Couto, Epidemic Marketplace: an e-Science Platform for Epidemic Modelling and Analysis. ERCIM News 82 – Special Theme: Computational Biology. July, 2010. 3. Luis Filipe Lopes, Fabrício A.B. Silva, Francisco M Couto, João Zamite, Hugo Ferreira, Carla Sousa, Mário J. Silva, Epidemic Marketplace: An Information Management System for Epidemiological Data. Proceedings of ITBAM'10 - 1st International Conference on Information Technology in Bio- and Medical Informatics - DEXA 2010 August, 2010. 4. João Zamite, Fabrício A.B. Silva, Francisco M Couto, Mário J. Silva, MEDCollector: Multisource Epidemic Data Collector. Proceedings of ITBAM'10 - 1st International Conference on Information Technology in Bio- and Medical Informatics - DEXA 2010 August, 2010. 5. João Zamite, Multisource Epidemic Data Collector, Master Dissertation, University of Lisbon, Faculty of Sciences, September 2010. 6. Luis Filipe Lopes, A Metadata Model for the Annotation of Epidemiological Data, Master Dissertation, University of Lisbon, Faculty of Sciences, September 2010. 7. Hugo Ferreira, O Mediador do Epidemic Marketplace. Master Dissertation, University of Lisbon, Faculty of Sciences, September, 2010; (in Portuguese). 15 Mar 2011 - 2nd Epiwork Review Brussels 40
41
WP3: status (what we will do) Overcoming the initial difficulties in hiring the planned resources Refreshed team with competencies required for the 2nd and 3rd year; Hiring 1 sw eng for push in release of EM 2.0 Working on Epidemic Marketplace 2.0 D3.4 and D3.5 due Feb 2012 site analytics interlinking Peeking on how to address challenges for the 4th year negotiating access to content 15 Mar 2011 - 2nd Epiwork Review Brussels 41
42
Changes in UL WP3 Team Out Fabrício Silva Luis F. Lopes (meta-data) Hugo Ferreira (mediator) In Dulce Domingos (access control) Juliana Duque (information architecture, graphics) João Ferreira (ontologies) + (always in) Mário Francisco João Zamite 15 Mar 2011 - 2nd Epiwork Review Brussels 42
43
Scheduled Deliverables 15 Mar 2011 - 2nd Epiwork Review Brussels 43
44
Todo List and Planning (Brussels, Mar 2011) 1. Evolve Simple EM Client and GleamViz to become showcase for tight integration with Computational Platform 2. Refine and populate the catalogue of epidemic resources: enrichment, interlinking and semantification of epidemic data 3. Release second version of the EM. Re-implemented Web Services (no more Muradora) New information architecture, new front-end design New social network access control 44 15 Mar 2011 - 2nd Epiwork Review Brussels
45
On the nature of Soc Intelligent Systems Who should learn behaviours about individuals from the network? No Silver Bullet “Classic” Engineering approaches too slow for 21st century pace We are now all part of a huge Living Lab How much longer will the fact that your cat sneezed be relevant?...we might have to ask again. Are we still under control? We may need more flexible ways to control access to sensitive data.. 45 2 Aug 2010 - Assyst, London
46
Classical Approaches Role Based Access Control (RBAC): Advantages: Roles are intuitive concepts in organizations Users can easily be reassigned from one role to another Disadvantages: Central Administration has to manage roles Does not take into account collaborative/social dynamics
47
Access Control Based on Social Networks Objects have owners (or publishers) Owners are part of a social network and define access policies based on the network information
48
EM 2.0 Software Components Fedora Commons 3.4 - main features of the repository. Mediator services reimplemented. Webservices provided by FC invoked directly. Access control in the platform XACML + LDAP (Tuttle et al. 2004) Shibolleth (identity management). Access Control Based on Social Networks Front-end based in the Drupal CMS Integrated forum 48 15 Mar 2011 - 2nd Epiwork Review Brussels
49
EM 2.0 Mock-up interface 15 Mar 2011 - 2nd Epiwork Review Brussels 49 http://v2.epimarketplace.net/mockup
50
WP3 SWOT Analysis Strengths Epiwork-driven EM Standards-based Open Source modules Supported (until 2012) Weaknesses Unpopulated EM Looking for the right policies What are the incentives? Interfaces to WP4 and WP5? 15 Mar 2011 - 2nd Epiwork Review Brussels 50 Unchanged !
51
WP3 SWOT Analysis Opportunities Epiwork testbed Creation of a baseline for epidemic modelling Showcase for partners’ outputs Threats Consortium enters “everyone for himself” mode. “Somebody will take care of that” attitude EM perceived as a very expensive, complex and useless cache 15 Mar 2011 - 2nd Epiwork Review Brussels 51 Unchanged !
52
15 Mar 2011 - 2nd Epiwork Review Brussels 52
53
Current Focus – EM 2.0 15 Mar 2011 - 2nd Epiwork Review Brussels 53 Designing and implementing the new user interface. Must be useful to the expert and occasional user. New Access Control mechanisms addressing data privacy in socially intelligent environment. Refining and populating, enriching the catalogue of epidemic resources using initial prototype. The method of scanning published epidemic modelling studies and then inferring the metadata descriptions has shown to be very useful.
54
Todo list and planning (Torino, Nov 2009) 15 Mar 2011 - 2nd Epiwork Review Brussels 54 1. Populate Repository 2. Linked Epidemic Data 3. Ethics, Privacy and Anonimization 4. Access control policies 5. Dataset selection generation 6. Distributed Authentication 7. Replicate EM node
55
Todo list and planning (Brussels, Mar 2010) 1. Populate Repository 2. Linked Epidemic Data 3. Ethics, Privacy and Anonimization 4. Access control policies 5. Dataset selection generation 6. Distributed Authentication 7. Replicate EM node 15 Mar 2011 - 2nd Epiwork Review Brussels 55
56
Todo list and planning (Torino, Dec 2010) 1. Populate Repository 2. New Front-end and updated components (prepare market) 3. Access control policies & distributed authentication 4. Linked Epidemic Data 5. Ethics, Privacy and Anonimization 6. Replicate EM node 15 Mar 2011 - 2nd Epiwork Review Brussels 56
57
The falacies of free-text 15 Mar 2011 - 2nd Epiwork Review Brussels 57 Initial “proof-of-concept” prototype showed the limitations spanning from annotating the datasets using free text in the meta-data description fields. A much simpler model, inspired on web2.0 “tags.” EM users will be able to freely annotate their datasets using their own terminologies (also dubbed as “folksonomies”).
58
Kdnuggets, march 2010 15 Mar 2011 - 2nd Epiwork Review Brussels 58
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.