Open Data – reflections from behind the Big Firewall Or, may you be cursed to live in interesting times.

Slides:



Advertisements
Similar presentations
Taxonomy & Ontology Impact on Search Infrastructure John R. McGrath Sr. Director, Fast Search & Transfer.
Advertisements

DELOS Highlights COSTANTINO THANOS ITALIAN NATIONAL RESEARCH COUNCIL.
Maines Sustainability Solutions Initiative (SSI) Focuses on research of the coupled dynamics of social- ecological systems (SES) and the translation of.
Sheldon Brown, UCSD, Site Director Milton Halem, UMBC Director Yelena Yesha, UMBC Site Director Tom Conte, Georgia Tech Site Director Fundamental Research.
ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
Virtual SharePoint Summit 2010 hosted by Rackspace Overcoming Collaboration Challenges with SharePoint Chris Samson Leslie Sistla Virtual SharePoint Summit.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Sheldon Brown, UCSD, Site Director Milton Halem, UMBC Director Yelena Yesha, UMBC Site Director Tom Conte, Georgia Tech Site Director Fundamental Research.
Unlock Your Data Rich connectivity Robust data integration Enterprise-class manageability Deliver Relevant Information Intuitive design environment.
Nick Wainwright HP Labs / Effectsplus project. The report of a consultation of the Future Internet Assembly – a cross disciplinary assembly of researchers.
© 2007 IBM Corporation © 2009 IBM Corporation 1 Tran Viet Huan, PhD CTO, IBM Vietnam IBM Research Global Technology Outlook.
Web 3.0 or The Semantic Web By: Konrad Sit CCT355 November 21 st 2011.
Presented to: By: Date: Federal Aviation Administration Enterprise Information Management SOA Brown Bag #2 Sam Ceccola – SOA Architect November 17, 2010.
Getting Smarter with Information An Information Agenda Approach
Welcome Thank you for joining us today. Please stand by while we wait for more attendees to join in. The webcast will begin momentarily.
© 2011 IBM Corporation Smarter Software for a Smarter Planet The Capabilities of IBM Software Borislav Borissov SWG Manager, IBM.
Findly Leads the World in Talent Innovation with Its Enterprise-Cloud for Global Talent Acquisition COMPANY PROFILE: FINDLY Findly is a SaaS ISV founded.
Understanding Data Warehousing
Hosted on the Powerful Microsoft Azure Platform, Advent Countdown Lets Companies Run Reliable and Scalable Holiday Marketing Campaigns MICROSOFT AZURE.
August 27, 2008 Platform Market, Business & Strategy.
TECHNOLOGY GUIDE THREE
material assembled from the web pages at
Alert Logic Security and Compliance Solutions for vCloud Air High-level Overview.
Linked-data and the Internet of Things Payam Barnaghi Centre for Communication Systems Research University of Surrey March 2012.
© 2011 IBM Corporation 1 (ENSUREing we can) Ride the Wave (on a Cloud) Presenter: Michael Factor, Ph.D. IBM Research – Haifa
1. Process Gather Input – Today Form Coherent Consensus – Next two months.
Data Warehousing Data Mining Privacy. Reading Bhavani Thuraisingham, Murat Kantarcioglu, and Srinivasan Iyer Extended RBAC-design and implementation.
Alert Logic Security and Compliance Solutions for vCloud Air High-level Overview.
Future Learning Landscapes Yvan Peter – Université Lille 1 Serge Garlatti – Telecom Bretagne.
IntelliSense.io Beyond the hype - Real World Applications / Solutions of Internet of Things.
Alert Logic Provides a Fully Managed Security and Compliance Solution Based in the Cloud, Powered by the Robust Microsoft Azure Platform MICROSOFT AZURE.
© 2010 IBM Corporation Business Analytics software Business Analytics Editable Text Editable Text Editable Text.
Actualog Social PIM Helps Companies to Manage and Share Product Information Using Secure, Scalable Ease of Microsoft Azure MICROSOFT AZURE ISV PROFILE:
Collaboration in eRegion- ICT for Growth and Empowerment Bror Salmelin Head of Unit, New working environments European Commission, DG Information Society.
Built on Azure, Moodle Helps Educators Create Proprietary Private Web Sites Filled with Dynamic Courses that Extend Learning Anytime, Anywhere MICROSOFT.
Combining Cloud Power with Mobile Technology, Fielding Systems Is Delivering the Digital Oilfield to Modern Oil and Gas Production Companies COMPANY PROFILE:
OpenField Consolidates Stadium Data, Provides CRM and Analysis Functions for an Intelligent, End-to-End Solution COMPANY PROFILE : OPENFIELD Founded by.
26/05/2005 Research Infrastructures - 'eInfrastructure: Grid initiatives‘ FP INFRASTRUCTURES-71 DIMMI Project a DI gital M ulti M edia I nfrastructure.
Built on the Microsoft Azure Platform, Prudena Provides Users with the Ideas and Confidence to Make Sound Investment Decisions MICROSOFT AZURE APP BUILDER.
MidVision Enables Clients to Rent IBM WebSphere for Development, Test, and Peak Production Workloads in the Cloud on Microsoft Azure MICROSOFT AZURE ISV.
IoT Meets Big Data Standardization Considerations
+ Logentries Is a Real-Time Log Analytics Service for Aggregating, Analyzing, and Alerting on Log Data from Microsoft Azure Apps and Systems MICROSOFT.
Microsoft Azure and DataStax: Start Anywhere and Scale to Any Size in the Cloud, On- Premises, or Both with a Leading Distributed Database MICROSOFT AZURE.
Microsoft Azure Powers the Convenios e Obras Module for the Connected Government Solution, Which Can Integrate, Speed Up Decision-Making MICROSOFT AZURE.
MAR Capability Overview Deck Protean Analytics.
Axis AI Solves Challenges of Complex Data Extraction and Document Classification through Advanced Natural Language Processing and Machine Learning MICROSOFT.
Zentera Guardia Fabric ™ Securely Connects Client-Server Apps between Microsoft Azure, Enterprise Datacenters & Other Public Clouds MICROSOFT AZURE ISV.
DenyAll Delivering Next-Generation Application Security to the Microsoft Azure Platform to Secure Cloud-Based and Hybrid Application Deployments MICROSOFT.
Microsoft Azure and ServiceNow: Extending IT Best Practices to the Microsoft Cloud to Give Enterprises Total Control of Their Infrastructure MICROSOFT.
Cyberinfrastructure Overview of Demos Townsville, AU 28 – 31 March 2006 CREON/GLEON.
Internet of Things. Creating Our Future Together.
1© 2015 IBM Corporation Unlocking the power of the API economy Client Briefing Nov.
Big Data Quality Challenges for the Internet of Things (IoT) Vassilis Christophides INRIA Paris (MUSE team)
SAP BI – The Solution at a Glance : SAP Business Intelligence is an enterprise-class, complete, open and integrated solution.
CSE 5810 Biomedical Informatics and Cloud Computing Zhitong Fei Computer Science & Engineering Department The University of Connecticut CSE5810: Introduction.
© 2007 IBM Corporation IBM Software Strategy Group IBM Google Announcement on Internet-Scale Computing (“Cloud Computing Model”) Oct 8, 2007 IBM Confidential.
Dell Software Unified Communications Command Suite (UCCS) Provides Flexible, Cross-Platform Management, Reporting and Data Diagnostics MICROSOFT AZURE.
Organizations Are Embracing New Opportunities
Smart Building Solution
Partner Logo Veropath Offers a Next-Gen Expense Management SaaS Technology Solution, Built Specifically to Harness Big Data Analytics Capabilities in Azure.
Smart Building Solution
H3 Solutions and the Azure Government Cloud Team Up to Power Contextual Intelligence Platform – Where Big Data Meets Business Productivity MICROSOFT AZURE.
Intelledox Infiniti Helps Organizations Digitally Transform Paper and Manual Business Processes into Intuitive, Guided User Experiences on Azure MICROSOFT.
Logsign All-In-One Security Information and Event Management (SIEM) Solution Built on Azure Improves Security & Business Continuity MICROSOFT AZURE APP.
Global Enterprise Search
Accelerate Your Self-Service Data Analytics
(VIP-EDC) Point 6 of the agenda
Data Warehousing and Data Mining
Dell Data Protection | Rapid Recovery: Simple, Quick, Configurable, and Affordable Cloud-Based Backup, Retention, and Archiving Powered by Microsoft Azure.
One-Stop Shop Manages All Technical Vendor Data and Documentation and is Globally Deployed Using Microsoft Azure to Support Asset Owners/Operators MICROSOFT.
Presentation transcript:

Open Data – reflections from behind the Big Firewall Or, may you be cursed to live in interesting times

Open Data …. Why bother? Open Contributed Content will become a core, strategic, economic resource – and the most accessible & scalable resource we possess. Mobility, Openness & Connection will matter more than Presence & Rigid Structures In 2013 expect generation of >850 Exabytes of Internet data. Mostly user contributed content (versus traditional enterprise sources). In 2013 expect generation of >850 Exabytes of Internet data. Mostly user contributed content (versus traditional enterprise sources). Global access to technology is already driving trends like ‘virtual citizenship’, ‘virtual employment’ & ‘social innovation’ On-demand interaction will increasingly be the norm for a global community of virtual innovators … who expect their user experience to be as simple as ‘using an appliance’

Open Data and Economics or …. ‘Greater Fool Investing’ …..!!’ Open data is a potential new 'raw material' for economic growth. It requires effort to produce and maintain. Unlike traditional raw materials like oil, gas and minerals, its value increases fastest when it is open and shareable. Bubble … "trade in high volumes at prices that are considerably at variance with intrinsic values". Open Data alone does not generate direct economic benefit sufficient to offset production & operational costs … the question is … can it generate sufficient ‘value’ to be sustainable? Incentives must be in place to sustain “economically significant” amounts of Open Data Some bright lights … but we need answers before we run out of steam!!

How Private is Private? Privacy is not absolute, it is a balance between Risk and Utility Open Data usage is inherently contradictory Social media usage -> Maximize Utility + (Largely) Ignore Risk Enterprise usage -> Maximize Utility + Minimize Risk Who carries liability in case of dispute? Uncertainty in usage policies is a substantial form of business risk Recognize in policy and legislation that privacy is mutable - based on context ✔ Available Open Data useful to identify & characterize group behaviors ✖ Negative usage for ‘nuisance’ providers to identify high-value targets { ∃ (high value residences)} ∩ { ∃ (long emergency response time)} ∩ { ∃ (many local area crimes)}  {area where people might buy home security products} (all available on open data sites near you ) { ∃ (high value residences)} ∩ { ∃ (long emergency response time)} ∩ { ∃ (many local area crimes)}  {area where people might buy home security products} (all available on open data sites near you )

A Fun Use Case

Challenges for Privacy in an Open Data World And I haven’t even mentioned Trust, Provenance, Security, ……

Data – 100’s of datasets, 1000’s of files – Very open domain(s) – Very expensive to normalize – Scaling complexity from high dimensionality Approach – Pay-as-you go approach, only process what you need – Do not stick to a common model, use any you can find – Generate interesting views and feed them to “analytics” Lessons learned – Multiple models, depending on context – Need to do things incrementally – Lightweight generally better than heavyweight Selected research results: -Live deployment in Dublin -Won prize in Semantic Web Challenge -Paper at ISWC -Paper at Hypertext -Invited paper at Journal of Web Semantics Selected research results: -Live deployment in Dublin -Won prize in Semantic Web Challenge -Paper at ISWC -Paper at Hypertext -Invited paper at Journal of Web Semantics Research impact: what we have learned so far There are plenty of interesting challenges!! Documents + Metadata StructureEntities LinksViews Insight …. Pay-as-you-go, Gain-as-you-go

Dublinked - Towards a robust test-bed for Open Data Research IBM Connections Social Media & Collaboration IBM Connections Social Media & Collaboration IBM IOC Interaction with Industry Solutions IBM IOC Interaction with Industry Solutions Dublin City Enterprise Platform IBM Enterprise Cloud Scalable compute, storage & network infrastructure IBM Enterprise Cloud Scalable compute, storage & network infrastructure Provider 1…N Open REST Web Services API Catalog & Navigation Search & Query Privacy & Security Knowledge Representation & Reasoning Publication & Annotation Visualization & Analytics Enterprise Citizen IBM Products & Services Robust models to organize and represent resources and their context Scalable privacy and security of resources Automated assimilation and sharing of resources Scalable privacy and security of resources Automated assimilation and sharing of resources Compose resources for development, mash-up & visualization Challenges include.. IBM Research Partners & People Key Represent knowledge efficiently for continuous machine reasoning and diagnosis Research Testbed

What we do: Learning Systems to Help Diagnose the City Problem How can we provide City decision makers with explanations and diagnoses for events by applying machine reasoning techniques to a fusion of massive, rich, complex and dynamic data? How can we move from explanation to prediction? Challenges Identifying relevant data and information Capturing and representing anomalies Correlating knowledge on heterogeneous data sources Advanced fusion of heterogeneous data from multiple sources Goals Identification of the nature and cause of changes Explaining logical connection of knowledge across space and time Move from explanation to prediction Anomaly Detected: Delayed buses, congested roads Anomaly Detected: Delayed buses, congested roads Detection to Diagnosis?

Outline Research Roadmap Use Cases Technology Provenance Privacy High-volume distributed querying Wide-scale distributed querying Distributed Entity Linking Fine-grain Access Control Streaming Analytics Distributed Reasoning Context Mining Lightweight Distributed Information Access Contextual Access Basic Access Control Distributed Entity Consolidation Graph Access Linked Data Cloud Context Retrieval Cross-agency Context Retrieval Cross-agency Analytics Cross Web-Enterprise Analytics Many-agency Analytics Public Safety Integrator Life analytics (social/health/public safety) High-risk/time-critical alerting Cross-agency Alerting Data Warehouse Dynamic Distributed Information Analytics