Axel Polleres web: http://polleres.net twitter: @AxelPolleres Open Data on the Web as the fuel for Cognitive Computing Das Potenzial von Open Data für.

Slides:



Advertisements
Similar presentations
A question of German
Advertisements

1 San Francisco Innovation and Cloud Journey Jon Walton Chief Information Officer
Spatial Data Infrastructure: Concepts and Components Geog 458: Map Sources and Errors March 6, 2006.
Turning statistics into knowledge Per Nymand-Andersen European Central Bank CCSA session on International Statistics Ankara, 5 S EPTEMBER 2013.
Web Standards and Technical Challenges for Publishing and Processing Data on the Web Axel Polleres web:
IBM TJ Watson Research Center © 2010 IBM Corporation – All Rights Reserved AFRL 2010 Anand Ranganathan Role of Stream Processing in Ad-Hoc Networks Where.
ASIDIC Spring Conference ‘Smart Content’ Uncovering the Value and Benefits of Semantic Technology Richard C. Fusco Director, Content Strategy – McGraw-Hill.
How can you use Open Data? ... And why you should!
Michalis Vafopoulos NTUA, GFOSS & The transformers GREEN CITY HACKATHON.
Web Standards and Technical Challenges for Publishing and Processing Open Data Axel Polleres web:
Web 2.0 Tools Used in the Finance/Investment Management Industry.
Modernisation of ESS infrastructure: The ESS instruments - a review E. di Meglio – P. Jacques – J.M. Museux.
Boris Villazón-Terrazas, Ghislain Atemezing FI, UPM, EURECOM, Introduction to Linked Data.
BG 5+6 How do we get to the Ideal World? Tuesday afternoon What gaps, challenges, obstacles prevent us from attaining the vision now? What new research.
@nmoneypenny Innovating New Products & Services with Enterprise Social Graphing: Naomi Moneypenny.
© 2007 IBM Corporation IBM Software Strategy Group IBM Google Announcement on Internet-Scale Computing (“Cloud Computing Model”) Oct 8, 2007 IBM Confidential.
AP CSP: What is Big Data?.
Activities in a nutshell
Sebastian Neumaier Advisor: Univ.Prof. Dr. Axel Polleres Co-Advisor:
Corinne Reisert Sarah Zelt Joerg Wacker
Kostas Seferis, i2S Data science and e-infrastructures can help aquaculture to improve performance and sustainability!
IBM Marketplace: Business Partner Overview
Primal and Microsoft Azure Deliver Personalized Content, Intelligence, and Analytics That Match Your Content to the Interests of Your Audience MICROSOFT.
Library Web Portals: Reinventing Libraries for the Future
From Data to Capabilities
Your customer base is at risk!
Getting Innovative with OER
A Web-enabled Approach for generating data processors
City of Toronto Open Data
Dr Paul Lewis Chief Technology Officer
Predictions For The Future of Cloud Computing. Cloud Services In India.
Lifting Data Portals to the Web of Data
Metadata Quality: Learning from Open Data Portalwatch
Digital transformation of democracy?
Financial Transparency System (FTS)
Welcome: How to use this presentation
Marek Šturc European Commission - Eurostat
Semantic Web: Commercial Opportunities and Prospects
File Manager for Microsoft Office 365, SharePoint, and OneDrive: Extensible Via Custom Connectors in Enterprise Deployments, Ideal for End Users OFFICE.
SDMX: Enabling World Bank to automate data ingestion
Global Enterprise Search
11. The future of SDMX Introducing the SDMX Roadmap 2020
Your gateway to cloud innovation
:Let’s Bring Schools to XXI Century!
Syllabus – what will we cover?
Automating Profitable Growth™
The Open Fiscal Data Package
Climate Change Unit Lesson 2
Big Data Young Lee BUS 550.
Metcalfe’s Law : Why is the Web so Big?
Information technologies/NBIC and Big data
Making official statistics open by default
How to build the bridge: Budget Open Data Policy
JOINED AT THE HIP: DEVSECOPS AND CLOUD-BASED ASSETS
Statistical Information Technology
Big Data ESSNet WP 1: Web scraping / Job Vacancies Pilot
International Statistics
Introduction to Dataflows in Power BI
Computer Services Business challenge
TOOLS & Projects overview
Low-bandwidth Semantic Web
Helsinki – Accelerating Open Innovation
Automating Profitable Growth
Let’s Talk About Open Data
Vancouver Public Library
APE EAD3 introduction - DARIAH - Brussels
Australian and New Zealand Metadata Working Group
Socio-Economic Impact of ESS
The Intelligent Enterprise and SAP Business One
The Key Pillars of Success in Becoming a Customer Centered Intelligent Enterprise in the Experience Economy Wonderful to be here today Work in Customer.
Presentation transcript:

Axel Polleres web: http://polleres.net twitter: @AxelPolleres Open Data on the Web as the fuel for Cognitive Computing Das Potenzial von Open Data für Cognitive Computing Axel Polleres web: http://polleres.net twitter: @AxelPolleres

Why are we here today? künstliche Intelligenz Cognitive Computing Danke für Die Einladung! Lassen Sie mich beginnen mit der Ankündigung zum heutigen event…. Hier stechen einige Begriffe ins Auge: Es geht um “Cognitive Computing” ... Was ist das überhaupt? Es hat offensichtlich etwas mit künstlicher Intelligenz zu tun... Was ist das überhaupt? Wie wir feststellen ist Artificial Intelligence im Rahmen der “:Digitalen Transformation“ selbst im Wandel und hat in den letzen Jahren einige Veränderungen durchlebt... Eine Frage die gestellt wird ist die welche Lösungen bereits im Einsatz sind worauf ich versuchen werde zumindest aus meiner Forschungsperspektive – der Erforschung von Daten am und aus dem Web – ein paar Antworten zu geben... Last, but not least, ist eine Frage in der Ankündigung, mit welchen Themen wir uns befassen sollten, und ich werde versuchen, Sie bis zum Ende dieses Vortrages davon zu überzeugen, dass Open Data, also offene Daten ein solches Thema sind! Mit welchen Themen sollten wir uns befassen? Welche Lösungen sind im Einsatz?

1. What is Cognitive Computing?

1. What is Cognitive Computing? Dr. Chris Welty, now Google, formerly IBM Research (Watson team) “Help computers perform tasks we can do really quickly, at a scale that humans can’t” “Computational tasks that require inexact solutions that combine multiple methods in unpredictable ways.” “Machine intelligence is not human intelligence, the difference lies in the errors it makes.” https://www.wu.ac.at/wutv/show/clip/20141024-welty/

2. What is Artificial Intelligence? John Launchbury, the Director of DARPA's Information Innovation Office Video, published on Feb 15, 2017 “demistify” Artificial Intelligence

2. What is Artificial Intelligence? John Launchbury, the Director of DARPA's Information Innovation Office Video, published on Feb 15, 2017

2. What is Artificial Intelligence? John Launchbury, the Director of DARPA's Information Innovation Office Video, published on Feb 15, 2017 “learn from data, like spreadsheets on steroids”

2. What is Artificial Intelligence? John Launchbury, the Director of DARPA's Information Innovation Office Video, published on Feb 15, 2017 Recall: “Machine intelligence is not human intelligence, the difference lies in the errors it makes.” is it?

2. What is Artificial Intelligence? John Launchbury, the Director of DARPA's Information Innovation Office Video, published on Feb 15, 2017 needs – if you want a combination of first and second wave AI – rules/explanations to model common sense and act in new contexts Again, data is at it’s heart!

Which solutions exist now on the Web? Welche Lösungen sind im Einsatz? Example 1: Google’s Knowledge Graph

Which solutions exist now on the Web? Welche Lösungen sind im Einsatz? Example 2: FB’s Social Graph & News Recommendations Also uses a knowledge graph...

Which solutions exist now on the Web? Welche Lösungen sind im Einsatz? Example 3: IBM Watson! Also uses a knowledge graph... https://youtu.be/P0Obm0DBvwI?t=951

This is the knowledge Graph that IBM Watson used: Open Data from the Web! Linking Open Data cloud diagram 2017, by Andrejs Abele, John P. McCrae, Paul Buitelaar, Anja Jentzsch and Richard Cyganiak. http://lod-cloud.net/

3. What is Open Data? Availability and Access: data must also be available in a convenient and modifiable form, at no more than a reasonable reproduction cost. Reuse and Redistribution: data must be provided under terms that permit reuse and redistribution and must be machine-readable. Universal Participation: everyone must be able to use, reuse and redistribute it (no discrimination) See more at: http://opendefinition.org/okd/ Open Knowledge Foundation

Open Data is a global trend: Cities, International Organizations, National and European Portals, Int'l. Conferences: 15

Open Data Portals CKAN ... http://ckan.org/ almost „de facto“ standard for Open Data Portals facilitates search, metadata (publisher, format, publication date, license, etc.) for datasets http://opendataportal.at/ http://data.gv.at/ machine-processable? ... ... partially

Our research: data.wu.ac.at What is the status of Open Data and what are the challenges using Open Data? OpenData PortalWatch – a project at WU How can Open Data be used by enterprises? Open City Data Pipeline – a joint project with Siemens on using Open Data in an Enterprise context! What's next? Improving Open Data Quality and Access: ADEQUATE (FFG - project) Making Open Data Searchable Building an Open Data Knowledge Graph! A striving Data Economy needs no silos… re-democratise the Web by Congitive Intelligence based on Open Data?

OPEN DATA PORTAL WATCH http://data.wu.ac.at/portalwatch/ Periodically monitoring a list of Open Data Portals 260 CKAN powered Open Data Portals worldwide Quality assessment Evolution tracking Meta data Data Formats, growth

What have we learnt? Open Data also has the "Vs" Volume: It's growing! (we currently monitor 90 CKAN portals, 512543 resources/ 160069 datasets, at the moment (statically) ~1TB only CSV files... Variety: different datasets (from different cities, countries, etc.), only partially comparable, partially not. Different metadata Different data formats Velocity: Open Data changes regularly (fast and slow) New datasets appear, old ones disappear

Portalwatch Example:

Our research: data.wu.ac.at What is the status of Open Data and what are the challenges using Open Data? OpenData PortalWatch – a project at WU How can Open Data be used by enterprises? Open City Data Pipeline – a joint project with Siemens on using Open Data in an Enterprise context! What's next? Improving Open Data Quality and Access: ADEQUATE (FFG - project) Making Open Data Searchable Building an Open Data Knowledge Graph! A striving Data Economy needs no silos… re-democratise the Web by Congitive Intelligence based on Open Data?

Use Case: City Data – Important for Infrastructure Providers & for City Decision Makers City Assessment and Sustainability reports Tailored offerings by Infrastructure Providers … however, these are often outdated before even published! Goal (short term): Leverage Open Data for calculating a city’ performance from public sources on the Web automatically Needs up-to-date City Data and calculates City KPIs in a way that allows to display the current state and run scenarios of different product applications. e.g. towards a “Dynamic” Green City Index: Goal (long term): Define and Refine KPI models to assess specific impact of infrastructural investments and gather/check input automatically 22

City Data Pipeline (started 2012) http://citydata.wu.ac.at/

City Data Pipeline: could be viewed as a cognitive computing use case Indicators Goal: understanding, combining, enriching, different open datasets using both first-wave and second wave AI methods Cities Note that so far we have considered the whole thing as an integrated single dataset. Now lets focus on these white spots specifially The question is: can we predict the indicators of eurostat for cities that are not in europoe, the green arrow, or the un indicators for cities which are only in eurostat the blue arrow We again used aproach 2 to fill in the missing values of a single dataset and use that as training set for the indicatosr of the other dataset.

Open Data: The more, the merrier! City Data Pipeline citydata.wu.ac.at Search for indicators & cities obtain results incl. sources Integrated data served as Linked Open Data Predicted values AND estimated error rates for missing data... Open Data: The more, the merrier! ...it‘s not finished, but: assumption: Predictions get better, the more Open data we integrate...

Our research: data.wu.ac.at What is the status of Open Data and what are the challenges using Open Data? OpenData PortalWatch – a project at WU How can Open Data be used by enterprises? Open City Data Pipeline – a joint project with Siemens on using Open Data in an Enterprise context! What's next? Improving Open Data Quality and Access: ADEQUATE.at Making Open Data Searchable Building an Open Data Knowledge Graph! A striving Data Economy needs no silos… re-democratise the Web by Congitive Intelligence based on Open Data?

Why is Search in Open Data a problem? https://www.youtube.com/watch?v=kCAymmbYIvc Structured Data in Web Search by Alon Halevy vs. Open Data Search is hard... No natural language „cues“ like in Web tables... Existing knowledge graphs don‘t cover the domain of "Open Data“ Open Data is not properly geo-referenced

Some starting points: First baby steps on building an Open Data Knowledge Graph: Ongoing work to make Open Data geo-searchable e.g. in our project communidata.at: International Semantic Web conference 2016:

Ongoing Projects (data.wu.ac.at)

Our research: data.wu.ac.at What is the status of Open Data and what are the challenges using Open Data? OpenData PortalWatch – a project at WU How can Open Data be used by enterprises? Open City Data Pipeline – a joint project with Siemens on using Open Data in an Enterprise context! What's next? Improving Open Data Quality and Access: ADEQUATE.at Making Open Data Searchable Building an Open Data Knowledge Graph! A striving Data Economy needs no silos… re-democratise the Web by Congitive Intelligence based on Open Data?

This is the knowledge Graph that IBM Watson used: “Both our knowldege graph and google’s habe roots in Wikipedia and freebase” – but none of Google and FB make their knowledge graphs freely and openly available again as Open Data! Open Data from the Web! Linking Open Data cloud diagram 2017, by Andrejs Abele, John P. McCrae, Paul Buitelaar, Anja Jentzsch and Richard Cyganiak. http://lod-cloud.net/

This is a fundamental threat to the Web itself: https://www.theguardian.com/technology/2017/mar/11/tim-berners-lee-web-inventor-save-internet 1) We’ve lost control of our personal data 2) It’s too easy for misinformation to spread on the web 3) Political advertising online needs transparency and understanding

Open-Data-Fueled Cognitive Computing to the rescue! http://www.communidata.at/austrian-open- data-day-students-co-create-open-data-ideas/

Open-Data-Fueled Cognitive Computing to the rescue!

Bottomline: Let’s build Open Knowledge Graphs as a basis for Cognitive Computing for all businesses! #OpenData künstliche Intelligenz Cognitive Computing Mit welchen Themen sollten wir uns befassen? Welche Lösungen sind im Einsatz? Danke!

Eine Einladung… Wenn Sie mehr zu den Themen Semantic Web, Cognitive Computing, Linked Open Data erfahren wollen: https://iswc2017.semanticweb.org/ Wir veranstalten von 21.-25.10. an der WU die 16. International Semantic Web Conference (ISWC2017)

Things I did NOT have time to talk about: Open Data and licences  DALICC Open Data adoption barriers  see our recent paper to be presented at CEDEM2017 Privacy and data on the Web  http://privacylab.at