DG CONNECT Unit G.3 - Data Value Chain

Slides:



Advertisements
Similar presentations
CEF and Business Modelling
Advertisements

ICT PSP Infoday Luxembourg Call 2011 – 2.4 eLearning ICT-PSP Call Objective eLearning Marc Röder Infso E6/eContent and Safer Internet Luxembourg,
DELOS Highlights COSTANTINO THANOS ITALIAN NATIONAL RESEARCH COUNCIL.
Information Society Technologies Third Call for Proposals Norbert Brinkhoff-Button DG Information Society European Commission Key action III: Multmedia.
CEF Building Blocks Joao RODRIGUES FRADE
1 Regional Policy contributing to smart growth in Europe 2020 Standard presentation Brussels, November 2010 Pierre GODIN Policy Analyst, DG Regional policy.
EU SME policy The “Small Business Act” for Europe and its Review
ICT work programme ICT 22 Multimodal and natural computer interaction Aleksandra Wesolowska (Unit G.3 - Data Value Chain) Juan Pelegrin (Unit.
Research and Innovation Research and Innovation Research and Innovation Research and Innovation Research Infrastructures and Horizon 2020 The EU Framework.
Towards EU big data economy Kimmo Rossi European Commission
WP5 Digital Business Ecosystem Alessandra Benvenuti, INSIEL SpA (Friuli Venezia Giulia Region) ADC Final Conference Venice, March 13 th 2012.
ICT work programme ICT 17 Cracking the language barrier Aleksandra Wesolowska Unit G.3 - Data Value Chain.
Kimmo Rossi Head of R&I sector European Commission DG CONNECT Unit G.3 - Data Value Chain KITES Symposium 31/10/2013.
The European Agenda for Culture The OMC and the Structured dialogue with civil society.
Roberto Cencioni European Commission DG CONNECT Communications Networks, Content & Technology Media & Data Zagreb, 30 November 2012.
European Commission Preparation of the Innovation Union Flagship Initiative European Commission Presentation to ERAC 11 June 2010.
Towards a European network for digital preservation Ideas for a proposal Mariella Guercio, University of Urbino.
European Commission - DG Research - Directorate B – “Structuring the European Research Area” Jean-David MALO – Bucharest, February 12-13, NOT LEGALLY.
1 Innovation in Services Business Service Design and Innovation Fostering the Economic and Legal Framework for Innovation Performance and Development of.
The EU framework programme for research and innovation.
Global policy framework and standards on ICT accessibility UNDESA/DSPD FPRUM DISABILITY INCLUSION AND ACCESSIBLE URBAN DEVELOPMENT Nairobi, Kenya 28 October.
Yves Paindaveine DG CONNECT, European Commission
Technology-enhanced Learning: EU research and its role in current and future ICT based learning environments Pat Manson Head of Unit Technology Enhanced.
19-20 October 2010 IT Directors’ Group meeting 1 Item 6 of the agenda ISA programme Pascal JACQUES Unit B2 - Methodology/Research Local Informatics Security.
EU-China: : Demonstrating Smart Cities achievements Dr Shaun Topham EU eForum.
Digital Single Market From Open Data to the Free Flow of Data in the Digital Single Market W3C Day in Spain – 26 May 2016 Szymon Lewandowski, Data Value.
H2020 ICT WP ICT 17 Cracking the language barrier Aleksandra Wesolowska Unit G.3 - Data Value Chain H2020-LEIT-ICT WP pending Commission.
1 Open Discovery Space Overview Argiris Tzikopoulos, Ellinogermaniki Agogi Open Discovery Space [CIP-ICT-PSP ][elearning] A socially-powered and.
Work Package 2 Eva Heckl 1/10/2014 Feasibility study on an internet-based e-platform for women entrepreneurs.
EUB Brazil: IoT Pilots HORIZON 2020 WP EUB Brazil: IoT Pilots DG CONNECT European Commission.
Name - Date Technology-enhanced Learning: tomorrow’s school and beyond Pat Manson Head of Unit Technology Enhanced Learning Directorate General.
CEF.AT and its Role for the Multilingual Digital Single Market Spyridon Pilos Head of Language applications sector Directorate-General for Translation.
Green ICT Development.  General objective:  The project’s general objective is to build up strategic cluster partnership in the field of green smart.
Accelerating progress towards the Green Economy
Mikolt Csap (Unit G.2 – Creativity)
eContentplus 2008 Work Programme
Public procurement of innovation
Horizon 2020 Health, Demographic Change and Well-being Open Info Day 12 May 2016, Bruxelles NCP training ICT for Health, demographic change and well-being.
Digitising European Industry
ICT22 – 2016: Technologies for Learning and Skills ICT24 – 2016: Gaming and gamification Francesca Borrelli DG CONNECT, European Commission BRUXELLES.
“Self-Sustaining Innovation Ecosystems for European Leadership”
GISELA & CHAIN Workshop Digital Cultural Heritage Network
Information Day on “Search Engines for Audio-Visual Content”
The ESS vision, ESSnets and SDMX
ICT PSP 2011, 5th call, Pilot Type B, Objective: 2.4 eLearning
Horizon 2020 Health, Demographic Change and Well-being Open Info Day 12 May 2016, Bruxelles NCP training ICT for Health, demographic change and well-being.
The ATTRACT initiative
HOSTED BY IN PARTNERSHIP WITH SUPPORTED BY Barcelona iCapital 2015.
A Better Internet for Kids through Industry collaboration Pat Manson –Inclusion, Skills & Youth Directorate General Communications Networks, Content.
Presentation of the ICT4Water cluster
Goal of Action Plan Implementation
HOLISDER Integrating Real-Intelligence in Energy Management Systems enabling Holistic Demand Response Optimization in Buildings and Districts Project presentation.
European data policy: Data Economy and Data for Policy
European Commission – DG CONNECT
ICTPSP Call 2007 ICT for ageing well
Presentation for information days Units involved:
The role of the ECCP (1) The involvement of all relevant stakeholders – public authorities, economic and social partners and civil society bodies – at.
Boosting Social Enterprises in Europe December 3-4, 2015
The partnership principle in the implementation of the CSF funds ___ Elements for a European Code of Conduct.
eContentplus Programme (2005 – 2008)
Culture Statistics: policy needs
European Innovation Partnership on Water
Ex ante conditionalities in cohesion policy:
GISELA & CHAIN Workshop Digital Cultural Heritage Network
The e-government Conference main issues
Juan Gonzalez eGovernment & CIP operations
eEurope 2005 What’s new? What’s still important?
DIME / ITDG Meeting Luxemburg, 14 Feb 2017
Reviewing RIS3 in Catalonia
eContentplus 2007 Work Programme
Presentation transcript:

DG CONNECT Unit G.3 - Data Value Chain Kimmo Rossi Head of R&I sector European Commission DG CONNECT Unit G.3 - Data Value Chain KITES Symposium 31/10/2013

Directorate G - Media and Data EUROPEAN COMMISSION The Communication Networks, Content & Technology Directorate General (DG CONNECT) Directorate G - Media and Data G1: Converging Media and Content G2: Creativity G3: Data Value Chain G4: Inclusion, Skills and Youth  G5: Administration and Finance

Who are we? Unit CNECT.G3 – Data Value Chain Resulted from merging three units in July 2012: Language technologies (language data, analytics) Information management (big data, linked data, analytics) Public sector information (open data) We manage a portfolio of 130 R&I projects with 300+ MEUR EU funding, 1500 FTE Responsible for: European Data value chain strategy, open data strategy and the PSI directive

Definitions What is Open Data? A useful definition: http://opendefinition.org/ free of charge (or almost) free to use, re-use, distribute ...for any purpose (also commercially) can be public OD (government) or private OD (user- generated, corporate) must not violate privacy www.open-data.europa.eu (EU OD portal) http://publicdata.eu (the LOD2 project)

Definitions What is Linked Data? Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/

Definitions What is Linked Data? Allows access to Web content as a database – very useful! But there are problems: What merits to be linked? How to establish links? How to ensure quality of (automatically established) links? Currently very scarce and English-dominated For details, see LOD2 project: http://lod2.eu

Definitions What is Big Data? Very difficult to define... data is "big" if it defies traditional processing & storage paradigms – bigness becomes part of the problem the "3Vs": volume (size), velocity (bytes/s), variety (database, jpeg, video, numbers, text in language X...) ...to which we add the 4th V to denote creation of Value (by linking, aggregating, analysing, visualizing...) "linguistic" big data: all tweets, all news broadcasts, YouTube contents... statistical MT is powered (trained) by big data – but can it handle big data?

European Data value chain strategy Objective: to put in place the "systemic" prerequisites for effective use, exchange, re-use, trading... of data assets capacities and skills: build and multiply the "new" competences, curricula, prize/recognition schemes infrastructures: Open Data Portal, language resource infrastructure, evaluation platforms, incubators legal and regulatory framework: PSI directive (Public Open Data), data protection and privacy, copyright technology, tools, methods: supporting the above pilots, demonstrators: demonstrating the above in real- world problem settings, market validation

Instruments Making it happen Research and innovation: Horizon 2020 – the Research & Innovation funding programme for 2014-2020 Big Data analytics, prediction, MT: cross-language and cross-sectoral (industrial) validation, capacity-building Deployment and service infrastructures: Connecting Europe Facility (CEF), addressing, for example: Open Data Portal, Automated translation, language resources & tools repository Policy and regulation: PSI directive, privacy, copyright Support creation of Data market, remove obstacles from re-use of valuable data

Projects Some ongoing and emerging initiatives META-SHARE: language resources repository and sharing infrastructure www.meta-share.eu MT training and deployment pilots: MT@EC, iTranslate4.eu, LetsMT!, Bologna... MosesCore: making most out of MOSES www.mosescore.eu MultilingualWeb: standards and workflows for language processing www.multilingualweb.eu FALCON: Linked (Open) Data for localisation, MT and term extraction, LIDER: Linked Data as enabler of cross-media and multilingual content analytics LT Innovate: networking the language industry www.lt-innovate.eu

MT@EC MT@EC The internal Machine Translation project of Commission's DG Translation Aim: to provide functional MT service for EU institutions and (later), MS administrations, pan-EU online services Currently based on MOSES open source technology (but open to other solutions as plug-in) In pilot use since summer 2011 (EC) In production since June 2013 (EU institutions) Already 8 million pages translated Very good quality for some language pairs

Connecting Europe Facility Coming up 2014-2020 On the next slides, I will present topics of interest from the current state of: Horizon 2020 Connecting Europe Facility Disclaimer: the programmes described in the next slides are at the level of proposal/draft and subject to change following a multi-party adoption process. Only the versions formally adopted by the legislative bodies can be considered final and binding.

Connecting Europe Facility (CEF)

grants for "generic services" building on and linking to core platform CEF – Overview What is CEF? A funding programme for infrastructures and deployment of digital services procurement and deployment of mature technology to build a "core platform" grants for "generic services" building on and linking to core platform Includes a small number of "building blocks" (e.g. document delivery, eInvoicing, eID, Automated translation) What CEF is not? research or innovation (Horizon 2020 is for that) How much? 1 B€ in total for the "Telecommunications" title (Broadband & Digital services)

Automated Translation (AT) is a "building block" CEF – Automated Translation (AT) Rationale Automated Translation (AT) is a "building block" AT will serve the other Digital Services Infrastructures in CEF AT = whatever it takes to make DSIs actually multilingual Features Adaptable machine translation and relevant Language Resources are central Other likely key areas: CAT, CMS, terminology, semantic interoperability, interfaces to various systems and data types Human element is essential: service provision, quality control, validation, post-editing, on-demand response... Instruments: mostly procurement (calls for tender) National dimension: CEF implies and encourages partnerships with member states and regions, e.g. use of structural funds for language resources and, technology and translation on a "local" basis

providers of language technology, especially MT CEF – stakeholders, contributors Language industry providers of language technology, especially MT language service providers Language competence centres: provision of language resources, tools, validation and evaluation Member states/regions: coordinating local/national/regional programs and initiatives with CEF to reach critical mass

Horizon 2020 – the Research and Innovation programme

Big Data & Open data - Innovation – 50 M€ Big Data – Research – 39 M€ H2020 Work Programme 2014-15 DRAFT! Some topics: Big Data & Open data - Innovation – 50 M€ Big Data – Research – 39 M€ Cracking the language barrier – 15 M€ Multimodal & natural interaction – 7.5 M€

European Data landscape is fragmented Big Data - Rationale DRAFT! (Big/Smart/Open) Data represents significant potential for job creation and wealth (revenue) generation European Data landscape is fragmented data producers: public sector, private sector, sensors, devices, transactions... data industry: a few large companies, high number of SMEs language barriers, legal & institutional obstacles data users: practically all industry sectors We need to create European Data ecosystem, where data flows without unnecessary obstacles (open data, interoperability) data creates value (by analysing, linking, visualizing...) data and the added value is accessible to all actors we have appropriate capacities for value creation (skills, infrastructures, markets, technology)

Big Data – How DRAFT! Improve EU capacities, develop generic technology and address entire data value chains and markets - cross-sector, cross-border, cross-language 1. Innovation actions: promote open data value chains and data markets, involve SMEs, technology transfer and data exchange/reuse in cross-sector/cross-border settings, network of European skills centres for big data analytics technologies and business development 2. Research actions: novel data structures, algorithms, architectures, optimization and language understanding technologies etc. for analysis and visualisation tasks on extremely large and diverse data streams; competitions/prize schemes around specific big data challenges (e.g. prediction, deep analysis) arising from key industrial domains (e.g. geo-spatial, energy, finance, health, skills/employment, agriculture, climate/weather, product recommendations...) Implies: strong attention to application areas: building the (sub)communities and links to sector-specific Societal Challenges

Cross-program coordination on Big Data Big Data – What DRAFT! Innovation (50 M€, Call 1) 1. Open Data reuse incubator for SMEs: spin off mini projects building experiments and proofs-of-concept for business models and value-adding chains based on reuse of Open Data. 2. Collaborative projects on cross-sectoral, cross-border, cross-lingual analytics solutions/services, technology transfer, market validation, clear business cases. 3. Horizontal actions: Cross-program coordination on Big Data Network of skills centres, curricula, training and education Networking, clustering, legal issues etc. Research (39 M€, Call 2) 1. Collaborative projects addressing analysis, prediction, visualization on extremely large, diverse (multilingual, structured/unstructured) data 2. Collaborative projects to set up benchmarking and evaluation settings for big data analysis and prediction 3. Prize schemes to stimulate excellence in deep analysis and prediction on Big Data

Cracking the language barrier DRAFT! Problem: European Digital Single Market is fragmented by language barriers. Current (e.g. Google) machine translation solutions fall short in quality and coverage (languages, text types, topics) and are not customizable. Lack of cross-lingual technology equally hampers progress in multi- and cross-lingual analytics Solution: explore new avenues, methods, approaches to achieve significant improvement in translation quality in fully automatic MT. Special emphasis on: all (difficult, small) EU languages as target language. Self-learning/self-improving systems, making best use of available data and language resources. Special focus on the EU languages "facing digital extinction". Implications: close collaboration and clustering with other actions supporting language resources infrastructure (META, Connecting Europe Facility, national programs, structural funds)

Cracking the language barrier – How DRAFT! Approach: explore new avenues, methods, approaches to achieve significant improvement in translation quality in MT. Self-learning/self-improving autonomous, fully automatic systems, making best use of available data and language resources. Emphasis on: all (difficult, small) EU languages as target language. Special focus on the EU languages "facing digital extinction". Implications: close collaboration and clustering among all actions supporting a language infrastructure (META, CLARIN-ERIC, Connecting Europe Facility, national programs, structural funds...)

Cracking the language barrier (15M€, Call 1) Cracking the language barrier – What DRAFT! Cracking the language barrier (15M€, Call 1) One deep and broad research project kick off a multidisciplinary research action focus on points where current systems fail (adaptation, quality, need of large corpora...) break the glass ceiling of quality improvement A few advanced pilots – the "intertwined innovation" element test, validate, evaluate quality improvement in realistic use situations, e.g. online services address "poorly served" languages connect, contribute & make use of platforms and infrastructures for language resources, open data... One Coordination Action: promote a common language resources infrastructure for MT, benchmarking, best practices evaluation, interoperability, metadata harmonisation...

Multimodal & Natural human-computer interaction DRAFT! Rationale: while systems and devices are becoming more and more powerful, the interface to humans is lagging behind Objective: achieve transparency and invisibility of technology – effortless, effective human-machine dialogue, easy use of complex and powerful systems, easy access to information Links to: creative industries, communication technologies, language (especially: speech) processing, cognitive & behavioural analysis Uses: search, information retrieval, elderly, people with special needs, designers/artists...

ICT 2013 Conference in Vilnius 6-8 Nov 2013 Events ICT 2013 Conference in Vilnius 6-8 Nov 2013 http://ec.europa.eu/digital-agenda/en/ict-2013 In-depth presentations of Horizon 2020 Work Programme 2014-15 Networking sessions Plenary session on Big Data with VP Neelie Kroes Info Day in Luxembourg 15-16 January 2014 Proposal clinics Networking sessions – partner matching – present your proposal idea Will be announced soon

Thank you. kimmo. rossi@ec. europa. eu http://cordis. europa