Open Source SUMMA Platform

Slides:



Advertisements
Similar presentations
Multilinguality & Semantic Search Eelco Mossel (University of Hamburg) Review Meeting, January 2008, Zürich.
Advertisements

TECHNOLOGIES & CONCEPTS IN BIG DATA QUANTIFIED SELF, INTERNET OF THINGS, TELEMATICS, AND VIDEO SEARCH Amer Aljarallah IDS 594 Selected Topics in Big Data.
A Human-Centered Computing Framework to Enable Personalized News Video Recommendation (Oh Jun-hyuk)
Exploring the news | Always multi- source, multimodal and personalized.
Distributed search for complex heterogeneous media Werner Bailer, José-Manuel López-Cobo, Guillermo Álvaro, Georg Thallinger Search Computing Workshop.
Rob Marchand Genesys Telecommunications
From Web Archiving services to Web scale data processing platform Internet Memory Research GA IIPC, Paris, May 19th 2014.
Improving Machine Translation Quality via Hybrid Systems and Refined Evaluation Methods Andreas Eisele DFKI GmbH and Saarland University Helsinki, November.
Languages & The Media, 4 Nov 2004, Berlin 1 Multimodal multilingual information processing for automatic subtitle generation: Resources, Methods and System.
1 Texmex – November 15 th, 2005 Strategy for the future Global goal “Understand” (= structure…) TV and other MM documents Prepare these documents for applications.
Mining the web to improve semantic-based multimedia search and digital libraries
Chapter 11 Beyond Bag of Words. Question Answering n Providing answers instead of ranked lists of documents n Older QA systems generated answers n Current.
1 SAFIRE Project DHS Update – July 15, 2009 Introductions  Update since last teleconference Demo Video - Fire Incident Command Board (FICB) SAFIRE Streams.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
Jumping Off Points Ideas of possible tasks Examples of possible tasks Categories of possible tasks.
Microsoft Operations Manager Presented by: Alen Plicanic.
Introductory Remarks Robust Intelligence Solicitation Edwina Rissland Daniel DeMenthon, George Lee, Tanya Korelsky, Ken Whang (The Robust Intelligence.
AQUAINT Kickoff Meeting – December 2001 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering.
New Task Group CRIS Architecture & Development Maximilian Stempfhuber RWTH Aachen University Library
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Some Thoughts on HPC in Natural Language Engineering Steven Bird University of Melbourne & University of Pennsylvania.
DFKI GmbH, , R. Karger Indo-German Workshop on Language Technologies Reinhard Karger, M.A. Deutsches Forschungszentrum für Künstliche Intelligenz.
Wikis are websites where pages can be edited using an online document editor. Users can easily edit and share content. Enterprise wikis are platforms.
University of Sheffield, NLP Entity Linking Kalina Bontcheva © The University of Sheffield, This work is licensed under the Creative Commons.
1 Peter Fox Xinformatics 4400/6400 Week 11, April 16, 2013 Information Audit and dealing with Unstructured Information.
© Copyright 2013 ABBYY NLP PLATFORM FOR EU-LINGUAL DIGITAL SINGLE MARKET Alexander Rylov LTi Summit 2013 Confidential.
The LC-3 – Chapter 7 COMP 2620 Dr. James Money COMP
Prof. Thomas Sikora Technische Universität Berlin Communication Systems Group Thursday, 2 April 2009 Integration Activities in “Tools for Tag Generation“
The Natural Language Processing Research Group u Professor Yorick Wilks u Dr. Rob Gaizauskas u Dr. Louise Guthrie u Dr. Mark Hepple.
Research Topics/Areas. Adapting search to Users Advertising and ad targeting Aggregation of Results Community and Context Aware Search Community-based.
Natural Language Processing Menu Based Natural Language Interfaces -Kyle Neumeier.
For Monday Read chapter 24, sections 1-3 Homework: –Chapter 23, exercise 8.
For Friday Finish chapter 24 No written homework.
The Mint Mapping tool The MoRe aggregator Vassilis Tzouvaras, Dimitris Gavrilis National Technical University of Athens Digital Curation Unit - IMIS, Athena.
LREC 2004, 26 May 2004, Lisbon 1 Multimodal Multilingual Resources in the Subtitling Process S.Piperidis, I.Demiros, P.Prokopidis, P.Vanroose, A. Hoethker,
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
NATURAL LANGUAGE PROCESSING Zachary McNellis. Overview  Background  Areas of NLP  How it works?  Future of NLP  References.
NA-MIC National Alliance for Medical Image Computing Core 1b – Engineering Computational Platform Jim Miller GE Research.
Pascal Kelm Technische Universität Berlin Communication Systems Group Thursday, 2 April 2009 Video Key Frame Extraction for image-based Applications.
IR&NLP Coursework P1 Text Analysis Within The Fields Of Information Retrieval and Natural Language Processing By Ben Addley Academic Year 2004.
Chapter – 8 Software Tools.
A method to restrict the blow-up of hypotheses... A method to restrict the blow-up of hypotheses of a non-disambiguated shallow machine translation system.
WEB MONITORING E6125 Web enHanced Information Management Presentation on Design of Web Monitoring applications. By Satyajeet Shaligram Columbia University.
September 2003, 7 th EDG Conference, Heidelberg – Roberta Faggian, CERN/IT CERN – European Organization for Nuclear Research The GRACE Project GRid enabled.
Software, IEE Proceedings, Vol.152, Num.3, June 2005,Page(s): Prasanthi.S March, Java-based component framework for dynamic reconfiguration.
INHA UNIVERSITY, KOREA Rainer Simon Austrian Institute of Technology.
TV Broadcasting What to look for Architecture TV Broadcasting Solution
4/19/ :02 AM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
KRISTINA Consortium Presented by: Mónica Domínguez (UPF-TALN)
Siemens Enables Digitalization: Data Analytics & Artificial Intelligence Dr. Mike Roshchin, CT RDA BAM.
Proposal for Term Project
Utilizing AI & GPUs to Build Cloud-based Real-Time Video Event Detection Solutions Zvika Ashani CTO.
LACONEC A Large-scale Multilingual Semantics-based Dictionary
The OpenMOOC project A free software platform for an open education
11/23/2018 8:30 AM BRK3037 BRK3037: Dive deep on building apps and services with the Office 365 Communications Platform David Newman Senior Program Manager.
REVEAL Total cost: EUR EU contribution: EUR
Speech Capture, Transcription and Analysis App
Alexa Programming.
DIVE into the Event-Based Browsing of Linked Historical Media
ITS 2.0 Enriched Terminology Annotation Showcase
Searching and browsing through fragments of TED Talks
Pilar Orero, Spain Yoshikazu SEKI, Japan 2018
CSE 635 Multimedia Information Retrieval
Content Augmentation for Mixed-Mode News Broadcasts Mike Dowman
Guided Research: Intelligent Contextual Task Support for Mails
FashionBrain: Understanding Europe’s Fashion Data Universe
Scalable Understanding of Multilingual Media
Idiap Research Institute University of Edinburgh
Presentation transcript:

Open Source SUMMA Platform https://github.com/summa-platform/summa-oss Guntis Barzdins (LETA) User Group Meeting 3 20 November 2018 The SUMMA project is funded by the EU H2020 ICT Programme under Grant Agreement 688139

Code and Installation https://github.com/summa-platform/summa-oss SUMMA User Interface Scales to 400 live TV channels: AWS m5.24xlarge instance per 25 live TV channels Code and Installation https://github.com/summa-platform/summa-oss

SUMMA Platform TRL 5-8 TRL 3-7 MediaItem Ingestion 250 TV/radio chanels Text & Social media Speech Recognition (ASR) 9 languages Machine Translation to English Segmentation and Punctuation Natural Language Understanding (NLU) Clustering in Storylines Summarisation Storylines MediaItems Topic Detection Named Entity Recognition (NER) and Linking (NEL) Persons Organizations GPE Events a Knowledge Base (KB) population (Facts about Named Entities) User eXperience (UX) Interface Trending view 24h Named Entities view (KB) Dynamic Storylines FreeText search Scalable to 400 live channels original original EN DE AR SP PT RU IR UA LV EN EN text annotations + text

SUMMA Platform Advantages Completely self-contained No dependence on external (cloud) services All components developed within SUMMA Scalable for BigData All NLP modules are Docker containers Scalable to 400 live streams on 800 servers (e.g. AWS) No external licencing

Multilingual technologies Open Source SUMMA Platform supports EN, DE, LV

Integration Architecture & Scalability for Big Data All components are Docker containers Scalability is achieved by launching as many Docker container instances per task as required Scales to 400 live TV channels

Final Scalability Test Sources and Resources

Final Scalability Test Conclusions Shallow stream processing for live video streams (ASR, punctuation, MT, topic detection) is useful for video content monitoring Natural Language Understanding components (storyline clustering, summarization, NER, NEL, relation extraction, geo-location) are useful only for written text input, but are mostly useless for live video input Language understanding needs to be grounded in video. LETA submited an ERC grant application: «High Dimensional Representation and Computing: Pixels, Objects, Language»

DEMO