Exploring Cognitive Services

Slides:



Advertisements
Similar presentations
Introduction to Computational Linguistics
Advertisements

KompoZer. This is what KompoZer will look like with a blank document open. As you can see, there are a lot of icons for beginning users. But don't be.
CM143 - Web Week 2 Basic HTML. Links and Image Tags.
Real-Time Speech Recognition Subtitling in Education Respeaking 2009 Dr Mike Wald University of Southampton.
CapturaTalk4Android Demonstration Abi James
Getting Started with HTML. HTML  Hyper Text Markup Language  HTML isn’t a program language, its known as a markup language  A Markup language has tags.
Cognitive Services SearchSpeech Languag e Knowledg e Visio n.
CLIENT COMMUNICATIONS. Definition of Communication  Webster’s dictionary defines communication as “to give, or give and receive, information, signals,
Introducing Precictive Analytics
How can speech technology be used to help people with disabilities?
An introduction to Amazon AI
Getting Started With HTML
11/28/2017 7:08 PM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
Microsoft Ignite /4/2018 1:44 PM BRK3105
Making videos accessible – Mandatory guidelines
Online PD Basic HTML The Magic Of Web Pages
2/21/ :54 AM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
4/19/ :02 AM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
4/23/2018 7:04 AM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
Component 1.6.
How to Give a Succesful Powerpoint Presentation
5/14/2018 7:32 PM BRK3299 Microsoft Cognitive Services: Infusing language and speech capabilities into your apps Giampaolo Battaglia Luis Cabrera Sr.
Module Title Module Subtitle
Globalizing apps and UX with Microsoft Translator text and speech translation API Kelly Altom Program Manager
Napredno prepoznavanje ljudi koristeći Microsoft Azure Cognitive Services SLAVEN MIŠAK, Span d.o.o. IVAN MARKOVIĆ, Span d.o.o.
Data Virtualization Tutorial… CORS and CIS
Machine Learning and Office 365 Collaboration
Let’s talk about Conversation Design
7/6/2018 1:42 PM BRK2391 Making Microsoft AI work for your business with Bing Custom Search and Bing Search API v7 Brian King Group Program Manager Bing.
Introduction to Azure Bot Framework
Parent Advisory Committee Royal Oak Schools Board Office
Building & Applying Emotion Recognition
Changing how people interact with computers
@marco_parenzan Azure Functions e Logic Apps I tuoi coltellini svizzeri per gestire i tuoi dati in un SQL Database.
Leverage the Intelligent Cloud
How To Use PowerPoint A Brief Introduction to Commonly Used Features
Explain to the group of pupils that they have been given an important opportunity to lead this intervention in their schools. They are communication role.
Microsoft SharePoint Server 2013
Chatbots for Dummies José 10/11/2018 Immersion
Continuous Automated Chatbot Testing
Text to Speech Functionality for the PARCC Assessments
This meme comes from South Park (S2E )
Azure bot Service February 19, 2018.
OU Campus Accessibility
Microsoft Ignite NZ October 2016 SKYCITY, Auckland.
Speech Capture, Transcription and Analysis App
Advanced NLP: Speech Research and Technologies
Alexa Programming.
Webinar Instructor Training
How To Use PowerPoint A Brief Introduction to Commonly Used Features
Is it a Good Web Site to use?
How Students Log In and Start a Test
Technical Capabilities
How To Use PowerPoint A Brief Introduction to Commonly Used Features
How To Use PowerPoint2 A Brief Introduction to Commonly Used Features
Extracting Recipes from Chemical Academic Papers
from data silos to multi-sources and multi-agents cognitive platforms
Microsoft Cognitive Services
Microsoft Cognitive Services with Power BI
How To Use PowerPoint A Brief Introduction to Commonly Used Features
Introduction for Students
Artificial intelligence for everyone
How To Use PowerPoint A Brief Introduction to Commonly Used Features
Creative Best Practice Guide
Making Social Media Posts Accessible
How To Use PowerPoint A Brief Introduction to Commonly Used Features
Bots, so you don't have to be always available to help your customers
A touch of AI with Cognitive Services
COGNITIVE SERVICES MACHINE LEARNING FOR DEVELOPERS
Presentation transcript:

Exploring Cognitive Services [ speaker name] [ speaker contact methods ] Put your details above. Each slide contains example script in the notes. Once confident with the content, you can use the headings in the notes as cues to ad-lib the content in your own words.

What are Cognitive Services? Cognitive services are a set of APIs that are designed to democratize artificial intelligence by enabling systems to see, hear, speak, understand and interpret our needs using natural methods of communication. What these services generally do is bring structured semantic data to human knowledge I/O with a degree of confidence CLICK Read/summarize description Read simplified explanation We want the machine to draw out those semantics and tell us with what degree of confidence, those semantic interpretations are correct. And if we can get some structured data values or entities in and out then even better.

The Cognitive Services Vision Speech Language Knowledge Search Computer Vision Content Moderator Emotion Face Video Bing Speech Custom Speech Service Speaker Recognition Bing Spell Check Language Understanding Linguistic Analysis Text Analytics Translator WebLM Academic Entity Linking Knowledge Exploration QnA Maker Recommendations Bing Autosuggest Bing Image Search Bing News Search Bing Video Search Bing Web Search Here’s the far ranging list of cognitive services. In this session we’ll delve into sample of these and see how they are used, including some which are part of our researcher scenario using 5 services.

Services Demonstrated Speech – Speech To Text & Text To Speech Search – Bing Image Search Language – Linguistic Analysis Vision – Facial Emotion Knowledge – QnA (bot) maker Language – LUIS So let’s look at which service APIs we are going to look at over 6 demos. List them We’ll end with a look at LUIS, one of the most powerful services.

Researcher Solution STT LUIS Image Search Emotion TTS Request a picture of a thing or person (with a specific emotion) by voice Determine intent for thing or person Get image to display Find emotions of person in picture Be told the highest emotion and confidence STT LUIS Image Search Emotion TTS Let’s recap the researcher scenario CLICK – Bing Speech Speech To Text aka Speech Recognition. This gets us the text of what was spoken and then we pass that to… CLICK – Language Understanding Intelligent Service (or LUIS) To get which intent was present and the degree of confidence CLICK – Bing Image Search We then search for matching images and we take the first one, and then if the predominant intent was for a person (vs. a thing), we also then pass the image to… CLICK – Emotion API Which gives us all the emotion ratings CLICK – Bing Speech Text to Speech We find the highest rated emotion and we speak about that “A picture of Satya Nadella happy” This is x% about a thing/person Image of person Emotional readings “x% sure that this person is [highest emotion]”

Speech – Speech To Text STT LUIS Image Search Emotion TTS Request a picture of a thing or person (with a specific emotion) by voice Determine intent for thing or person Get image to display Find emotions of person in picture Be told the highest emotion and confidence STT LUIS Image Search Emotion TTS Let’s get started. First, let’s look at how to do speech to text (aka Speech Recognition) and text to speech “A picture of Satya Nadella happy” This is x% about a thing/person Image of person Emotional readings “x% sure that this person is [highest emotion]”

Demo Demo 2.1 Speech to text and back again

Search – Bing Image Search Request a picture of a thing or person (with a specific emotion) by voice Determine intent for thing or person Get image to display Find emotions of person in picture Be told the highest emotion and confidence STT LUIS Image Search Emotion TTS So we are now able to listen to the user, convert it to text do something with that text and output text results out as speech. Moving on, we’ll skip LUIS for now and come back to it at the end. Now, let’s see how to see Bing Image Search “A picture of Satya Nadella happy” This is x% about a thing/person Image of person Emotional readings “x% sure that this person is [highest emotion]”

Demo Demo 2.2 Bing Image Search

Language – Linguistic Analysis So it’s easier for us to get URLs to image matching search text. We can get several or just use one in our scenario. While note related to the researcher scenario, linguistic analysis is a powerful tool for deconstructing text. It breaks down a sentence and can be used to aid in natural language interpretation and processing.

Demo Demo 2.3 Linguistic Analysis

Vision – Facial Emotion Request a picture of a thing or person (with a specific emotion) by voice Determine intent for thing or person Get image to display Find emotions of person in picture Be told the highest emotion and confidence STT LUIS Image Search Emotion TTS Back to the research scenario in the event that we want to search for an image of a person. Once we get an image, we want to assess the emotions present. “A picture of Satya Nadella happy” This is x% about a thing/person Image of person Emotional readings “x% sure that this person is [highest emotion]”

Demo Demo 2.4 Facial Emotion

Knowledge – QnA (bot) maker So that’s emotion in place. Before ending with LUIS, let’s take a look at one of the really commercial useful Knowledge-related APIs – the QnA maker.

Demo Demo 2.5 QnA Maker

Language - LUIS STT LUIS Image Search Emotion TTS Request a picture of a thing or person (with a specific emotion) by voice Determine intent for thing or person Get image to display Find emotions of person in picture Be told the highest emotion and confidence STT LUIS Image Search Emotion TTS Now let’s see how to get started with LUIS, the Language Understanding Intelligence Services. In this case we want to determine if the words spoken and converted to text, convey a request about a person or a thing. “A picture of Satya Nadella happy” This is x% about a thing/person Image of person Emotional readings “x% sure that this person is [highest emotion]”

Demo Demo 2.6 LUIS in the researcher app

Resources Learn and try at - https://www.microsoft.com/cognitive-services Docs & getting started - https://docs.microsoft.com/en- us/azure/cognitive-services/ Cognitive Services example code - https://github.com/Microsoft/Cognitive-Samples-IntelligentKiosk That concluded our demos, completing the chain of 5 services used together and selectively depending on the intent of the user. Discuss Resource shown