YALE LAW SCHOOL POLICY SCIENCES CENTER ANNUAL INSTITUTE Using a New Method of Natural Language Intelligence for Performing Wiretap Analysis Amy Neustein,

Slides:



Advertisements
Similar presentations
Listening Beyond the Coursebook Paul Robinson & Aisha Mahmoud Omar Awad Eltecs 9th January 2011.
Advertisements

JustinMind: Dynamic Panels
Cornell Notes.
Taking Notes on Lectures
Chapter 3 Listening for intermediate level learners Helgesen, M. & Brown, S. (2007). Listening [w/CD]. McGraw-Hill: New York.
PHONEXIA Can I have it in writing?. Discuss and share your answers to the following questions: 1.When you have English lessons listening to spoken English,
ACE TESOL Diploma Program – London Language Institute OBJECTIVES You will understand: 1. Various techniques for assessing student listening ability. You.
Natural Language Processing and Speech Enabled Applications by Pavlovic Nenad.
National Curriculum Key Stage 2
Lecture 3 Teaching Listening
Language Assessment 4 Listening Comprehension Testing Language Assessment Lecture 4 Listening Comprehension Testing Instructor Tung-hsien He, Ph.D. 何東憲老師.
Second Annual Research Symposium of the Human Language Technology Research Institute Sequence Package Analysis: A New Natural Language Intelligence Method.
The TBL framework. The pre-task phase introduces the class to the topic and the task activating topic related words and phrases. Pre-task phase.
Introduction to Socratic Seminar. What does Socratic mean? Socratic comes from the name Socrates. Socrates (ca B.C.) was a Classical Greek philosopher.
Real-Time Speech Recognition Subtitling in Education Respeaking 2009 Dr Mike Wald University of Southampton.
A study on Prediction on Listener Emotion in Speech for Medical Doctor Interface M.Kurematsu Faculty of Software and Information Science Iwate Prefectural.
Title of Articulate Module (must match what’s on the VITALS calendar) Johnny Hippocrates, MD Assistant Professor of Western Medicine
Communicative Resources. How Do We Communicate? Conversation involves more than language – Gestures, facial expressions, tone of voice, … – Face-to-face.
CP SC 881 Spoken Language Systems. 2 of 23 Auditory User Interfaces Welcome to SLS Syllabus Introduction.
Developing Communicative Dr. Michael Rost Language Teaching.
ESSENTIAL QUESTION How do I analyze information in diverse formats and evaluate the motives behind the presentation? Homework 1.You are to use this Power.
Inference Communications Grammatical Inference real world implementations..a new approach in practice Callan Schebella VP Business Development Inference.
Turning Audio Search and Speech Analytics into Business Intelligence.
The new languages GCSE: STRATEGIES FOR SUCCESSFUL IMPLEMENTATION.
Teaching language means teaching the components of language Content (also called semantics) refers to the ideas or concepts being communicated. Form refers.
Teaching Productive Skills Which ones are they? Writing… and… Speaking They have similarities and Differences.
SEQUENCE PACKAGE ANALYSIS: A NEW WAY TO UNDERSTAND NATURAL LANGUAGE DATA ACROSS DIFFERENT LANGUAGES AND DIALECTS AMY NEUSTEIN, Ph.D. LINGUISTIC TECNOLOGY.
Discourse. The study of discourse: – Involves our efforts to interpret or be interpreted…and how we accomplish it – Goes beyond just linguistic forms.
SEQUENCE PACKAGE ANALYSIS: A New Natural Language Understanding Method for Performing Data Mining of Help-Line Calls and Doctor- Patient Interviews AMY.
Mining for What’s Missing: How to Find What’s Not in the Speech Application’s Vocabulary AMY NEUSTEIN, Ph.D. LINGUISTIC TECNOLOGY SYSTEMS
Spoken Dialog Systems and Voice XML Lecturer: Prof. Esther Levin.
Data Mining By Dave Maung.
Fluency…Teach for It! Pamela Grayson Reading Recovery ® Teacher Leader Chapel Hill-Carrboro City Schools North Carolina
Oracy O 6.1 Understand the main points and simple opinions in a spoken story, song or passage listen attentively, re-tell and discuss the main ideas agree.
Lecture 5: Writing the Project Documentation Part III.
TEACHING A SECOND LANGUAGE -- TIPS FROM THE TRENCHES.
Leveraging Speech Analytics for Customer Satisfaction
Input and Interaction Ellis (1985), interaction, as the discourse jointly constructed by the learner and his interlocutors and input is the result of.
Artificial intelligence
Earth Science I How to succeed in this class. A Technique for Reading 1.Survey headings and topic sentences 2.Turn each heading into a question 3.Read.
Caption Mic Jeannie Colangelo. Function Caption Mic is a speech to text assistive technology tool. It serves the deaf and hard of hearing population.
FRENCH HIGH SCHOOL FRENCH I Unit 5 In Town Getting Around a New Town Describe Where You Live Describe What You Are Going to do Relate a Story Using Pictures.
TEACHING PRONUNCIATION Teaching Suprasegmentals. Word Stress A stressed syllable is…
Communication Skills Speaking Skill 1 Lecture 23.
M1G Introduction to Programming 2 3. Creating Classes: Room and Item.
Grade 2: Comprehension and Collaboration SL1 Participate in collaborative conversations with diverse partners about grade 2 topics and texts with peers.
Focus Education Assessing Reading: Exceeding Year 2 Expectations Year 2 Exceeding Expectations: Reading Enhance meaning through expression and intonation.
Tips for Working with English Language Learners Compiled by Hayley Minner, SDMS.
The Non Fictional “Hal”? Rather than make humans conform to computer-speak, design computers to understand conversational dialog. KeyWordsSequence Packages.
Colby Smart, E-Learning Specialist Humboldt County Office of Education
Discourse Analysis Week 10 Riggenbach (1999) Chapter 1 - Quotes.
Sequence Package Analysis A New Data Mining Tool to Speed Up Wiretap Analysis Amy Neustein, Ph.D. Linguistic Technology Systems
INTONATION Islam M. Abu Khater.
(Teaching Pronunciation) Beyond the 4 language skills speaking (Teaching Pronunciation) Presented to: Dr. Antar Abdellah.
Input, Interaction, and Output Input: (in language learning) language which a learner hears or receives and from which he or she can learn. Enhanced input:
Objectives of session By the end of today’s session you should be able to: Define and explain pragmatics and prosody Draw links between teaching strategies.
Key Stage 2 Portfolio. Llafaredd / Oracy Darllen / Reading Ysgrifennu / Writing Welsh Second Language.
ELA - 3 Common Core Vs Kansas Standards. DOMAIN Standards For Literature (RL)
Lecture Capture The role it plays in the flipped classroom.
Assessment in Key Stage 2 Changes at Our Lady’s. Why? Due to Government initiatives which felt that levels were becoming too competitive and did not show.
Indian Community Languages Schools Parents and Teachers Conference July 2017.
Artificial Intelligence for Speech Recognition
Business Communication Dr. Aravind Banakar –
Business Communication
Business Communication
Complex Instruction: Concepts and Principles
A New Conversational Query Language (C-QL) For The “Emotionally Intelligent” Smartphone Amy Neustein, Ph.D.
National Curriculum Requirements of Language at Key Stage 2 only
Communicative Resources
Presentation transcript:

YALE LAW SCHOOL POLICY SCIENCES CENTER ANNUAL INSTITUTE Using a New Method of Natural Language Intelligence for Performing Wiretap Analysis Amy Neustein, Ph.D. Linguistic Technology Systems

WHY DO WE NEED A NEW NATURAL LANGUAGE INTELLIGENCE METHOD FOR MINING WIRETAP RECORDINGS? 1) The volume of terrorism-related government wiretap recordings far exceeds the intelligent agent’s human capabilities to mine those recordings; and 2) Most automated audio data mining programs have a low rate of return when searching for “keywords” in wiretap recordings because terror suspects will deliberately avoid the use of key words that can identify names, places, dates, etc.

Sequence Package Analysis--A New Method of Natural Language Intelligence H OW DOES SPA WORK? 1) Add rather than Replace SPA adds a layer of intelligence to standard dialog systems. 2) Mines audio data SPA goes beyond a conventional search for words and word strings. Identifies a Series of Related Speaking Turns and Turn Construction Units (parts of turns) that are Discretely Packaged as a Sequence of Conversational Interaction

WHAT IS THE METHODOLOGICAL BASIS OF SPA? SPA is a new natural language understanding method, which has been successfully peer reviewed and cited by other researchers as an important data mining method for captioning text, that draws mainly from the field of conversation analysis: the study of the orderly properties of interactive dialog that revolve around the turn-taking system process and other sequentially based features that are part of that process. Conversation Analysis has been called by some a sub field of A.I. because it can detect the detailed structural organization of dialog which is a necessary precondition for the design of dialog systems that simulate and understand human dialog.

WHAT DOES SPA DO? 1) SPA permits the discovery of “key” words (e.g., the name of a location where a crucial meeting among terrorists will take place) that are not contained in the speech application’s vocabulary. 2) SPA permits rapid and efficient data mining of large volumes of audio text by spotting sequence packages in the dialog.

MINING THE DATA FOR SEQUENCE PACKAGES A sudden increase in the speakers’ use of pronouns in place of noun referents may indicate the speakers are going over familiar or well rehearsed subject matter. The unexpected increased use of adjectival descriptors, serving as a kind of privately shared “shorthand” label to describe a person or enemy target, in the place of nouns can flag terrorist plans and activities. SPA, by looking for sequence patterns, can locate these descriptors even when they are outside of the speech application’s vocabulary.

ADVANTAGES OF SPA SPA captures the predictable patterns of human dialog, while all other methods depend on spotting isolated key words or phrases, which can vary from speaker to speaker; Can be applied to different languages because it works by identifying conversational sequence patterns - which cut to the heart of the social architecture of language-- rather than identify a preset glossary of words; and Has the potential of performing data mining in real time, allowing a human analyst to act on the spot when hearing high alarm content.

DEMONSTRATION disguised The following example shows how applying an SPA approach to wiretapped dialog can flag important security information that is cleverly disguised by the suspects:

Speaker “A” is trying to educate Speaker “B” about a new meeting place right at the tip of the Brooklyn Bridge. Any confusion or misunderstanding about this meeting place could spoil the plans. But Speaker “A” is very clever: First, he stays away from buzz words (such as naming a bridge, a tunnel or a street). Second, he refrains from making any prefatory remarks or comments to the other speaker about how vital it is to get these instructions right.

Dialog Example Juniors? (the question mark shows an upward intonation) second pause (speaker then pauses briefly) Speaker “A”: Come to the intersection near Juniors? (the question mark shows an upward intonation) second pause (speaker then pauses briefly) Speaker “B”: 1.2 second pause Speaker “A”: You know the thoroughfare with the big traffic light? Speaker “B”: Juniors, yeah.

THE SEQUENCE PACKAGE Speaker “A”: Come to the intersection near Juniors? Speaker “B”: 1.2 seconds of silence A noun referent (“Juniors”) with an upward intonation A brief pause, giving the listener the chance to show recognition or ask for clarification. Silence by the listener which indicates lack of understanding or confusion.

Speaker “A”: You know the thoroughfare with the big traffic light? Speaker “B”: Juniors, yeah. Speaker “A” produces a clarification of the noun referent (“Juniors”) (“You know the thoroughfare with...”) Speaker “B” produces a repeat of the noun referent (“Juniors”) - the source of the recognition trouble followed by a recognitional marker (“Yeah”)--which demonstrates to Speaker “A” that he has corrected the misunderstanding. Had he simply produced a recognitional marker (“yeah”) without mentioning the source of the trouble (“Juniors”), there would be no indication to the other speaker that he now recognizes the importance of the meeting place.

Finding the Sequence Package in the Dialog Example Look for a concatenation of these utterance components: noun referent with upward intonation brief pause silence clarification of noun referent repeat of noun referent that was initial source of the recognition trouble recognitional marker

CODA The next step is the validation of SPA as a necessary tool for performing wiretap analysis Research Question: Do mining programs have a higher rate of accuracy in spotting terrorists when adding Sequence Package Analysis as a new method of natural language intelligence for performing wiretap analysis?