Semantic Search An Active Approach to Searching. Why Semantic Search? A better question is – why not? We are interested in what the document means, not.

Slides:



Advertisements
Similar presentations
Programming with Alice Computing Institute for K-12 Teachers Summer 2011 Workshop.
Advertisements

A Medical Note Automated Parsing And Semantic Extraction.
Stacks & Their Applications COP Stacks  A stack is a data structure that stores information arranged like a stack.  We have seen stacks before.
Paragraph & Sentence Construction. Paragraph Beginnings/Middles/Endings Begin with the topic (topic sentence). End by emphasizing the topic or a consequence.
Documenting Information Systems
Complex Client Interactions using Semantic Search.
Computational Models of Discourse Analysis Carolyn Penstein Rosé Language Technologies Institute/ Human-Computer Interaction Institute.
Cognitive Biases 2 Incomplete and Unrepresentative Data.
From Words to Knowledge ORION Active Structure. ORION Active Structure Two Approaches We could separate the process of turning words into knowledge into.
Copyright 2006 Interactive Engineering Pty Ltd ORION General Legal The Sarbanes Oxley Act Turning legal text into active structure.
Modeling Entry Task, Reading Notes, and Reading Strategies for Lyddie
Microsoft ® Office Excel ® 2007 Training Get started with PivotTable ® reports [Your company name] presents:
Searching The Web Search Engines are computer programs (variously called robots, crawlers, spiders, worms) that automatically visit Web sites and, starting.
Microsoft ® Office Excel ® 2007 Training Get started with PivotTable ® reports Guangzhou Newelink Technology Co,. Ltd.
Writing tips Based on Michael Kremer’s “Checklist”,
Assignment 1 Pointers ● Be sure to use all tags properly – Don't use a tag for something it wasn't designed for – Ex. Do not use heading tags... for regular.
Basic Scientific Writing in English Lecture 3 Professor Ralph Kirby Faculty of Life Sciences Extension 7323 Room B322.
Preparing a Presentation
Gmail Tutorial This tutorial aims to quickly cover some of the basic elements of web based using Gmail - a free service Use the Index on the.
1 Topic Mathematical Proofs. 2 Topic Mathematical Proofs California Standards: 24.2 Students identify the hypothesis and conclusion in logical.
HBar OR Reader Documentation A copy of the PowerPoint Viewer is shipped with the HBar OR Reader on the HBar Official Records [OR] CD. The PowerPoint Viewer.
The California Writing Exam Grades 4 and 7
1.1 1 Introduction Foundations of Computer Science  Cengage Learning.
Array.
Semantic Internet Searching Using Active Structure Interactive Engineering Hit and miss is not the way to do it.
Confirmation Bias. Critical Thinking Among our critical thinking questions were: Does the evidence really support the claim? Is there other evidence that.
FILL- IN-THE-BLANK Majority related to terminology STRATEGIES Study from back of flash cards Use immediate response, delayed, assisted Use synonym.
Process of Science The Scientific Method.
§ 4.3 Equations and Inequalities Involving Absolute Value.
The Concept Convert free text into a structure that captures objects, relations, existential, propositional and temporal logic – the whole thing, no excuses.
Notes from Instructor Regarding Term Papers. Howard Godfrey UNC Charlotte Fall, 2007.
Paul Mundy Editing step by step How an expert does it.
The Use of Active Structure in Biotechnology. We won’t bother to tell you how hard, how complex, how dynamic, is knowledge in the field of biotechnology.
Copyrighted material John Tullis 10/17/2015 page 1 04/15/00 XML Part 3 John Tullis DePaul Instructor
© Copyright 2011 John Wiley & Sons, Inc.
The World Wide Web is a great place to find more information about a topic. But there are a lot of sites out there—some are good and some are not so good.
XP New Perspectives on The Internet, Sixth Edition— Comprehensive Tutorial 3 1 Searching the Web Using Search Engines and Directories Effectively Tutorial.
Lecture 5: Writing the Project Documentation Part III.
M1G Introduction to Database Development 5. Doing more with queries.
XP Chapter 3 Succeeding in Business with Microsoft Office Access 2003: A Problem-Solving Approach 1 Analyzing Data For Effective Decision Making Chapter.
The World Wide Web is a great place to find more information about a topic. But there are a lot of sites out there—some are good and some are not so good.
Entity Theories of Meaning. Meaning Talk Theory should make sense of meaning talk Theory should make sense of meaning talk What sorts of things do we.
Structure of IR Systems INST 734 Module 1 Doug Oard.
Grade Book Database Presentation Jeanne Winstead CINS 137.
Facebook Ad’s Reloaded Keep it Organized. To use Facebook Ad’s effectively in today’s market you must have Funnel Over the last week we have covered the.
For Monday Read chapter 24, sections 1-3 Homework: –Chapter 23, exercise 8.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Software Engineering Requirements + Specifications.
{ What is a Number? Philosophy of Mathematics.  In philosophy and maths we like our definitions to give necessary and sufficient conditions.  This means.
Structured Text Retrieval Models. Str. Text Retrieval Text Retrieval retrieves documents based on index terms. Observation: Documents have implicit structure.
COLD READING UNIT. WHAT DO YOU THINK ABOUT WHEN YOU HEAR “COLD READING?”
8.1 8 Algorithms Foundations of Computer Science  Cengage Learning.
Why IR test collections are so bad Mark Sanderson University of Sheffield.
+ Publishing Your First Post USING WORDPRESS. + A CMS (content management system) is an application that allows you to publish, edit, modify, organize,
For Monday Read chapter 26 Homework: –Chapter 23, exercises 8 and 9.
CS Introduction to XML - What is XML? Extensible Markup Language, or XML for short, is a new technology for web applications. XML is a World Wide.
EXAMINERS’ COMMENTS RAPHAEL’S LONG TURN GRAMMAR Accurate use of simple grammatical structures and also of some complex sentences: ‘they could also be preparing.
Database Planning Database Design Normalization.
Powerpoint Templates Data Communication Muhammad Waseem Iqbal Lecture # 07 Spring-2016.
Session 5: How Search Engines Work. Focusing Questions How do search engines work? Is one search engine better than another?
Complex Client Interactions using Semantic Search
HCT: The Library Catalogue
Using Functions
Information Retrieval
Software Development Techniques
Invitation to Computer Science 5th Edition
Presentation transcript:

Semantic Search An Active Approach to Searching

Why Semantic Search? A better question is – why not? We are interested in what the document means, not whether it contains some key words. There are many ways of saying the same thing, so searching using key words is unreliable – it is more reliable to search for a meaning, so the searching is not confounded by negations or multiple topics, or the inability to say exactly what you require. We have tolerated crummy methods for a long time, because the computer resources were inadequate to the task – that has changed.

Why Structure Search? The search requirement is turned into a structure – this structure realises all the semantics – the relational, existential, propositional, locational and temporal logic connecting objects - allowing complex searching at many levels simultaneously

Why Not Static Search 1 Let’s say you want to find something on tsunami wave theory, but not Airy waves (wind waves) You could search for “tsunami, theory, NOT Airy” But then you would miss “We looked at John’s work on tsunamis, but rejected it because of its use of Airy wave theory – instead we developed our own wave theory” Static search makes several very strong assumptions – the document is short, it is only directed to the topic at hand, and it doesn’t use a lot of NOTs. In other words, it is extremely dumb. If a few hits are expected, and the person is willing to sort among them, fine. If the hits run into the tens of thousands, and it is tedious to sort among them, not fine. Worse, you have to figure out whether your search strategy will exclude the very documents you are searching for.

Why Not Static Search 2 Today it is Enron, and we want to find out if any information was omitted from reports to FERC. We search for “report omission FERC” A document says “We prepared the report for FERC. We also prepared a report to the board, leaving out some information, as it had been covered in previous board meetings.” This seems to have the basic elements – “report”, “FERC”, and “leaving out” (a synonym for omission), but it clearly doesn’t qualify. “We followed John’s suggestion in his of 15 th inst in preparing the report” Someone or something has to read it (and potentially other documents) to get the sense of what it is saying – its semantics.

Freedom of Expression Semantic search allows you to say exactly what you want – if some results come back that aren’t what you want, you change how you express your wishes. You don’t need to worry about knowing the exact way something is described in the document – the meaning is searched for, not the words. This can be done because a structure to represent the meaning of the document has already been built – the structure was built to allow any sort of search on the document because the structure is the exact structural analogue of the meaning expressed in free text. The search structure is matched against the document structure – the matching becomes active where that is necessary – the generation of multiple possibilities and their pruning.

We Get from Words to Structure Words are turned into objects on a parse chain – the parse structure is built on the chain

More Parsing The tumour extends to the visceral pleura but unequivocal pleural invasion is not seen in the routinely stained sections.

Definitions of Words The structural definition is limited to only cancer. There are two meanings – the process of spreading or the resultant tumor. (the FROMTO is used for all movement relations)

Building Semantic Structure Head of sentence – True state from discourse Connection to parse chain Meaning flows down

Searching Sets propagate through the structure – the structure “works” A search structure is created using free text, and it is then matched to the structure built from the document. These aren’t two different structures – the query structure is anchored in the structure built from the text to be searched, so common points are found during building of the query, and defined terms are used.

Structural Synonyms John omitted the data from the report The report omitted the data John didn’t include the data in the report John left the data out of the report John forgot to put the data in the report John failed to add the data to the report We were instructed not to insert the data John excluded the data from the report The report did not contain the data The contents of the report were incomplete

No Real Limit There is no real limit on the extent of Semantic Search. Currently the queries are separate entities, but they can easily be made to create objects or lists which are used in additional queries - the queries become part of an interactive dialogue with the machine, with both the machine and the user changing their internal states and connections. Interactive Engineering Pty Ltd Australia Active Structure Inc. USA