Language Recognition… Searching with Precision Santa Clara, CA October 31, 2001 Julian Henkin Vice President, Worldwide Customer Services LexiQuest, Inc.

Slides:



Advertisements
Similar presentations
Classification & Your Intranet: From Chaos to Control Susan Stearns Inmagic, Inc. E-Libraries E204 May, 2003.
Advertisements

Semantics Rule, Keywords Drool J. Brooke Aker CEO Expert System USA February 2010.
Laurie E Damianos, MITRE September 2008 Approved for Public Release; Distribution Unlimited. MITRE Case # ©2008 The MITRE Corporation. All rights.
By: Mr Hashem Alaidaros MIS 211 Lecture 4 Title: Data Base Management System.
1.Data categorization 2.Information 3.Knowledge 4.Wisdom 5.Social understanding Which of the following requires a firm to expend resources to organize.
1 Knowledge Management Session 4. 2 Objectives 1.What is knowledge management? Why do businesses today need knowledge management programs and systems.
Managing Data Resources
Taxonomies in Electronic Records Management Systems May 21, 2002.
Basic IR: Queries Query is statement of user’s information need. Index is designed to map queries to likely to be relevant documents. Query type, content,
Mastering the Internet, XHTML, and JavaScript Chapter 7 Searching the Internet.
Information Retrieval Concerned with the: Representation of Storage of Organization of, and Access to Information items.
Automating Keyphrase Extraction with Multi-Objective Genetic Algorithms (MOGA) Jia-Long Wu Alice M. Agogino Berkeley Expert System Laboratory U.C. Berkeley.
Information retrieval Finding relevant data using irrelevant keys Example: database of photographic images sorted by number, date. DBMS: Well structured.
Driving Productivity with Microsoft Dynamics CRM Presenter Name Presenter Title Presenter Date.
Chapter 14 The Second Component: The Database.
The Wharton School of the University of Pennsylvania OPIM 101 2/16/19981 The Information Retrieval Problem n The IR problem is very hard n Why? Many reasons,
University of Kansas Data Discovery on the Information Highway Susan Gauch University of Kansas.
Libraries and Institutional Content Management Systems
IBE312: Ch15 Building an IA Team & Ch16 Tools & Software 2013.
Yuri de Lugt Collexis Karin Clavel TU Delft Library.
ROI & Impact: Quantitative & Qualitative Measures for Taxonomies Wednesday, 11 February :00 – 12:30 PM MST Presented by Jay Ven Eman, Ph.D., CEO.
Information Literacy, Search Strategies & Catalog Instruction Frederic Murray Assistant Professor MLIS, University of British Columbia BA, Political Science,
1 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
© Paradigm Publishing Inc. 9-1 Chapter 9 Database and Information Management.
Using Taxonomies Effectively in the Organization v. 2.0 KnowledgeNets 2001 Vivian Bliss Microsoft Knowledge Network Group
Human Resource Management Lecture 27 MGT 350. Last Lecture What is change. why do we require change. You have to be comfortable with the change before.
Knowledge Management By Dr.S.Sridhar,Ph.D., RACI(Paris),RZFM(Germany),RMR(USA),RIEEEProc. web-site :
LIS 506 (Fall 2006) LIS 506 Information Technology Week 11: Digital Libraries & Institutional Repositories.
LIS618 lecture 1 Thomas Krichel economic rational for traditional model In olden days the cost of telecommunication was high. database use.
© Paradigm Publishing Inc. 9-1 Chapter 9 Database and Information Management.
Stuart Card PARC (since ’74) Area Manager of the User Interface Research Center Ph.D. in Psychology from Carnegie Mellon Co-authored “The Psychology of.
Knowledge Management with a Purpose Velocity & Variability aka Lean Six Sigma Gary Jones Dir, Service Operations.
The PATENTSCOPE search system: CLIR February 2013 Sandrine Ammann Marketing & Communications Officer.
1 Information Retrieval Acknowledgements: Dr Mounia Lalmas (QMW) Dr Joemon Jose (Glasgow)
C11- Managing Knowledge.
Never-ending Search: (What you REALLY need to know about online searching) Ms. Emili school year.
Chapter 3 DECISION SUPPORT SYSTEMS CONCEPTS, METHODOLOGIES, AND TECHNOLOGIES: AN OVERVIEW Study sub-sections: , 3.12(p )
Chapter 6: Information Retrieval and Web Search
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
Searching the web Enormous amount of information –In 1994, 100 thousand pages indexed –In 1997, 100 million pages indexed –In June, 2000, 500 million pages.
Building Data and Document-Driven Decision Support Systems How do managers access and use large databases of historical and external facts?
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
ITGS Databases.
ECM and Shared Services Overview AITR Meeting April 23, 2009.
©2003 Paula Matuszek CSC 9010: Text Mining Applications Dr. Paula Matuszek (610)
Introduction to Information Retrieval Example of information need in the context of the world wide web: “Find all documents containing information on computer.
Information Retrieval CSE 8337 Spring 2007 Introduction/Overview Some Material for these slides obtained from: Modern Information Retrieval by Ricardo.
Information Retrieval
CHAPTER 11 MANAGING KNOWLEDGE
Advancing Science: OSTI’s Current and Future Search Strategies Jeff Given IT Operations Manager Computer Protection Program Manager Office of Scientific.
Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.
Information Literacy, Search Strategies & Catalog Instruction Frederic Murray Assistant Professor MLIS, University of British Columbia BA, Political Science,
Chapter. 3: Retrieval Evaluation 1/2/2016Dr. Almetwally Mostafa 1.
Foundations of Information Systems in Business
Smart Web Search Agents Data Search Engines >> Information Search Agents - Traditional searching on the Web is done using one of the following three: -
CHAPTER ELEVEN MANAGING KNOWLEGE. Objectives We have been through some of this already Role of knowledge management Types of knowledge management systems.
1 2. Knowledge Management. 2  Structuring of knowledge enables effective and efficient problem solving dynamic learning strategic planning decision making.
Text Analytics World San Francisco – March 31, :15-4:45pm Speaker: Bryan Bell, Executive Vice President, Expert System USA  What is in Your Business.
Trends in NL Analysis Jim Critz University of New York in Prague EurOpen.CZ 12 December 2008.
Organization and Knowledge Management
User Interface HEP Summit, DESY, May 2008
Federated & Meta Search
Taxonomies, Lexicons and Organizing Knowledge
Information Retrieval
MANAGING KNOWLEDGE FOR THE DIGITAL FIRM
CSE 635 Multimedia Information Retrieval
Web Mining Department of Computer Science and Engg.
Introduction to Information Retrieval
AGMLAB Information Technologies
PolyAnalyst™ text mining tool Allstate Insurance example
Presentation transcript:

Language Recognition… Searching with Precision Santa Clara, CA October 31, 2001 Julian Henkin Vice President, Worldwide Customer Services LexiQuest, Inc. Booth # 523

Topics for Discussion  Critical Nature of Search  Importance of Linguistics  Language Recognition  Case Studies

Critical Nature of Search  “At least one-third of your visitors are going to use the search function as soon as they enter your site.” - Improving Your Site’s Search, The Information Standard, August 11, 2000  “On average, professional users spend 11 hours per week looking for information. 71% said they could not find what they were looking for.” - “Information Management Software,” Lazard Freres & Co. LLC, February 2001  “Ultimately, the return on investment (ROI) of corporate information systems cannot be solely derived from the cost of building populating and maintaining these systems. True ROI also reflects the ability of all classes of users to effectively use the information.” - “Looking for a Lifesaver?”, KM Magazine, August 1999

Challenges with Today’s Search  Traditional and advanced methods (key word, Boolean searches, statistical and probability algorithms, concept agents, neural networks and pattern recognition) are limited in their ability to retrieve accurate results:  Not intuitive for typical user so full breadth of capability is rarely utilized  Do not provide any level of “understanding” of the text or of the concepts represented by the queries.  Search is based solely or largely on the comparison of the character strings in both queries and text.  Results often include a lot of “noise” (irrelevant results) and “silence” (accurate results are not found).  What if you don’t know what you are looking for?  What if you don’t know what you are looking for?

Importance of Linguistics  Linguistic-based systems are knowledge-sensitive: the more information there is in their “dictionaries”, the better the quality :  Natural Language interface is very intuitive for users, lets the system do the work  Up to a 400% improvement in performance over traditional search engines (greater relevance, and precision)  Can deliver multilingual and cross-lingual access

How Does Language Recognition Work? CONCEPTS Organizes concepts regardless of their language i.e., Table (Fr), Table (Eng), Mesa (Sp), Tavola (It) SEMANTIC Understands the meanings of words i.e., book=to register for a future activity vs. book= set of bound sheets of paper SYNTAX Understands a sentence’s or phrase’s structure and the “roles” of words i.e., subjects, verbs, objects; “to book” vs. “a book” MORPHOLOGY Word structure. Recognizes words (simple and compound) i.e., “to buy”, “bought” The Ladder of Language

1. Personalization (Sharing) 2. Codification (Capture, Structured Storage) 3. Discovery (Search, Retrieval) 4. Creation Innovation LexiQuest Mine LexiQuest Categorize LexiQuest Guide LexiQuest Respond 5. Capture Monitor “Knowledge Management is the collection of processes that govern the creation, dissemination, and utilization of knowledge.” “Knowledge is one, if not THE, principal factor that makes personal, organizational, and societal intelligent behavior possible.” “Organizations that have adopted this position (Chief Knowledge Officer) include Hoffman-LaRoche, GE Lighting, Xerox PARC, and several consultancies, including Ernst &Young, Gemini, and McKinsey” Five KM Activities

Enterprise Document Databases, Web sites or Repositories Domain 1 Limited amount of content Domain 3 Significant amount and depth of content Users who browse via a directory structure/taxonomy. Many Search Engines now leverage a taxonomy: improved accuracy Users who know what they are looking for and prefer using a search engine. Users who collectively ask the same narrow set of questions over and over again LexiQuest Mine LexiQuest Respond LexiQuest Categorize LexiQuest Guide Users who don’t know what they are looking for and need concepts illuminated. (Research) Domain 2 Limited amount of content Suite of Capabilities

User Experience “Who are the main ISP’s in the Far East?” Linguistic Analysis Accurate Results: Taiwanese Access Service Provider

Mine: A Research Tool Electronic Commerce NEAR Fraud

Mine’s Native Search 89 Documents

Guide’s Linguistic Expansion “consumers’ fraud protection online” – 21 documents “Swindle” returns this relevant document

Pharmaceutical Example “Cipro” expands to ciprofloxacin hydrochloride

Pharmaceutical Example “What antibiotic treats anthrax” Antibiotic expansions include ciprofloxicin, Cipro, ciprofloxicin hydrochloride

Quantitative Results 60% 50% 40% 30% 20% 10% 0% 400% more Accurate than Current Solutions number of answers retrieved Custom LexiQuest Guide Search Engine % of correct answers of all answers retrieved  Ensures all relevant information is retrieved  Reduces “noise” from irrelevant results

Julian Henkin Vice President, Worldwide Customer Services 641 Lexington Ave, 30 th Floor New York, NY x19 Booth # 523