Text Analytics Workshop: Introduction

Slides:



Advertisements
Similar presentations
Taxonomy Development An Infrastructure Model Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Advertisements

Top Tips Enterprise Content Management Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Metadata Strategies Alternatives for creating value from metadata Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services.
Improving Navigation and Findability Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Beyond Sentiment New Dimensions for Social Media A Panel Discussion of Trends and Ideas Dave Hills, Twelvefold Media Mike Lazarus, Atigeo, LLC Moderator:
Copyright © 2012, SAS Institute Inc. All rights reserved. #analytics2012 Quick Start for Text Analytics Tom Reamy Chief Knowledge Architect KAPS Group.
Enterprise Information Architecture A Platform for Integrating Your Organization’s Information and Knowledge Activities Tom Reamy Chief Knowledge Architect.
Faceted Navigation: Search and Browse Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Taxonomy Development Case Studies
Innovation in Search? Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Model of Taxonomy Development Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Knowledge Architecture Process & Case Studies Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Semantic Infrastructure Workshop Development Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Semantic Infrastructure Workshop Development Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Taxonomy Boot Camp Panel Text Analytics Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Improving Search for Discovery Tom Reamy Chief Knowledge Architect KAPS Group Program Chair – Text Analytics World Knowledge Architecture Professional.
Automatic Facets: Faceted Navigation and Entity Extraction Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services.
Copyright © 2011, SAS Institute Inc. All rights reserved. #analytics2011 Text Analytics Evaluation A Case Study: Amdocs Tom Reamy Chief Knowledge Architect.
Beyond Sentiment Mining Social Media A Panel Discussion of Trends and Ideas Marie Wallace, IBM Marcello Pellacani, Expert System Fabio Lazzarini, CRIBIS.
Enterprise Semantic Infrastructure Workshop Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Facets and Faceted Navigation Development Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Expanding Enterprise Roles for Librarians Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Text Analytics Workshop Development Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Best of Both Worlds Text Analytics and Text Mining Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Selecting Taxonomy Software Who, Why, How Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Taxonomy and Knowledge Organization Taxonomy in Context Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Building a Foundation for Info Apps Tom Reamy Chief Knowledge Architect KAPS Group Program Chair – Text Analytics World Knowledge Architecture Professional.
Enterprise Search/ Text Analytics Evaluation Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Text Analytics And Text Mining Best of Text and Data
Best of All Worlds Text Analytics and Text Mining and Taxonomy Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services.
New Directions in Social Media Tom Reamy Chief Knowledge Architect KAPS Group
SemTech Text Analytics Evaluation Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Text Analytics and Taxonomies Tom Reamy Chief Knowledge Architect KAPS Group
Smart Text How to Turn Big Text into Big Data Tom Reamy Chief Knowledge Architect KAPS Group Program Chair – Text Analytics World.
Integrating an Enterprise Taxonomy with Local Variations Tom Reamy Chief Knowledge Architect KAPS Group Program Chair – Text Analytics World Knowledge.
Applying Semantics to Search Text Analytics Tom Reamy Chief Knowledge Architect KAPS Group Enterprise Search Summit New York.
Taxonomy and Social Media Social Taxonomies Tom Reamy Chief Knowledge Architect KAPS Group Program Chair – Text Analytics World Knowledge Architecture.
Content Categorization Tools Taxonomies & Technologies for Infrastructure Solutions Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture.
Text Analytics Summit Text Analytics Evaluation Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Text Analytics Software Choosing the Right Fit Tom Reamy Chief Knowledge Architect KAPS Group Text Analytics World October 20.
New Directions in Social Media Tom Reamy Chief Knowledge Architect KAPS Group
Metadata and Taxonomies The Best of Both Worlds Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Integrating an Enterprise Taxonomy with Local Variations Tom Reamy Chief Knowledge Architect KAPS Group Taxonomy Boot Camp.
Text Analytics Mini-Workshop Quick Start Tom Reamy Chief Knowledge Architect KAPS Group Program Chair – Text Analytics World Knowledge Architecture Professional.
Enterprise Semantic Infrastructure Workshop Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Folksonomy Folktales Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Selecting Taxonomy Software Who, Why, How Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Text Analytics Workshop Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Advanced Semantics and Search Beyond Tag Clouds and Taxonomies Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services.
Text Analytics for Search Applications Workshop Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Text Analytics A Tool for Taxonomy Development Tom Reamy Chief Knowledge Architect KAPS Group Program Chair – Text Analytics World Knowledge Architecture.
Text Analytics Workshop Applications Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Text Analytics Workshop Tom Reamy Chief Knowledge Architect KAPS Group Program Chair – Text Analytics World Knowledge Architecture Professional Services.
Taxonomy and Text Analytics Case Studies Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Taxonomy Development An Infrastructure Model Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Deep Text New Approaches in Text Analytics and Knowledge Organization Tom Reamy Chief Knowledge Architect KAPS Group Author: Deep.
Text Analytics World Future Directions of Text Analytics: Smarter, Bigger, and Better Tom Reamy Chief Knowledge Architect KAPS Group Program Chair – Text.
Text Analytics Webinar
Tom Reamy Chief Knowledge Architect KAPS Group
Text Analytics Tutorial
Deep Text Social Media Analysis A Text Analytics Foundation
Tom Reamy Chief Knowledge Architect KAPS Group
Combining Taxonomy, Ontology, Text, and Data A Deep Text Approach
Enterprise Social Networks A New Semantic Foundation
Program Chair: Tom Reamy Chief Knowledge Architect
Text Analytics Workshop
Using Text Analytics to Spot Fake News
Text Analytics Workshop
Program Chair: Tom Reamy Chief Knowledge Architect
Expertise Location Basic Level Categories
Presentation transcript:

Text Analytics Workshop: Introduction Tom Reamy Chief Knowledge Architect KAPS Group Program Chair – Text Analytics World Knowledge Architecture Professional Services http://www.kapsgroup.com

Agenda Getting Started with Text Analytics 11:00-12:30 Introduction – State of Text Analytics 9:00-10:30 Break – 10:30-11:00 Getting Started with Text Analytics 11:00-12:30 Lunch – 12:30-1:15 Development – 1:15 – 2:30 Break – 2:30 - 3:00 Text Analytics Applications 3:00-4:15 Questions / Discussions 4:15 – 4:30

Introduction: KAPS Group Knowledge Architecture Professional Services – Network of Consultants Applied Theory – Faceted & emotion taxonomies, natural categories Services: Strategy – IM & KM - Text Analytics, Social Media, Integration Taxonomy/Text Analytics, Social Media development, consulting Text Analytics Quick Start – Audit, Evaluation, Pilot Partners – Smart Logic, Expert Systems, SAS, SAP, IBM, FAST, Concept Searching, Attensity, Clarabridge, Lexalytics Clients: Genentech, Novartis, Northwestern Mutual Life, Financial Times, Hyatt, Home Depot, Harvard Business Library, British Parliament, Battelle, Amdocs, FDA, GAO, World Bank, Dept. of Transportation, etc. Program Chair – Text Analytics World – March 29-April 1 - SF Presentations, Articles, White Papers – www.kapsgroup.com Current – Book – Text Analytics: How to Conquer Information Overload and Get Real Value from Social Media

Introduction: Elements of Text Analytics Text Mining – NLP, statistical, predictive, machine learning Different skills, mind set, Math not language Semantic Technology – ontology, fact extraction Extraction – entities – known and unknown, concepts, events Catalogs with variants, rule based Sentiment Analysis Objects and phrases – statistics & rules – Positive and Negative Summarization Dynamic – based on a search query term Generic – based on primary topics, position in document

Introduction: Elements of Text Analytics Auto-categorization Training sets – Bayesian, Vector space Terms – literal strings, stemming, dictionary of related terms Rules – simple – position in text (Title, body, url) Semantic Network – Predefined relationships, sets of rules Boolean– Full search syntax – AND, OR, NOT Advanced – DIST(#), ORDDIST#, PARAGRAPH, SENTENCE Platform for multiple features – Sentiment, Extraction Disambiguation - Identification of objects, events, context Distinguish Major-Minor mentions Model more subtle sentiment

Case Study – Categorization & Sentiment

Case Study – Categorization & Sentiment

Case Study – Taxonomy Development

Text Analytics Workshop Introduction: Text Analytics History – academic research, focus on NLP Inxight –out of Zerox Parc Moved TA from academic and NLP to auto-categorization, entity extraction, and Search-Meta Data Explosion of companies – many based on Inxight extraction with some analytical-visualization front ends Half from 2008 are gone - Lucky ones got bought Initial Focus on enterprise text analytics Shift to sentiment analysis - easier to do, obvious pay off (customers, not employees) Backlash – Real business value? Current – Multiple Applications Text Analytics is growing – time for a jump?

Text Analytics Workshop Current State of Text Analytics Current Market: 2013 – exceed $1 Bil for text analytics (10% of total Analytics) Growing 20% a year Search is 33% of total market Other major areas: Sentiment and Social Media Analysis, Customer Intelligence Business Intelligence, Range of text based applications Fragmented market place – full platform, low level, specialty Embedded in content management, search, No clear leader.

Interviews with Leading Vendors, Analysts: Current Trends From Mundane to Advanced – reducing manual labor to “Cognitive Computing” Enterprise – Shift from Information to Business – cost cutting rather than productivity gains Integration – data and text, text analytics and analytics Social Media – explosion of wild text, combine with data – customer browsing behavior, web analytics Big Data – more focus on extraction (where it began) but categorization adds depth and sophistication Shift away from IT – compliance, legal, advertising, CRM US market different than Europe/Asia – project oriented

Text Analytics Workshop Current State of Text Analytics: Vendor Space Taxonomy Management – SchemaLogic, Pool Party From Taxonomy to Text Analytics Data Harmony, Multi-Tes Extraction and Analytics Linguamatics (Pharma), Temis, whole range of companies Business Intelligence – Clear Forest, Inxight Sentiment Analysis – Attensity, Lexalytics, Clarabridge Open Source – GATE Stand alone text analytics platforms – IBM, SAS, SAP, Smart Logic, Expert System, Basis, Open Text, Megaputer, Temis, Concept Searching Embedded in Content Management, Search Autonomy, FAST, Endeca, Exalead, etc.

Text Analytics Workshop Future Directions: Survey Results Important Areas: Predictive Analytics & text mining – 90% Search & Search-based Apps – 86% Business Intelligence – 84% Voice of the Customer – 82%, Social Media – 75% Decision Support, KM – 81% Big Data- other – 70%, Finance – 61% Call Center, Tech Support – 63% Risk, Compliance, Governance – 61% Security, Fraud Detection-54%

Future of Text Analytics Obstacles - Survey Results What factors are holding back adoption of TA? Lack of clarity about TA and business value - 47% Lack of senior management buy-in - 8.5% Need articulated strategic vision and immediate practical win Issue – TA is strategic, US wants short term projects Sneak Project in, then build infrastructure – difficulty of speaking enterprise Integration Issue – who owns infrastructure? IT, Library, ? IT understands infrastructure, but not text Need interdisciplinary collaboration – Stanford is offering English-Computer Science Degree – close, but really need a library-computer science degree

Future of Text Analytics Primary Obstacle: Complexity Usability of software is one element More important is difficulty of conceptual-document models Language is easy to learn , hard to understand and model Need to add more intelligence (semantic networks) and ways for the system to learn – social feedback Customization – Text Analytics– heavily context dependent Content, Questions, Taxonomy-Ontology Level of specificity – Telecommunications Specialized vocabularies, acronyms

Text Analytics Workshop Benefits of Text Analytics Why Text Analytics? Enterprise search has failed to live up to its potential Enterprise Content management has failed to live up to its potential Taxonomy has failed to live up to its potential Adding metadata, especially keywords has not worked Social Media/Sentiment is superficial (too much bandwagon) What is missing? Intelligence – human level categorization, conceptualization Infrastructure – Integrated solutions not technology, software Text Analytics can be the foundation that (finally) drives success – search, content management, Social Media, and much more

Text Analytics Workshop Costs and Benefits IDC study – quantify cost of bad search Three areas: Time spent searching Recreation of documents Bad decisions / poor quality work Costs 50% search time is bad search = $2,500 year per person Recreation of documents = $5,000 year per person Bad quality (harder) = $15,000 year per person Per 1,000 people = $ 22.5 million a year 30% improvement = $6.75 million a year Add own stories – especially cost of bad information Search as Platform for Applications – Direct profits & lower costs

Text Analytics Workshop Costs and Benefits Social Media

Questions? Tom Reamy tomr@kapsgroup.com KAPS Group Knowledge Architecture Professional Services http://www.kapsgroup.com