A presentation by W H Inmon BRIDGING THE GAP BETWEEN UNSTRUCTURED DATA AND STRUCTURED DATA.

Slides:



Advertisements
Similar presentations
Wincite Knowledge Warehousing and Networking Sophisticated Simplicity.
Advertisements

Classification & Your Intranet: From Chaos to Control Susan Stearns Inmagic, Inc. E-Libraries E204 May, 2003.
MediTract Contract Management Software
Chapter 1 Business Driven Technology
© 2007 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice HP TRIM HP Information Management.
By: Mr Hashem Alaidaros MIS 211 Lecture 4 Title: Data Base Management System.
Business Intelligence Accurate Information, Accurate Decisions June 2012 Presented by: Scott Lea Government Services Infogroup Government Division.
ICS (072)Database Systems: A Review1 Database Systems: A Review Dr. Muhammad Shafique.
Web- and Multimedia-based Information Systems. Assessment Presentation Programming Assignment.
DECISION SUPPORT SYSTEMS AND BUSINESS INTELLIGENCE
MIKE2.0 Methodology Presentation to Wiki Wednesday community, London 6 June 2007
Enterprise Business Processes and Reporting (IS 6214) MBS MIMAS 20 th Jan 2010 Fergal Carton Business Information Systems.
Business Intelligence Andrew Davis Andria Zippler Jana Krinsky Tiffany Ferris.
Enterprise Resource Planning ERP Systems
Business Intelligence Business intelligence (BI) refers to all of the applications and technologies used to, provide access to, and information to efforts.
Libraries and Institutional Content Management Systems
Module 1: Overview of Information System in Organizations Chapter 2: How Organizations use IS.
CIT 858: Data Mining and Data Warehousing Course Instructor: Bajuna Salehe Web:
Effective Communications for Success Phillip Rosebrook JR, CR.
The Benefits of ISO 9001…. Copyright ©2008 The 9000 Store.
Chapter 2: Business Intelligence Capabilities
Governance, Risk, and Compliance Bill Greene Senior Industry Director.
Actionable Intelligence via Speech Analytics
Customer Service and Team Building Mrs. Flowers Finance & Business Technology.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
Almaden Services Research © 2008 IBM Corporation Intellectual Property Analytics Turning Unstructured Information Into Value Jeffrey T. Kreulen, Ph.D.
AdWords Instructor: Dawn Rauscher. Quality Score in Action 0a2PVhPQhttp:// 0a2PVhPQ.
Shilpa Seth.  What is Data Mining What is Data Mining  Applications of Data Mining Applications of Data Mining  KDD Process KDD Process  Architecture.
Data Mining Techniques As Tools for Analysis of Customer Behavior
Managing Data Resources
Enterprise & Intranet Search How Enterprise is different from Web search What to think about when evaluating Enterprise Search How Intranet use is different.
Pricing DiscriminationDynamicPersonalization Real Time September 2013.
Customer Service and Team Building Mrs. Flowers Finance & Business Technology.
What You Need before You Deploy Master Data Management Presented by Malcolm Chisholm Ph.D. Telephone – Fax
Case Study – Venture Portfolio Tracking and Competitive Intelligence Sam Knox - Director of Analyst Services Christopher Cho - Consulting Analyst.
A presentation by W H Inmon TEXTUAL ETL – OPENING UP NEW WORLDS OF OPPORTUNITY.
Presentation Path  Introduction to Ved Consultancy and OpenText  Current Challenges  The Valued Customers and Sectors  Our Solutions  Demo. Together,
Office Business Applications Unlocking the Business Value of IT Gurprit Singh Director, Emerging Technologies Microsoft Corporation.
© 2007 by Prentice Hall 1 Introduction to databases.
CALENDAR MANAGEMENT Calendar Management makes sharing calendars with teammates easy. You can divide calendars into sub-calendars (e.g., speaking engagements,
Enterprise Resource Planning ERP Systems
Succeeding with Technology Database Systems Basic Data Management Concepts Organizing Data in a Database Database Management Systems Using Database Systems.
A Talkument Overview. Why Record Business Calls? Public safety Financial services firms Consumer telesales Public utilities Compliance Quality Management.
ICS (072)Database Systems: An Introduction & Review 1 ICS 424 Advanced Database Systems Dr. Muhammad Shafique.
Technology In Action Chapter 11 1 Databases and… Databases and their uses Database components Types of databases Database management systems Relational.
INFO1408 Database Design Concepts Week 15: Introduction to Database Management Systems.
A presentation by W H Inmon ANALYZING CALL CENTER TEXT.
© 2007 IBM Corporation IBM Information Management Accelerate information on demand with dynamic warehousing April 2007.
1 Technology in Action Chapter 11 Behind the Scenes: Databases and Information Systems Copyright © 2010 Pearson Education, Inc. Publishing as Prentice.
1Erdal Nebol PART 3 CUSTOMER ACCOMMODATION & MARKET DISTRIBUTION.
MICROSOFT SEMANTIC ENGINE Unified Search, Discovery and Insight.
A presentation by W H Inmon ANALYZING CALL CENTER TEXT – VERIZON.
Project Management May 30th, Team Members Name Project Role Gint of Communications Sai
Chapter 6 Database Management and Business Intelligence Introduction to Business Information Systems by James Norrie, Mark Huber, Craig Piercy, And Patrick.
NSU Website Structure By: Debbie Jones, NSU Webmaster 1 NSU Web Services Publication - Author: NSU Webmaster Norfolk State University.
Chapter 3 Building Business Intelligence Chapter 3 DATABASES AND DATA WAREHOUSES Building Business Intelligence 6/22/2016 1Management Information Systems.
11 Database The ultimate in data organization. 2 Database Management Systems (DBMS)  Application software designed to capture and analyze data  Four.
OLAP Theory-English version On-Line Analytical processing (Buisness Intelligence) Ing.Skorkovský,CSc Department of Corporate Economy Faculty of Economics.
Module 1: Overview of Information System in Organizations
Accounting Information Systems: An Overview
ANALYZING REAL ESTATE TRANSACTIONS
Future Salon Palo Alto, CA January 16, 2004 Dirk Wenzel, VP, InfoTame
ANALYZING CALL CENTER TEXT
Governance, Risk, and Compliance Bill Greene Senior Industry Director
Creating New Business Value with Big Data
Business Drivers and Requirements
PolyAnalyst Data and Text Mining tool
Data Analysis.
How businesses use information systems (Part 2)
PolyAnalyst™ text mining tool Allstate Insurance example
Presentation transcript:

A presentation by W H Inmon BRIDGING THE GAP BETWEEN UNSTRUCTURED DATA AND STRUCTURED DATA

- unstructured data -.doc files -.txt files -.xls files - - transcripted telephone The informal systems of the corporation: .Txt.Doc - structured systems - structured data - corporate transactions - corporate reports - corporate databases -customer files - audit reports The formal systems of a corporation: Program

It is estimated that less than 20% of corporate systems are structured. 80 % .Txt.Doc 20% Program

.Txt.Doc search engines legal discovery archive taxonomy ontology document mgmt web content Program dbms business intelligence applications transactions OLTP ERP compliance imagine what would happen if the two worlds could be integrated……. the world of dbms, analytics, and other processing opens up.

.Txt.Doc search engines legal discovery archive taxonomy ontology document mgmt web content Program dbms business intelligence applications transactions OLTP ERP compliance .Txt.Doc tight integration between the two types of data.

There is a gulf between the two worlds: - technology - business practice - organizational - historical .Txt.Doc Program

Think of the possibilities! .Txt.Doc Program

Imagine this - Reports and visualization show a lot. have you ever wondered why you can’t hook up your Business Objects to ? or telephone conversations?

.Txt.Doc text numbers There is a fundamental disconnect between unstructured data and business intelligence. So what would happen if we had powerful visualization for text? Business Intelligence

liver cancer skin cancer thirst diabetes blood pressure correlative information becomes very easy to spot

for the general population for women for women who smoke over the age to 50 doing analysis on sub populations of women

for the general population for women who smoke over the age to 50 the contrast between the different correlations of different populations leads to great insight

service delivery late broken installation salesman attitude wait too long did not fit what about looking at customer feedback – complaints? now you can see the broader picture of what is happening

but there are plenty of other places where the technology applies – - manufacturing warranties – (what patterns of defects are there?) - Weblogs (marketing – who is saying what?) - customer complaints – (what are the problem products?) - general – (What’s the buzz? what is on people’s minds?) - insurance claims (what are the circumstances of accidents?)

.Txt.Doc another possibility is the monitoring of and the transport of to the structured environment

Monitoring s and other corporate conversations - .Txt.Doc Sarbanes Oxley HIPAA BASEL II compliance – making sure that is being used properly - compliance - corporate standard for language

Jan 3 - vp to vp “This is going to be a real barn burner of a quarter….” Jan 5 – finance to vp “It looks like we are going to do $9,000,000 this quarter…” Jan 5 – president to analyst “This quarter looks like we are going to break new records…” Feb 1 – employee to employee “Did you see the stock market? Everything is going down…” Feb 3 – president to vp “What is happening to sales in the midwest? We didn’t expect this…” Feb 4 – sales manager to vp Feb 3 – vp to vp “The sales cycle looks like it is extending. The economy is tanking…” “It looks like we are going to be a little short this quarter…” Feb 6 – president to vp “What are we going to do to get sales up? Do we need to do some discounting?” Mar 2 – sales person to vp “Demand has dried up. We aren’t going to close as many sales this quarter as we thought…” A bunch of s and conversations: What do you do with them?

Jan 3 - vp to vp “This is going to be a real barn burner of a quarter….” Jan 5 – finance to vp “It looks like we are going to do $9,000,000 this quarter…” Jan 5 – president to analyst “This quarter looks like we are going to break new records…” Feb 1 – employee to employee “Did you see the stock market? Everything is going down…” Feb 3 – president to vp “What is happening to sales in the midwest? We didn’t expect this…” Feb 4 – sales manager to vp Feb 3 – vp to vp “The sales cycle looks like it is extending. The economy is tanking…” “It looks like we are going to be a little short this quarter…” Feb 6 – president to vp “What are we going to do to get sales up? Do we need to do some discounting?” Mar 2 – sales person to vp “Demand has dried up. We aren’t going to close as many sales this quarter as we thought…” Examining s (“combing” them) for important corporate information: Sarbanes Oxley quarter stock sales discount demand sales cycle external categories

sales – Feb 2 – Mar 5 phone – Mar 8 ……………… quarter – Jan 2 – Jan 4 – Feb 5 ……………… discount phone conversation – Jan 6 – Jan 12 – Jan 14 ………………………….. sales cycle – Feb 24 phone conversation – Mar 14 meeting notes – Mar 18 ……………………………. Structured Environment The “combed” information is brought over to the structured environment. Now you can use standard tools, such as Cognos, Business Objects, Crystal Reports, MicroStrategy to do analysis.

customer data probabilistic match s and telephone conversations can be linked to CDI/CRM data. But there are other ways that communications can be used

A true 360 degree view of the customer can be formed. “I placed an order last week and when it arrived it was the wrong size. And then your company would not take it back. I’m mad.” how easy is it going to be to engage Mrs Jones until she has satisfaction about her order

A true 360 degree view of the customer can be formed. communications demographics delivering on the promise of CDI

.Txt.Doc Program can’t I just use a search engine to link the two worlds? integration search engines do not integrate textual information

.Txt.Doc Program integration text doesn’t need to be searched, it needs to be integrated

.Txt.Doc Program integration “ha” “head ache” “heart attack” “Hepatitis A”

.Txt.Doc Program integration “oblique fractured ulna” “oblique fractured tibia” “obliq fractured tarsi” “broken bone”

.Txt.Doc Program 1 – stop word editing 2 – stemming 3 – synonym replacement 4 – synonym concatenation 5 – homograph resolution 6 – alternate spelling resolution 7 – external category classification 8 – theming 9 – probabilistic matching 10 – negation exclusion 11 – concept clustering 12 – mid process editing 13 – change sensitivity What is meant by editing, integrating text? integration

.Txt.Doc Program For a detailed description of how the unstructured environment should be linked to the structured environment, go to - and look for DW 2.0 TM or go to -

Unstructured Data Structured Environment Query Business Objects, Cognos, MicroStrategy, Crystal Reports DB2 probabilistic match visualization