Data Sources, Use Cases and Capabilities

Slides:



Advertisements
Similar presentations
An Introduction to Data Mining
Advertisements

PolyAnalyst Data and Text Mining tool Your Knowledge Partner TM www
COMP423 Intelligent Agents. Recommender systems Two approaches – Collaborative Filtering Based on feedback from other users who have rated a similar set.
Extract from various presentations: Bing Liu, Aditya Joshi, Aster Data … Sentiment Analysis January 2012.
April 22, Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Doerre, Peter Gerstl, Roland Seiffert IBM Germany, August 1999 Presenter:
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Huimin Ye.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Drew DeHaas.
WHT/ HPCC Systems Flavio Villanustre VP, Products and Infrastructure HPCC Systems Risk Solutions.
From Words to Meaning to Insight Julia Cretchley & Mike Neal.
Knowledge Science & Engineering Institute, Beijing Normal University, Analyzing Transcripts of Online Asynchronous.
More than words: Social networks’ text mining for consumer brand sentiments A Case on Text Mining Key words: Sentiment analysis, SNS Mining Opinion Mining,
Opinion mining in social networks Student: Aleksandar Ponjavić 3244/2014 Mentor: Profesor dr Veljko Milutinović.
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.
Data Mining GyuHyeon Choi. ‘80s  When the term began to be used  Within the research community.
©2008 Srikanth Kallurkar, Quantum Leap Innovations, Inc. All rights reserved. Apollo – Automated Content Management System Srikanth Kallurkar Quantum Leap.
Using Text Mining and Natural Language Processing for Health Care Claims Processing Cihan ÜNAL
Introduction to Web Mining Spring What is data mining? Data mining is extraction of useful patterns from data sources, e.g., databases, texts, web,
Use of Hierarchical Keywords for Easy Data Management on HUBzero HUBbub Conference 2013 September 6 th, 2013 Gaurav Nanda, Jonathan Tan, Peter Auyeung,
Unit 2, Lesson 4 Using Auto Features in Word. Objectives Check and correct spelling. Check and correct spelling. Check and correct grammar. Check and.
Text Based Information Retrieval Text Based Information Retrieval H02C8A H02C8B Marie-Francine Moens Karl Gyllstrom Katholieke Universiteit Leuven.
Data Mining Algorithms for Large-Scale Distributed Systems Presenter: Ran Wolff Joint work with Assaf Schuster 2003.
Decision Support Systems
Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.
Huge Universal Language Keyboard Team 9 Dhruv Lamba Luke Johnston Ethan Byler Andy Robison.
CSC 594 Topics in AI – Text Mining and Analytics
Text Based Information Retrieval H02C8A Marie-Francine Moens Karl Gyllstrom Katholieke Universiteit Leuven Study points: 4 Language: English Periodicity:
Finding document topics for improving topic segmentation Source: ACL2007 Authors: Olivier Ferret (18 route du Panorama, BP6) Reporter:Yong-Xiang Chen.
NATURAL LANGUAGE PROCESSING Zachary McNellis. Overview  Background  Areas of NLP  How it works?  Future of NLP  References.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 TIARA: A Visual Exploratory Text Analytic System Presenter.
(1) Organize information processing centers environment, the various functions and details Electronic Data Processing (EDP): can refer to the use of automated.
COMP423 Intelligent Agents. Recommender systems Two approaches – Collaborative Filtering Based on feedback from other users who have rated a similar set.
An Effective Statistical Approach to Blog Post Opinion Retrieval Ben He, Craig Macdonald, Jiyin He, Iadh Ounis (CIKM 2008)
Trends in NL Analysis Jim Critz University of New York in Prague EurOpen.CZ 12 December 2008.
Editing Editing – the process of updating a word processing document to: make changes correct errors make it visually appealing.
Machine Learning Best Practices with Alfresco & Activiti
Plan for Today’s Lecture(s)
Data Mining.
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Guangbing Yang Presentation for Xerox Docushare Symposium in 2011
IB Assessments CRITERION!!!.
Chapter 7: Text and Web Mining
Personalized Social Image Recommendation
IBM Content and Predictive Analytics for Healthcare How it works
MID-SEM REVIEW.
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Topic 3: Cluster Analysis
PolyAnalyst Data and Text Mining tool
Predictive Delinquency Leading Financial Services Company
Unit4 Partner Portal for Case Creator
TDM=Text Mining “automated processing of large amounts of structured digital textual content for purposes of information retrieval, extraction, interpretation.
Text Categorization Rong Jin.
Document Clustering Matt Hughes.
Enterprise Infrastructure
CSE 635 Multimedia Information Retrieval
INNOvation in TRAINING BUSINESS ANALYSTS HAO HElEN Zhang UniVERSITY of ARIZONA
Data Mining Concepts and Techniques
Web Mining Department of Computer Science and Engg.
Content Analysis of Text
Text Mining & Natural Language Processing
Unit4 Customer Portal Knowledge User Access.
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Introduction to Text Analysis
e-Discovery through Text Mining
Business Intelligence
Topic 5: Cluster Analysis
AI Discovery Template IBM Cloud Architecture Center
Text Mining Application Programming Chapter 9 Text Categorization
PolyAnalyst™ text mining tool Allstate Insurance example
Presentation transcript:

Data Sources, Use Cases and Capabilities Text Mining: Data Sources, Use Cases and Capabilities Kayvis Damptey Jie Zhang Dec 8, 2016

What is Text Mining?

Text Mining Text Mining uses documents to identify insightful patterns within the text. Thus allowing managers to summarize/organize huge collections of documents and automate detection based on useful linguistic patterns. Detecting these patterns can be done via the following methods: Keyword/Event Detection Word Clouds Visualizations Natural Language Processing Topic Discovery/Category Assignment Sentiment Analysis Financial Management Services November 13, 2018

What skills are needed to do Text Mining? Understand Linguistic Nuances Domain Knowledge Probability and Linear Algebra Computer Programming Financial Management Services November 13, 2018

Past Use Case: Business Problem: Remedy Topic Discovery With many tickets coming into Financial Support Center, there was a need to understand what general topics were being discussed in many of the tickets to better understand which issue need to be resolved faster or had the most tickets. Financial Management Services November 13, 2018

Past Use Cases: Remedy Topic Discovery Approach: Data Cleansing: Curate data for modeling, including relevant customer data, and keywords in the text, spell checking etc. Filter data based on things like excessive symbols (and thus maybe just a long email exchange) Standardize slang/abbreviations and clean the text from spelling errors Data Transformation: By creating a “ticket by term” matrix the tickets can be scored based on their term-frequency combinations (which is an approximation of the antonyms and synonyms). Modeling and Scoring: The tickets are then grouped based on the proportion of the co-occurrence of their term-frequency combination score. Financial Management Services November 13, 2018

Past Use Cases: Remedy Topic Discovery from 1,168 tickets Solution: Financial Management Services November 13, 2018

Everyday Textmining

How might text mining be useful in everyday situations? Automatic Categorization and response to incoming documents Detect strong customer sentiment towards a product or service Discovering Common Topics of many documents Summarizing documents Detecting comments about actions taken by key partners Extract location information from descriptions Financial Management Services November 13, 2018

Jie Zhang (jzhang28@Stanford.edu) Kayvis Damptey (kayvis@Stanford.edu)