Text Mining with JMP Pro 13: A Case Study

Slides:



Advertisements
Similar presentations
Cognitive Academic Language Learning Approach
Advertisements

Copyright © 2008, SAS Institute Inc. All rights reserved. Discovering Meaningful Patterns in Genomics Data with JMP Genomics Jordan Hiller JMP Genomics.
Chapter 1 Introduction to Visual Basic Programming and Applications 1 Exploring Microsoft Visual Basic 6.0 Copyright © 1999 Prentice-Hall, Inc. By Carlotta.
Chapter 5: Introduction to Information Retrieval
Copyright © 2010 SAS Institute Inc. All rights reserved. A Quick Introduction to JMP Dara Hammond JMP Account Rep.
Guided Reading A Part of a Balanced Literacy Framework.
Ann Arbor ASA ‘Up and Running’ Series: SPSS Prepared by volunteers of the Ann Arbor Chapter of the American Statistical Association, in cooperation with.
Sarah Reonomy OSCON 2014 ANALYZING DATA WITH PYTHON.
Slide 7A.1 Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. An Introduction to Object-Oriented Systems Analysis and Design with.
Adobe Flash CS4 Revealed Author: Shuman Copyright 2010 ISBN:
Statistical Discovery. TM From SAS Data Visualization Using SAS and JMP Analyze Visualize Synchronize
Section 2.1 Compare the Internet and the Web Identify Web browser components Compare Web sites and Web pages Describe types of Web sites Section 2.2 Identify.
The SEASR project and its Meandre infrastructure are sponsored by The Andrew W. Mellon Foundation SEASR Overview Loretta Auvil and Bernie Acs National.
Chapter 1 Introduction to Visual Basic Programming and Applications 1 Exploring Microsoft Visual Basic 6.0 Copyright © 1999 Prentice-Hall, Inc. By Carlotta.
1 Programming Concepts Module Code : CMV6107 Class Contact Hours: 45 hours (Lecture 15 hours) (Laboratory/Tutorial 30 hours) Module Value: 1 Textbook:
Chapter 1 Introduction to Visual Basic Programming and Applications 1 Joshi R.G. Dept. of Computer Sci. YMA.
Copyright © 2006, SAS Institute Inc. All rights reserved. Enterprise Guide 4.2 : A Primer SHRUG : Spring 2010 Presented by: Josée Ranger-Lacroix SAS Institute.
Teaching with Multimedia and Hypermedia
Copyright © 2010, SAS Institute Inc. All rights reserved. Applied Analytics Using SAS ® Enterprise Miner™
Copyright © 2006 The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 1 Chapter 2.
Medline on OvidSP. Medline Facts Extensive MeSH thesaurus structure with many synonyms used in mapping and multidatabase searching with Embase Thesaurus.
Copyright © 2005, SAS Institute Inc. All rights reserved. Interactive Demonstrations of Statistical Concepts Using JMP TM Mark Bailey Statistical Services.
Cognitive Academic Language Learning Approach TEACHER GUSTAVO GÓMEZ.
Chapter 6: Information Retrieval and Web Search
WELCOME TO STATISTICS The required textbook for this course is: Introduction to the Practice of Statistics Seventh Edition by Moore/McCabe/Craig (W.H.
Copyright © 2008, SAS Institute Inc. All rights reserved. Interactive Analysis and Data Visualization Using JMP −Dara Hammond, Federal Systems Engineer.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
BOĞAZİÇİ UNIVERSITY DEPARTMENT OF MANAGEMENT INFORMATION SYSTEMS MATLAB AS A DATA MINING ENVIRONMENT.
Business Analytics Skills
Copyright © 2015, SAS Institute Inc. All rights reserved. Business & Analytics unite VS.
Big Data Using Big Data for Cultures and Communities Jeremy Reffin Simon Wibberley CASM, University of Sussex Carl Miller CASM, Demos July 2014.
Tutorials and Quick Guides A quick introduction. Overview  Genre of Tutorial  Genre of Quick Guide  Genre of Reference  Genre of User Manual  Attributes.
Chapter 1 Introduction to Visual Basic Programming and Applications 1 Exploring Microsoft Visual Basic 6.0 Copyright © 1999 Prentice-Hall, Inc. By Carlotta.
“Moh’d Sami” AshhabSummer 2008University of Jordan MATLAB By (Mohammed Sami) Ashhab University of Jordan Summer 2008.
MARKO ZOVKO, ACCOUNT MANAGER STEPHEN SMITH, SOLUTIONS SPECIALIST JOURNALS & HIGHLY-CITED DATA IN INCITES V. OLD JOURNAL CITATION REPORTS. WHAT MORE AM.
Chapter 2 Data, Text, and Web Mining. Data Mining Concepts and Applications  Data mining (DM) A process that uses statistical, mathematical, artificial.
introductionwhyexamples What is a Web site? A web site is: a presentation tool; a way to communicate; a learning tool; a teaching tool; a marketing important.
DEPARTMENT OF COMPUTER SCIENCE Introduction to Visual Basic BCA 3 RD YR PRESENTED BY HASHIR UN NABI Dated:01/07/
Zohreh Raghebi.  A software platform provides an integrated environment  Machine learning  Data mining  Text mining  Predictive analytics  Business.
Data Analytics Challenges Some faults cannot be avoided Decrease the availability for running physics Preventive maintenance is not enough Does not take.
Chapter 1 Introduction to Visual Basic
ANALYSIS OF THE 1st presidential DEBATE USING SAS® TEXT ANALYTICs
ANALYSIS OF THE 1st PRESIDENTIAL DEBATE USING SAS® TEXT ANALYTICS
Over 1,000 books, journals, videos and reference material
Natural Language Processing (NLP)
(Mohammed Sami) Ashhab
Data Analyzing Artificial Intelligence (AI)
Search Techniques and Advanced tools for Researchers
Susan Mowers, Data Librarian, GSG Centre - UOttawa
Programming Guidance for Using R and JMP 13 or later Capabilities for Statistics Instruction David Stephan Two Bridges Instructional Technology
CSE 143 Goodbye, world!.
Siva R Venna (sxv6878) Satya Katragadda (sxk6389)
What's New in eCognition 9
Analytics: Its More than Just Modeling
Text Analytics and Machine Learning Workshop
Spreadsheets, Modelling & Databases
CHAPTER 7: Information Visualization
Publication Output on the Topical Area of "Energy" and Real Estate (Education) Bob Martens.
Natural Language Processing (NLP)
TEXTAND WEB MINING.
TEXT and WEB MINING.
Modeling Text-Based Requirements and their Relationship to Design
Information Retrieval and Web Design
Introduction to JMP Text Explorer Platform
What's New in eCognition 9
We live in a technology rich environment!
What's New in eCognition 9
Chapter 29 - Stem Cells and Generation of New Cells in the
How To Write An Impressive Business To Business Content.
Natural Language Processing (NLP)
Presentation transcript:

Text Mining with JMP Pro 13: A Case Study Mia Stephens – mia.stephens@jmp.com Who is familiar with JMP? JMP 12 release Who am I Copyright © 2010, SAS Institute Inc. All rights reserved.

What is JMP (and JMP Pro)? Statistical Discovery Software from SAS Developed in 1989 For teaching and doing Comprehensive Basic Advanced Extendible powerful scripting language application and add-in builders Excel, R, MATLAB, and SAS Visual, dynamic and interactive JMP Pro – Advanced tools for analytics and modeling Runs natively By its very nature, JMP is visual, interactive and dynamic, which makes it an ideal tool for teaching and learning statistical concepts. JMP has a programming language, which is not required to use JMP, but allows users to expand applications of JMP Copyright © 2010, SAS Institute Inc. All rights reserved.

Agenda Introduction to Text Mining General Workflow Simple Example – Pet Survey Case Study: Toronto Casino Survey Reference: A practical guide to text mining with topic extraction, Karl, Wisnowski, and Rushing, 2015 Citation: WIREs Comput Stat 2015, 7:326–340. doi: 10.1002/wics.1361

Text Mining: General Workflow Define the problem Collect/compile the data (unstructured text, and other relevant information) Process/prepare the text Correct spelling/capitalization (Recode) Combine words with same root (Stemming) Remove common words, symbols, numbers,… (Stopwords) Transform text (Document Term Matrix) Explore clusters and topics to identify themes Group similar documents and words *Create new variables for predictive modeling

What is a Document Term Matrix? Take the following “documents” Find the number of unique terms Create indicator variables for each term The prices at Lowes are amazing Finding help in Lowes is a problem

Simple Example: Pet Survey

Case Study: Toronto Casino Survey

Case Study: Toronto Casino Survey Data exploration Are respondents generally in favor of or opposed to the casino? Does the response depend on gender or age? Text Explorer Term and Phrase Lists: What are the most frequently used terms and phrases? In what context are the terms and phrases used? Word Cloud: Do those in favor use different words/phrases than those opposed? Clustering: Which terms tend to appear together? Can similar documents be grouped together? Topic Analysis: What are the recurring themes? Predictive Modeling Can the topics or terms be used to predict whether a respondent was in favor of or opposed to the casino?

JMP Text Explorer Resources Info Kit Includes a webinar and book chapter on text mining Short guides (in the JMP Learning Library) jmp.com/learn Videos:  http://www.jmp.com/en_us/events/ondemand/mastering-jmp/text-explorer.html http://www.jmp.com/en_us/events/ondemand/technically-speaking/tackling-unstructured-data-with-text-exploration.html General overview discussion of text explorer (Heath Rushing) with a simple example:  http://www.jmp.com/en_us/events/ondemand/analytically-speaking/analytically-speaking-heath-rushing.html

Mia.stephens@jmp.com jmp.com/academic Discussion and Q&A Mia.stephens@jmp.com jmp.com/academic