Download presentation
Presentation is loading. Please wait.
Published byLeon Scott Modified over 9 years ago
1
1
2
Introductions Customer feedback Where does the data come from? Why is this analysis even more important today? Questionnaires with ratings The Likert scale How to visualize the results Are the results significant? Free text responses What can we do with it? Sentiment analysis Count words and phrases Class 4 Outline 2
3
Curry Guinn Hokie (or Fighting Gobbler) Blue Devil Seahawk 10 years at RTI International (in Research Triangle Park) Introductions 3
4
Where do we get the data? How has this changed over the years? Who gets to see customer feedback? Why is it even more important to understand and respond to customer feedback now? Customer Feedback 4
5
Questionnaires with Ratings These rating scales are often called Likert-style questionnaires Free text responses Styles of Data Collection 5
6
Likert Scaling 1234512345 1234512345 3Scan a multitude of information and decide what is important. 1234512345 1234512345 1234512345 1234512345 1234512345 1Manage time effectively 2Manage resources effectively. 3Scan a multitude of information and decide what is important. 4Decide how to manage multiple tasks. 5Organize the work when directions are not specific. 1Manage time effectively Rating Sheet A group of judges rates each item on a scale where: 1=strongly disagree 2=disagree3=undecided4=agree 5=strongly agree
7
To compare across questions, column (or bar) charts with either mean, median, or mode. You can use bar charts to compare across products or services too. To compare responses within a question, histograms. Let’s see examples of these in our exercise. Visualizing the Results of Likert-Like Surveys 7
8
Column Chart Example – Single Product (Exercise 1) 8
9
Bar Chart Example – Multiple Products (Exercise 2) 9
10
Histogram Example – Examining a Single Response (Exercise 3) 10
11
What is the true average? 11 In our Supra Sedan, the overall satisfaction had a mean of 3.4. That result was calculated with 10 customer responses? How certain are we that the “true” mean is 3.4? Would the result be exactly the same with 100 customer responses? What about 1000?
12
Standard Deviation 12 What standard deviation (and the related statistic standard error) tell us is what is the likely range of the true mean. We say that there is a 95% likelihood that the true mean is within twice the standard deviation. This gives us a level of confidence in our results.
13
Standard Deviation 13 If the standard deviation is big, then the likelihood that we have calculated a mean that is close to the real mean is lower.
14
Why is this important? 14 Often we want to know whether the difference between two means is significant. For instance, suppose some customers rate Product 1’s overall satisfaction at 3.4 and Product 2’s satisfaction at 2.9. Should we tell our company that customers prefer Product 1 over Product 2? These means are just an estimate of the real satisfaction. So, is there a chance that customers really don’t prefer Product 1 to Product 2? The answer is yes! Fortunately, we can calculate that chance.
15
Here’s the visual 15
16
Using a T-Test to determine whether two sets of data are significantly different 16 A t-test can return the probability that two sets of data have the same mean If that probability is low (say 5%), then you can have some confidence that the statistical means are truly different. Exercise 4: Is the difference in customer’s opinion of “Price” significant? How about attractiveness? Overall satisfaction?
17
What are some problems with Likert-style surveys? 17 Social Desirability – Respondents often answer trying to put themselves (or others) in a positive light. Real world example I just encountered: Internship Supervisor Survey. Question: Give your overall rating of the intern: Poor Fair Average Good Excellent No one was rated “Poor” or “Fair” How to interpret that? Bimodal distributions – Relatively common T-tests assume a normal distribution Continued on next slide
18
Bimodal distribution 18 Answers tend to cluster around two different “means” Real world example: Student grades in my programming classes Where else might you expect to see a bimodal distribution? Controversial issues
19
Be careful looking at the mean 19 Suppose we have a 7-point scale where 1 means “Hate it!” and 7 means “Love it!”. 4 means “Neutral”. You ask 100 people their opinion of Obamacare and you calculate a mean of 4. Does that mean that people are “Neutral” on Obamacare? The histogram is important! Exercise 4.5: Looking at a bimodal distribution.
20
Let’s Take a Break 20 When we come back … Working with Free-Text Responses
21
Text Analytics 21 What can we do with all the free form text that that comes from sources like: Customer Satisfaction Surveys Amazon.com Tripadvisor.com Angie’s List Social Media – Facebook – Twitter
22
Tools for Text Analytics 22 Goals: – Use the structure and regularities inherent in language to extract useful information – Communicate that information
23
Our Domain for this Example 23 784 customer reviews of a particular Linksys router (Source: Amazon.com) – Cisco-Linksys WRT160N Wireless-N Broadband Router What are some things we can do? – Sentiment analysis – Topic identification
24
24 Determine whether some text is basically “positive”, “negative” or “neutral” in affect What is sentiment analysis?
25
25 Pre-defined dictionary of words and phrases annotated with sentiment information – Example: Finn Arup’s annotated list of words – Advantage: Accuracy – Disadvantages: Poor coverage, not sensitive to how vocabulary is used in a domain, sarcasm Advanced: Use machine learning techniques to enable computer to learn words and phrases associated with particular sentiments – Advantages: Can be sensitive to domain; good accuracy – Disadvantages: Requires corpus for analysis Tools for Sentiment Analysis
26
26 This macro was written by Mike Alexander Uses Arun Frup’s list of affect words Documentation: http://datapigtechnologies.com/blog/index.php/quantifying- subjective-text-with-sentiment-analysis/#more-5356 Documentation Example Spreadsheet: http://www.datapigtechnologies.com/downloads/Text_To_Sentiment.xlsm Example Spreadsheet Exercise 6: Using a VBA macro that analyzes sentiment
27
27 The following was identified as being roughly neutral: Had a real headache to set it up but after it was installed it works just fine. I think the instructions could have been written better. Clearly, we need to identify topics within a review AND – Perform sentiment analysis on these sub- topics Consolidated Sentiment Analysis Can Be Misleading
28
28 Word and phrase counts Advantage: Can be done using Excel tools Disadvantage: Does not automatically combine related words/phrases Pre-defined topic vocabularies Advantage: Accurate Disadvantage: Must pre-defined categories Cluster analysis: Computer figures out categories Advantage: Computer does the work Disadvantage: Not supported well by Excel Tools for Topic Identification
29
29 Exercise 7: Word counting in Excel Using Pivot Table – Cool!
30
30 Advanced Tools A number of commercial products exist for performing more advanced text analysis Semantria https://semantria.com/https://semantria.com/ Repustate https://www.repustate.com/https://www.repustate.com/ Text2Data http://text2data.org/http://text2data.org/ QI Macros http://www.qimacros.com/http://www.qimacros.com/ are examples To see what they can do, let’s look at Semantria
31
31 Exercise 8: Semantria
32
With sufficient data, the customer feedback statistics can provide meaningful and actionable data Performs best with substantial data Vulnerable if the amount of data is low But if the amount of data is small, you probably wouldn’t be using automated tools anyway. Text Analytics 32
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.