Ch 7. What Makes a Great Analysis? Taming The Big Data Tidal Wave 31 May 2012 SNU IDB Lab. Sengyu Rim.

Slides:



Advertisements
Similar presentations
Key Performance Indicators KPI’s
Advertisements

Standardized Scales.
HOW TO PASS AS EVALUATING STATISTICAL REPORTS.
Objectives 10.1 Simple linear regression
1 Multiple Regression A single numerical response variable, Y. Multiple numerical explanatory variables, X 1, X 2,…, X k.
What is a sample? Epidemiology matters: a new introduction to methodological foundations Chapter 4.
Using Statistics in Research Psych 231: Research Methods in Psychology.
Estimation from Samples Find a likely range of values for a population parameter (e.g. average, %) Find a likely range of values for a population parameter.
Market Intelligence Julie Edell Britton
Mathematics 191 Research Seminar in Mathematical Modeling 28 March 2005 Rapid Modeling and Tractability.
Data Analysis, Interpretation, and Reporting
Using Statistics in Research Psych 231: Research Methods in Psychology.
Using Statistics in Research Psych 231: Research Methods in Psychology.
Simple Linear Regression Analysis
Simple Linear Regression. Introduction In Chapters 17 to 19, we examine the relationship between interval variables via a mathematical equation. The motivation.
Quantitative vs. Categorical Data
Determining Sample Size
Ch 8 Estimating with Confidence. Today’s Objectives ✓ I can interpret a confidence level. ✓ I can interpret a confidence interval in context. ✓ I can.
Chap 20-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 20 Sampling: Additional Topics in Sampling Statistics for Business.
Big Idea 1: The Practice of Science Description A: Scientific inquiry is a multifaceted activity; the processes of science include the formulation of scientifically.
Lecture 4 1 Introduction to Systems Planning Lecture 4 2 Objectives n Describe the strategic planning process n Explain the purpose of a mission statement.
A Comparison of Statistical Significance Tests for Information Retrieval Evaluation CIKM´07, November 2007.
Statistics (cont.) Psych 231: Research Methods in Psychology.
Problems with the Design and Implementation of Randomized Experiments By Larry V. Hedges Northwestern University Presented at the 2009 IES Research Conference.
Statistics (cont.) Psych 231: Research Methods in Psychology.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 4 Designing Studies 4.2Experiments.
Confidence Intervals: The Basics BPS chapter 14 © 2006 W.H. Freeman and Company.
1 Chapter 7 Sampling Distributions. 2 Chapter Outline  Selecting A Sample  Point Estimation  Introduction to Sampling Distributions  Sampling Distribution.
Section 10.1 Confidence Intervals
10.1: Confidence Intervals Falls under the topic of “Inference.” Inference means we are attempting to answer the question, “How good is our answer?” Mathematically:
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
1 First Day Data Sheet – fill out and bring to lab tomorrow Syllabus – go over.
The Sampling Design. Sampling Design Selection of Elements –The basic idea of sampling is that by selecting some of the elements in a population, we may.
Information, Analysis, and Knowledge Management in the Baldrige Criteria Examines how an organization selects, gathers, analyzes, manages, and improves.
Experimentation in Computer Science (Part 2). Experimentation in Software Engineering --- Outline  Empirical Strategies  Measurement  Experiment Process.
Africa Program for Education Impact Evaluation Dakar, Senegal December 15-19, 2008 Experimental Methods Muna Meky Economist Africa Impact Evaluation Initiative.
Finishing up: Statistics & Developmental designs Psych 231: Research Methods in Psychology.
© 2013 Catalyst The Case for Incrementality How to Measure the Real ROI of Your Marketing Programs By Marc Solomon, Director of Analytics, Catalyst 6/3/2013.
Chapter Nine Building Customer Relationships Through Effective Marketing.
Ch 8 Estimating with Confidence 8.1: Confidence Intervals.
Ch 8. What Makes a Great Analytic Professional? Taming The Big Data Tidal Wave 7 June 2012 SNU IDB Lab. Jee-bum Park.
Course: Research in Biomedicine and Health III Seminar 5: Critical assessment of evidence.
T tests comparing two means t tests comparing two means.
Statistics (cont.) Psych 231: Research Methods in Psychology.
Statistics (cont.) Psych 231: Research Methods in Psychology.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Analyse questions. What to do when you are confronted with an ANALYSE question! Analyse means to break down all the elements of the question then show.
Inferential Statistics Psych 231: Research Methods in Psychology.
Copyright 2010, The World Bank Group. All Rights Reserved. Producer prices, part 2 Measurement issues Business Statistics and Registers 1.
Inferential Statistics Psych 231: Research Methods in Psychology.
Psych 231: Research Methods in Psychology
CHAPTER 9 Testing a Claim
Evaluating a Statistical Article
Strategic Workforce Planning Chris MacRae
Research Process №5.
Chapter 1 Strategic Leadership: Managing the Strategy-Making Process for Competitive Advantage.
Building Customer Relationships Through Effective Marketing
Lecture # 4 Software Development Project Management
EVALUATING STATISTICAL REPORTS
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Inferential Statistics
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Presentation transcript:

Ch 7. What Makes a Great Analysis? Taming The Big Data Tidal Wave 31 May 2012 SNU IDB Lab. Sengyu Rim

Outline  Criterions for a Good Analysis  Frame the Problem Correctly  Making Inferences 2

Criterions for a Good Analysis(1/7) What is Reporting?  Reporting isn’t equal to analysis – Many organizations mistakenly equate reporting with analysis  A reporting environment(business intelligence environment) – Select the reports they want to run – Get the reports executed – View the results 3 report Provide data Predefined form Inflexible

Criterions for a Good Analysis(2/7) What is Analysis?  An analysis is an interactive process of – Tackling problem – Finding the data required – Analyze the data – Interpret the results 4 Analysis Provide answer Customized Flexible

Criterions for a Good Analysis(3/7) Comparison between Reporting and Analysis  Summary of Analysis versus Reporting 5 ReportingAnalysis Provides dataProvides answers Provides what is asked forProvides what is needed Is typically standardizedIs typically customized Does not involve a personInvolves a person Is fairy inflexibleIs extremely flexible

Criterions for a Good Analysis(4/7) G.R.E.A.T criteria  G.R.E.A.T criteria will add value to analysis 6 G R E A T Guided-guided by a business need Relevant-relevant to the business Explainable-analysis needs to be explained effectively Actionable-a great analysis will be actionable Timely-analysis will be delivered in a timely fashion

Criterions for a Good Analysis(5/7) What are Core Analytics?  Core analytics tend to ask simple questions and provide simple answers – What happened – When it happened – What the impact was 7 Sales Promotion 1.How many subscribers signed up? 2.How did the sing- ups occur everyday? 3.How much money did the new subscribers bring in?

Criterions for a Good Analysis(6/7) What are Advanced Analytics?  Advanced analytics go further than core analytics – What caused it to happen – What can be done in the future 8 Customer Web Activity 1. Identify the relationship between browsing and sales 2. Formulate strategy for marketing

Criterions for a Good Analysis(7/7) Cherry Picking  Sometimes the gut feelings of executives conflict with analysis  One of the worst abuses is to cherry pick results  Cherry picking – Use the analytics when the results serve your purpose – Ignore the findings when the results conflict with the original plan 9

Outline  Criterions for a Good Analysis  Frame the Problem Correctly  Making Inferences 10

Frame the Problem Correctly(1/6) How to Frame the Problem?  Great analysis starts with framing the problem correctly – Assess the data correctly – Develop a solid analysis plan – Technical and practical considerations should be taken into account  Framing the problem is the most important step of an analysis 11

Frame the Problem Correctly(2/6) Statistical Significance  Statistical significance – Used to evaluate the parameter estimates  A statistical significance will validate the conclusions 12

Frame the Problem Correctly(3/6) Never Take Shortcuts  Ensure you have all the data you need – Given the part of the story, conclusions may be completely wrong  Who has the higher average batting? 13 Season TomJoeWinner Joe Joe Joe Joe Joe Year Tom Avg Tom At Bats Tom Hits Joe Avg Joe At Bats Joe Hits winner Joe Joe Joe Joe Joe Total Tom

Frame the Problem Correctly(4/6) Business Importance  Statistical significance should match the business perspective – What are the costs to make the recommended changes? – How much additional revenue might be generated? – Is the new approach consistent with the overall strategy? – Are the new changes executable? 14 Statistical significance Business importance

Frame the Problem Correctly(5/6) Samples versus Population  Using today’s scalable systems, it’s possible to work with an entire population – With big data, we have enough data for a sufficient sample  When a sampling process is needed, it needs to be done correctly – The bigger sample is made, the tighter the margin of the error – Sample size should be suitable for the problem 15

Frame the Problem Correctly(6/6) All Data is Needed  Any given problem may require only a small sample of the data  Different samples require different data – Entire data should be kept 16 s1 s2 s4 s7 s8 s6 s5 s3 s9 s10

Outline  Criterions for a Good Analysis  Frame the Problem Correctly  Making Inferences 17

Making Inferences  To produce a great analysis, it is necessary to infer potential actions – Make initial inferences based on analysis 18