D IAMOND A NALYSIS Zachary Baine Comm-486. Project Introduction For the following project, I have created a brief, hypothetical statistical analysis presentation.

Slides:



Advertisements
Similar presentations
(Say each word as it appears on the screen.)
Advertisements

“People do not want to be less informed. They want to be more informed with less information!” To move forward, backward or to a Contents page, move your.
Using the SmartPLS Software “Structural Model Assessment”
Here we add more independent variables to the regression.
Correlation and Regression Statistics Introduction Means etc are of course useful We might also wonder, “how do variables go together?” IQ is a.
 Coefficient of Determination Section 4.3 Alan Craig
Statistical Techniques I EXST7005 Lets go Power and Types of Errors.
Classical Regression III
Stat 112: Lecture 15 Notes Finish Chapter 6: –Review on Checking Assumptions (Section ) –Outliers and Influential Points (Section 6.7) Homework.
Part IVA Analysis of Variance (ANOVA) Dr. Stephen H. Russell Weber State University.
Finishing up: Posters, Talks, & Regression Psych 231: Research Methods in Psychology.
Simple Linear Regression
Lecture 3 MGMT © 2011 Houman Younessi Rational Choice – Consumer behavior 50 Bread Potatoes I1 I2 A B Indifference Curves A’
Class 7: Thurs., Sep. 30. Outliers and Influential Observations Outlier: Any really unusual observation. Outlier in the X direction (called high leverage.
Class 2: Tues., Sept. 14th Correlation (2.2) Introduction to Measurement Theory: –Reliability of measurements and correlation –Example that demonstrates.
Biol 500: basic statistics
Covariance and Correlation
Stat 112: Lecture 9 Notes Homework 3: Due next Thursday
Intro Stats in the 21 st Century Dick De Veaux Williams College USCOTS 15 May 28, 2015.
That Greenhouse Effect!
Lecture 3-2 Summarizing Relationships among variables ©
Relations in Categorical Data 1. When a researcher is studying the relationship between two variables, if both variables are numerical then scatterplots,
Bonds Lecture 8 This lecture is part of Chapter 4: Investing in the Company.
A P STATISTICS LESSON 3 – 2 CORRELATION.
Section 7.3 ~ Best-Fit Lines and Prediction Introduction to Probability and Statistics Ms. Young.
Using the Margins Command to Estimate and Interpret Adjusted Predictions and Marginal Effects Richard Williams
Bivariate Regression Analysis The most useful means of discerning causality and significance of variables.
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 12.4.
Statistics for the Social Sciences Psychology 340 Fall 2013 Correlation and Regression.
Statistical Analysis Topic – Math skills requirements.
Step 3 of the Data Analysis Plan Confirm what the data reveal: Inferential statistics All this information is in Chapters 11 & 12 of text.
9 Simple Steps to Building A Strong and Inspiring “Why or I” Story
Multiplying Signed Numbers © Math As A Second Language All Rights Reserved next #9 Taking the Fear out of Math ×
1 Definition: A variable is something that can have different amounts or different types.
Chapter 10 Correlation and Regression
Critical Reading Strategy
Kendall Klimek Data Culminating Mr. Rancourt. Introduction  Topic: Population vs Transportation Revenues  Purpose: Researching the correlation between.
Brian Macpherson Ph.D, Professor of Statistics, University of Manitoba Tom Bingham Statistician, The Boeing Company.
GEM A Finance Survival Kit GEM2508 Frederick H. Willeboordse
Regression Chapter 16. Regression >Builds on Correlation >The difference is a question of prediction versus relation Regression predicts, correlation.
Regression BPS chapter 5 © 2010 W.H. Freeman and Company.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 14 Comparing Groups: Analysis of Variance Methods Section 14.1 One-Way ANOVA: Comparing.
Stat 112 Notes 9 Today: –Multicollinearity (Chapter 4.6) –Multiple regression and causal inference.
Sight Word List.
Introduction to Linearization ( No units, no uncertainties, just the core idea ) The purpose of linearization is to get the equation that describes real.
Positive Communication: Defusing Challenging Situations
Quadratic Regression ©2005 Dr. B. C. Paul. Fitting Second Order Effects Can also use least square error formulation to fit an equation of the form Math.
ANOVA, Regression and Multiple Regression March
Multiplication of Common Fractions © Math As A Second Language All Rights Reserved next #6 Taking the Fear out of Math 1 3 ×1 3 Applying.
Regression. Outline of Today’s Discussion 1.Coefficient of Determination 2.Regression Analysis: Introduction 3.Regression Analysis: SPSS 4.Regression.
Writing Reminders Specific Tips on Writing Technical Reports.
Within Subject ANOVAs: Assumptions & Post Hoc Tests.
Sight Words. the and to see it you like me.
Samantha Bellah Adv. Stats Final Project Real Estate Forecasting Regression Model Market: Highland Park Neighborhood Data Sources: Zillow.com E:\PuebloRESales2014Q1Q2.xlsx.
ANNOTATIONANNOTATION Critical Reading Strategy. Why annotate? How many times have you had to read something more than once to comprehend it? How many.
The Law of Averages. What does the law of average say? We know that, from the definition of probability, in the long run the frequency of some event will.
Conjoint Analysis. 1. Managers frequently want to know what utility a particular product feature or service feature will have for a consumer. 2. Conjoint.
HW 19 Key. 22:37 Diamond Rings. The data table contains the listed prices and weights of the diamonds in 48 rings offered for sale in The Singapore Times.
Multivariate vs Univariate ANOVA: Assumptions. Outline of Today’s Discussion 1.Within Subject ANOVAs in SPSS 2.Within Subject ANOVAs: Sphericity Post.
Introduction to Regression Analysis
Multiple Regression: I
Regression Analysis Simple Linear Regression
Correlation A Lecture for the Intro Stat Course
Males and Female College Completion Rates in Wisconsin in 1957
The Scientific Method.
Charles University Charles University STAKAN III
Analyzing Stability in Colorado K-12 Public Schools
Unit 5 Correlation.
Z-test and T-test Chong Ho (Alex) Yu 8/12/2019 1:50 AM
Chapter 7 Excel Extension: Now You Try!
Presentation transcript:

D IAMOND A NALYSIS Zachary Baine Comm-486

Project Introduction For the following project, I have created a brief, hypothetical statistical analysis presentation. The presentation is based off a sample set of 310 diamonds that I have analyzed using JMP statistical software. The data includes 5 variables for each diamond : the diamond’s cut, color, clarity, carat, and price. Each of these variables, excluding price, has its own rating system. The purpose of this presentation is to explain how each of these variables determines the price of a diamond. In a real world application this information could be used to inform a company, investor, etc. how diamonds should be priced in a competitive market. As this is a short presentation I will not delve too far into the mathematics behind a diamond’s pricing, but rather explain what goes into a diamond’s price and short explanation why. That being said, let’s get started.

After conducting my analysis, I have found a few key factors that really seem to determine the price of each diamond. For starters, which of the four c’s (cut, color, clarity, and carat) do you think impacts a diamond’s price the most? (click on the diamond to select your choice) CUT CLARITY COLOR CARAT

Unfortunately not. A diamond’s cut is not the most impactful variable on a diamond’s price. Click on continue to find the most impactful or retry to guess again. CONTINUE

Unfortunately not. A diamond’s color is not the most impactful variable on a diamond’s price. Click on continue to find the most impactful or retry to guess again.

Unfortunately not. A diamond’s clarity is not the most impactful variable on a diamond’s price. Click on continue to find the most impactful or retry to guess again. RETRY

That’s right, a diamond’s carat increases the price of it more than any other variable by far. How do we know this? After formatting the data, I ran a correlation analysis which yields the following table: Looking at the corresponding values between the column LnPrice and the variables LnCarat, Dum Cut, Dum Color, and Dum Clarity we can see (circled in red) that the greatest value is between LnCarat and LnPrice, indicating the most impactful.

How else is price determined? Statistically speaking it is helpful to know that the price and carat of a diamond are strongly related; however, there is more to the price than just the carat. My next step was to run multiple regression models in JMP in order to find out how the other variables (color, cut, and clarity) impact the price. To do this I compared price with only color, cut, and clarity.

A FTER RUNNING THIS MODEL, WHICH OF THE FOLLOWING DO YOU THINK HAS THE MOST IMPACT ON A DIAMOND ’ S PRICE ?

Unfortunately not. A diamond’s cut is not the most impactful remaining variable. Click on continue to find the most impactful remaining variable or retry to guess again.

As it turns out, it is clarity that is the second most impactful variable. In order to avoid being too technical, let me just say that we can determine this since the coefficients yielded by the model were much greater for clarity than for cut or color.

Conclusion From all of this we can now determine that the price placed on a diamond is mainly based on its carat and then by its clarity. As a matter of fact, over 90% of the variability in a diamond’s price can be determined by these two characteristics alone; it turns out the cut and color of the diamond have little to no impact on its price.