What factors are most responsible for height?. Model Specification ERROR??? measurement error model error analysis unexplained unknown unaccounted for.

Slides:



Advertisements
Similar presentations
The Normal Distribution
Advertisements

Apr-15H.S.1 Stata: Linear Regression Stata 3, linear regression Hein Stigum Presentation, data and programs at: courses.
Outline Research Question: What determines height? Data Input Look at One Variable Compare Two Variables Children’s Height and Parents Height Children’s.
Linear regression models
Chapter 3 Bivariate Data
Stem and Leaf Display Stem and Leaf displays are an “in between” a table and a graph – They contain two columns: – The left column contains the first digit.
CTRC Core Curriculum Seminar Series
Describing data with graphics and numbers. Types of Data Categorical Variables –also known as class variables, nominal variables Quantitative Variables.
STAT 104 Section 6 Daniel Moon. Agenda Review Midterm 1 Practice Problems.
Chapter 19 Data Analysis Overview
Data Interpretation and Graphs. The Display Display the data you collect with tables and graphs. Include these in both your Project Binder and on your.
STA220: Practice of Statistics 1 Section L0301: Health & Life Sciences September 17,
How to Analyze Data? Aravinda Guntupalli. SPSS windows process Data window Variable view window Output window Chart editor window.
Ana Jerončić, PhD Department for Research in Biomedicine and Health.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 12 Describing Data.
Programming in R Describing Univariate and Multivariate data.
Correlation and Covariance. Overview Continuous Categorical Histogram Scatter Boxplot Predictor Variable (X-Axis) Height Outcome, Dependent Variable (Y-Axis)
Correlation and Covariance
What factors are most responsible for height?
R Example Descriptive Statistics Frequency and Histogram Diagrams Standard Deviation.
Descriptive Statistics Method of collecting, organizing, displaying, and drawing conclusion of data.
Outline Class Intros Overview of Course & Series Example Research Projects Beginning R.
Tuesday, March 18, 2014MAT Tuesday, March 18, 2014MAT 3122.
Graphing Parameters Titles X-Axis Title Y-Axis Title Legend Scales Color Gridlines library(help="graphics") Basic Chart Types The R Graphics Package LineHistogram.
I❤RI❤R Kin Wong (Sam) Game Plan Intro R Import SPSS file Descriptive Statistics Inferential Statistics GraphsQ&A.
A Few Handful Many Time Stamps One Time Snapshot Many Time Series Number of Variables Mobile Phone Galton Height Census Titanic Survivors Stock Market.
Sec 1.5 Scatter Plots and Least Squares Lines Come in & plot your height (x-axis) and shoe size (y-axis) on the graph. Add your coordinate point to the.
Tuesday, February 18, 2014MAT 312. Tuesday, February 18, 2014MAT 312 Suppose the Kick-a-Poo Milling Company made this claim about their Raisin Bran: In.
Physics 270.  y ( x )Data Set #1 t (s) y (cm)
Graphs.  Graphs are used to present numerical information in picture form.  Two common types of graphs are bar graphs and broken-line graphs. New Car.
Thursday, February 27, 2014MAT 312. Thursday, February 27, 2014MAT 312.
Statistics with TI-Nspire™ Technology Module E. Lesson 2: Properties Statistics with TI-Nspire™ Technology Module E.
REGRESSION DIAGNOSTICS Fall 2013 Dec 12/13. WHY REGRESSION DIAGNOSTICS? The validity of a regression model is based on a set of assumptions. Violation.
GRASPING GRAPHING. Graphs are visual representations of data you collect in an experiment. There are many types of graphs including: bar, line, scatter.
Math 145 September 11, Recap  Individuals – are the objects described by a set of data. Individuals may be people, but they may also be animals.
Chap 18-1 Copyright ©2012 Pearson Education, Inc. publishing as Prentice Hall Chap 18-1 Chapter 18 A Roadmap for Analyzing Data Basic Business Statistics.
Tuesday, February 3, 2014MAT 312. Tuesday, February 3, 2014MAT 312.
Research Question What determines a person’s height?
Where to Get Data? Run an Experiment Use Existing Data.
Residual Plots Unit #8 - Statistics.
Engineers often: Regress data to a model  Used for assessing theory  Used for predicting  Empirical or theoretical model Use the regression of others.
Outline Research Question: What determines height? Data Input Look at One Variable Compare Two Variables Children’s Height and Parents Height Children’s.
Tuesday, February 25, 2014MAT 312. Tuesday, February 25, 2014MAT 312 Suppose the Kick-a-Poo Milling Company made this claim about their Raisin Bran: In.
Logistic Regression Saed Sayad 1www.ismartsoft.com.
Statistics with TI-Nspire™ Technology Module E Lesson 1: Elementary concepts.
Why do we analyze data?  It is important to analyze data because you need to determine the extent to which the hypothesized relationship does or does.
Steps Continuous Categorical Histogram Scatter Boxplot Child’s Height Linear Regression Dad’s Height Gender Continuous Y X1, X2 X3 Type Variable Mom’s.
Descriptive Statistics  Individuals – are the objects described by a set of data. Individuals may be people, but they may also be animals or things. 
Continuous Outcome, Dependent Variable (Y-Axis) Child’s Height
Unit 4 Review. Starter Write the characteristics of the binomial setting. What is the difference between the binomial setting and the geometric setting?
1 Take a challenge with time; never let time idles away aimlessly.
USING GRAPHING SKILLS. Axis While drawing graphs, we have two axis. X-axis: for consistent variables Y-axis: for other variable.
Graphs with SPSS Aravinda Guntupalli. Bar charts  Bar Charts are used for graphical representation of Nominal and Ordinal data  Height of the bar is.
Descriptive Statistics using R. Summary Commands An essential starting point with any set of data is to get an overview of what you are dealing with You.
1 By maintaining a good heart at every moment, every day is a good day. If we always have good thoughts, then any time, any thing or any location is auspicious.
Statistics and probability Dr. Khaled Ismael Almghari Phone No:
Chapter 12: Correlation and Linear Regression 1.
MARCH 18, 2014 DATA ANALYSIS. WHAT TO DO WITH DATA Take a look at your data Histogram Descriptive statistics Mean, mode, range, standard deviation/standard.
Chapter 18 Data Analysis Overview Yandell – Econ 216 Chap 18-1.
Clinical Calculation 5th Edition
MATH-138 Elementary Statistics
Review 1. Describing variables.
Jeopardy Final Jeopardy Chapter 1 Chapter 2 Chapter 3 Chapter 4
Chapter 2 Describing Data: Graphs and Tables
Treat everyone with sincerity,
Program This course will be dived into 3 parts: Part 1 Descriptive statistics and introduction to continuous outcome variables Part 2 Continuous outcome.
Correlation and Covariance
Histograms.
Math 341 January 24, 2007.
Math 10, Spring 2019 Introductory Statistics
Presentation transcript:

What factors are most responsible for height?

Model Specification ERROR??? measurement error model error analysis unexplained unknown unaccounted for missing variables Outcome = (Model) + Error

Analytics & History: 1st Regression Line The first “Regression Line”

Men's average height 'up 11cm since 1870s'

Galton’s Notebook on Families & Height

X1X2X3 X4 X5Y

we find that a 54-loci genomic profile explained 4–6% of the sex- and age-adjusted height variance the Galtonian mid-parental prediction method explained 40% of the sex- and age-adjusted height variance

> getwd() [1] "C:/Users/johnp_000/Documents" > setwd()

Dataset Input Function Filename Object

Data Types: Numbers and Factors/Categorical str() summary()

head() summary() ece

ContinuousCategorical Continuous Categorical Histogram Scatter Bar Cross Table Boxplot Predictor Variable (X-Axis) Pie Child’s Height Smartphone? Yes or No Outcome, Dependent Variable (Y-Axis) Mosaic Cross Table Linear Regression Logistic Regression Regression Model Parents Height Gender Frequency 0 1 Outcome, Dependent Variable (Y-Axis)

Frequency Distribution, Histogram hist(heights$childHeight)

Standard Deviation Mean

Deviation between mean and an actual data point. Calculating Standard Deviation - sd()

Normal Distribution and SD Mean = 66.5 S.D. = = 73.6 SDPct.Z-scoreHeights 190% % % = 59.4

Area = 1 Density Plot plot(density(h$childHeight))

hist(h$childHeight,freq=F, breaks =25, ylim = c(0,0.14)) curve(dnorm(x, mean=mean(h$childHeight), sd=sd(h$childHeight)), col="red", add=T) Bimodal: two modes Mode, Bimodal

ggplot2