Exploring Data in R Introduction to R, Part II

Slides:



Advertisements
Similar presentations
© Tan,Steinbach, Kumar Introduction to Data Mining 8/05/ Data Mining: Exploring Data Lecture Notes for Chapter 3 Introduction to Data Mining by Tan,
Advertisements

TOTAL QUALITY MANAGEMENT
1 STAT 6020 Introduction to Biostatistics Fall 2005 Dr. G. H. Rowell Class 1.
Mother and Child Health: Research Methods G.J.Ebrahim Editor Journal of Tropical Pediatrics, Oxford University Press.
CHAPTER 1: Picturing Distributions with Graphs
STA220: Practice of Statistics 1 Section L0301: Health & Life Sciences September 17,
McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved.
+ The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE Chapter 1: Exploring Data Introduction Data Analysis: Making Sense of Data.
Instruction Guides for David Howell’s Textbook Examples Boxplots and Stem-and-Leaf Plots Sign Test using the Binomial Distribution.
SPSS Instructions for Introduction to Biostatistics Larry Winner Department of Statistics University of Florida.
AP Statistics Semester One Review Part 1 Chapters 1-3 Semester One Review Part 1 Chapters 1-3.
© Tan,Steinbach, Kumar Introduction to Data Mining 8/05/ Data Mining: Exploring Data Lecture Notes for Chapter 3 Introduction to Data Mining by Tan,
Univariate EDA. Quantitative Univariate EDASlide #2 Exploratory Data Analysis Univariate EDA – Describe the distribution –Distribution is concerned with.
Statistics with TI-Nspire™ Technology Module E Lesson 1: Elementary concepts.
+ The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE Chapter 1: Exploring Data Introduction Data Analysis: Making Sense of Data.
Bell Ringer You will need a new bell ringer sheet – write your answers in the Monday box. 3. Airport administrators take a sample of airline baggage and.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 1 Exploring Data Introduction Data Analysis:
Chapter 5: Organizing and Displaying Data. Learning Objectives Demonstrate techniques for showing data in graphical presentation formats Choose the best.
AP Statistics Review Day 1 Chapters 1-4. AP Exam Exploring Data accounts for 20%-30% of the material covered on the AP Exam. “Exploratory analysis of.
Statistical Exploratory Analysis with “EnQuireR” 1.Introduction 2.Installation 3.How to 4.Report.
Prof. Eric A. Suess Chapter 3
Chapter 1: Exploring Data
SFB stats workshop Bodo Winter.
Part Four ANALYSIS AND PRESENTATION OF DATA
EHS 655 Lecture 4: Descriptive statistics, censored data
Exploring Data: Summary Statistics and Visualizations
Intro to AP Statistics and Exam
Welcome to AP Stats!.
CHAPTER 1 Exploring Data
A linear approach to predicting house prices
AP Statistics Variables.
Microsoft Office Illustrated
Chapter 1: Exploring Data
Part Three. Data Analysis
CHAPTER 1: Picturing Distributions with Graphs
CHAPTER 1: Picturing Distributions with Graphs
Warm-Up Today Warm-up Objective Pre-test Statistics Chapter 1
Additional Resources Introduction to R, Part II
AP Exam Review Chapters 1-10
Chapter 1 Data Analysis Ch.1 Introduction
Data Mining: Exploring Data
Investigating associations between categorical variables
Chapter 1: Exploring Data
Writing Functions in R Introduction to R, Part II
Installing Packages Introduction to R, Part II
CHAPTER 1 Exploring Data
Good Morning AP Stat! Day #2
Program This course will be dived into 3 parts: Part 1 Descriptive statistics and introduction to continuous outcome variables Part 2 Continuous outcome.
CHAPTER 1 Exploring Data
CIS 2033 Base on text book: A Modern Introduction to
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Getting Data In and Out of R Introduction to R, Part II
CHAPTER 1: Picturing Distributions with Graphs
Welcome to AP Statistics
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
Lecture 1: Descriptive Statistics and Exploratory
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Probability and Statistics
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Producing good data through sampling and experimentation
CHAPTER 1 Exploring Data
Data exploration and visualization
Presentation transcript:

Exploring Data in R Introduction to R, Part II Anna Blackstock Statistician, Biostatistics and Information Management Office (BIMO) NCEZID/DFWED

Exploratory Data Analysis Exploratory Data Analysis (EDA) is a term used to describe the process of exploring general dataset characteristics. Three realms of EDA*: Transformation Visualization Modelling *From the book “R for Data Science” (http://r4ds.had.co.nz/exploratory-data-analysis.html).

Exploratory Data Analysis There is not a specific formula for doing EDA “the right way” Goals of EDA: Checking data quality Any mistakes? Any unusual or unexpected values? Understanding distributions Investigating relationships between variables

EDA Today Will cover a few basic functions that you can use to get started with data exploration Will NOT provide in-depth instruction on EDA tools (especially data transformation—stay tuned for other courses)

Exploring Data When you first read your data into R, you will want to do a few data checks. Functions you may consider:

EDA for Categorical Variables Tables Univariate and multi-way tables Plots Bar plots, stacked bar plots Statistical tests Chi-square tests

EDA for Continuous Variables Summary statistics Mean, variance, percentiles . . . Plots Scatterplots, Box plots Statistical tests T-tests

EDA for Categorical + Continuous Variables Summary statistics Mean, variance, etc. by category Plots Box plots, bubble plots Statistical tests T-tests

Where to next? Take an online course Keep an eye out for future CDC courses See the book “R for Data Science” by Garrett Grolemund and Hadley Wickham: http://r4ds.had.co.nz/ Find an online guide to EDA