Homework 2 Responses and Issues

Slides:



Advertisements
Similar presentations
Bankfull / Effective / Dominant
Advertisements

UNLOCKING THE SECRETS HIDDEN IN YOUR DATA Part 3 Data Analysis.
E(X 2 ) = Var (X) = E(X 2 ) – [E(X)] 2 E(X) = The Mean and Variance of a Continuous Random Variable In order to calculate the mean or expected value of.
BPT2423 – STATISTICAL PROCESS CONTROL
Weather and Climate. Weather Weather: a set of all the phenomena occurring in a given atmosphere at a given time Weather is short-term. What is it like.
What z-scores represent
MOS Performance MOS significantly improves on the skill of model output. National Weather Service verification statistics have shown a narrowing gap between.
by B. Zadrozny and C. Elkan
Testing Theories: Three Reasons Why Data Might not Match the Theory Psych 437.
Introduction to inference Use and abuse of tests; power and decision IPS chapters 6.3 and 6.4 © 2006 W.H. Freeman and Company.
OARE Module 5A: Scopus (Elsevier). Table of Contents About Scopus (Elsevier) Using Scopus Search Page Results/Refine Search Pages Download, PDF, Export,
Awk Dr. Tran, Van Hoai Faculty of Computer Science and Engineering HCMC Uni. of Technology
Scientific Inquiry Learning Goal: We will be able to validate observations as real or inferences.
By Andrea Shelley, Dietetic Intern.  Obesity Epidemic  Leads to: ◦ Cardiovascular Disease ◦ Type II Diabetes ◦ Respiratory Disease ◦ Joint Damage 
Excel: Pivot Tables Exploring Computer Science Lesson Supplemental.
Histograms: Summarizing Data 1. SUMMARIZING DATA Raw data, such as company records, often contains a wealth of information that would be of use in making.
Count and add list of numbers From user input and from file.
Homework 3 Responses This was meant to give you practice with data management/mining given a non ideally formatted input data structure.
Searching and Sorting. Why Use Data Files? There are many cases where the input to the program may come from a data file.Using data files in your programs.
Lecture 6 Complex Sorting 1. Announcements Homework 3 due Monday No participation this week Test is on Thursday Part of Wednesday will be review I will.
Dr Mark Cresswell Scenarios of the Future 69EG6517 – Impacts & Models of Climate Change.
10.11 Data Manipulation 3 Reports. You will need… Specimen 2007 Paper 2 Task C (a PDF) q 43 Read this question carefully before you start The database.
16-Dec-15Advanced Programming Spring 2002 sed and awk Henning Schulzrinne Dept. of Computer Science Columbia University.
UNIX commands Head More (press Q to exit) Cat – Example cat file – Example cat file1 file2 Grep – Grep –v ‘expression’ – Grep –A 1 ‘expression’ – Grep.
Awk- An Advanced Filter by Prof. Shylaja S S Head of the Dept. Dept. of Information Science & Engineering, P.E.S Institute of Technology, Bangalore
MDFP Mathematics and Statistics 1. Univariate Data – Today’s Class 1.STATISTICS 2.Univariate (One Variable) Data 1.Definition 2.Mean, Median, Mode, Range.
Programming Languages Meeting 12 November 18/19, 2014.
Statistics & Evidence-Based Practice
Frequency Distributions
Lecture 14 Programming with awk II
Statistics in SPSS Lecture 3
SUR-2250 Error Theory.
The Marketing Research Process
Exploring Computer Science Lesson Supplemental
Integration of Technology Across the Curriculum Using PowerPoint & the Internet Dr. Rayton Sianjina, Ph.D.
Statistics for the Social Sciences
NAC flat fielding and intensity calibration
The Marketing Research Process
Alternating Current – Learning Outcomes
Introduction to Statistics
Significant Figures Significant figures are those numbers considered to be important in value All non-zero digits are significant Zeros are NOT always.
Lesson 13 - Cleaning Data Lesson 14 - Creating Summary Tables
Interested in our Clinical Encounters: Pain?
Meta-Analytic Thinking
Tools of Environmental Science
UNIT 2 MECHANICS CHAPTER 4 KINEMATICS.
Bringing Organism Observations Into Bioinformatics Networks
MGT 362 Competitive Success/snaptutorial.com
MGT 362 Education for Service-- snaptutorial.com
MGT 362 Teaching Effectively-- snaptutorial.com
Physics 114: Lecture 6 Measuring Noise in Real Data
The Marketing Research Process
Lab 2 Data Manipulation and Descriptive Stats in R
Everyone thinks they know this stuff
Homework 1 Responses and Issues
Investigating associations between categorical variables
Appetizer: Statistical methods in XSPEC
Manipulating and Sharing Data in a Database
Slide Source: Caroline van Schaik
Lesson 13 - Cleaning Data Lesson 14 - Creating Summary Tables
Covalent Bonds.
P.O.D. # 32 2/4/19.
Pod # 32 February 6, 2019.
Statistical Thinking and Applications
Homework Check.
The Nature of Science.
2.1 Inquiry and the Scientific Method
Using Veera with R and Shiny to Build Complex Visualizations
Rony Azouz, Charles M. Gray  Neuron 
Homework Check.
Presentation transcript:

Homework 2 Responses and Issues This was a good exercise in real data mining – while your were given a structured journey you can clearly see how you can interrogate data in open ended ways – a future exercise will involve you finding weather data across the country and find evidence for climate change in that data using any exploratory analysis that you like. This is a potentially publishable result.

The raw data is messy – can be handled with JSON tools but this is not necessary; You just want to sort, by any means necessary, this data to pull out the elevation number

Unix Shell The number of fields per line is not constant Care about elevation which is always in field 12 or greater awk ‘{print $12 FS $13 FS $14 FS $15 FS $ 16 FS }’ data.txt > new.txt Now I have an easier file to work with; can test for presence of a number

Using JSON Tools

See Learning Occurs

Interrogating the data – verification is important

Do this visually!

Interrogated with Fortran

Is the one peak statistically significant? Is increase since 1990 significant? Without the peak average bin count = 36 +/- 6 Peak bin = 52 +/- 7.2 Sigma = (52-36)/sqrt(6*6 +7.2*7.2) = 16/9 = NO

USE the Z-test http://www.stat.yale.edu/Courses/1997-98/101/meancomp.htm

Maybe it’s a sine wave? Reality check

Droughts – much variance

Cyclical or Systematic? Z-test = 4.61 sigma!

When to get married outside?

Science

Panoply