Ivan Ramler St. Lawrence University Canton, New York A Guitar Hero Based Project in Mathematical Statistics.

Slides:



Advertisements
Similar presentations
BE THE STATISTICIAN: AN APPROACH FOR THE MODERN STUDENT Melissa M. Sovak, Ph.D. California University of Pennsylvania
Advertisements

General Set up of the exam. Two Sections Multiple Choice Multiple Choice 90 minutes 90 minutes 40 questions 40 questions Free Response Free Response 90.
AP Statistics – Chapter 9 Test Review
AP Statistics Thursday, 24 April 2014 OBJECTIVE TSW review for tomorrow’s Chi-Square Inference test. DUAL CREDIT FINAL: NEXT WEEK –Everyone will take this.
USING WRITING IN STATISTICS CLASSES TO DISPEL “WORD PROBLEM” ANXIETY AND ENHANCE UNDERSTANDING Tonya Adkins Presented June 12, 2014 at the AP Statistics.
Hypothesis testing Week 10 Lecture 2.
Chapter 7: Statistical Applications in Traffic Engineering
Chapter 7(7b): Statistical Applications in Traffic Engineering Chapter objectives: By the end of these chapters the student will be able to (We spend 3.
Resampling techniques Why resampling? Jacknife Cross-validation Bootstrap Examples of application of bootstrap.
Stat 301 – Day 15 Comparing Groups. Statistical Inference Making statements about the “world” based on observing a sample of data, with an indication.
Stat 512 – Lecture 12 Two sample comparisons (Ch. 7) Experiments revisited.
Evaluating Hypotheses
1 Practicals, Methodology & Statistics II Laura McAvinue School of Psychology Trinity College Dublin.
Stat 217 – Week 10. Outline Exam 2 Lab 7 Questions on Chi-square, ANOVA, Regression  HW 7  Lab 8 Notes for Thursday’s lab Notes for final exam Notes.
Lehrstuhl für Informatik 2 Gabriella Kókai: Maschine Learning 1 Evaluating Hypotheses.
BCOR 1020 Business Statistics
Monté Carlo Simulation MGS 3100 – Chapter 9. Simulation Defined A computer-based model used to run experiments on a real system.  Typically done on a.
Objective: To test claims about inferences for proportions, under specific conditions.
Introduction to Monte Carlo Methods D.J.C. Mackay.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 9. Hypothesis Testing I: The Six Steps of Statistical Inference.
Results and Discussion Let’s revisit this assignment!
Lecture 7: Simulations.
Introduction CSE 1310 – Introduction to Computers and Programming
Group Discussion Explain the difference between assignment bias and selection bias. Which one is a threat to internal validity and which is a threat to.
Who Plays Video Games? Saumitra Sahi. Introduction In U.C. Berkeley, some kinds in the statistics class uses an alternative method to learning statistics.
Two-Sample Inference Procedures with Means. Two-Sample Procedures with means two treatments two populationsThe goal of these inference procedures is to.
PROBABILITY & STATISTICAL INFERENCE LECTURE 6 MSc in Computing (Data Analytics)
+ Chapter 9 Summary. + Section 9.1 Significance Tests: The Basics After this section, you should be able to… STATE correct hypotheses for a significance.
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 10.2.
Using Lock5 Statistics: Unlocking the Power of Data
Statistics: Unlocking the Power of Data Lock 5 Afternoon Session Using Lock5 Statistics: Unlocking the Power of Data Patti Frazer Lock University of Kentucky.
CompSci 725 Handout 7: Oral Presentations, Projects and Term Reports Version July 2009 Clark Thomborson University of Auckland.
Monte Carlo Simulation and Personal Finance Jacob Foley.
Using a Pharmacy Aisle as the Final Exam Cindy Wuller, M.S. and Miranda Wilhelm, Pharm.D. School of Pharmacy.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
The Marketing Research Project. Purposes of the Project 1.Give you practical experience at conducting a marketing research project. 2.Examine some factors.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the.
Confidence intervals and hypothesis testing Petter Mostad
AMSc Research Methods Research approach IV: Experimental [1] Jane Reid
Math Field Day. What is Math Field Day?  A competition in which students from many different schools complete various mathematics problems/tasks. These.
Data Mining Practical Machine Learning Tools and Techniques By I. H. Witten, E. Frank and M. A. Hall Chapter 5: Credibility: Evaluating What’s Been Learned.
Chapter 6: Analyzing and Interpreting Quantitative Data
CH 25 Paired Samples and Blocks. Paired Data 1. Observations that are collected in pairs (data on age differences between husbands and wives, for instance).
ECE791 Senior Design Experience Project Requirements and Timeline.
1 Introduction to Statistics − Day 4 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Lecture 2 Brief catalogue of probability.
Analysis. How the theory lead to (empirical) analysis In most papers the theory is there to help understand the empirical work. – It must be the minimum.
T tests comparing two means t tests comparing two means.
Student Growth Goals for Coaching Conversations. Requirements for Student Growth Goals Time line Reference to an Enduring Skill Proficiency and Growth.
The research process Psych 231: Research Methods in Psychology.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 11 Inference for Distributions of Categorical.
Final Exam Information These slides and more detailed information will be posted on the webpage later…
Background and Contact Information Pat Agard Mathematics teacher at Highlands High School (11 th year) Teach: AP Stats, Pre-Calculus Adv., College Prep.
Statistics and probability Dr. Khaled Ismael Almghari Phone No:
Test of a Population Median. The Population Median (  ) The population median ( , P 50 ) is defined for population T as the value for which the following.
Test of a Population Median. The Population Median (  ) The population median ( , P 50 ) is defined for population T as the value for which the following.
Amy Wagaman Amherst College Mathematics and Statistics.
BIO1130 Lab 2 Scientific literature
Research Design and Methods (METHODOLOGY)
Unit 5: Hypothesis Testing
STAT 312 Chapter 7 - Statistical Intervals Based on a Single Sample
Goals of Statistics 8/27.
What is there to discuss in a Statistics course?
When we free ourselves of desire,
Lecture Slides Elementary Statistics Eleventh Edition
The future is a vain hope, the past is a distracting thought
Psych 231: Research Methods in Psychology
Week 7 Advanced Research Design
Section 11.1: Significance Tests: Basics
Mastery Assessment in Teaching Statistics
Presentation transcript:

Ivan Ramler St. Lawrence University Canton, New York A Guitar Hero Based Project in Mathematical Statistics

Outline 1. Background and learning goals 2. Description of project 3. Some student based results 4. Tips for implementing in your classroom

Background of Students Mathematical Statistics Course Estimation Hypothesis testing Introduction to Bootstrap Upper level undergraduates Mostly Senior Math Major/Stat Minors All have had Probability Most have had a second applied statistics course Exposure to R helps Learn some during Math Stat course as well

Series of popular video games for many platforms Players use a plastic guitar-controller to play notes as they scroll across the screen Receive points for hitting notes correctly While playing I raised the question “Are missed notes occurring at random?” and “If not, prove it.” What is Guitar Hero?

Learning Goals Develop statistical thinking to answer a research objective Improve R programming skills Learn how to evaluate and compare competing methodologies through simulation experiments Have fun while learning some interesting Statistics!

Steps to the Project This was a semester long project with the following parts. 1. Developing an “intuitive” estimator/methodology 2. Program method into R 3. Collecting data 4. Developing a hypothesis test 5. Evaluating the power of the methodology 6. Writing up the results

Step 1: Develop an Estimator Devise an “estimator” that they believe is useful in answering: “Are missed notes occurring at random?” 3 practice songs to visualize data and test estimator Describe WHY their estimator might be able useful Assigned the second day of class Two weeks to work on it Students worked with a partner Provided an example maximum number of consecutive misses in a song (Cmax)

Step 1: Practice Songs Note Song A Song B Song C Note Song A Song B Song C

Step 2: R Programming Write an R function that, given a song represented by a vector of 0’s and 1’s, returns a numerical value This will be their test statistic for future parts Have R code for Cmax to modify Assigned shortly after step 1 is due Have one week to struggle by themselves Then they sign up for a time with me to go over it with them Need to me their attempted code the day before we meet

Step 3: Collecting Data Took a Friday afternoon for an optional data collection event Play Guitar Hero on a projector screen Show them the difficulties of collecting accurate data By hand…no one could keep up Webcams! Stressed the importance of simulated data

Step 4: Hypothesis Testing H 0 : Notes are missed completely at random H A : Notes are not missed completely at random Tested these hypotheses using the two (successfully) collected real songs Hungry Like the Wolf – Duran Duran (Expert player) Ring of Fire – Johnny Cash (Beginning player) Obtain a bootstrapped p-value One week to obtain p-values

Step 5: Power Study Learn that statistical power can be more than just a simple calculation from a Normal distribution Using Monte Carlo methods 11 Scenarios Short songs (200 notes) and longer songs (600 notes) Various correlation structures Random sequences of 0’s and 1’s (sampling under H 0 ) Used to estimate Type 1 error rate Two days to finish study

Step 6: Written Report Technical report concerning all aspects of the project Professional manuscript style LaTeX encouraged/preferred About 4 pages excluding R code attached in an appendix Rough draft due mid April Final draft due last day of class

Some Results: Students’ Estimators Number of pairs of missed notes in a song n consecutive misses counted as n-1 pairs E.g., 3 misses in a row = 2 pairs Variance of number of misses in partitioned song Split song into (approximately) equal length sections Find the variance of the # of misses in a section – “Var(X)” Under H 0, Var(X) = Kp(1-p) K is the length of the section p is the overall proportion of missed notes in the song Under H A, Var(X) > Kp(1-p)

“Best” Students’ Estimator “Carrying Value” Assigns different values to misses during a song Each miss get a flat value of 2 + Carrying Value Carrying Value = 4 – consecutive number of correct notes played between misses Bounded between 0 and 4 Results Great power for nearly all scenarios Appropriate Type 1 error rates which was a problem for many other methods Hungry Like the Wolf (Expert player) - Reject H 0 Ring of Fire (Beginning player) – Fail to Reject H 0

Optional Components Extra credit given to people who presented a poster at the annual “Festival of Science” Two posters, 3 people per poster Split estimators into two groups Presenters had to be able to give basic descriptions of other peoples methods All six presenters enjoyed this Felt they better understood the project Especially the power study and using it to evaluate methods

Implementing in your Classroom Use Partners! 8 groups was about as much as I could handle Easily could have made groups of 3+ and required at least 2 estimators Good tool for having them learn R programming and bootstrap on a more independent basis Be open about how not all methods will work well Relay to them how research is an ongoing process without a clear/correct answer at all times They were only graded on their ability to convey the ideas and evaluate them in an objective way

Things for next time Require all students to participate in the Festival of Science poster session Make each group devise at least 2 estimators (many of the better students already had) Shrink power study and make them focus on why these scenarios were used Give some type of prize (gift certificate for a pizza?) for the “best” estimator Still making sure the project isn’t graded on how well the estimator works

Any Questions? Thank You!