STAT 101: Data Analysis and Statistical Inference Professor Kari Lock Morgan

Slides:



Advertisements
Similar presentations
Guidelines for Assessment and Instruction in Statistics Education (GAISE) Kari Lock Morgan STA 790: Teaching Statistics 9/19/12.
Advertisements

Exploratory Data Analysis I
Hello and Welcome! This brief walkthrough is designed to help you become familiar with the ALEKS program and how it will be used in this class. It will.
STAT 101: Data Analysis and Statistical Inference
Statistics: Unlocking the Power of Data Lock 5 STAT 250: Introduction to Biostatistics Kari Lock Morgan
Section 1.1 The Structure of Data. Why Statistics? Statistics is all about DATA  Collecting DATA  Describing DATA – summarizing, visualizing  Analyzing.
Economics 1 Principles of Microeconomics Instructor: Ted Bergstrom.
STAT 250.3: Introduction to Biostatistics Instructor: Efi Antoniou Introduction.
1 BA 275 Quantitative Business Methods Housekeeping Introduction to Statistics Elements of Statistical Analysis Concept of Statistical Analysis Exploring.
CS150 Introduction to Computer Science 1 Professor: Chadd Williams.
Class 1: Sept. 9 About instructor: Dylan Small, Assistant Professor, Department of Statistics. How I got interested in statistics?
Stat 217 – Week 10. Outline Exam 2 Lab 7 Questions on Chi-square, ANOVA, Regression  HW 7  Lab 8 Notes for Thursday’s lab Notes for final exam Notes.
Multiple Regression III 4/16/12 More on categorical variables Missing data Variable Selection Stepwise Regression Confounding variables Not in book Professor.
Test Preparation Strategies
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Fall, 2014 Room 120 Integrated.
CHEMISTRY Professor Richard Karpeles. Spring 2014 Chemistry 2 (84.122) Dr. Richard Karpeles Olney Hall 502A (978)
Synthesis and Review 3/26/12 Multiple Comparisons Review of Concepts Review of Methods - Prezi Essential Synthesis 3 Professor Kari Lock Morgan Duke University.
Please CLOSE YOUR LAPTOPS, and turn off and put away your cell phones, and get out your note-taking materials. Today’s daily quiz will be given at the.
COMP 111 Programming Languages 1 First Day. Course COMP111 Dr. Abdul-Hameed Assawadi Office: Room AS15 – No. 2 Tel: Ext. ??
MGS 351 Introduction to Management Information Systems
Financial Accounting Business 112 Introduction (To Start, select Slide Show, then View Show. Advance slide and topics within slide by mouse click)
1 BA 275 Quantitative Business Methods Housekeeping Introduction to Statistics Elements of Statistical Analysis Concept of Statistical Analysis Statgraphics.
1 Student Orientation. Hello and Welcome! This brief walkthrough is designed to help you become familiar with the ALEKS program and how it will be used.
Please initial the appropriate attendance roster near the door. If you are on the Wait List you will find your name at the bottom. If you are not on the.
Statistics: Unlocking the Power of Data Lock 5 Afternoon Session Using Lock5 Statistics: Unlocking the Power of Data Patti Frazer Lock University of Kentucky.
Political Research and Statistics 8/26/2013. Readings Bring your cd's and a flash drive to class on Thursday Pollack Textbook – Introduction – Ch: 10.
* Please find your seat (name on note card). Please take a textbook from the counter at the back of the room Please look over the book for any problems.
Introduction to Information Systems and Technology MIS 213, Spring 2015 CIS 2005, CIS 1007.
Lecturer:Prof. Elizabeth A. Ritchie, ATMO TAs:Mr. Adrian Barnard Ms. Anita Annamalai NATS 101 Introduction to Weather and Climate Section 14: T/R 2:00.
1 Introduction to Physics 250 Dr. Phil Womble Applied Physics Institute Office Hours: MWF 8:00-9:00 TCCW 232.
381 QSCI Winter 2012 Introduction to Probability and Statistics.
Welcome to Physics 1D03.
Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan Multiple Regression SECTIONS 9.2, 10.1, 10.2 Multiple explanatory variables.
Astronomy 114 Lab Section 211, Professor Weigel. Outline for Today About Goals for this class Attendance Syllabus Safety Star Project Apparent vs. Absolute.
Lecture 1: Introduction I am Dr. Rong Fu, your instructor of this class. Welcome to the first class of GEO 302C Climate: Past, Present and Future! Before.
Welcome to CS 115! Introduction to Programming. Class URL Write this down!
Welcome MM255 – Business Math Seminar 1
How to read a scientific paper
BUS 462 Marketing Research Yinghong (Susan) Wei. Day 1 - Introduction Agenda for Today:  About Me  About You  About the Class  Form Teams  Discussion.
Quantitative Methods in Geography Geography 391. Introductions and Questions What (and when) was the last math class you had? Have you had statistics.
+ Introduction to Class IST210 Class Lecture. + Course Objectives Understand the importance of data, databases, and database management Design and implement.
Inference after ANOVA, Multiple Comparisons 3/21/12 Inference after ANOVA The problem of multiple comparisons Bonferroni’s Correction Section 8.2 Professor.
Dr. A. Al-Saadi Chemistry 101 Sections: 4, 5, and 6 SMW 8:00AM – 8:50AM Textbook: Chemistry by Zumdahl Instructor: Dr. Abdulaziz Al-Saadi ( د. عبدالعزيز.
Evaluation & Assessment 10/31/06 10/31/06. Typical Point Breakdown COURSE GRADES: Grades will be assigned on the basis of 450 points, distributed as follows:
Statistics: Unlocking the Power of Data Lock 5 Exam 2 Review STAT 101 Dr. Kari Lock Morgan 11/13/12 Review of Chapters 5-9.
1 1.Log in to the computer in front of you –Temp account: 231class / 2.Update your in Cascadia's system –If I need to you I'll use.
Please CLOSE YOUR LAPTOPS, and turn off and put away your cell phones, and get out your note- taking materials.
Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan 12/6/12 Synthesis Big Picture Essential Synthesis Bayesian Inference (continued)
Please initial the attendance roster near the door. If you are on the Wait List you will find your name at the bottom. If you are not on the roster, please.
Chemistry 101 Beth Lindquist 7 Chemistry Annex Office Hours: 9-10 am Tuesdays and Thursdays And by appointment.
CSCE 1030 Computer Science 1 First Day. Course Dr. Ryan Garlick Office: Research Park F201 B –Inside the Computer Science department.
Please initial the attendance roster near the door. If you are on the Wait List you will find your name at the bottom. If you are not on the roster, please.
STAT 250: Introduction to Biostatistics
Please initial the attendance roster near the door. If you are on the Wait List you will find your name at the bottom. If you are not on the roster, please.
C Programming Lecture 1 : Introduction Bong-Soo Sohn Associate Professor School of Computer Science and Engineering Chung-Ang University.
MGS 351 Introduction to Management Information Systems Lecture #1.
James Tam Introduction To CPSC 233 James Tam Java Object-Orientation Event driven software.
INTRODUCTION: WELCOME TO STAT 200 January 5 th, 2009.
Synthesis and Review 2/20/12 Hypothesis Tests: the big picture Randomization distributions Connecting intervals and tests Review of major topics Open Q+A.
Please initial the attendance roster near the door. If you are on the Wait List you will find your name at the bottom. If you are not on the roster, please.
Please initial the appropriate attendance roster near the door. If you are on the Wait List you will find your name at the bottom. If you are not on the.
Lecture 1: Introduction I am Dr. Zong-Liang Yang, your instructor of this class. Welcome to the first class of GEO 302C Climate: Past, Present and Future!
Welcome to AP Stats!. The AP Exam Thursday, May12, This is during the second week of AP testing and about 4 weeks after Spring Break. The TEST:
Welcome to Introduction to Psychology! Let’s share a bit about where we are all from…
Economics 175 American Economic History
Intro to AP Statistics and Exam
Welcome to AP Stats!.
STAT 250: Introduction to Biostatistics
Hybrid Course Overview & Requirements
Presentation transcript:

STAT 101: Data Analysis and Statistical Inference Professor Kari Lock Morgan

STAT 101: Day 1 Introduction to Data 1/11/12 Syllabus, Course Overview Why Statistics? Data Cases and Variables Categorical and Quantitative Variables Section 1.1

Sakai Course Website: Syllabus available online Lecture slides available online

Required Course Materials Textbook: Statistics: Unlocking the Power of Data by Lock, Lock, Lock Morgan, Lock, and Lock To be handed out in lab tomorrow Clicker: i>clicker – Available at the bookstore, Amazon, or from previous students – $43 at the bookstore, $20 used on Amazon – Need by 1/30 Calculator – basic calculator is fine – need a non-cell phone calculator

Support My Office Hours: (in Old Chemistry 216) – 3 – 5 pm Wednesday – 1 – 3 pm Friday – or by appointment Statistics Education Center: – 4 – 9 pm Sunday – Thursday in Old Chem 211A or your

Grade Breakdown Clicker Questions10% Homework15% Projects (2  10%)20% Midterm Exams (2  15%) 30% Final Exam25% Grades ≥ 90 are guaranteed at least an A- Grades ≥ 80 are guaranteed at least a B- Grades ≥ 70 are guaranteed at least a C- Grades ≥ 60 are guaranteed at least a D-

Clickers You need to purchase an i>clicker Clicker grading will begin 1/30 Review “Quiz” Questions: – Credit only for answering correctly – Goal: motivate you to keep up with the material New Questions: – Credit simply for clicking in – Goal: motivate you to think actively about new material as it is being presented

Class Year What is your class year? (a) First-Year (b) Sophomore (c) Junior (d) Senior (e) Graduate student

Major Your primary major (or potential future major) best falls under the category… (a) Natural Sciences (b) Arts and Humanities (c) Social Sciences (d) Math/Statistics/CS (e) Other

Homework Weekly homework due Collaboration and discussion encouraged, but write-up must be done on your own (no copying) Point of homework: – to LEARN! – to make sure you are keeping up with the material – to prepare you for projects and exams Graded problems and practice problems Grading – Graded on a 10 point scale – Lowest homework grade dropped – Penalties for late homework

Projects Project 1 – individual – confidence intervals, hypothesis tests – written report up to 5 pages in length Project 2 – group – regression – 10 minute presentation – written report up to 10 pages in length

Exams Midterm Exam 1: 2/22 and 2/23 Midterm Exam 2: 3/27 and 3/28 In-Class Portion: – Closed book: only allowed a calculator and one page of notes prepared only by you In-Lab Portion: – Open book: allowed any materials (including computer) except communication with other humans Final Exam: 4/30, 9 – 12 pm SAVE THESE DATES!

Labs Labs are on Thursday in Old Chem 01 Learn how to use statistical software – RStudio Familiarity with the software will be necessary for homework, projects, and exams You should have signed up for a section: 8:45 – 9:35 am (Jessica Feldman) (new section!) 10:20 – 11:10 am: Yue Jiang 11:55 – 12:45 pm (Yue Jiang) 1:30 – 2:20 pm (Michael McCreary) 3:05 – 3:55 pm (Christine Cheng) I need your gmail to set up an account

Keys to Success 1.COME TO CLASS! Come to class on-time and ready to pay attention and think. 2.COME TO LAB! Attend every lab, and spend time in lab working on statistics. 3.DO THE HOMEWORK! Try the homework first by yourself, get help where needed, and make sure you understand all the problems by the time you turn it in. 4.Start the projects early and allow adequate time for working on them. 5.Give yourself time to prepare a good cheat sheet for exams. Use this preparation to go through the material, and take time to review concepts you don’t understand. 6.Do lots of practice problems. 7.Stay on top of the material. Clear up confusion as it occurs.

Why Statistics? Statistics is all about DATA – Collecting DATA – Describing DATA – summarizing, visualizing – Analyzing DATA Data are everywhere! Regardless of your field, interests, lifestyle, etc., you will almost definitely have to make decisions based on data, or evaluate decisions someone else has made based on data

Data Data are a set of measurements taken on a set of individual units The individual units that measurements are taken on are known as cases One measurement collected across all the cases is known as a variable Usually data is stored and presented in a dataset, where each row represents one case, and each column represents one variable

Countries of the World Country Land AreaPopulationRuralHealthInternet Birth Rate Life ExpectancyHIV Afghanistan Albania Algeria American Samoa Andorra Angola Antigua and Barbuda Argentina Armenia

Intro Statistics Survey Data

Diet Coke and Calcium DrinkCalcium Excreted Diet cola50 Diet cola62 Diet cola48 Diet cola55 Diet cola58 Diet cola61 Diet cola58 Diet cola56 Water48 Water46 Water54 Water45 Water53 Water46 Water53 Water48

Data US News and World Report National University Rankings Republican Presidential Nomination Polls Duke Basketball Hybrid Cars Stock Market Unemployment Rate Antidepressants and Alzheimer’s

Data Applicable to You Think of a potential dataset (it doesn’t have to actually exist) that you would be interested in analyzing What are the cases? What are the variables?

Kidney Cancer Source: Gelman et. al. Bayesian Data Anaylsis, CRC Press, Counties with the highest kidney cancer death rates

Kidney Cancer Source: Gelman et. al. Bayesian Data Anaylsis, CRC Press, Counties with the lowest kidney cancer death rates

Kidney Cancer If the values in the kidney cancer dataset are rates of kidney cancer deaths, then what are the cases? (a) The people living in the US (b) The counties of the US

Kidney Cancer If the values in the kidney cancer dataset are yes/no, then what are the cases? (a) The people living in the US (b) The counties of the US

Variables: Categorical vs Quantitative A categorical variable divides the cases into groups, placing each case into exactly one of two or more categories A quantitative variable measures or records a numerical quantity for each case.

CategoricalQuantitative

Kidney Cancer If the cases in the kidney cancer dataset are people, then the measured variable is… (a) Categorical (b) Quantitative

Kidney Cancer If the cases in the kidney cancer dataset are counties, then the measured variable is… (a) Categorical (b) Quantitative

Let’s Collect Some Data! QUESTION: If you are romantically interested in someone, should you be obvious about it, or should you play hard to get? Using Data to Answer a Question

Romance Which type of person are you generally more romantically interested in? (a) Someone who is obviously into you (b) Someone who plays hard to get

Romance MALES ONLY: Which type of person are you generally more romantically interested in? (a) Someone who is obviously into you (b) Someone who plays hard to get

Romance FEMALES ONLY: Which type of person are you generally more romantically interested in? (a) Someone who is obviously into you (b) Someone who plays hard to get

One or Two Variables Sometimes we are interested in one variable, as in whether people prefer obvious romantic interest or hard to get Other times we are interested in the relationship between two variables, such as 1)prefer obvious interest or hard to get? 2)gender

What do you want to know? We’ll do a class survey, collecting data you are interested in. 1.What data do you want to collect from your peers? One variable or a relationship between two variables? What are the variables? Are they categorical or quantitative?

What do you want to know? 2.Write a question to measure each variable of interest. Write questions so the resulting data will be accurate and easy to analyze. Quantitative variable? Give units. Categorical variable? Make multiple choice and give the possible categories (no more than 5). Be clear and specific.

Summary Data are everywhere, and pertain to a wide variety of topics A dataset is usually comprised of variables measured on cases Variables can be categorical or quantitative Data can be used to provide information about essentially anything we are interested in and want to collect data on!

Course Objectives An understanding of the importance of data collection, the ability to recognize limitations in data collection methods, and an awareness of the role that data collection plays in determining the scope of inference. The ability to use technology to summarize data numerically and visually, and to perform straightforward data analysis procedures. A solid conceptual understanding of key concepts such as the logic of statistical inference, estimation with intervals, and testing for significance. The ability to understand and think critically about data-based claims. The knowledge of which statistical methods to use in which situations, the technological expertise to use the appropriate method(s), and the understanding necessary to interpret the results correctly, effectively, and in context. An awareness of the power of data.

To Do Add your gmail address to the google doc if you haven’t alreadygoogle doc Buy a clicker if you don’t already have one Be on the lookout for data and data-based claims – they are everywhere!