Main Themes Few vs. Many Variables Linear vs. Non-Linear Statistics vs. Machine Learning.

Slides:



Advertisements
Similar presentations
Solutions to Text book HW
Advertisements

Business Statistics - QBM117 Scatter diagrams and measures of association.
Statistics 350 Lecture 16. Today Last Day: Introduction to Multiple Linear Regression Model Today: More Chapter 6.
Correlation and Regression Analysis
Statistics 350 Lecture 11. Today Last Day: Start Chapter 3 Today: Section 3.8 Mid-Term Friday…..Sections ; ; (READ)
1 i247: Information Visualization and Presentation Marti Hearst Graphing and Basic Statistics.
Data Mining – Intro.
L Berkley Davis Copyright 2009 MER301: Engineering Reliability Lecture 13 1 MER301: Engineering Reliability LECTURE 13 Chapter 6: Multiple Linear.
Data Mining. 2 Models Created by Data Mining Linear Equations Rules Clusters Graphs Tree Structures Recurrent Patterns.
Data Mining Dr. Chang Liu. What is Data Mining Data mining has been known by many different terms Data mining has been known by many different terms Knowledge.
Project Categories and Questions How to improve [Financial Metric]? Business Science What Determines Height? Government Sports How are School Districts.
1 Statistics 202: Statistical Aspects of Data Mining Professor David Mease Tuesday, Thursday 9:00-10:15 AM Terman 156 Lecture 9 = Review for midterm exam.
A P STATISTICS LESSON 3 – 2 CORRELATION.
Prior Knowledge Linear and non linear relationships x and y coordinates Linear graphs are straight line graphs Non-linear graphs do not have a straight.
Revelation – A Book of Contrasts REVELATION 1:3 TEXT.
Section 4.2 Regression Equations and Predictions.
Knowledge Discovery and Data Mining Evgueni Smirnov.
Outline Class Intros – What are your goals? – What types of problems? datasets? Overview of Course Example Research Project.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 10-5 Multiple Regression.
Chartese Jones - MCIS Department - Mississippi Valley State University Mentor: Dr. Raymond Williams, Mississippi Valley State University What effect does.
Knowledge Discovery and Data Mining Evgueni Smirnov.
Outline Class Intros Overview of Course & Series Example Research Projects Beginning R.
Time Series 1.
A Few Handful Many Time Stamps One Time Snapshot Many Time Series Number of Variables Mobile Phone Galton Height Census Titanic Survivors Stock Market.
Outline Class Intros Overview of Course Example Research Project.
Regression Analysis Part B Calculation Procedures Read Chapters 3, 4 and 5 of Forecasting and Time Series, An Applied Approach.
STAT 3610/5610 – Time Series Analysis Topics to be covered Chapters 1 – 8 Chapter 9 – Skip Chapters 10 – 12.
Unit 1, Chapter 2 Integrated Science. Unit One: Forces and Motion 2.1 Using a Scientific Model to Predict Speed 2.2 Position and Time 2.3 Acceleration.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
Linear Regression Analysis Using MS Excel Tutorial for Assignment 2 Civ E 342.
1 …continued… Part III. Performing the Research 3 Initial Research 4 Research Approaches 5 Hypotheses 6 Data Collection 7 Data Analysis.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and Regression and Time Series CHAPTER 11 Correlation and Regression: Measuring and Predicting Relationships.
Syllabus. We covered Regression in Applied Stats. We will review Regression and cover Time Series and Principle Components Analysis. Reference Book.
ECON 338/ENVR 305 CLICKER QUESTIONS Statistics – Question Set #8 (from Chapter 10)
Chapter 9 Correlational Research Designs. Correlation Acceptable terminology for the pattern of data in a correlation: *Correlation between variables.
1 AAEC 4302 STATISTICAL METHODS IN AGRICULTURAL RESEARCH Chapter 1.
No Yes No Yes Longitudinal & Time Series Cross-Sectional & Panel Data PEW Mobile Phone Galton Children Height Census Text Sentiment Old Faithful Web Analytics.
AP Statistics Section 4.1 A Transforming to Achieve Linearity.
Where to Get Data? Run an Experiment Use Existing Data.
1 To Loan or Not to Loan Student Coaching Notes. 2 Concepts Covered Statistics Macroeconomics Ethics.
Least Squares Regression Lines Text: Chapter 3.3 Unit 4: Notes page 58.
Monday, January 11,  INSTRUCTORS  STUDENTS:  Name?  Class?  Hometown?  Major?  Background: Math? Computers? Statistics?  Why did you take.
4 basic analytical tasks in statistics: 1)Comparing scores across groups  look for differences in means 2)Cross-tabulating categoric variables  look.
Chapter 7 Linear Regression. Bivariate data x – variable: is the independent or explanatory variable y- variable: is the dependent or response variable.
1 Regression Line Part II Class Class Objective After this class, you will be able to -Evaluate Regression and Correlation Difficulties and Disasters.
Lecture 1 Outline: Thu, Sep 4 Introduction/Syllabus Course outline Some useful guidelines Case studies and
Chapter 9 Scatter Plots and Data Analysis LESSON 1 SCATTER PLOTS AND ASSOCIATION.
Statistics 3502/6304 Prof. Eric A. Suess Chapter 3.
Data Mining: Neural Network Applications by Louise Francis CAS Convention, Nov 13, 2001 Francis Analytics and Actuarial Data Mining, Inc.
Chapter 2.4 Paired Data and Scatter Plots. Scatter Plots A scatter plot is a graph of ordered pairs of data values that is used to determine if a relationship.
McGraw-Hill/Irwin © 2003 The McGraw-Hill Companies, Inc.,All Rights Reserved. Part Four ANALYSIS AND PRESENTATION OF DATA.
Titanic and Decision Trees Supplement. Titanic Predictions and Decision Trees Variable Selection Approaches – Hypothesis Driven – Data Driven – Kitchen.
Chapter 3 LSRL. Bivariate data x – variable: is the independent or explanatory variable y- variable: is the dependent or response variable Use x to predict.
Section Copyright © 2015, 2011, 2008 Pearson Education, Inc. Lecture Slides Essentials of Statistics 5 th Edition and the Triola Statistics Series.
Yandell - Econ 216 Chap 1-1 Chapter 1 Introduction and Data Collection.
Data Mining – Intro.
DSS & Warehousing Systems
Least Squares Regression Line.
Chapter 5 LSRL.
Chapter 3.2 LSRL.
S519: Evaluation of Information Systems
Lecture Slides Elementary Statistics Twelfth Edition
Elementary Statistics: Picturing The World
Least Squares Regression Line LSRL Chapter 7-continued
Chapter 5 LSRL.
Chapter 5 LSRL.
Lecture Slides Elementary Statistics Twelfth Edition
Descriptive Statistics Univariate Data
Cases. Simple Regression Linear Multiple Regression.
Presentation transcript:

Main Themes Few vs. Many Variables Linear vs. Non-Linear Statistics vs. Machine Learning

Number of Variables Analyzed Week Regression Correlation

Number of Variables / Observations A few variables vs. many variables A few observations vs. a ton of variables

Few Yes No Many Time Series Number of Variables PEW Mobile Phone Galton Children Height Census Text Sentiment Old Faithful Web Analytics Titanic Survivors Bank Loans Stock Market

Principle Components Analysis

People Variables Cluster Analysis

Linear Regression

Compare and Contrast

Tom Brady Not Tom Brady Machine Learning

Excel

R and Data Mining

Intermediate Book We will use the front part of chapters that are more conceptual, along with some of the case studies.