Football for KMS: NFL ‘01 Abhijit Kumar Kaijia Bao Vishal Rupani APRIL 30 TH 2008 Course Instructor: Prof. Hsinchun Chen.

Slides:



Advertisements
Similar presentations
Anthony Wayne Lacrosse
Advertisements

Data Mining in Computer Games By Adib Adam Hussain & Mohammed Sarfraz.
Select Team Man Up Offense: Big Red. Big Red: Level of Play Big Red is the most basic man up available in the curriculum and every team should be able.
Fantasy Football Forecasting
Predicting the Winner of an NFL Football Game Matt Gray CS/ECE 539.
Berlin Miami Welcome. Berlin Miami Agenda 1.Briefing 2.Background 3.Targets 4.Targetgroup 5.SWOT 6.Strategy 7.Measures 8.Conclusion.
Introduction Offensive strategies of the National Football League have seemingly shifted towards a “West Coast” style offense, relying more heavily on.
WFL (Wilburn Football League) Football 101. Table of Contents What is Football? –History and ETHS Legends What is the WFL? –Basic Rules and terminology.
Fantasy Football : NFL Score Predictor Matt Grecco Abhishek Goyal.
Fantasy Football By: Carson Barnette. Fantasy Football Fantasy football is a game based on the performance of NFL players How do you play? What does this.
Knowledge Management for UEFA Champions League By Harsha Gunnam Hetal Mehta Nargis Memon Manish Wadhwa.
1 Lecture 5: Automatic cluster detection Lecture 6: Artificial neural networks Lecture 7: Evaluation of discovered knowledge Brief introduction to lectures.
Using Rubrics for Evaluating Student Learning Office of Assessment and Accreditation Indiana State University.
SLIDE 1IS 257 – Fall 2008 Data Mining and the Weka Toolkit University of California, Berkeley School of Information IS 257: Database Management.
Click to highlight each section of the article one by one Read the section, then click once to view the description of it If you want to read it, you.
The National Curriculum The national curriculum for physical education aims to ensure that all pupils: develop competence to excel in a broad range of.
-- Mohammed Ayub. References Sports Data Mining- Springer by Robert P. Schumaker, Osama K. Solieman, Hsinchun Chen
Simple Linear Regression NFL Point Spreads – 2007.
Emily Johnson.  Field Positions  Basics  Lineouts  Scrums  Rucks  Kickoffs & Running the Ball OUTLINE.
Solve for y when x = 1, 2, 3 and 4. 1.) y = x ) y = 5x 4 3.) y = 3x Solve for y when x is -2, -1, 0, 1. Patterns and Functions Day 2.
Eric Huggins, Ph.D. Fort Lewis College Durango, CO.
April 11, 2008 Data Mining Competition 2008 The 4 th Annual Business Intelligence Symposium Hualin Wang Manager of Advanced.
Football Quiz Game What You Know About The Beautiful Game Of Football Group Members : Ricardo Davis Jermaine Thompson Sanjay Douglas.
The Concept of Fantasy Football By Eric VanRemortel.
8 Station Rotation for football Teams A circuit of drills to help football teams reach their goals Logan W, Black B.
Middle Eastern Women 2005 Optional Business Computer Skills Review Paula Ecklund.
Writing research proposal/synopsis
NFL Play Predictions Will Burton, NCSU Industrial Engineering 2015
SCIENTIFIC INVESTIGATION AND PROCESS Open up to “Scientific Process” NOTE BLANKS 1)Read the introduction 2)Fill in the Scientific Process Skills using.
Sykesville Raiders Coaching Clinic Overview & Defense.
Football Made Easy UNDERSTANDING THE GAME. Offensive Player Positions  Center (C): first person to touch the ball and he snaps to the Quarterback (QB);
April Status Provide Rich Data Set Consistent Imagery Play Metadata Ground Truth Player Positions.
BY SANDY. WHAT IS DATAMINING TYPES OF DATAMINING TOOLS OVERVIEW OF TIBCO TIBCO SPOTFIRE MINER DATA ANALYSIS EXPLORE DATA MANIPULATE DATA CHART VIEW.
An Investigation of Commercial Data Mining Presented by Emily Davis Supervisor: John Ebden.
Select Team Offense: Special Plays (14 Double). 14 Double: Level of Play This play is suited for teams of all skill levels. “14 Double” has very simple.
EA Sports Madden NFL: Football by the Numbers Educator Companion Guide.
FOOTBALL. Football Gameplay Positions-Offense Positions-Defense Field Red is input.
PREDICTING WHICH POSITION AN NFL PLAYER SHOULD BE ON THE VIDEO GAME MADDEN. Forrest Lamp & Matt Nord.
Communicating Marketing Research Findings
An Introduction Student Name: Riaz Ahmad Program: MSIT( ) Subject: Data warehouse & Data Mining.
SnowflakeFootball! 1.All teams need ATLEAST 8 players 2.1 st games on Monday, 11/23 and Tuesday, 11/24 3.Game will be a forfeit if the team plays with.
Integer Football Welcome to Integer Football!
 Football is an inspiration for me. It gives me things to look forward to and develops a good character.
Chapter 1: The Nature of Science Section 1: What is Science?
F OOTBALL. D ID T HAT J UST H APPEN ? &feature=related.
Gameplay Positions-Offense Positions-Defense Field.
MIS 480/580 Final Project Presentation Knowledge Management in Cricket – A Research Project By: Luis Barreda Deepika Nim Jagadish Ramamurthy James Sanford.
PRESENTATION TO MIS480/580 GABE HAZLEWOOD JOSH HOTTENSTEIN SCOTTIE WANG JAMES CHEN MAY 5, 2008 Betting in Super Bowl match ups.
ROBERT MORRIS UNIVERSITY FOOTBALL Formations / Motions / Shifts.
TABLES Why Make a Table Steps to Making a Table Rows Vs. Colums Resources By: Morgan Jahnke 2 nd Grade Math.
Measuring NFL team performance by quarterback stats A neural networks approach By David Michlig EC 539.
Lopez – MA 276 MA 276: Sports and statistics Lecture 5: NFL, expected points, game theory 0.
PLAYER OWNERSHIP Why low player ownership has value in tournaments.
EDUCAUSE Annual Conference
Fantasy football.
An analysis of the nfl runningback
Who Is The Best Player In Major League Baseball?
Worthing College Sports Science Liam Lee 2015
NBA Draft Prediction BIT 5534 May 2nd 2018
Data Mining: Concepts and Techniques Course Outline
Using Neural Networks to Determine NFL Game Outcomes
Artificial Neural Networks for the NFL Draft
Game Theory II Solutions 1
The Game of Football.
Game Theory II Solutions 1
Kaytee Bartley DSCI 101 Fall 2018
Jesse Ohmer DSCI 101 Section 2
NFL Quarterbacks HUNTER king.
Sample Presentation – Mr. Linden
March Madness Data Crunch Overview
Presentation transcript:

Football for KMS: NFL ‘01 Abhijit Kumar Kaijia Bao Vishal Rupani APRIL 30 TH 2008 Course Instructor: Prof. Hsinchun Chen

Agenda Data Collection Client Relations Final Presentation Knowledge Discovery Statistical Analysis Data Mining Techniques Key Findings KMS Demonstration ABHI VISHALKAI Objectives Literature Overview Conclusion Data Cleaning Statistical Analysis Final Paper Data Import Data Transformation Data Mining

Research Objectives  Pattern identification  Descriptive Statistics  Data Mining Techniques  Prediction  Developing a strategy  Fantasy League

Literature Overview  Moneyball: The Art of Winning an Unfair Game Michael Lewis  Las Vegas Odds  NFL Fantasy League

Knowledge Discovery Process DATA Pro-Football -3 Tables -40 Columns -82,346 Rows Lisa Ordonez -1 Table -90 Columns -50,417 Rows SQL 2005 IS TRANSFORMATION Dependent Variables Calculated Variables Independent Variables SQL 2005 AS Play Decision, Intended Player, Play Direction, Yards GameNum, IsPlayChal, PlayZone, TotalOffTO, PlayDecision, QtrTimeLeft, HalfTimeLeft, GameTimeLeft Defense, Down, GAP, Halftime Left, Off Ydl, Offense, Play Zone, QTR, ToGo, Total Off TO

Knowledge Discovery Process DATA Pro-Football -3 Tables -40 Columns -82,346 Rows Lisa Ordonez -1 Table -90 Columns -53,000 Rows SQL 2005 IS TRANSFORMATION Dependent Variables Calculated Variables Independent Variables SQL 2005 AS PROCESSING Simple Statistics -Play Decision -Intended Player -Play Direction -Yards MS Excel 2007 MINING Models - ID3 - Neural Networks Accuracy -Lift Charts -Classification Matrix SQL 2005 AS

Dependency Network

Intended Player: Statistics Top 3 Intended Players for Passes for the 4 teams that played in the semi-finals H.Ward (142), P.Burress (121), B.Shaw (44) T.Brown (143), D.Patten (93), M.Edwards (39) T.Holt (133), M.Faulk (104), I.Bruce (103) J.Thrash (107), D.Staley (89), T.Pinkston (83)

Play Direction: Statistics  Direction of Rushes for all plays in 2001 season Middle Left Tackle Right GuardRight End Middle Right Tackle Left GuardLeft End

Play Direction: Statistics  Direction of Rushes for all plays in 2001 season Number of Rushes Direction

Yardage: Statistics  Yardage during each down for Pass and Rush Yards To Go Average Yards Covered Passes Rushes

Play Decision: Statistics  Play Decisions for the 4 teams that played in the semi-finals Number of Decisions Play Decision Type

Play Decision: Analysis Overview  Discovery of what environmental and/or game factors affect play decision  Discovery of football expert knowledge through data mining  Prediction of play decisions based on game factors

Play Decision: ID3 Analysis

Play Decision: Accuracy

Rush Accuracy: Lift Chart

Field Goal Accuracy: Lift Chart

Play Decision: Classification Matrix

Play Decision: Key Findings  Football strategy can be discovered through data, instead of knowledge experts  Top 3 factors affecting decision:  Down, Off Ydl, Time  Accuracy of the models are different depending on the decision we are trying to predict  Team specific strategies may be discovered with more data.

Play Direction: Analysis Overview  Discover team’s strengths and weakness in their defense and/or offense  Prediction of play directions based on game factors Middle Left Tackle Right GuardRight End Middle Right Tackle Left GuardLeft End

Play Direction: Accuracy

Play Direction: Key Findings (ID3)

Intended Player: Analysis Overview  Discover each team’s favored recipient of a pass  Prediction of intended player based on game factors

Intended Player: Lift Chart

Intended Player: Key Findings  There are 400+ intended players  Not enough data to accurately predict intended players  Not enough data to gain knowledge over statistical models

Conclusions PLAY DECISION - Accurate - Gained Knowledge PLAY DIRECTION - Less accurate - Enough data to gain knowledge INTENDED PLAYERS - Insufficient data - No knowledge gained - Need to increase sample size

Future Direction  Increase sample set  More instances of different scenarios  Incorporate additional information  Pro-football-Reference.com  VegasInsider.com (Odds for favorites)  Extend Analysis  Nested case (Historical performance)

References  Prof. Lisa Ordóñez  Professor in Statistics  Steve Aldrich  Author of Moneyball in Football  About Football  Glossary of terms

DATA Pro-Football -3 Tables -40 Columns -82,346 Rows Lisa Ordonez -1 Table -90 Columns -53,000 Rows SQL 2005 IS TRANSFORMATION Dependent Variables Calculated Variables Independent Variables SQL 2005 AS PROCESSING Simple Statistics -Play Decision -Intended Player -Play Direction -Yards MS Excel 2007 MINING Models - ID3 - Neural Networks Accuracy -Lift Charts -Classification Matrix SQL 2005 AS Knowledge Discovery Process

Research Objectives Literature Overview Knowledge Discovery Statistics: Intended Player Statistics: Play Direction Statistics: Yardage Statistics: Play Decision Accuracy: Lift Chart Charts Analysis: Play Decision Analysis: Play Direction Analysis: Intended Player Conclusions Future Directions System Design

Backup Slide Section

Data Collection Football Outsiders Pro-Football Initial Dataset Cleaning Hierarchy Relevance Processing Dependent Independent Calculated Analysis 55,000 rows 90 columns 47,033 rows 30 columns Dependent – 4 Independent – 10 Calculated - 9

System Design NFL Season 2001 FOOTBALL DATA DB NFL KMS Model Building Testing/ Accuracy Pattern Analysis Formations Substitutions Play Decisions FIELD STRATEGY DEFENSE STRATEGY METRICS Accuracy Performance

Yards Analysis  Yards gained on the play is used as a metric to measure effort  Discover how environmental and/or game factors affect player’s efforts  Key Findings: Top 4 environmental factors  Off Ydl  Time  Down  Gap