What’s the Big Deal about Big Data? Jennifer Lewis Priestley, Ph.D. Professor of Statistics and Data Science
Big Data – What is it? Center for Statistics and Analytical Services at Kennesaw State University 3
Big Data – What is it? Center for Statistics and Analytical Services at Kennesaw State University 4 VOLUME VELOCITY VARIETY
Big Data – What is it? Big Data (noun) – Condition present when the volume, variety, and velocity of data exceeds an organization’s storage or computing capacity for accurate and timely decision making. It is NOT just about size. 5
Big Data – What is it? …but size (volume) is certainly part of the issue… 6 Number of s sent every second? 2.9 Million Video Uploaded to Youtube every minute? 20 Hours Amount of Data processed every day by Google? 24 Petabytes Tweets per Day? 50 million Orders Processed by Amazon every Second? 73
Big Data – What is it? 7 …and the costs of storage are dropping…
The total amount of digital data will reach 2.7 zettabytes by the end of this year. Approximately 80 percent of this data will be unstructured…2.7 zettabytes 8 Big Data – What is it?
Unstructured Data = Data 9
Big Data’s Evil Twin: Dark Data 10 Data which has been collected, often without intention, but not leveraged. …or, has historically been too costly to analyze. Kennesaw State University Department of Statistics and Analytical Sciences
11 Big Data – Does it really matter?
12 Native and Non-Native Companies…
Big Data Company 1: Coca Cola 13 ~ 1500 machines around the world Can dispense about 95 drinks an hour Can dispense about 125 different drinks Submits real time data on: -Syrup consumed/drink configuration -Outlet -Time
14 Big Data Company 2: The Home Depot ~ 2300 stores globally About 40,000 products in each store Product pricing has to be dynamic Thousands of vendors
15 Big Data Company 3: The Southern Company ~ 4.6 million customers ~27,000 power distribution lines Real time data, every customer Advanced Metering Infrastructure
16 Big Data Company 2: GM Cars are an emerging data platform Car-to-Manufacturer Monetization Opportunities Car-to-Customer Telematics changes everything for cars Kennesaw State University Department of Statistics and Analytical Sciences
What do these companies have in common? They have all recognized the value of data to their operations. 17 They have all invested heavily in new hardware and software to capture and store their new data. They have new hiring needs: Computer Scientists, Statisticians, Mathematicians
18 This shift has huge implications for universities.
19 We can’t teach the way we have always taught. The 1950s called…they want their curriculum back…
So, what does a 21 st Century Curriculum look like? 20 Math, Stat, Computer Science… Real Big, Real World Datasets… Better Integration with Practitioners… More Interdisciplinary Degrees…
21