AP CSP: What is Big Data?.

Slides:



Advertisements
Similar presentations
Science Fair Secrets.
Advertisements

1 Computer Logic, Logic Gates, and Building Circuits Image: Intel Museum.
A Wanderer’s Guide to the Data Network Navigating Our Connected World Garrett Shields, GISP, CFM.
1 Dr. Michael D. Featherstone Introduction to e-Commerce Laws of the Web.
Lecture 1: Performance EEN 312: Processors: Hardware, Software, and Interfacing Department of Electrical and Computer Engineering Spring 2013, Dr. Rozier.
Introduction to Earth Science Section 2 Section 2: Science as a Process Preview Key Ideas Behavior of Natural Systems Scientific Methods Scientific Measurements.
BIG DATA. The information and the ability to store, analyze, and predict based on that information that is delivering a competitive advantage.
DNA Computing. What is it?  “DNA computing is a branch of computing which uses DNA, biochemistry, and molecular biology hardware, instead of the traditional.
Chapter 8 Technology and Information Management 1 Chapter 8 Technology and Information Management ©2008 Thomson/South-Western.
Integration Lower sums Upper sums
AP CSP: Identifying People with Data and The Cost of Free
AP CSP: Making Visualizations & Discovering a Data Story
Using a Growth Mindset to Help Our Students Succeed
Unit 2: Lesson 11 & 12 Making Data Visualizations
AP CSP: Cleaning Data & Creating Summary Tables
Vocabulary Big Data - “Big data is a broad term for datasets so large or complex that traditional data processing applications are inadequate.” Moore’s.
AP CSP: Sending Binary Messages with the Internet Simulator
Memory Miss Elliott.
Activity 1 6 minutes Research Activity: What is RAM? What is ROM?
Unit 1: Matter, Measurement, and unit conversions
Introduction to e-Commerce
AP CSP: Data Assumptions & Good and Bad Data Visualizations
UNIT 2 – CHAPTER 1 – LESSON 1 DIGITAL INFORMATION.
LO: Science Fair Projects
The Cost of Free Lesson 4-4.
BIG Data 25 Need-to-Know Facts.
Foundational Services
Exploring Values.
Section 2: Science as a Process
Science Fair Projects.
UNIT 4 – THE COST OF FREE LESSON 4.
Check Your Assumptions
UNIT 4 - BIG DATA AND PRIVACY
Vocabulary Big Data - “Big data is a broad term for datasets so large or complex that traditional data processing applications are inadequate.” Moore’s.
Vocabulary Big Data - “Big data is a broad term for datasets so large or complex that traditional data processing applications are inadequate.”
AP CSP: Bytes, File Sizes, and Text Compression
The Do’s and don’t of studying
Sending Bits on the Internet
What is Product “X”? Name a Product “X” have you recently bought to replace an older version, where the new one was much: Better Faster Cheaper Computer.
Big Data, Big Responsibility
Science Fair Projects.
Riverview STEM Academy School
Research and Its Applications
Science Fair Projects.
Introduction to Digital Electronics
Chapter GS Getting Started.
Chapter 1: Exploring Data
Science Fair Projects.
CHAPTER 12 More About Regression
Accumulation AP Calculus AB Day 13
Chapter GS Getting Started.
Chapter 8 Technology and Information Management
Introduction to Scientific Research
UNIT 4 – THE COST OF FREE LESSON 6.
Unit 4: Lesson 1 & 2- What is Big Data
Vocabulary Big Data - “Big data is a broad term for datasets so large or complex that traditional data processing applications are inadequate.” Moore’s.
Chapter GS Getting Started.
Mathematical Explorations
CHAPTER 12 More About Regression
Using data innovations to improve
Transformations Limit Continuity Derivative
CHAPTER 12 More About Regression
Science Fair Projects.
Science Fair Projects.
Scientific Method.
Science Fair Projects.
Science Fair Projects.
Chapter GS Getting Started.
Science Fair Projects.
Science Fair Projects.
Presentation transcript:

AP CSP: What is Big Data?

Introduction: We’ve just finished up our unit on programming and moving onto a unit related to Big Data and Privacy! Before we can discuss Big Data properly we must first try and understand the concept of Big Data and everything it entails. We are going to watch a 15-minute video which helps describe this concept. The link for you to watch it again is below: https://www.ted.com/talks/kenneth_cukier_big_data_is_better_data?language =en

Video Discussion: So based on what you have just watched, What is Big Data? Discuss concept among your peers Big Data doesn’t have the most exact/clear definition. Big data means different things, at different times, to different people. It can mean devices that are constantly collecting data. It can mean digitizing data that’s been around for a long time (e.g., every book ever written). It can mean machine learning and artificial intelligence. Before we delve too much into what Big Data is and what it is used for, first we must learn about two important topics which will help us understand the phenomenon of big data.

The Exponential Growth of the Digital World:

Exponential Growth of Data: Part of what contributes to data being “big” is the sheer growth of the amount of data in the world. The graphic on the previous page shows how data is being created, replicated, and consumed by year. As you can see from the chart, the amount of data flying around is growing exponentially, doubling every two years or so. Here’s a way to think about how fast this is: The world will produce as much digital data over the next 2 years, as currently existed in all of humanity prior to that. And it will do the same the 2 years after that. And so on. All this data flying around but what to do with it?

Moore’s Law: Another important concept to think about when trying to understand Big Data is Moore’s Law. In 1965, a computer chip designer named Gordon Moore predicted that the number of transistors one could fit on a chip would double every 18 months or so. This prediction has been more or less true to the present day. In other words since about 1970, computers have gotten twice as fast, at half the cost, roughly every 1.5-2 years. With some small differences, the same is true for data storage capacity. This is a very important concept when designing future products.

Moore’s Law Continued: Computers speed and capacity are growing exponentially. With more and more machines that are faster and faster, the amount of data being pushed around, saved, and processed is growing exponentially (Big Data). With speed and capacity growing exponentially, companies need to keep that fact in mind as they plan for the future. For example: f the average hard drive today is 1 TB and you are planning for something or 6 years away, you should expect that average hard drives will be 8-10 TB. Moore's law has encouraged industries that use computers to effectively plan future research and development based on anticipated increases in computing power. Moore’s Law and the amount of data flying are important concepts to understand when thinking of Big Data and how it came about.

Big Data Sleuth Activity: Big data surrounds us but it is sometimes surprisingly challenging to get access to it, use it, or see it. Much of the data out there is in the “wild.” Even when the data is “available,” it can sometimes be challenging to figure out where it came from, or how to use it. Form into pairs and complete the instructions on the handout you were provided in class. Select one of the websites provided in the handout and complete all the questions inside the table.

Wrap-Up: So, after your explorations what do you think “big data” actually means? What makes it “big” as opposed to not? Whether data is “big” often depends on the context of the data itself or how it’s trying to be used A general-purpose definition is that Big data is a broad term for datasets so large or complex that traditional data processing applications are inadequate. The amount of data you have can lend itself to new techniques or methods of analyzing/visualizing all that data and drawing accurate conclusions or predictions about what it is saying. Share your finding with another group and think about the questions below: What kinds of data are out there? What format does it come it? Where does it come from? Did anyone find a link to an actual data source? Did anyone find an API? A way to access this data. (Twitter API)

Utilizing Big Data: With such vast amounts of data flying all around we need a new set of tools and techniques to properly analyze and make predictions about the world around us. The potential impacts are great, but there are many challenges involved in using large amounts of data. Below are some questions to think about as we proceed in this Big Data Unit: What kinds of fields and industries are there that aren’t affected by big data? Are there any areas of life that are impossible to be affected by big data? What are some of the challenges of working with big data? What’s different about working with big data versus other data we’ve seen so far?