Presentation is loading. Please wait.

Presentation is loading. Please wait.

Big Data Young Lee BUS 550.

Similar presentations


Presentation on theme: "Big Data Young Lee BUS 550."— Presentation transcript:

1 Big Data Young Lee BUS 550

2 Big data

3 Big Data Explosion of information Iot Analytics
Not just SQL (Structured query language)but unstructured data Transformation from a entity based data to transactional databases

4 Industry https://www.youtube.com/watch?v=eVSfJhssXUA
Billion dollar industry Corporate investments

5 who IBM Google Sears Amazon Social media applications: facebook

6 Why Insights Predictions Customer value Efficiency Costs savings
Product development Makes AI possible Analytics

7 How Cloud computing Cognitive computing Artificial intelligence
Software implementations

8 Who needs big data Insurance companies Airlines Retail Hospitals
Traffic Manufacturers

9 Concept of big data

10 DatA Data is like a dam Gartner security
High volume, high velocity and high variety (unstructured) Veracity (trustworthy) security

11 IBM take on Big data

12 Tools of big data Map reduce Hadoop Big Table Kaggle
Tool design by google to functions large amount of data Hadoop Run Map reduce on large cluster Big Table Google developed distributed storage Kaggle

13 Hadoop The Apache™ Hadoop® project develops open-source software for reliable, scalable, distributed computing. The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware to deliver high-availability, the library itself is designed to detect and handle failures at the application layer, so delivering a highly-available service on top of a cluster of computers, each of which may be prone to failures.

14 Kaggle Kaggle is an online community of data scientists and machine learners, owned by Google, Inc. Kaggle allows users to find and publish data sets, explore and build models in a web-based data-science environment, work with other data scientists and machine learning engineers, and enter competitions to solve data science challenges.

15 issues Security Personal information Constant monitoring Safety
Storage Errors ( Scary (Forbes)

16 Questions Who made map reduce?


Download ppt "Big Data Young Lee BUS 550."

Similar presentations


Ads by Google