Presentation is loading. Please wait.

Presentation is loading. Please wait.

SAS users meeting in Halifax

Similar presentations


Presentation on theme: "SAS users meeting in Halifax"— Presentation transcript:

1 SAS users meeting in Halifax
An Implementation of SAS with Hadoop. Roman Tzirulnick - SAS Senior Consultant November 4, 2016 SAS users meeting in Halifax

2 Expert in a wide range of SAS Solutions and Services
SAS Products SAS Administration SAS Base SAS Enterprise Guide SAS Data Integration Studio SAS Enterprise Miner SAS Text Miner SAS BI Platform SAS Visual Analytics SAS Forecast SAS Grid Management SAS High-Performance Analytics Architecture Design Installation and Upgrades Migration Quality Assurance Development Performance Tuning Training and Mentorship and knowledge sharing Support

3 Agenda Why Big Data ? What is Hadoop ? SAS Implementation with Hadoop
Live Demo QA

4 Processing Big Data Google

5 Processing Big Data Facebook

6 Volumes of Data Facebook Youtube Twitter
30 billion pieces of content were added to Facebook this past month by 600 million plus users Youtube More than 2 billion videos were watch on YouTube yesterday Twitter 32 billion searches were performed last month on Twitter

7 What is BIG Data ? Volume Large amount of data Velocity
Needs to be analyzed quickly Variety Different types of structured and unstructured data

8 Key questions enterprises are asking about Big Data
How to store and protect big data? How to backup and restore big data? How to organize and catalog the data that you have backed up? How to keep costs low while ensuring that all the critical data is available when you need it?

9 What is Hadoop? Using Hadoop is cheaper, faster and better. Hadoop is a software framework for storing and processing big data in a distributed fashion on large clusters of commodity hardware. It achieves two tasks: Massive data storage. Faster processing.

10 What is Hadoop? Processing MapReduce HDFS Storage

11 How Hadoop Operates?

12 What is Apache Hive? Hive supports SQL-like language called HiveQL The Apache Hive is data warehouse software facilitates querying and managing large datasets residing in distributed storage Hive provides tools to easy data extract/transform/load Hive supports analysis of large datasets stored in Hadoop’s HDFS

13 HDP - Hadoop Data Platform

14 Hadoop Vendors

15 SAS Implementation with Hadoop HortonWorks

16 SAS Implementation with Hadoop

17 SAS Access to Hadoop

18 SAS Implementation with Hadoop

19 SAS Implementation with Hadoop

20 Questions/ Comments


Download ppt "SAS users meeting in Halifax"

Similar presentations


Ads by Google