Introducing – SAS® Grid Manager for Hadoop

Slides:



Advertisements
Similar presentations
National Institute of Advanced Industrial Science and Technology Advance Reservation-based Grid Co-allocation System Atsuko Takefusa, Hidemoto Nakada,
Advertisements

Introduction to Grid Application On-Boarding Nick Werstiuk
Copyright © 2008 SAS Institute Inc. All rights reserved. SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks.
Copyright © 2008 SAS Institute Inc. All rights reserved. SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks.
Copyright © 2007, SAS Institute Inc. All rights reserved. SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks.
Copyright © 2008 SAS Institute Inc. All rights reserved. SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks.
Copyright © 2007, SAS Institute Inc. All rights reserved. SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks.
FACULTY OF ENGINEERING & INFORMATION TECHNOLOGIES A Pareto Frontier for Optimizing Data Transfer vs. Job Execution in Grids Albert Y. Zomaya | Professor.
Copyright © 2010, SAS Institute Inc. All rights reserved. SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks.
Can’t We All Just Get Along? Sandy Ryza. Introductions Software engineer at Cloudera MapReduce, YARN, Resource management Hadoop committer.
1 © 2014 Electric Power Research Institute, Inc. All rights reserved. Naresh Kumar, Ph.D., MBA Senior Program Manager Electric Power Research Institute.
Hardening Hadoop for the Enterprise: Managing Diverse Workloads, Securing and Governing your Big Data Platform How does IT balance the tension between.
Copyright © 2007, SAS Institute Inc. All rights reserved. SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks.
Matei Zaharia, Dhruba Borthakur *, Joydeep Sen Sarma *, Khaled Elmeleegy +, Scott Shenker, Ion Stoica UC Berkeley, * Facebook Inc, + Yahoo! Research Delay.
Copyright © 2004, SAS Institute Inc. All rights reserved. Wayne Embry Technical Account Manager March 17, 2005 Delivering Enterprise Value with SAS ® 9.
Next Generation of Apache Hadoop MapReduce Arun C. Murthy - Hortonworks Founder and Architect Formerly Architect, MapReduce.
Copyright © 2012 Cleversafe, Inc. All rights reserved. 1 Combining the Power of Hadoop with Object-Based Dispersed Storage.
© 2009 VMware Inc. All rights reserved vFabric Overview Michael Lazar Senior Solutions Architect.
Copyright © 2008 Altair Engineering, Inc. All rights reserved. PBS GridWorks - Efficient Application Scheduling in Distributed Environments Dr. Jochen.
Copyright © 2010 Platform Computing Corporation. All Rights Reserved.1 The CERN Cloud Computing Project William Lu, Ph.D. Platform Computing.
ISG We build general capability Introduction to Olympus Shawn T. Brown, PhD ISG MISSION 2.0 Lead Director of Public Health Applications Pittsburgh Supercomputing.
Introduction to Apache Hadoop Zibo Wang. Introduction  What is Apache Hadoop?  Apache Hadoop is a software framework which provides open source libraries.
Hadoop Ali Sharza Khan High Performance Computing 1.
Evaluation of Agent Teamwork High Performance Distributed Computing Middleware. Solomon Lane Agent Teamwork Research Assistant October 2006 – March 2007.
Grid Computing at The Hartford Condor Week 2008 Robert Nordlund
Apache Airavata (Incubating) Gateway to Grids & Clouds Suresh Marru Nov 10 th 2011.
WNoDeS – Worker Nodes on Demand Service on EMI2 WNoDeS – Worker Nodes on Demand Service on EMI2 Local batch jobs can be run on both real and virtual execution.
“Come out of the desert of ignorance to the OASUS of knowledge” Grid Computing with SAS ® Foundation Statistics Canada SAS Technology Centre.
Copyright © 2004, SAS Institute Inc. All rights reserved. SAS Stored Processes An analyst’s perspective Sylvain Tremblay SAS Canada 24 February 2006.
Evolution of a High Performance Computing and Monitoring system onto the GRID for High Energy Experiments T.L. Hsieh, S. Hou, P.K. Teng Academia Sinica,
Doug Haigh, SAS Institute Inc.
Virtualization and Databases Ashraf Aboulnaga University of Waterloo.
Easier Platform Administration using SAS 9.4 Grid Option Sets SAS New South Wales User Group - Nov 2015 Andrew Howell ANJ Solutions Pty Ltd.
Copyright © 2015, SAS Institute Inc. All rights reserved. THE ELEPHANT IN THE ROOM SAS & HADOOP.
Copyright © 2012 Cleversafe, Inc. All rights reserved. 1 Combining the Power of Hadoop with Object-Based Dispersed Storage.
Copyright © 2012, SAS Institute Inc. All rights reserved. SAS ® GRID AT PHAC SAS OTTAWA PLATFORM USERS SOCIETY, NOVEMBER 2012.
LSF Universus By Robert Stober Systems Engineer Platform Computing, Inc.
Copyright © 2012, SAS Institute Inc. All rights reserved. SAS GRID OPUS SPRING 2014 MEETING FRANK SCOTT, SAS CANADA.
CSF. © Platform Computing Inc CSF – Community Scheduler Framework Not a Platform product Contributed enhancement to The Globus Toolkit Standards.
Early Experiences with OGSI-Agreement Ming Xu, Principal Product Architect.
Distributed Correlation in Fabric Kiwi Team PSNC.
Avanade Virtualized Grids A case study on the benefits of virtualization Presenter: Date: Luca Regini Associate Principal Consultant, CTO team, Avanade.
Petr Škoda, Jakub Koza Astronomical Institute Academy of Sciences
OGSA HPC cluster usecase for reference Model v.02
Models for Resources and Management
Hadoop-based Distributed Web Crawler
Introduction to Distributed Platforms
By Chris immanuel, Heym Kumar, Sai janani, Susmitha
Dynamic Deployment of VO Specific Condor Scheduler using GT4
An Open Source Project Commonly Used for Processing Big Data Sets
HDFS Yarn Architecture
Chapter 10 Data Analytics for IoT
Hadoop MapReduce Framework
GWE Core Grid Wizard Enterprise (
Practical aspects of multi-core job submission at CERN
Data Platform and Analytics Foundational Training
FCT Follow-up Meeting 31 March, 2017 Fernando Meireles
Hadoop Clusters Tess Fulkerson.
Enterprise security for big data solutions on Azure HDInsight
Ministry of Higher Education
Capital One Architecture Team and DataTorrent
Hadoop for SQL Server Pros
Introduction to Apache
Wide Area Workload Management Work Package DATAGRID project
Creating a Dynamic HPC Infrastructure with Platform Computing
Building and running HPC apps in Windows Azure
Containerized Spark at RBC
Introduction to Azure Data Lake
OpenStack for the Enterprise
Presentation transcript:

Introducing – SAS® Grid Manager for Hadoop Cheryl Doninger, SAS Doug Haigh, SAS Copyright © 2010, SAS Institute Inc. All rights reserved.

About the presenter I started with SAS in 1986 and am currently a Senior R&D Director. My teams work on many of the foundation technologies providing the compute capabilities of SAS: SAS Grid, SAS/CONNECT, all host teams, Core, IOM and WorkSpace as well as SAS Environment Manager. I have a Master’s from NC State and hold 2 patents related to SAS Grid Computing. #SASGF Copyright © 2016, SAS Institute Inc. All rights reserved.

A Bit of Background… SAS Grid designed for Workload management High availability Performance Architected to support multiple providers Platform Suite for SAS (LSF, PM) Hadoop (YARN, Oozie)

Why SAS Grid Manager for Hadoop? Co-location of SAS Grid jobs on Hadoop cluster Requires – Integration with YARN Supported enterprise Hadoop distribution Kerberos Spare capacity to accommodate additional workload Nodes architected to be compute nodes

#SASGF Copyright © 2016, SAS Institute Inc. All rights reserved.

#SASGF Copyright © 2016, SAS Institute Inc. All rights reserved.

#SASGF Copyright © 2016, SAS Institute Inc. All rights reserved.

What Behavior Can I Expect? Consistent job submission Grid launched WS servers Grid servers created with SAS/CONNECT Batch submission with SASGSUB Batch submission with Schedule Manager plug-in All SAS Grid integration is the same SAS Grid jobs will run unchanged

What Is Different? Hadoop is not part of the product Monitoring/management via Hadoop interfaces Kerberos is required Need for shared file system is reduced Data will need to be migrated to HDFS Need to understand the capabilities of YARN Each SAS Grid job results in (at least) two YARN containers

Conclusion Gas powered vs. electric

#SASGF Copyright © 2016, SAS Institute Inc. All rights reserved.