Presentation is loading. Please wait.

Presentation is loading. Please wait.

© 2014 MapR Technologies 1 Ted Dunning February 20, 2015.

Similar presentations


Presentation on theme: "© 2014 MapR Technologies 1 Ted Dunning February 20, 2015."— Presentation transcript:

1 © 2014 MapR Technologies 1 Ted Dunning February 20, 2015

2 © 2014 MapR Technologies 2 Contact Information Ted Dunning Chief Applications Architect, MapR Technologies Committer & PMC for Apache’s Drill, Zookeeper & Mahout Mentor for Myriad & Apache’s Storm, Flink, Datafu, Optiq, Drill Email tdunning@apache.orgtdunning@maprtech.comtdunning@apache.orgtdunning@maprtech.com Twitter @ted_dunning@MapR@ApacheMyriad Hashtag today: #StrataHadoop

3 © 2014 MapR Technologies 3 Myriad Project Very new open source / open community project Started as collaboration between Mesosphere, MapR & eBay Proposal to be an incubator project of the Apache Foundation submitted 12 February 2015 Goal: global resource management for multiple data centers

4 © 2014 MapR Technologies 4 Agenda The need Recap How it works Use Cases Lessons Learned The Future

5 © 2014 MapR Technologies 5 What We Need Tight integration of resources and programming models User specified resources and allocation models Lightweight executive Strong isolation Fast task launch

6 © 2014 MapR Technologies 6 What We Need Very fast scheduling Very careful (slow) scheduling Long-lived system tasks Short-lived tasks Long-lived ephemeral tasks Pre-emption

7 © 2014 MapR Technologies 7 What We Need Very good support of entire Hadoop eco-system –Tight integration of MapReduce2 –Tez –Impala –Drill –Spark Very good support of everything else –Arbitrary containers –Web servers –Systems processes without containers –User defined containers –Licensing constraints

8 © 2014 MapR Technologies 8 This is a problem

9 © 2014 MapR Technologies 9 And an opportunity

10 © 2014 MapR Technologies 10 What We Have - Yarn Resource Manager, NodeManager, heartbeat –Direct lineage from JobTracker, TaskTracker Application Master, Task containers –The other half of the JobTracker and TaskTracker Monolithic scheduling Pre-emption Hadoop standard Pre-defined resources Good Hadoop eco support –MapReduce2, Tez, Impala, Drill, Spark

11 © 2014 MapR Technologies 11 What We Have - Mesos Two level scheduling –Bottom level is application specific –Frameworks to ease complexity –Offers, Returns Actor-based, bidi RPC –Super fast process launch Marathon, Chronos –ISO8601, jboss, jetty, sinatra, rails User defined resources, attributes Some Hadoop (Spark native!)

12 © 2014 MapR Technologies 12 Sound the same Very much not

13 © 2014 MapR Technologies 13 Myriad integrates Mesos and Yarn

14 © 2014 MapR Technologies 14 How It Works Mesos creates virtual clusters YARN uses resources provided by Mesos Myriad can ask YARN to release some resources Or give it more Mesos YARN cluster Web Servers

15 © 2014 MapR Technologies 15

16 © 2014 MapR Technologies 16

17 © 2014 MapR Technologies 17

18 © 2014 MapR Technologies 18

19 © 2014 MapR Technologies 19 How Myriad Works Mesos runs Yarn –Yarn runs Yarn programs –Multiple Yarns supported –Multiple Yarn versions easy Mesos runs program + Yarn fakeout –Gets resources back from Yarn quickly –High priority “Yarn” program –As Yarn executes “tasks”, resources given back to Mesos –Allows fast spinup/spindown of Yarn resources

20 © 2014 MapR Technologies 20 How Myriad Works Mesos Persistence Layer

21 © 2014 MapR Technologies 21 How Myriad Works Mesos Persistence Layer

22 © 2014 MapR Technologies 22 Let’s see some examples

23 © 2014 MapR Technologies 23 #1 – I wanna cluster

24 © 2014 MapR Technologies 24 I Want a Cluster Very common need –Ephemeral clusters for multi-tenancy –Quick dev or QA clusters –Compatibility testing Yarn doesn’t run Yarn well –Especially across incompatible versions –Encapsulation can’t be unrolled Myriad does this trivially, but –Must have data localization, universal name space

25 © 2014 MapR Technologies 25 #2 – Version upgrade

26 © 2014 MapR Technologies 26 YARN Version Upgrade Another very common need –Need to test first –Applications roll over to new cluster –Resources follow applications –Data layer must remain inter-operable Yarn doesn’t run Yarn well (again) –Especially across incompatible versions –Encapsulation can’t be unrolled Myriad does this trivially, but –Must have data localization, universal name space

27 © 2014 MapR Technologies 27 #3 – Resource slosh

28 © 2014 MapR Technologies 28 Resource Slosh Resource slosh –Data ingestion pulse requires many web-servers –After ingestion, analytics pulse requires many Hadoop nodes –Data layer must remain inter-operable Conflict between Sysop/Hadoop viewpoints Myriad does this trivially, but –Must have data localization, universal name space

29 © 2014 MapR Technologies 29 Resource Slosh Resource slosh –Data ingestion pulse requires many web-servers –After ingestion, analytics pulse requires many Hadoop nodes –Data layer must remain inter-operable Conflict between Sysop/Hadoop viewpoints Myriad does this trivially, but –Must have data localization, universal name space

30 © 2014 MapR Technologies 30 Some Lessons Learned Omega paper –Not news –Single scheduler framework not viable Multi-cultural software is actually pretty cool –But you have to value both cultures One incubator project (Slider) doesn’t change that

31 © 2014 MapR Technologies 31 The Future Incubator –Proposal at http://wiki.apache.org/incubator/MyriadProposalhttp://wiki.apache.org/incubator/MyriadProposal –Initial team from Mesosphere, Ebay, MapR Community building –Diversity is good already –Starting with very lean team

32 © 2014 MapR Technologies 32 The Future Incubator –Proposal at http://wiki.apache.org/incubator/MyriadProposalhttp://wiki.apache.org/incubator/MyriadProposal –Initial team from Mesosphere, Ebay, MapR Community building –Diversity is good already –Starting with very lean team Older whisky, faster horses, more features –Apologies to the cowboy and the poet –And Tom T HallTom T Hall

33 © 2014 MapR Technologies 33 World domination

34 © 2014 MapR Technologies 34 World domination

35 © 2014 MapR Technologies 35 World domination Peaceful coexistence via specialization

36 © 2014 MapR Technologies 36 Myriad Project Blog “Project Myriad: No Hadoop is an Island” http://bit.ly/myriad- mapr-bloghttp://bit.ly/myriad- mapr-blog Proposal to be an incubator project of the Apache Foundation submitted 12 February 2015 http://bit.ly/myriad-asf-proposalhttp://bit.ly/myriad-asf-proposal Initial code on github: http://bit.ly/github-myriadhttp://bit.ly/github-myriad Join us! Twitter for Myriad community @ApacheMyriad [no, it’s not an official project logo]

37 © 2014 MapR Technologies 37 Contact Information Ted Dunning Chief Applications Architect, MapR Technologies Committer & PMC for Apache’s Drill, Zookeeper & Mahout Mentor for Myriad & Apache’s Storm, Flink, Datafu, Optiq, Drill Email tdunning@apache.orgtdunning@maprtech.comtdunning@apache.orgtdunning@maprtech.com Twitter @ted_dunning@ApacheMyriad Hashtag today: #StrataHadoop


Download ppt "© 2014 MapR Technologies 1 Ted Dunning February 20, 2015."

Similar presentations


Ads by Google