Download presentation
Presentation is loading. Please wait.
Published byLeonard McKenzie Modified over 8 years ago
1
Learn
2
Hadoop Online training course is designed to enhance your knowledge and skills to become a successful Hadoop developer and In-depth knowledge of core concepts will be covered in the course along with implementation on varied industry use-cases. take a look on HADOOP ADMIN AND DEVELOPER COURSE content HADOOP ADMIN AND DEVELOPER COURSE Skype Id: info.vibloo Email: info@Vibloo.com USA: +1-248-809-1418 IND: +91-40-3296-5222
3
What is Hadoop? The Hadoop Distributed File System Hadoop Map Reduce Works Anatomy of a Hadoop Cluster Master Daemons Name node Introduction to Hadoop Job Tracker Secondary name node Slave Daemons Job tracker Task tracker Skype Id: info.vibloo Email: info@Vibloo.com USA: +1-248-809-1418 IND: +91-40-3296-5222 www.vibloo.com/Hadoop-Online-Training
4
Blocks and Splits Input and HDFS Splits Data Replication Hadoop Rack Aware Data high availability Data Integrity Cluster architecture and block placement Accessing HDFS JAVA & CLI Approach HDFS (Hadoop Distributed File System) Programming Practices Developing MapReduce Programs in Running without HDFS and MapReduce Running all daemons in a single node Running daemons on dedicated nodes Local Mode Pseudo-distributed Mode Fully distributed mode Skype Id: info.vibloo Email: info@Vibloo.com USA: +1-248-809-1418 IND: +91-40-3296-5222 www.vibloo.com/Hadoop-Online-Training
5
Make a fully distributed Hadoop cluster on a single laptop/desktop Name Node in Safe mode Meta Data Backup Integrating Kerberos security in hadoop Setup Hadoop cluster of Apache, Cloudera and Horton Works Skype Id: info.vibloo Email: info@Vibloo.com USA: +1-248-809-1418 IND: +91-40-3296-5222 www.vibloo.com/Hadoop-Online-Training
6
Examining a Sample MapReduce Program, with several examples Basic API Concepts The Driver Code The Mapper The Reducer Hadoop's Streaming API Writing a MapReduce Program Skype Id: info.vibloo Email: info@Vibloo.com USA: +1-248-809-1418 IND: +91-40-3296-5222 www.vibloo.com/Hadoop-Online-Training
7
The configure and close Methods Sequence Files Record Reader Record Writer Role of Reporter Output Collector Performing several hadoop jobs Processing XML files Counters Directly Accessing HDFS Tool Runner Using The Distributed Cache Skype Id: info.vibloo Email: info@Vibloo.com USA: +1-248-809-1418 IND: +91-40-3296-5222 www.vibloo.com/Hadoop-Online-Training
8
Sorting and Searching Indexing Classification/Machine Learning Term Frequency - Inverse Document Frequency Word Co-Occurrence Common MapReduce Algorithms Creating an Inverted Index Identity Mapper Identity Reducer MapReduce applications Skype Id: info.vibloo Email: info@Vibloo.com USA: +1-248-809-1418 IND: +91-40-3296-5222 www.vibloo.com/Hadoop-Online-Training
9
Testing with MRUnit Logging Other Debugging Strategies Debugging MapReduce Programs Skype Id: info.vibloo Email: info@Vibloo.com USA: +1-248-809-1418 IND: +91-40-3296-5222 www.vibloo.com/Hadoop-Online-Training
10
A Recap of the MapReduce Flow The Secondary Sort Customized Input Formats and Output Formats Advanced MapReduce Programming Skype Id: info.vibloo Email: info@Vibloo.com USA: +1-248-809-1418 IND: +91-40-3296-5222 www.vibloo.com/Hadoop-Online-Training
11
Counters Skipping Bad Records Rerunning failed tasks with Isolation Runner Monitoring and debugging on a Production Cluster Skype Id: info.vibloo Email: info@Vibloo.com USA: +1-248-809-1418 IND: +91-40-3296-5222
12
Reducing network traffic with combiner Partitioners Using Compression Reusing the JVM Running with speculative execution Refactoring code and rewriting algorithms Parameters affecting Performance Other Performance Aspects Tuning for Performance in MapReduce Skype Id: info.vibloo Email: info@Vibloo.com USA: +1-248-809-1418 IND: +91-40-3296-5222 www.vibloo.com/Hadoop-Online-Training
13
HBase HBase concepts HBase architecture Region server architecture File storage architecture HBase basics Column access Scans HBase use cases Install and configure HBase on a multi node cluster Create database Develop and run sample applications Access data stored in HBase using clients like Java, Python and Pearl HBase and Hive Integration HBase admin tasks Defining Schema and basic operation Skype Id: info.vibloo Email: info@Vibloo.com USA: +1-248-809-1418 IND: +91-40-3296-5222 www.vibloo.com/Hadoop-Online-Training
14
PIG Pig basics Install and configure PIG on a cluster PIG Vs MapReduce and SQL Pig Vs Hive Write sample Pig Latin scripts Modes of running PIG Running in Grunt shell Programming in Eclipse Running as Java program PIG UDFs Pig Macros Skype Id: info.vibloo Email: info@Vibloo.com USA: +1-248-809-1418 IND: +91-40-3296-5222 www.vibloo.com/Hadoop-Online-Training
15
Flume, Chukwa, Avro, Scribe, Thrift Flume and Chukwa concepts Use cases of Thrift Avro and scribe Install and configure flume on cluster Create a sample application to capture logs from Apache using flume Skype Id: info.vibloo Email: info@Vibloo.com USA: +1-248-809-1418 IND: +91-40-3296-5222 www.vibloo.com/Hadoop-Online-Training
16
CDH4 Enhancements Name Node High – Availability Name Node federation Fencing YARN Skype Id: info.vibloo Email: info@Vibloo.com USA: +1-248-809-1418 IND: +91-40-3296-5222 www.vibloo.com/Hadoop-Online-Training
17
Hadoop Challenges Hadoop disaster recovery Hadoop suitable cases Skype Id: info.vibloo Email: info@Vibloo.com USA: +1-248-809-1418 IND: +91-40-3296-5222 www.vibloo.com/Hadoop-Online-Training
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.