CSE 548 Advanced Computer Network Security Email Trust in MobiCloud using Hadoop Framework Updates Sayan Cole Jaya Chakladar Group No: 1.

Slides:



Advertisements
Similar presentations
Digital Library Service – An overview Introduction System Architecture Components and their functionalities Experimental Results.
Advertisements

EHarmony in Cloud Subtitle Brian Ko. eHarmony Online subscription-based matchmaking service Available in United States, Canada, Australia and United Kingdom.
A Hadoop Overview. Outline Progress Report MapReduce Programming Hadoop Cluster Overview HBase Overview Q & A.
O’Reilly – Hadoop: The Definitive Guide Ch.5 Developing a MapReduce Application 2 July 2010 Taewhi Lee.
Optinuity Confidential. All rights reserved. C2O Configuration Requirements.
XMAS installation instructions Windows Version: 1.0 4/22/2008.
Hadoop Setup. Prerequisite: System: Mac OS / Linux / Cygwin on Windows Notice: 1. only works in Ubuntu will be supported by TA. You may try other environments.
MCTS Guide to Microsoft Windows Server 2008 Network Infrastructure Configuration Chapter 8 Introduction to Printers in a Windows Server 2008 Network.
Undergraduate Poster Presentation Match 31, 2015 Department of CSE, BUET, Dhaka, Bangladesh Wireless Sensor Network Integretion With Cloud Computing H.M.A.
Introduction to MapReduce Programming & Local Hadoop Cluster Accesses Instructions Rozemary Scarlat August 31, 2011.
1 Chapter Overview Introduction to Windows XP Professional Printing Setting Up Network Printers Connecting to Network Printers Configuring Network Printers.
Hadoop Demo Presented by: Imranul Hoque 1. Topics Hadoop running modes – Stand alone – Pseudo distributed – Cluster Running MapReduce jobs Status/logs.
VIRTUALISATION OF HADOOP CLUSTERS Dr G Sudha Sadasivam Assistant Professor Department of CSE PSGCT.
APACHE SERVER By Innovationframes.com »
Secure Search Engine Ivan Zhou Xinyi Dong. Project Overview  The Secure Search Engine project is a search engine that utilizes special modules to test.
Copyright © 2012 Cleversafe, Inc. All rights reserved. 1 Combining the Power of Hadoop with Object-Based Dispersed Storage.
Automating Student Course Profile & Student Record Report Uploads to GaDOE Chris A. McManigal Camden County Schools Kingsland, GA.
Take An Internal Look at Hadoop Hairong Kuang Grid Team, Yahoo! Inc
Linux Operations and Administration
Using Opal to deploy a real scientific application as a Web service Sriram Krishnan
Advanced Topics: MapReduce ECE 454 Computer Systems Programming Topics: Reductions Implemented in Distributed Frameworks Distributed Key-Value Stores Hadoop.
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
Projects. High Performance Computing Projects Design and implement an HPC cluster with one master node and two compute nodes. (Hint: use Rocks HPC Cluster.
Overview Hadoop is a framework for running applications on large clusters built of commodity hardware. The Hadoop framework transparently provides applications.
Secure Search Engine Ivan Zhou Xinyi Dong. Introduction  The Secure Search Engine project is a search engine that utilizes special modules to test the.
CS525: Special Topics in DBs Large-Scale Data Management Hadoop/MapReduce Computing Paradigm Spring 2013 WPI, Mohamed Eltabakh 1.
HAMS Technologies 1
MapReduce: Hadoop Implementation. Outline MapReduce overview Applications of MapReduce Hadoop overview.
Introduction to Apache Hadoop Zibo Wang. Introduction  What is Apache Hadoop?  Apache Hadoop is a software framework which provides open source libraries.
Hadoop/MapReduce Computing Paradigm 1 Shirish Agale.
Introduction to Hadoop and HDFS
f ACT s  Data intensive applications with Petabytes of data  Web pages billion web pages x 20KB = 400+ terabytes  One computer can read
ZhangGang Since the Hadoop farm has not successfully configured at CC, so I can not do some test with HBase. I just use the machine named.
CSE 548 Advanced Computer Network Security Document Search in MobiCloud using Hadoop Framework Sayan Cole Jaya Chakladar Group No: 1.
Grid Computing at Yahoo! Sameer Paranjpye Mahadev Konar Yahoo!
A Brief Documentation.  Provides basic information about connection, server, and client.
Experiment Management System CSE 423 Aaron Kloc Jordan Harstad Robert Sorensen Robert Trevino Nicolas Tjioe Status Report Presentation Industry Mentor:
Apache Hadoop Daniel Lust, Anthony Taliercio. What is Apache Hadoop? Allows applications to utilize thousands of nodes while exchanging thousands of terabytes.
Apache Mahout. Prerequisites for Building MAHOUT Java JDK 1.6 Maven 3.0 or higher ( ). Subversion (optional)
Weekly Report By: Devin Trejo Week of June 14, 2015-> June 20, 2015.
Youngil Kim Awalin Sopan Sonia Ng Zeng.  Introduction  Concept of the Project  System architecture  Implementation – HDFS  Implementation – System.
Programming in Hadoop Guangda HU Huayang GUO
CS525: Big Data Analytics MapReduce Computing Paradigm & Apache Hadoop Open Source Fall 2013 Elke A. Rundensteiner 1.
Apache Hadoop on the Open Cloud David Dobbins Nirmal Ranganathan.
IS493 INFORMATION SECURITY TUTORIAL # 1 (S ) ASHRAF YOUSSEF.
Enabling Grids for E-sciencE Software installation and setup Viet Tran Institute of Informatics Slovakia.
Hadoop Joshua Nester, Garrison Vaughan, Calvin Sauerbier, Jonathan Pingilley, and Adam Albertson.
CSE 548 Advanced Computer Network Security Trust in MobiCloud using Hadoop Framework Updates Sayan Kole Jaya Chakladar Group No: 1.
Linux Operations and Administration
Hadoop/MapReduce Computing Paradigm 1 CS525: Special Topics in DBs Large-Scale Data Management Presented By Kelly Technologies
Secure Search Engine Ivan Zhou Xinyi Dong. Project Overview  The Secure Search Engine project is a search engine that utilizes special modules to test.
Cloud Computing project NSYSU Sec. 1 Demo. NSYSU EE IT_LAB2 Outline  Our system’s architecture  Flow chart of the hadoop’s job(web crawler) working.
Hands-On Microsoft Windows Server 2008 Chapter 5 Configuring Windows Server 2008 Printing.
Distributed File System. Outline Basic Concepts Current project Hadoop Distributed File System Future work Reference.
INTRODUCTION TO HADOOP. OUTLINE  What is Hadoop  The core of Hadoop  Structure of Hadoop Distributed File System  Structure of MapReduce Framework.
Learn. Hadoop Online training course is designed to enhance your knowledge and skills to become a successful Hadoop developer and In-depth knowledge of.
By: Joel Dominic and Carroll Wongchote 4/18/2012.
Hadoop. Introduction Distributed programming framework. Hadoop is an open source framework for writing and running distributed applications that.
Hadoop Architecture Mr. Sriram
Unit 2 Hadoop and big data
How to download, configure and run a mapReduce program In a cloudera VM Presented By: Mehakdeep Singh Amrit Singh Chaggar Ranjodh Singh.
Chapter 9 Router Configuration (Ospf, Rip) Webmin, usermin Team viewer
Lab 1 introduction, debrief
Meng Cao, Xiangqing Sun, Ziyue Chen May 28th, 2014
INSTALLING AND SETTING UP APACHE2 IN A LINUX ENVIRONMENT
Overview Hadoop is a framework for running applications on large clusters built of commodity hardware. The Hadoop framework transparently provides applications.
Overview Hadoop is a framework for running applications on large clusters built of commodity hardware. The Hadoop framework transparently provides applications.
Lecture 16 (Intro to MapReduce and Hadoop)
Lecture 16B: Instructions on how to use Hadoop on Amazon Web Services
Presentation transcript:

CSE 548 Advanced Computer Network Security Trust in MobiCloud using Hadoop Framework Updates Sayan Cole Jaya Chakladar Group No: 1

Overview Installation of Hadoop Understanding the existing trust system and its suitability as a MapReduce application

Project Tasks (updated) Tasks Responsible Status Learn MapReduce and HadoopJaya & Sayan100 % Install and configure Hadoop in MobiCloud Jaya & Sayan60 % Develop UI web applicationJaya25 % Search mapper algorithmSayan25 % Search reduction algorithmJaya25 % HDFS data store creation and updates Jaya & Sayan Testing and problem resolutionJaya & SayanNot started Delivery and demoJaya & SayanNot started

Software and Hardware Requirements Hadoop Database software e.g. MySQL or Apache HDFS 3 or 4 Android phones mapped to virtual machines in 2 different Linux boxes

Hadoop Single Cluster Installation Prerequisites Java 6 –Add the canonical partner repository to the apt repository –Update the source list –Install JDK –Select Sun’s Java as the default on the machine Add a dedicated Hadoop system user Configure SSH –Configure SSH access for Hadoop system user –Generate an SSH key for Hadoop user –Enable SSH access to local mahine with the new key created Disable IPv6

Hadoop Single Cluster Installation Download Hadoop from Apache mirror sites and extract Set JAVA_HOME in /conf/hadoop-env.sh Configure core-site.xml –Set path for hadoop.tmp.dir to local directory –Set the HDFS variable Configure mapred-site.xml to set the host and port of mapReduce job tracker. Configure hdfs-site.xml to specify the number of replications for each file in the system.

Hadoop Single Cluster Installation Format the Hadoop HDFS name node – make sure data is backed up Start a single node cluster, this starts the name node, data node, job tracker & task tracker.

Hadoop Multiple Cluster Installation Setup two single node clusters to continue Designate one as master and the other one as slave Shutdown clusters in both machines Update /etc/hosts on both machines with appropriate names (master and slave) and addresses SSH configuration between master and slave –Hadoop user must connect to users master and slave –Password less connection

Hadoop Multiple Cluster Installation Master node runs master daemons like name node for HDFS and job tracker Both nodes run slave daemons like data node fro HDFS and task tracker

Hadoop Multiple Cluster Installation Master vs. Slave configuration –On master /conf/master lists the master –On slaves, /conf/slaves lists two entries master and slave Update core-site.xml on all machines to setup fs.default.name as hdfs://master: Update mapred-site.xml on all sites to fix mapread.job.tracker as master: Change dfs.replication variable in hdfs-site.xml to the number of sites avaiable, 4 in our case. Format the name node

Hadoop Multiple Cluster Installation Start up the multi-node cluster –Start HDFS daemons like name node and data node daemons in master and slaves respectively –Start Map reduce daemons like job tracker on master and task tracker in slaves

Challenges faced so far Multi node setup errors

Project Time Line Week 1Week 2Week 3Week 4Week 5Week 6Week 7Week 8Week 9 Week 10 Week 11 Week 12 Study and understand Mapreduce and Hadoop Install and configure Hadoop Run simple application and demonstrate correctness of implementation Create Mapreduce algorithm particular to specific problem/application Develop the user interface/frontend Installation on Mobicloud Stress checking and testing Analyze and interpret the results Present the application