CSE 548 Advanced Computer Network Security Document Search in MobiCloud using Hadoop Framework Sayan Cole Jaya Chakladar Group No: 1.

Slides:



Advertisements
Similar presentations
MapReduce.
Advertisements

Digital Library Service – An overview Introduction System Architecture Components and their functionalities Experimental Results.
EHarmony in Cloud Subtitle Brian Ko. eHarmony Online subscription-based matchmaking service Available in United States, Canada, Australia and United Kingdom.
CMU SCS : Multimedia Databases and Data Mining Extra: intro to hadoop C. Faloutsos.
CSE 548 Advanced Computer Network Security Long Qiu Xinyi Dong DOSGI APPLICATION PLATFORM FOR MOBICLOUD.
AStudy on the Viability of Hadoop Usage on the Umfort Cluster for the Processing and Storage of CReSIS Polar Data Mentor: Je’aime Powell, Dr. Mohammad.
DOSGi Application Platform for MobiCloud Long Qiu Xinyi Dong.
IS112 – Chapter 1 Notes Computer Organization and Programming Professor Catherine Dwyer 2003.
Poly Hadoop CSC 550 April 26, 2007 Scott Griffin Daniel Jackson Alexander Sideropoulos Anton Snisarenko.
Undergraduate Poster Presentation Match 31, 2015 Department of CSE, BUET, Dhaka, Bangladesh Wireless Sensor Network Integretion With Cloud Computing H.M.A.
Lecture 2 – MapReduce CPE 458 – Parallel Programming, Spring 2009 Except as otherwise noted, the content of this presentation is licensed under the Creative.
Professor Michael J. Losacco CIS 1150 – Introduction to Computer Information Systems System Software Chapter 4.
1 © 2006 Cisco Systems, Inc. All rights reserved. Session Number Presentation_ID Using the Cisco Technical Support & Documentation Website for LAN Issues.
Common Services in a network Server : provide services Type of Services (= type of servers) –file servers –print servers –application servers –domain servers.
Secure Search Engine Ivan Zhou Xinyi Dong. Project Overview  The Secure Search Engine project is a search engine that utilizes special modules to test.
1 © 2006 Cisco Systems, Inc. All rights reserved. Session Number Presentation_ID Using the Cisco Technical Support & Documentation Website for Security.
Identity Management and DNS Services Tianyi XING.
Identity Management and DNS Services Tianyi XING.
Advanced Topics: MapReduce ECE 454 Computer Systems Programming Topics: Reductions Implemented in Distributed Frameworks Distributed Key-Value Stores Hadoop.
U.S. Department of the Interior U.S. Geological Survey David V. Hill, Information Dynamics, Contractor to USGS/EROS 12/08/2011 Satellite Image Processing.
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
Research on cloud computing application in the peer-to-peer based video-on-demand systems Speaker : 吳靖緯 MA0G rd International Workshop.
USING HADOOP & HBASE TO BUILD CONTENT RELEVANCE & PERSONALIZATION Tools to build your big data application Ameya Kanitkar.
Sumit Kumar Archana Kumar Group # 4 CSE 591 : Virtualization and Cloud Computing.
Building service testbeds on FIRE D5.2.5 Virtual Cluster on Federated Cloud Demonstration Kit August 2012 Version 1.0 Copyright © 2012 CESGA. All rights.
Secure Search Engine Ivan Zhou Xinyi Dong. Introduction  The Secure Search Engine project is a search engine that utilizes special modules to test the.
CS525: Special Topics in DBs Large-Scale Data Management Hadoop/MapReduce Computing Paradigm Spring 2013 WPI, Mohamed Eltabakh 1.
MapReduce: Hadoop Implementation. Outline MapReduce overview Applications of MapReduce Hadoop overview.
Distributed Indexing of Web Scale Datasets for the Cloud {ikons, eangelou, Computing Systems Laboratory School of Electrical.
Introduction to Apache Hadoop Zibo Wang. Introduction  What is Apache Hadoop?  Apache Hadoop is a software framework which provides open source libraries.
Hadoop/MapReduce Computing Paradigm 1 Shirish Agale.
Introduction to Hadoop and HDFS
f ACT s  Data intensive applications with Petabytes of data  Web pages billion web pages x 20KB = 400+ terabytes  One computer can read
Optimizing Cloud MapReduce for Processing Stream Data using Pipelining 作者 :Rutvik Karve , Devendra Dahiphale , Amit Chhajer 報告 : 饒展榕.
Spatial Tajo Supporting Spatial Queries on Apache Tajo Slideshare Shorten URL : goo.gl/j0VLXpgoo.gl/j0VLXp.
MapReduce Kristof Bamps Wouter Deroey. Outline Problem overview MapReduce o overview o implementation o refinements o conclusion.
Grid Computing at Yahoo! Sameer Paranjpye Mahadev Konar Yahoo!
1 Session Number Presentation_ID © 2002, Cisco Systems, Inc. All rights reserved. Using the Cisco TAC Website for Security and Virtual Private Network.
Altman IM Ltd | | process | verify | convert | route | connect Prism Software’s solutions provide advanced workflow.
CSE 548 Advanced Computer Network Security Trust in MobiCloud using Hadoop Framework Updates Sayan Cole Jaya Chakladar Group No: 1.
IPSec VPN on a Android Phone Group 1 Avinash Bhashyam Axel Christiansen.
Programming in Hadoop Guangda HU Huayang GUO
Windows Azure. Azure Application platform for the public cloud. Windows Azure is an operating system You can: – build a web application that runs.
CS525: Big Data Analytics MapReduce Computing Paradigm & Apache Hadoop Open Source Fall 2013 Elke A. Rundensteiner 1.
CSE 548 Advanced Computer Network Security Trust in MobiCloud using Hadoop Framework Updates Sayan Kole Jaya Chakladar Group No: 1.
Abdullah Alshalan Garrett Drown Group #4 CSE591 - Virtualization and Cloud Computing.
Hadoop/MapReduce Computing Paradigm 1 CS525: Special Topics in DBs Large-Scale Data Management Presented By Kelly Technologies
PROJECT. Topics  Theoretical: Error Performance Analysis for Partitioned Sketch Data Structures  Survey: Security and Privacy for Big Data: A Survey.
Secure Search Engine Ivan Zhou Xinyi Dong. Project Overview  The Secure Search Engine project is a search engine that utilizes special modules to test.
Cloud Computing project NSYSU Sec. 1 Demo. NSYSU EE IT_LAB2 Outline  Our system’s architecture  Flow chart of the hadoop’s job(web crawler) working.
{ Tanya Chaturvedi MBA(ISM) Hadoop is a software framework for distributed processing of large datasets across large clusters of computers.
Distributed File System. Outline Basic Concepts Current project Hadoop Distributed File System Future work Reference.
Next Generation of Apache Hadoop MapReduce Owen
Information Systems Design and Development Technical Implications (Software) Computing Science.
INTRODUCTION TO HADOOP. OUTLINE  What is Hadoop  The core of Hadoop  Structure of Hadoop Distributed File System  Structure of MapReduce Framework.
By: Joel Dominic and Carroll Wongchote 4/18/2012.
Lecture 3 – MapReduce: Implementation CSE 490h – Introduction to Distributed Computing, Spring 2009 Except as otherwise noted, the content of this presentation.
Sentiment Analysis of Twitter Data(using HadoopMapreduce)
Big Data is a Big Deal!.
How to download, configure and run a mapReduce program In a cloudera VM Presented By: Mehakdeep Singh Amrit Singh Chaggar Ranjodh Singh.
Spark Presentation.
DCR ARB Presentation Team 5: Tour Conductor.
TYPES OF SERVER. TYPES OF SERVER What is a server.
Ministry of Higher Education
CS6604 Digital Libraries IDEAL Webpages Presented by
Hadoop Basics.
Introduction to Apache
Network+ Guide to Networks, Fourth Edition
MapReduce: Simplified Data Processing on Large Clusters
Presentation transcript:

CSE 548 Advanced Computer Network Security Document Search in MobiCloud using Hadoop Framework Sayan Cole Jaya Chakladar Group No: 1

Project Goal Hadoop MapReduce – software framework to write applications to process huge datasets on large number of computer clusters Apache Hadoop Distributed File System – primary storage systems used by Hadoop applications Project – develop a document search algorithm in MobiCloud based on Hadoop and HDFS

Group project Description Install and configure Hadoop in MobiCloud Develop user interface to enable searching and display result Create and update the HDFS data Mapper function to map users to keywords Reduction function to sort the best matches for a search criteria System testing Delivery and demo

Project Description (cont.) TasksResponsible Install and configure Hadoop in MobiCloud Jaya & Sayan Develop UI web applicationJaya Synchronize phone list with virtual machines Sayan Search mapper algorithmSayan Search reduction algorithmJaya HDFS data store creation and updates Jaya & Sayan Testing and problem resolutionJaya & Sayan Delivery and demoJaya & Sayan

Software and Hardware Requirements Hadoop Database software e.g. MySQL or Apache HDFS 3 or 4 Android phones mapped to virtual machines in 2 different Linux boxes

Network topology Web Application to process Requests Hadoop Master Applicatio n HDFS Data Store VM

Project Time Line Week 1Week 2Week 3Week 4Week 5Week 6Week 7Week 8Week 9 Week 10 Week 11 Week 12 Study and understand Mapreduce and Hadoop Install and configure Hadoop Run simple application and demonstrate correctness of implementation Create Mapreduce algorithm particular to specific problem/application Develop the user interface/frontend Installation on Mobicloud Stress checking and testing Analyze and interpret the results Present the application

Technical Details Install and configure Hadoop in MobiCloud Create a UI web application –The user interface of the application will be web based –The output will be shown back in the same web page Create an application to synchronize contact list with virtual machines

Technical Details Develop an application to search for documents –The user requests a document search using some keywords. The input is handled in a webpage hosted as part of the web application –The web application hands over the request to the Hadoop Master application –The master application accesses the contact list of the requesting user

Technical Details –For each entry in the contact list, it creates a job to map the user to the keywords. –The reduction function creates a sorted list of users based on their degree of match –The result set is returned to the web application to update the web page displayed to the user. Creation and updating of Apache HDFS distributed file storage system

Risks and Benefits Novel aspects of this project: –Distributed computing in mobile clouds –Enable meaningful search in a mobile device Risks/challenges: –Cluster failures, debugging and solution –Handling errors e.g. a mobile is switched off or network issues Potential applications & benefits: –Can be used to search for multimedia objects in mobile environment –Access data handling capabilities in a mobile device