KUST Korea University Study Site College of Information and communication 2012210038 Lee Mina 2012210084 Baek Dae Hyun 2009210017 Joo Jae Woo.

Slides:



Advertisements
Similar presentations
Introduction to Hadoop Richard Holowczak Baruch College.
Advertisements

Project presentation by Mário Almeida Implementation of Distributed Systems KTH 1.
A Hadoop Overview. Outline Progress Report MapReduce Programming Hadoop Cluster Overview HBase Overview Q & A.
A Survey of Distributed Database Management Systems Brady Kyle CSC
Google Bigtable A Distributed Storage System for Structured Data Hadi Salimi, Distributed Systems Laboratory, School of Computer Engineering, Iran University.
Hadoop tutorials. Todays agenda Hadoop Introduction and Architecture Hadoop Distributed File System MapReduce Spark 2.
CPS216: Advanced Database Systems (Data-intensive Computing Systems) How MapReduce Works (in Hadoop) Shivnath Babu.
SQL on Hadoop. Todays agenda Introduction Hive – the first SQL approach Data ingestion and data formats Impala – MPP SQL.
Google Distributed System and Hadoop Lakshmi Thyagarajan.
Gowtham Rajappan. HDFS – Hadoop Distributed File System modeled on Google GFS. Hadoop MapReduce – Similar to Google MapReduce Hbase – Similar to Google.
HADOOP ADMIN: Session -2
© 2013 Mellanox Technologies 1 NoSQL DB Benchmarking with high performance Networking solutions WBDB, Xian, July 2013.
Hadoop Team: Role of Hadoop in the IDEAL Project ●Jose Cadena ●Chengyuan Wen ●Mengsu Chen CS5604 Spring 2015 Instructor: Dr. Edward Fox.
SOFTWARE SYSTEMS DEVELOPMENT MAP-REDUCE, Hadoop, HBase.
Zois Vasileios Α. Μ :4183 University of Patras Department of Computer Engineering & Informatics Diploma Thesis.
CS525: Special Topics in DBs Large-Scale Data Management Hadoop/MapReduce Computing Paradigm Spring 2013 WPI, Mohamed Eltabakh 1.
HBase A column-centered database 1. Overview An Apache project Influenced by Google’s BigTable Built on Hadoop ▫A distributed file system ▫Supports Map-Reduce.
MapReduce and Hadoop 1 Wu-Jun Li Department of Computer Science and Engineering Shanghai Jiao Tong University Lecture 2: MapReduce and Hadoop Mining Massive.
Presented by CH.Anusha.  Apache Hadoop framework  HDFS and MapReduce  Hadoop distributed file system  JobTracker and TaskTracker  Apache Hadoop NextGen.
Goodbye rows and tables, hello documents and collections.
Hadoop tutorials. Todays agenda Hadoop Introduction and Architecture Hadoop Distributed File System MapReduce Spark Cluster Monitoring 2.
Hadoop Basics -Venkat Cherukupalli. What is Hadoop? Open Source Distributed processing Large data sets across clusters Commodity, shared-nothing servers.
Apache Hadoop MapReduce What is it ? Why use it ? How does it work Some examples Big users.
Hadoop/MapReduce Computing Paradigm 1 Shirish Agale.
Introduction to Hadoop and HDFS
f ACT s  Data intensive applications with Petabytes of data  Web pages billion web pages x 20KB = 400+ terabytes  One computer can read
SEMINAR ON Guided by: Prof. D.V.Chaudhari Seminar by: Namrata Sakhare Roll No: 65 B.E.Comp.
1 Intern Project Presentation Connor Richardson Big Data August 4, 2015.
Design of Cloud Management Layer for High-Performance File Transfer 高效能檔案傳輸之雲端層設計 1.
Introduction to Hadoop Programming Bryon Gill, Pittsburgh Supercomputing Center.
KUST Korea University Study Site College of Information and communication Lee Mina Baek Dae Hyun Joo Jae Woo.
CSE 548 Advanced Computer Network Security Trust in MobiCloud using Hadoop Framework Updates Sayan Cole Jaya Chakladar Group No: 1.
Presented by: Katie Woods and Jordan Howell. * Hadoop is a distributed computing platform written in Java. It incorporates features similar to those of.
Youngil Kim Awalin Sopan Sonia Ng Zeng.  Introduction  Concept of the Project  System architecture  Implementation – HDFS  Implementation – System.
Virtualization and Databases Ashraf Aboulnaga University of Waterloo.
Hadoop IT Services Hadoop Users Forum CERN October 7 th,2015 CERN IT-D*
CS525: Big Data Analytics MapReduce Computing Paradigm & Apache Hadoop Open Source Fall 2013 Elke A. Rundensteiner 1.
Distributed Time Series Database
NoSQL Or Peles. What is NoSQL A collection of various technologies meant to work around RDBMS limitations (mostly performance) Not much of a definition...
Nov 2006 Google released the paper on BigTable.
HADOOP Carson Gallimore, Chris Zingraf, Jonathan Light.
CSE 548 Advanced Computer Network Security Trust in MobiCloud using Hadoop Framework Updates Sayan Kole Jaya Chakladar Group No: 1.
Copyright © 2012 Cleversafe, Inc. All rights reserved. 1 Combining the Power of Hadoop with Object-Based Dispersed Storage.
왜 왔어요 ? COLLEGE OF INFORMATION AND COMMUNICATION LEE MINA BAEK DAE HYUN JOO JAE WOO.
Hadoop/MapReduce Computing Paradigm 1 CS525: Special Topics in DBs Large-Scale Data Management Presented By Kelly Technologies
Hadoop & Neptune Feb 김형준.
{ Tanya Chaturvedi MBA(ISM) Hadoop is a software framework for distributed processing of large datasets across large clusters of computers.
Youngil Kim Awalin Sopan Sonia Ng Zeng.  Introduction  System architecture  Implementation – HDFS  Implementation – System Analysis ◦ System Information.
Next Generation of Apache Hadoop MapReduce Owen
Learn Hadoop and Big Data Technologies. Hadoop  An Open source framework that stores and processes Big Data in distributed manner on a large groups of.
INTRODUCTION TO HADOOP. OUTLINE  What is Hadoop  The core of Hadoop  Structure of Hadoop Distributed File System  Structure of MapReduce Framework.
Learn. Hadoop Online training course is designed to enhance your knowledge and skills to become a successful Hadoop developer and In-depth knowledge of.
By: Joel Dominic and Carroll Wongchote 4/18/2012.
BIG DATA/ Hadoop Interview Questions.
CPSC8985 FA 2015 Team C3 DATA MIGRATION FROM RDBMS TO HADOOP By Naga Sruthi Tiyyagura Monika RallabandiRadhakrishna Nalluri.
EpiC: an Extensible and Scalable System for Processing Big Data Dawei Jiang, Gang Chen, Beng Chin Ooi, Kian Lee Tan, Sai Wu School of Computing, National.
Microsoft Ignite /28/2017 6:07 PM
Practical Hadoop: do’s and don’ts by example Kacper Surdy, Zbigniew Baranowski.
Google App Engine. Contents Overview Getting Started Databases Inter-app Communications Modes.
An Open Source Project Commonly Used for Processing Big Data Sets
HDFS Yarn Architecture
Introduction to HDFS: Hadoop Distributed File System
Hadoop Clusters Tess Fulkerson.
Ministry of Higher Education
Database Applications (15-415) Hadoop Lecture 26, April 19, 2016
NoSQL Systems Overview (as of November 2011).
Introduction to Apache
Lecture 16 (Intro to MapReduce and Hadoop)
Cloud Computing for Data Analysis Pig|Hive|Hbase|Zookeeper
Pig Hive HBase Zookeeper
Presentation transcript:

KUST Korea University Study Site College of Information and communication Lee Mina Baek Dae Hyun Joo Jae Woo

Outline 1.Again, what are we making? 2.What is done for now? 3.What will be available in the near future?

Again, what are we making? A study site for students of Korea University to encourage them to share their knowledge based on the classes that they are currently taking 1 KUST

What is done for now? 2

This is what we did during last month “Hello Hadoop…?”

This is what we did during last month “Hello SVN…?”

Let’s see the appearance first.

login.html

signup.html

findpw.html

welcome.html

exam.jsp

qna.jsp

qna_post.jsp

qna_write.jsp

Let’s see the web process

JSP working process (Sign up) Signup.htmlSignup.jspWelcome.html Portal ID Password Nickname Database

JSP working process (Login & Logout) Login.html Cookie LoginPro.js p Cookie Login Confirm.jsp Exam.jsp Cookie Logout.jsp Using Database

JSP working process (Q&A) qna.jsp Qna_write. jsp Qna_write Pro.jsp Database qna_post. jsp

JSP working process (Resource up & download) Resource _post.jsp Resource _down.jsp Database

Let’s see the System

System & Database 4 Linux Server at 과도 6F OS : Linux version generic CPU : Intel(R) Core(TM) i5 CPU 2.80GHz Memory : 2GB Disk : 500GB Hadoop + Hbase + Zookeeper + (Hive or Tajo)

Hadoop What is Hadoop? Open-source framework for distributed computing. Flexibility : Support various data format Scalability : scalable increase linearly MapReduce : split a larger data problem into smaller and distribute HDFS(Hadoop Distributed File System) : split data into smaller and replicate and store on other servers.

Hadoop MapReduce Layer HDFS Layer master Job Tracker Name Node slave Task Tracker Data Node slave Task Tracker Data Node slave Task Tracker Data Node

What is Hbase? - Hbase is the Hadoop database, a distributed, scalable, big data store. -Colunm-oriented store model, NoSQL DB -Strictly consistent reads and writes. -Automatic sharding of table -Easy to use Java API for client access -No SQL HBase

What will be available in the near future? Web server programming Database implementation Android App Get A+ 3