PROJECT. Topics  Theoretical: Error Performance Analysis for Partitioned Sketch Data Structures  Survey: Security and Privacy for Big Data: A Survey.

Slides:



Advertisements
Similar presentations
Suggested Course Outline Cloud Computing Bahga & Madisetti, © 2014Book website:
Advertisements

Sheldon Brown, UCSD, Site Director Milton Halem, UMBC Director Yelena Yesha, UMBC Site Director Tom Conte, Georgia Tech Site Director Fundamental Research.
EHarmony in Cloud Subtitle Brian Ko. eHarmony Online subscription-based matchmaking service Available in United States, Canada, Australia and United Kingdom.
HCI SEMESTER PROJECT PROJECTS  Project #2 (due 2/20)  Find an interface that can be improved  Interview potential clients  Identify an HCI concept.
International Conference on Cloud and Green Computing (CGC2011, SCA2011, DASC2011, PICom2011, EmbeddedCom2011) University.
2008 © ChengXiang Zhai Dragon Star Lecture at Beijing University, June 21-30, Introduction to IR Research ChengXiang Zhai Department of Computer.
Finding Similar Music Artists for Recommendation Presented by :Abhay Goel, Prerak Trivedi.
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
IS112 – Chapter 1 Notes Computer Organization and Programming Professor Catherine Dwyer 2003.
Karaoke Player Daniel Kirschner David Pinho. Background Iphone ◦Is one of the most widely used smart phones. ◦It has a strong open-source community through.
A survey of tag cloud presentation techniques Mogens Nielsen June 6th 2007.
Introduction to SEG 5010 Hong Cheng 2009/10 Second Term.
The Chinese University of Hong Kong. Research on Private cloud : Eucalyptus Research on Hadoop MapReduce & HDFS.
SaaS, PaaS & TaaS By: Raza Usmani
Cloud based linked data platform for Structural Engineering Experiment Xiaohui Zhang
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
INTRODUCTION TO CLOUD COMPUTING Cs 595 Lecture 5 2/11/2015.
SERVER Betül ŞAHİN What is this? Betül ŞAHİN
Used by employees who work remotely. Diary Management Software.
Research on cloud computing application in the peer-to-peer based video-on-demand systems Speaker : 吳靖緯 MA0G rd International Workshop.
Tyson Condie.
Enterprise & Intranet Search How Enterprise is different from Web search What to think about when evaluating Enterprise Search How Intranet use is different.
Project 3 U-Pick – A Project of Your Own Design Proposal Due: March 31 st Project Due: April 28 th Presentation: April 28 th.
Project 1 Online multi-user video monitoring system.
Advanced Topics in Distributed Systems Fall 2011 Instructor: Costin Raiciu.
Cloud Distributed Computing Environment Content of this lecture is primarily from the book “Hadoop, The Definite Guide 2/e)
M i SMob i S Mob i Store - Mobile i nternet File Storage Platform Chetna Kaur.
Location-aware MapReduce in Virtual Cloud 2011 IEEE computer society International Conference on Parallel Processing Yifeng Geng1,2, Shimin Chen3, YongWei.
Advanced Software Engineering PROJECT. 1. MapReduce Join (2 students)  Focused on performance analysis on different implementation of join processors.
EXPOSE GOOGLE APP ENGINE AS TASKTRACKER NODES AND DATA NODES.
Eric Holtel.  Introduction  Project Description  Demonstration  Deliverables  Conclusion.
CSE 548 Advanced Computer Network Security Document Search in MobiCloud using Hadoop Framework Sayan Cole Jaya Chakladar Group No: 1.
RANI NALAMARU DEPARTMENT OF COMPUTER SCIENCE BALL STATE UNIVERSITY RANI NALAMARU DEPARTMENT OF COMPUTER SCIENCE BALL STATE UNIVERSITY Efficient Transmission.
Technovation Incorporating Feedback Week 4. Check-in: paper prototype By now, your paper prototype should be complete, so that you can begin creating.
User Log Analyzing Algorithm Simulator 491 May15-11.
Spatial Tajo Supporting Spatial Queries on Apache Tajo Slideshare Shorten URL : goo.gl/j0VLXpgoo.gl/j0VLXp.
Performance Evaluation of Image Conversion Module Based on MapReduce for Transcoding and Transmoding in SMCCSE Speaker : 吳靖緯 MA0G IEEE.
Most of contents are provided by the website Introduction TJTSD66: Advanced Topics in Social Media Dr.
Proposal for Term Project Information Security, Fall 2014 J. H. Wang Sep. 25, 2014.
CSE 548 Advanced Computer Network Security Trust in MobiCloud using Hadoop Framework Updates Sayan Cole Jaya Chakladar Group No: 1.
Incorporating Feedback Lesson 5 0. Check-in: paper prototype By now, your paper prototype should be complete, so that you can begin creating your app.
Holly Wang Workshop at CAU December 15, 2010 Conducting Empirical Research and Publishing in International Journals.
Advanced Software Engineering PROJECT November 2015.
 Frequent Word Combinations Mining and Indexing on HBase Hemanth Gokavarapu Santhosh Kumar Saminathan.
ITCS 6265 Details on Project & Paper Presentation.
Site Technology TOI Fest Q Celebration From Keyword-based Search to Semantic Search, How Big Data Enables That?
CSE 548 Advanced Computer Network Security Trust in MobiCloud using Hadoop Framework Updates Sayan Kole Jaya Chakladar Group No: 1.
A Technical Overview Bill Branan DuraCloud Technical Lead.
Big Data Analytics Platforms. Our Team NameApplication Viborov MichaelApache Spark Bordeynik YanivApache Storm Abu Jabal FerasHPCC Oun JosephGoogle BigQuery.
HEMANTH GOKAVARAPU SANTHOSH KUMAR SAMINATHAN Frequent Word Combinations Mining and Indexing on HBase.
David M. Kroenke and David J. Auer Database Processing Fundamentals, Design, and Implementation Chapter Twelve: Big Data, Data Warehouses, and Business.
Learn Hadoop and Big Data Technologies. Hadoop  An Open source framework that stores and processes Big Data in distributed manner on a large groups of.
Copyright © 2016 Pearson Education, Inc. Modern Database Management 12 th Edition Jeff Hoffer, Ramesh Venkataraman, Heikki Topi CHAPTER 11: BIG DATA AND.
This is a free Course Available on Hadoop-Skills.com.
Advanced Higher Computing Science The Project. Introduction Worth 60% of the total marks for the course Must include: An appropriate interface using input.
Kundan Singh Venkatesh Oct 2013
Connected Infrastructure
Big Data is a Big Deal!.
Proposal for Term Project Information Security, Fall 2016
The advantages and the disadvantages of working in the cloud.
Cloud based linked data platform for Structural Engineering Experiment
Science Behind Cross-device Conversion Tracking
Introduction to IR Research
Electronic Records Management Program
Connected Infrastructure
Hadoop Clusters Tess Fulkerson.
Chapter 19 Speaking in and as a Group
Microsoft Connect /22/2018 9:50 PM
An Introduction to Cloud Computing
Big DATA.
Presentation transcript:

PROJECT

Topics  Theoretical: Error Performance Analysis for Partitioned Sketch Data Structures  Survey: Security and Privacy for Big Data: A Survey and Future Directions  Experiments: Citizen Behavior of 7-21 Storm in Beijing, 2012 Music Knowledge Mining Hadoop for Video Streaming on the Web MapReduce Jobs For Video Conversion  Your proposed one…

1. Error Performance Analysis for Partitioned Sketch Data Structures  We talked about the time complexity already (in terms of update time)  TASK: What about error performance? How to optimally allocate the depth of each sketch (zipfian)?  Start to learn from how CM sketch analyzes its error performance (Theorem 1 and alike) full.pdf  Learn about P(d)-CU

How to determine this?

Result  Analysis (e.g., mathematical derivations)  Some initial simulation (correctness)

2. Survey  Write a good survey in English on Security and Privacy for Big Data: A Survey and Future Directions  Cite at least 40+ references (IEEEXplore and ACM Digital Lib)  Paper organization Classify these works in different categories, from different angles Extensive comparisons Identify future directions (i.e., what are the missing pieces?)

Some Materials    intelligence-driven-security-io.pdf intelligence-driven-security-io.pdf  data/GroupDocuments/Big_Data_Top_Ten_v1.pdf data/GroupDocuments/Big_Data_Top_Ten_v1.pdf  papers/wp_addressing-big-data-security-challenges.pdf papers/wp_addressing-big-data-security-challenges.pdf   Think about: Storage Analysis Applications Cloud, Internet-of-Things

3. Analyze Citizen Behaviors of 7-21 Storm in Beijing, 2012  The Power of Social Networks and Public Crowd   Using social network APIs like Sina Weibo open.weibo.com/wiki  Use the keyword search to retrieve all related data  # 望京人赴机场免费救援 # , # 双闪车队 # (100+)  菠菜 X6 望京网

4. Music Knowledge Mining  Million Song Dataset  For Example: to calculate music density process-a-million-songs-in-20-minutes/ process-a-million-songs-in-20-minutes/  YOUR TASK: Predict which songs a user will listen to

5. Video Streaming on the Web  Store your video as chunks in HDFS  Case: user suddenly move to a specific part of the video  Seek in the file to position the cursor at a specific location  HDFS can only be accessed through a Hadoop client, Apache server is not.  Apache/FUSE: all file system operations (dir browsing, file opening and content access) are enabled over HDFS content through the FUSE interface.  oop_for_video_streaming/ oop_for_video_streaming/

Result  A demo Choose a least 1 type of video format (e.g., flv) A client to play video A web server (with Apache FUSE) HDFS to store your videos

6. MapReduce For Video Conversion  Convert huge number of video files from one format to another.  using the open source video converter FFMPEG (  Data stored on HDFS  Create an app doing it (running on Google AppEngine)

Mechanism  Working in group: 3-5 students, clear roles  me by this Friday (Nov 22) Team leader, Team members Topic  Deadline: 28 December 2013!  Deliverable: project report in Chinese Introduction (motivation, WHY?) Related Work (What others have done) Your proposal (HOW?) Performance Evaluation Conclusion  Presentation

Suggested Arrangement  Week-1: Define your roles and start literature research  Week-2 and 3: Propose solutions  Week-4 and 5: Implementation and obtain results  Week-6: Write report