© 2012 IBM Corporation IBM Security Systems 1 © 2013 IBM Corporation 1 Ecommerce Antoine Harfouche.

Slides:



Advertisements
Similar presentations
R and HDInsight in Microsoft Azure
Advertisements

Big Data Management and Analytics Introduction Spring 2015 Dr. Latifur Khan 1.
Big Data What is Big Data? Recently much good science, whether physical, biological, or social, has been forced to confront - and has often benefited from.
Big Data and Predictive Analytics in Health Care Presented by: Mehadi Sayed President and CEO, Clinisys EMR Inc.
Open Government Vlora Ademi, Business Development Manager-Edu, Microsoft Macedonia &Kosovo
Observation Pattern Theory Hypothesis What will happen? How can we make it happen? Predictive Analytics Prescriptive Analytics What happened? Why.
Dunja Mladenić Marko Grobelnik Jožef Stefan Institute, Slovenia.
Axis Intelligent Video Intelligence where you need it.
Fraud Detection in Banking using Big Data By Madhu Malapaka For ISACA, Hyderabad Chapter Date: 14 th Dec 2014 Wilshire Software.
ONS Big Data Project. Plan for today Introduce the ONS Big Data Project Provide a overview of our work to date Provide information about our future plans.
Knowledge Portals and Knowledge Management Tools
Meeting of the Management of Statistical Information System (MSIS 2014) Innovation – Open Data Initiative of Government of India – Fostering Innovations,
Big Data. What is Big Data? Analog starage vs digital. The FOUR V’s of Big Data. Who’s Generating Big Data The importance of Big Data. Optimalization.
Big Data A big step towards innovation, competition and productivity.
Copyright © 2014 Pearson Education, Inc. 1 It's what you learn after you know it all that counts. John Wooden Key Terms and Review (Chapter 6) Enhancing.
BIG DATA – WHAT’S THE BIG DEAL The call would start soon, please be on mute. Thanks for your time and patience.
This presentation was scheduled to be delivered by Brian Mitchell, Lead Architect, Microsoft Big Data COE Follow him Contact him.
© 2011 IBM Corporation Smarter Software for a Smarter Planet The Capabilities of IBM Software Borislav Borissov SWG Manager, IBM.
© 2013 IBM Corporation Version 1.0 The New Eye Insight through Big Data and Analytics: A Case Study on Citizen Sentiment Analysis Sandipan Sarkar, Executive.
USING HADOOP & HBASE TO BUILD CONTENT RELEVANCE & PERSONALIZATION Tools to build your big data application Ameya Kanitkar.
BIGDATA AND DATASCIENCE By Sigma Analytics and Computing.
Big Data. What is Big Data? Big Data Analytics: 11 Case Histories and Success Stories
Charles Tappert Seidenberg School of CSIS, Pace University
© Hortonworks Inc Hortonworks Page 1. © Hortonworks Inc Big Data Changes the Game Megabytes Gigabytes Terabytes Petabytes Purchase detail.
Big Data: A definition Big data is the realization of greater intelligence by storing, processing, and analyzing data that was previously ignored due to.
+ Hbase: Hadoop Database B. Ramamurthy. + Motivation-0 Think about the goal of a typical application today and the data characteristics Application trend:
Data Science and Big Data Analytics Chap1: Intro to Big Data Analytics
Graph Data Analytics Arka Mukherjee, Ph.D. Global IDs Resolving Complexity at an Enterprise Scale.
Big Data Analytics Large-Scale Data Management Big Data Analytics Data Science and Analytics How to manage very large amounts of data and extract value.
+ Big Data IST210 Class Lecture. + Big Data Summary by EMC Corporation ( More videos that.
Machine Learning Extract from various presentations: University of Nebraska, Scott, Freund, Domingo, Hong,
Transforming video & photo collections into valuable resources John Waugaman President - Tygart Technology, Inc.
Cyberspace Law Committee Meeting, August 3, 2012 Big Data Lois Mermelstein The Law Office of Lois D. Mermelstein
1 Melanie Alexander. Agenda Define Big Data Trends Business Value Challenges What to consider Supplier Negotiation Contract Negotiation Summary 2.
Cloud Computing & Big Data Group 9 Femme L H Sabaru | Aditya Gisheila N P | Aninda Harapan | Harry | Andrew Khosugih.
© 2012 IBM Corporation Converting Big Data into Big Knowledge.
CISC 849 : Applications in Fintech Namami Shukla Dept of Computer & Information Sciences University of Delaware iCARE : A Framework for Big Data Based.
What we know or see What’s actually there Wikipedia : In information technology, big data is a collection of data sets so large and complex that it.
IoT Meets Big Data Standardization Considerations
Big Data Analytics with Excel Peter Myers Bitwise Solutions.
Big Data Yuan Xue CS 292 Special topics on.
Smart Grid Big Data: Automating Analysis of Distribution Systems Steve Pascoe Manager Business Development E&O - NISC.
Course : Study of Digital Convergence. Name : Srijana Acharya. Student ID : Date : 11/28/2014. Big Data Analytics and the Telco : How Telcos.
BIG DATA. The information and the ability to store, analyze, and predict based on that information that is delivering a competitive advantage.
Big Data Javad Azimi May First of All… Sorry about the language  Feel free to ask any question Please share similar experiences.
B IG D ATA A NALYTICS A Presentation by Meg Monsen, Michael Leonard, and Eric Zeng.
BIG DATA. Big Data: A definition Big data is a collection of data sets so large and complex that it becomes difficult to process using on-hand database.
B ig D ata Analysis for Page Ranking using Map/Reduce R.Renuka, R.Vidhya Priya, III B.Sc., IT, The S.F.R.College for Women, Sivakasi.
Leverage Big Data With Hadoop Analytics Presentation by Ravi Namboori Visit
BIG Data What is it?. There are some things that are so big that they have implications for everyone, whether we want it or not. Big Data is one of those.
Big Data-An Analysis. Big Data: A definition Big data is a collection of data sets so large and complex that it becomes difficult.
Data Analytics (CS40003) Introduction to Data Lecture #1
CNIT131 Internet Basics & Beginning HTML
Understanding Big Data
April 25, 2012 The Three R’s Are Old School – Now It Is All About Volume, Velocity & Variety Peter Guest Alberta Public Sector Client Technical Advisor.
BIG Data 25 Need-to-Know Facts.
BIG DATA IN ENGINEERING APPLICATIONS
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Department of Information Systems
April 25, 2012 The Three R’s Are Old School – Now It Is All About Volume, Velocity & Variety Peter Guest Alberta Public Sector Client Technical Advisor.
Big Data.
Big Data Young Lee BUS 550.
IT Megatrends that shape the Digital Future…
Zoie Barrett and Brian Lam
Big DATA.
UNIT 6 RECENT TRENDS.
Big Data.
Presentation transcript:

© 2012 IBM Corporation IBM Security Systems 1 © 2013 IBM Corporation 1 Ecommerce Antoine Harfouche

© 2012 IBM Corporation IBM Security Systems 2 © 2013 IBM Corporation 2 Big Data Analytics Lecture Series Adapted from Kalapriya Kannan IBM Research Labs July, 2013

© 2012 IBM Corporation IBM Security Systems 3 © 2013 IBM Corporation 3 What is the aim of the course Focus is on “Systems” and applications for cloud-based storage and processing of BIG DATA. +Big Data - Definition +Big Data - Analytics +Big Data - Storage (HDFS) +Big Data - Computing (Map/Reduce) +Big Data - Database (HBase) +Big Data – Graph DB (Titan) +Big Data - Streaming (Strom)

© 2012 IBM Corporation IBM Security Systems 4 © 2013 IBM Corporation 4 “Learning is not just restricted to listening, it is actively asking relevant questions”

© 2012 IBM Corporation IBM Security Systems 5 © 2013 IBM Corporation 5  Get Convinced about “Big Data”  Understand why we need a different paradigm.  Ascertain with confidence the need to look at data computing in a different way.  Realize the potential of big data – All of you are skilled enough to get into it.  What we will not do – Do research on why things have evolved into the current trends as it stands. – Try to be hands-on – But not guaranteed Aim

© 2012 IBM Corporation IBM Security Systems 6 © 2013 IBM Corporation 6 What are we going to understand  What is Big Data?  Why we landed up there?  To whom does it matter  Where is the money?  Are we ready to handle it?  What are the concerns?  Tools and Technologies – Is Big Data Hadoop

© 2012 IBM Corporation IBM Security Systems 7 © 2013 IBM Corporation 7 Simple to start  What is the maximum file size you have dealt so far? –Movies/Files/Streaming video that you have used? –What have you observed?  What is the maximum download speed you get?  Simple computation –How much time to just transfer.

© 2012 IBM Corporation IBM Security Systems 8 © 2013 IBM Corporation 8 What is big data?  “ Every day, we create 2.5 quintillion bytes of data — so much that 90% of the data in the world today has been created in the last two years alone. This data comes from everywhere: sensors used to gather climate information, posts to social media sites, digital pictures and videos, purchase transaction records, and cell phone GPS signals to name a few. This data is “ big data.”

© 2012 IBM Corporation IBM Security Systems 9 © 2013 IBM Corporation 9 Huge amount of data  There are huge volumes of data in the world: + From the beginning of recorded time until 2003, + We created 5 billion gigabytes (exabytes) of data. + In 2011, the same amount was created every two days + In 2013, the same amount of data is created every 10 minutes.

© 2012 IBM Corporation IBM Security Systems 10 © 2013 IBM Corporation 10 Big data spans three dimensions: Volume, Velocity and Variety  Volume: Enterprises are awash with ever-growing data of all types, easily amassing terabytes—even petabytes—of information. – Turn 12 terabytes of Tweets created each day into improved product sentiment analysis – Convert 350 billion annual meter readings to better predict power consumption  Velocity: Sometimes 2 minutes is too late. For time-sensitive processes such as catching fraud, big data must be used as it streams into your enterprise in order to maximize its value. – Scrutinize 5 million trade events created each day to identify potential fraud – Analyze 500 million daily call detail records in real-time to predict customer churn faster – The latest I have heard is 10 nano seconds delay is too much.  Variety: Big data is any type of data - structured and unstructured data such as text, sensor data, audio, video, click streams, log files and more. New insights are found when analyzing these data types together. – Monitor 100’s of live video feeds from surveillance cameras to target points of interest – Exploit the 80% data growth in images, video and documents to improve customer satisfaction

© 2012 IBM Corporation IBM Security Systems 11 © 2013 IBM Corporation 11 Finally…. `Big- Data’ is similar to ‘Small-data’ but bigger.. But having data bigger it requires different approaches: Techniques, tools, architecture … with an aim to solve new problems Or old problems in a better way

© 2012 IBM Corporation IBM Security Systems 12 © 2013 IBM Corporation 12 Whom does it matter  Research Community  Business Community - New tools, new capabilities, new infrastructure, new business models etc.,  On sectors Financial Services..

© 2012 IBM Corporation IBM Security Systems 13 © 2013 IBM Corporation 13 How are revenues looking like….

© 2012 IBM Corporation IBM Security Systems 14 © 2013 IBM Corporation 14 The Social Layer in an Instrumented Interconnected World 2+ billion people on the Web by end billion RFID tags today (1.3B in 2005) 4.6 billion camera phones world wide 100s of millions of GPS enabled devices sold annually 76 million smart meters in 2009… 200M by TBs of tweet data every day 25+ TBs of log data every day ? TBs of data every day

© 2012 IBM Corporation IBM Security Systems 15 © 2013 IBM Corporation 15 What does Big Data trigger?  From “Big Data and the Web: Algorithms for Data Intensive Scalable Computing”, Ph.D Thesis, Gianmarco

© 2012 IBM Corporation IBM Security Systems 16 © 2013 IBM Corporation 16 BIG DATA is not just HADOOP Manage & store huge volume of any data Hadoop File System MapReduce Manage streaming data Stream Computing Analyze unstructured data Text Analytics Engine Data Warehousing Structure and control data Integrate and govern all data sources Integration, Data Quality, Security, Lifecycle Management, MDM Understand and navigate federated big data sources Federated Discovery and Navigation

© 2012 IBM Corporation IBM Security Systems 17 © 2013 IBM Corporation 17 Types of tools typically used in Big Data Scenario  Where is the processing hosted? – Distributed server/cloud  Where data is stored? – Distributed Storage (eg: Amazon s3)  Where is the programming model? – Distributed processing (Map Reduce)  How data is stored and indexed? – High performance schema free database  What operations are performed on the data? – Analytic/Semantic Processing (Eg. RDF/OWL)

© 2012 IBM Corporation IBM Security Systems 18 © 2013 IBM Corporation 18 When dealing with Big Data is hard  When the operations on data are complex: – Eg. Simple counting is not a complex problem. – Modeling and reasoning with data of different kinds can get extremely complex  Good news with big-data: – Often, because of the vast amount of data, modeling techniques can get simpler (e.g., smart counting can replace complex model-based analytics)… – …as long as we deal with the scale.

© 2012 IBM Corporation IBM Security Systems 19 © 2013 IBM Corporation 19 Why Big-Data?  Key enablers for the appearance and growth of ‘Big-Data’ are: + Increase in storage capabilities + Increase in processing power + Availability of data

© 2013 IBM Corporation IBM Security Systems 20 IBM big data IBM big data IBM big data IBM big data THINK