Big Data Javad Azimi May 2015. First of All… Sorry about the language  Feel free to ask any question Please share similar experiences.

Slides:



Advertisements
Similar presentations
R and HDInsight in Microsoft Azure
Advertisements

Big Data Management and Analytics Introduction Spring 2015 Dr. Latifur Khan 1.
Big Data and Predictive Analytics in Health Care Presented by: Mehadi Sayed President and CEO, Clinisys EMR Inc.
Big Data Workflows N AME : A SHOK P ADMARAJU C OURSE : T OPICS ON S OFTWARE E NGINEERING I NSTRUCTOR : D R. S ERGIU D ASCALU.
Chapter 14 The Second Component: The Database.
Business Intelligence: Essential of Business
Evolution in Coming 10 Years: What's the Future of Network? - Evolution in Coming 10 Years: What's the Future of Network? - Big Data- Big Changes in the.
Big Data. What is Big Data? Analog starage vs digital. The FOUR V’s of Big Data. Who’s Generating Big Data The importance of Big Data. Optimalization.
Chapter 2: Business Intelligence Capabilities
Copyright © 2014 Pearson Education, Inc. 1 It's what you learn after you know it all that counts. John Wooden Key Terms and Review (Chapter 6) Enhancing.
Basic Marketing Research Customer Insights and Managerial Action
© 2012 TeraMedica, Inc. Big Data: Challenges and Opportunities for Healthcare Joe Paxton Healthcare and Life Sciences Sales Leader.
This presentation was scheduled to be delivered by Brian Mitchell, Lead Architect, Microsoft Big Data COE Follow him Contact him.
Chapter 11 Databases.
Big Data. What is Big Data? Big Data Analytics: 11 Case Histories and Success Stories
Cloud Computing and Big Data
© 2012 IBM Corporation IBM Security Systems 1 © 2013 IBM Corporation 1 Ecommerce Antoine Harfouche.
Big Data: A definition Big data is the realization of greater intelligence by storing, processing, and analyzing data that was previously ignored due to.
Case 2: Emerson and Sanofi Data stewards seek data conformity
Big Data Bijan Barikbin Denisa Teme Matthew Joseph.
5-1 McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
Big Data Analytics Large-Scale Data Management Big Data Analytics Data Science and Analytics How to manage very large amounts of data and extract value.
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
By N.Gopinath AP/CSE. There are 5 categories of Decision support tools, They are; 1. Reporting 2. Managed Query 3. Executive Information Systems 4. OLAP.
WEB MINING. In recent years the growth of the World Wide Web exceeded all expectations. Today there are several billions of HTML documents, pictures and.
+ Big Data IST210 Class Lecture. + Big Data Summary by EMC Corporation ( More videos that.
Innovation Work Circle: Big Data Presented By: Innovation Work Circle Group.
What’s a mobile app? A mobile app is a software program you can download and access directly using your phone or another mobile device, like a tablet.
Big Data – Big Opportunity Mohammad Khansari ITRC President Jan 2015 ITRC, Tehran, Iran.
© 2012 IBM Corporation Converting Big Data into Big Knowledge.
SUPPLY CHAIN OF BIG DATA. WHAT IS BIG DATA?  A lot of data  Too much data for traditional methods  The 3Vs  Volume  Velocity  Variety.
What we know or see What’s actually there Wikipedia : In information technology, big data is a collection of data sets so large and complex that it.
IoT Meets Big Data Standardization Considerations
Data Resource Management Agenda What types of data are stored by organizations? How are different types of data stored? What are the potential problems.
Axis AI Solves Challenges of Complex Data Extraction and Document Classification through Advanced Natural Language Processing and Machine Learning MICROSOFT.
BUSINESS INTELLIGENCE & ADVANCED ANALYTICS DISCOVER | PLAN | EXECUTE JANUARY 14, 2016.
Big Data Analytics with Excel Peter Myers Bitwise Solutions.
Copyright © 2016 Pearson Education, Inc. Modern Database Management 12 th Edition Jeff Hoffer, Ramesh Venkataraman, Heikki Topi CHAPTER 11: BIG DATA AND.
B IG D ATA : S TORAGE, A NALYSIS AND I MPACT Justinas Bisikirskas 1.
Course : Study of Digital Convergence. Name : Srijana Acharya. Student ID : Date : 11/28/2014. Big Data Analytics and the Telco : How Telcos.
Big Data Quality Panel Norman Paton University of Manchester.
Big Data ---a statistician’s perspective Ming Ji, PhD College of Nursing USF.
BIG DATA. Big Data: A definition Big data is a collection of data sets so large and complex that it becomes difficult to process using on-hand database.
Leverage Big Data With Hadoop Analytics Presentation by Ravi Namboori Visit
Big Data-An Analysis. Big Data: A definition Big data is a collection of data sets so large and complex that it becomes difficult.
Reinventing Customer Experiences
CNIT131 Internet Basics & Beginning HTML
Data Analytics 1 - THE HISTORY AND CONCEPTS OF DATA ANALYTICS
SNS COLLEGE OF TECHNOLOGY
Visualizing Complex Software Systems
Sushant Ahuja, Cassio Cristovao, Sameep Mohta
Understanding Big Data
ANOMALY DETECTION FRAMEWORK FOR BIG DATA
April 25, 2012 The Three R’s Are Old School – Now It Is All About Volume, Velocity & Variety Peter Guest Alberta Public Sector Client Technical Advisor.
Big Data.
BIG DATA IN ENGINEERING APPLICATIONS
Contextual Intelligence as a Driver of Services Innovation
Mohammad J. Mansourzadeh
Introduction Data Mining for Business Analytics.
April 25, 2012 The Three R’s Are Old School – Now It Is All About Volume, Velocity & Variety Peter Guest Alberta Public Sector Client Technical Advisor.
Big Data.
Big Data Young Lee BUS 550.
Zoie Barrett and Brian Lam
Big Data: Four Vs Salhuldin Alqarghuli.
Big DATA.
V. Uddameri Texas Tech University
Big Data.
Presentation transcript:

Big Data Javad Azimi May 2015

First of All… Sorry about the language  Feel free to ask any question Please share similar experiences

Outline Why Big Data? Hadoop and Next Distributed Machine Learning Algorithm A few Projects to Share

Definition Big data is the term for a collection of data sets so large and complex that it becomes difficult to process using on-hand database management tools or traditional data processing applications. The challenges include capture, curation, storage, search, sharing, transfer, analysis, and visualization.

The importance of Big Data The real issue is not that you are acquiring large amounts of data. It's what you do with the data that counts. The hopeful vision is that organizations will be able to take data from any source, harness relevant data and analyze it to find answers that enable: Cost reductions Time reductions New product development and optimized offerings Smarter business decision making

The FOUR V’s of Big Data From traffic patterns and music downloads to web history and medical records, data is recorded, stored, and analyzed to enable that technology and services that the world relies on every day. But what exactly is big data be used? According to IBM scientists big data can be break into four dimensions: Volume, Velocity, Variety and Veracity.

The FOUR V’s of Big Data

Volume. Many factors contribute to the increase in data volume. Transaction-based data stored through the years. Unstructured data streaming in from social media. Increasing amounts of sensor and machine-to-machine data being collected. In the past, excessive data volume was a storage issue. But with decreasing storage costs, other issues emerge, including how to determine relevance within large data volumes and how to use analytics to create value from relevant data. Lesson: We have enough data for many application

The FOUR V’s of Big Data

Variety. Data today comes in all types of formats. Structured, numeric data in traditional databases. Information created from line-of-business applications. Unstructured text documents, , video, audio, stock ticker data and financial transactions. Managing, merging and governing different varieties of data is something many organizations still grapple with. Lesson: Data from difference sources can be used for a specific target application.

The FOUR V’s of Big Data

Velocity. Data is streaming in at unprecedented speed and must be dealt with in a timely manner. RFID tags, sensors and smart metering are driving the need to deal with torrents of data in near-real time. Reacting quickly enough to deal with data velocity is a challenge for most organizations. Lesson: Online processing time MUST be very quick!

The FOUR V’s of Big Data

Veracity - Big Data Veracity refers to the biases, noise and abnormality in data. Is the data that is being stored, and mined meaningful to the problem being analyzed. Inderpal feel veracity in data analysis is the biggest challenge when compares to things like volume and velocity. In scoping out your big data strategy you need to have your team and partners work to help keep your data clean and processes to keep ‘dirty data’ from accumulating in your systems. Lesson: Noise is different from minority data and this is the problem of everyone Lesson: Noise is different from minority data and this is the problem of everyone

Applications (1) User Movie Netflix Collaborative Filtering

Applications (2)

Applications (3)

More…

Age Detection using social media Break up estimation using tweeter data Gender detection using profile Avatar Leveraging information across web (Entity detection using table in web) Matching patients with drug (CollabRX) What can we do with tweeter?