CSCI5570 Large Scale Data Processing Systems A General Introduction to Big Data Applications and Infrastructures James Cheng CSE, CUHK

Slides:



Advertisements
Similar presentations
Copyright © 2012, SAS Institute Inc. All rights reserved. INTRODUCTION TO DATA AND TEXT MINING ANDREW PEASE, 8 MARCH 2013.
Advertisements

Fashion Marketing Basics
Chapter 1 Business Driven Technology
Ruckus Smart Wi-Fi for Retail
Big Data Management and Analytics Introduction Spring 2015 Dr. Latifur Khan 1.
IBM SPSS Solutions A SELECT INTERNATIONAL COMPANY.
SAS solutions SAS ottawa platform user society nov 20th 2014.
An intuitive online e-commerce store. A complete solution to build & manage your online store. It's a proven technology platform with integrated payment,
Data Mining By Jason Baltazar, Phil Cademas, Jillian Latham, Rachel Peeler & Kamila Singh.
Presentation to the IAB Audience Measurement Leadership Forum New York City; November 29, 2007.
 Fundamentally, data mining is about processing data and identifying patterns and trends in that information so that you can decide or judge.  Data.
1 Melanie Alexander. Agenda Define Big Data Trends Business Value Challenges What to consider Supplier Negotiation Contract Negotiation Summary 2.
Information Management and Market Research. Marketing Research Links…. Consumer, Customer, and Public Marketer through information Marketing Research:
DATA MINING PREPARED BY RAJNIKANT MODI REFERENCE:DOUG ALEXANDER.
Big Data Yuan Xue CS 292 Special topics on.
1© 2015 IBM Corporation Unlocking the power of the API economy Client Briefing Nov.
Target Market - Review What is a market? ◦ People who share similar needs and wants and have the ability to purchase a given product are a market What.
Calendars Made Easy Simple strategies to boost your profit potential Norwood, the Norwood logo, norwood.com, and all related trademarks, logos, and trade.
BIG DATA USES CASES & LESSONS LEARNED Marrakech – March 2016 Alexandre AKROUR, CEO 1.
Green IT: Sustainability A History Computing Research: Roles and Opportunities for Information Technology in Meeting Sustainability Challenges.
Medicine medical imaging patient monitoring record keeping surgery diagnostics pharmaceuticals prosthetics remote care nanotechnology Art digital.
Marketing and the Marketing Concept 1.1
Place – Marketing Mix 4.5 The four Ps.
Teck Chia Partner, Exponent.vc
CSCI5570 Large Scale Data Processing Systems
Connected Infrastructure
UNIT C The Business of Fashion
Bennu Advanced Research Institute.
Jenna Spivak Evans Digital and eCommerce Capabilities Manager Unilever
UNIT C The Business of Fashion
5 Ways to Optimize eCommerce Search Performance Presented by:
Makes Insurance Smarter.
ACTi Retail Big Data Solutions
Connected Living Connected Living What to look for Architecture
Marketing in Today’s World
Smart Building Solution
Native Ads by YeahMobi.
Analytics Reports Available on 500 Different U. S. Industries
Discovering Computers 2010: Living in a Digital World Chapter 14
Built on Microsoft Azure, 11Ants Retail Analytics Customer Science Solution Delivers Real Growth Opportunities to Retailers with Loyalty Programs MICROSOFT.
ANOMALY DETECTION FRAMEWORK FOR BIG DATA
Big Data –An Overview
Brookshire Grocery Company: Personalizing the Customer Shopping Experience in Real Time with SAP® Solutions Company Brookshire Grocery Company Headquarters.
The Internet of Things (IoT) and Analytics
What is Marketing?.
Smart Building Solution
Connected Living Connected Living What to look for Architecture
WELCOME Mobile Applications Testing
Connected Infrastructure
Location Analytics and Competitive Advantage of Place
Assessment Criteria Course Project: 50%
Project Hermes Artificial Intelligence Initiative
Machine Learning’s Growing Impact on E-Commerce
Top Emerging E-commerce Magento trends. The progress of E-commerce industry is changing year by year, this evolution has made super easy for the online.
Big Data For Indian SMEs
Marketing Foundations
Marketing Functions Marketing Co-Op.
Analytics Reports Available on 500 Different U. S. Industries
Introduction to Business
Big Data Young Lee BUS 550.
Selling IIoT Solutions to End Users
Chapter 17 Promotional Concepts and Strategies
Digital Twin Market
Unit 3 Review Questions.
Mobile Commerce and Ubiquitous Computing
Marketing Foundations
Network Attached Storage (NAS) Market
Lesson 3.2 Product Planning
Presentation transcript:

CSCI5570 Large Scale Data Processing Systems A General Introduction to Big Data Applications and Infrastructures James Cheng CSE, CUHK

Big Data Applications Big data applications in science Big data applications in industry Big data applications for social good

Big Data Applications Big data applications in science – Genomic studies (Big Data Institute at CUHK-SZ, working with hospitals and companies) – Astronomical data analysis – Complex physics simulations – Biology and environmental research – …

Big Data Applications Big data applications in industry – Sales conversion optimization – Consumer behavior analysis – Customer segmentation – Security threat prediction – Predictive support – Market basket analysis – Pricing optimization – Other industry-specific applications Source: Big data use cases by Dell net/Dell/big-data-use- cases

Sales Conversion Optimization Collect data from the process how consumers go through Internet advertising or search, to conversion into sales Analyze data for the entire sales conversion process: from a click on an ad to the final transaction Uncover insights on how the conversion process can be improved

Sales Conversion Optimization Example 1: – Industry: communication – Companies: T-Mobile, Celcom Axiata Berhad – Usage: customer retention, product promotion, market share acquiring … Example 2: – Industry: finance – Companies: Credem – Usage: financial product/service prediction, consumer targeting …

Consumer Behavioral Analytics Analyze customers’ purchasing habits, as well as data about customers from multiple sources Understand why customers like certain products, and craft personalized marketing campaigns to boost profits

Consumer Behavior Analysis Example 1: – Industry: food & beverage – Companies: Starbucks, McDonald’s, Nestle – Usage: customer loyalty building, customer experience enhancing, customer sentiment monitoring, brand image maintaining, crisis control … Example 2: – Industry: finance – Companies: Mastercard – Usage: customers’ spending patterns, shoppers’ interests, consumer benchmarking …

Customer Segmentation Analyze data about consumers from multiple sources, e.g., social media data and transaction history Classify customers into different groups, and target each group with personalized offers

Customer Segmentation Example 1: – Industry: retail – Companies: Walmart, Nordstrom – Usage: customized marketing, customized shopping experience, targeted product promotion … Example 2: – Industry: hotel/travel – Companies: IHG (International Hotel Group) – Usage: personalized web experience, marketing mix adjusting, sales boosting …

Security Threat Prediction Track trends in IT security breaches Analyze anomalies that indicate a potential security breach Proactively go after threats before they strike

Security Threat Prediction Example 1: – Industry: banking – Companies: Rabobank, Zion’s Bank – Usage: fraud detection, financial criminal activity prediction … Example 2: – Industry: e-commerce – Companies: Amazon – Usage: warehouse security, …

Predictive Support Analyze sensor data and other machine- generated data Predict potential equipment malfunctions Reduce lost profits due to downtime Improve safety for employees and customers Examples: – Travel industry: Southwest Airlines – Transportation industry: Union Pacific Railroad – Cloud storage industry: Engine Yard

Market Basket Analysis and Pricing Optimization Analyze market basket data, pricing data, and data from multiple sources Optimize product selection and pricing, and decide where to target ads. Examples: – Household retail industry: P&G – Beverage industry: Coca-Cola – Travel industry: Etihad – Car manufacturing industry: Ford

Industry-Specific Applications Insurance (e.g., Discovery Health): identify fraudulent claims Healthcare (e.g., Aurora Health Care): control healthcare quality, find trends in diseases Travel (e.g., Kayak): flight price prediction HR/Recruiting (e.g., Catalyst IT): screen job candidates Farming (e.g., John Deere): plan farming, boost efficiency and yields

Summary of Industrial Applications 6 big categories of big data use cases (plus 5 more industry-specific use cases) A wide range of industries: communication, finance, food & beverage, retail, hotel, travel, banking, e- commerce, transportation, Cloud storage, car manufacturing, insurance, healthcare, HR/recruiting, farming Proven working in many companies, with thousands to 2 million employees If the nature of a company falls into any of the industries, or is similar to the business of any of the companies just presented, then the large scale data processing systems taught in this course will be very helpful!

Big Data Applications Big data applications for social good – Physical education in junior schools (a project involving hundreds of schools in Hong Kong, each school with students) – Healthy ageing – Health care monitoring – Air pollution control – …

Big Data Solutions Deep learning – The universal big data solution? – The best big data solution? Source: 马小平 THU

Big Data Solutions A big data application often requires a combination of multiple types of systems to develop a good solution What types of systems are generally available today for big data solutions?

Systems for Big Data Solutions General-purpose big data platforms: Hadoop, Spark, Flink, Dato, Naiad, Husky … NoSQL: MongoDB, Cassandra, CouchDB … Key-value stores: Redis, Memcached … Search engines: ElasticSearch, Solr … Machine learning systems: Petuum, GraphLab, TensorFlow, mxnet, Angel, DMTK … Graph computing systems: Pregel, Giraph, GraphLab, …

Systems for Big Data Solutions General- purpose platforms NoSQL Key-value stores Search engines Graph systems Machine learning systems Great! So many big data tools available!

A Typical Big Data Solution Graph Analytics Machine Learning Map Reduce Stream Processing Stream Processing SQL OLAP SQL OLAP Powerful Computing Engine (e.g., Husky) APIs Search Engine Messaging System Key-value Stores NoSQL Hadoop Ecosystem Data Storage Data Collection Data Processing Graph Analytics Machine Learning Map Reduce Stream Processing Stream Processing SQL OLAP SQL OLAP User-Friendly Application Interface With such a platform (e.g., Husky), you can easily build high- performance end-to-end big data business solutions! Smart city Finance Marketing Scientific research Anything about big data