MIS 3500 Instructor: Bob Travica Newer DB Topics 2015.

Slides:



Advertisements
Similar presentations
MongoDB PostgreSQL SaaS Quality Measure Storage
Advertisements

R and HDInsight in Microsoft Azure
OVERVIEW OF NETWORKING RESEARCH IN NETLAB 1 Dr. Jim Martin Associate Professor School of Computing Clemson University
Skills: none Concepts: LAN, data link functions – moving data within a LAN and medium access, data link protocols – Ethernet and WiFi, why protocols standards.
Observation Pattern Theory Hypothesis What will happen? How can we make it happen? Predictive Analytics Prescriptive Analytics What happened? Why.
Blog A Blog is a website where entries are written in chronological order and commonly displayed in reverse chronological order.
Architecting for the Internet of Things
25 Need-to-Know Facts. Fact 1 Every 2 days we create as much information as we did from the beginning of time until 2003 [Source]Source © 2014 Bernard.
Fraud Detection in Banking using Big Data By Madhu Malapaka For ISACA, Hyderabad Chapter Date: 14 th Dec 2014 Wilshire Software.
Chapter 3 Foundations of Business Intelligence: Databases and Information Management.
PROTOCOLSSTANDARDSEQUIPMENTBLUETOOTH CELL PHONE DATA NETWORKS ADVANTAGES/ DISADVANTAGES GENERAL INFORMATION Main Menu.
SoNDa Sensor Network for Data Explore! 1. SoNDa Sensor Network for Data Explore! KEYWORDS Wireless Sensors Communication 2.
Basic Marketing Research Customer Insights and Managerial Action
AMBIENT INTELLIGENCE Presented by GOKUL SURESH. INTRODUCTION  Evolution of Ambient Intelligence.  Science with a fictional view.  Enriching environment.
How Life Will Change With Smart Homes Yakovlev Artyom BCG account manager.
This presentation was scheduled to be delivered by Brian Mitchell, Lead Architect, Microsoft Big Data COE Follow him Contact him.
1 CSCE 5013: Hot Topics in Mobile and Pervasive Computing Nilanjan Banerjee Hot Topic in Mobile and Pervasive Computing University of Arkansas Fayetteville,
© 2011 IBM Corporation Smarter Software for a Smarter Planet The Capabilities of IBM Software Borislav Borissov SWG Manager, IBM.
© 2013 IBM Corporation Version 1.0 The New Eye Insight through Big Data and Analytics: A Case Study on Citizen Sentiment Analysis Sandipan Sarkar, Executive.
SOFTWARE SYSTEMS DEVELOPMENT MAP-REDUCE, Hadoop, HBase.
Big Data. What is Big Data? Big Data Analytics: 11 Case Histories and Success Stories
4G-LTE: Enhancing Efficiency in Organizations. Factors Impacting Digitization Processes and Systems January Powerful Platforms and Devices Storage.
CS525: Special Topics in DBs Large-Scale Data Management Hadoop/MapReduce Computing Paradigm Spring 2013 WPI, Mohamed Eltabakh 1.
Specification section 6.2. What do you need to learn? The application and advantages/disadvantages of the following digital media and new technology in.
What is the Internet? Internet: The Internet, in simplest terms, is the large group of millions of computers around the world that are all connected to.
Hadoop/MapReduce Computing Paradigm 1 Shirish Agale.
SEMINAR ON Guided by: Prof. D.V.Chaudhari Seminar by: Namrata Sakhare Roll No: 65 B.E.Comp.
CONFIDENTIAL 1. 2 Designing the Intelligent Energy Gateway 2009 CONFIDENTIAL.
IoT, Big Data and Emerging Technologies
Google’s Big Table 1 Source: Chang et al., 2006: Bigtable: A Distributed Storage System for Structured Data.
Hadoop Ali Sharza Khan High Performance Computing 1.
Ethics of Big Data Eduardo Felipe Zecca da Cruz. What is Big Data? Stamford, Conn.-based IT research firm Gartner Inc. defines "big data" as "high-volume,
Database Essentials. Key Terms Big Data Describes a dataset that cannot be stored or processed using traditional database software. Examples: Google search.
Data Warehousing Data Mining Privacy. Reading Bhavani Thuraisingham, Murat Kantarcioglu, and Srinivasan Iyer Extended RBAC-design and implementation.
What Is Big Data? And What Does It Mean to Marketers? Frank Cotignola October 17, 2013.
REU 2004 Computer Science and Engineering Department The University of Texas at Arlington Research Experiences for Undergraduates in Distributed Rational.
Smarter Transportation Data… the next natural resource for Smarter Cities Eric-Mark Huitema MD Smarter Transportation, IBM
+ Big Data IST210 Class Lecture. + Big Data Summary by EMC Corporation ( More videos that.
CS525: Big Data Analytics MapReduce Computing Paradigm & Apache Hadoop Open Source Fall 2013 Elke A. Rundensteiner 1.
By Jack Stewart. Cloud computing, or something being in the cloud, is a colloquial expression used to describe a variety of different types of computing.
What we know or see What’s actually there Wikipedia : In information technology, big data is a collection of data sets so large and complex that it.
REU 2007 Computer Science and Engineering Department The University of Texas at Arlington Research Experiences for Undergraduates in Information Processing.
Chapter 17 Types of Careers. I. Exploring Careers  A. Career cluster: a group of careers that are related in some way  total clusters identified.
Computer Science and Engineering Department The University of Texas at Arlington MavHome: An Intelligent Home Environment.
Big Data Analytics with Excel Peter Myers Bitwise Solutions.
AZ PASS User Group Azure Data Factory Overview Josh Sivey, Solution Partner October
What if your app could put the power of analytics everywhere decisions are made? Modern apps with data visualizations built-in have the power to inform.
Abstract MarkLogic Database – Only Enterprise NoSQL DB Aashi Rastogi, Sanket V. Patel Department of Computer Science University of Bridgeport, Bridgeport,
Internet of Things – Getting Started
1 Enabling Smart Cities/Campuses to Serve the Internet of People Florence Hudson Senior Vice President & Chief Innovation Officer Internet2 TNC16 June.
Unlock your Big Data with Analytics and BI on Office365 Brian Culver ● SharePoint Fest Seattle● BI102 ● August 18-20, 2015.
MIS 3500 Instructor: Bob Travica Trendy Database Topics 2016.
Data Analytics 1 - THE HISTORY AND CONCEPTS OF DATA ANALYTICS
Tutorial: Big Data Algorithms and Applications Under Hadoop
Connected Infrastructure
Big Data Enterprise Patterns
Software Systems Development
Where do we need it ? Why do we need it ? What is Data Analytics ?
BIG Data 25 Need-to-Know Facts.
Hadoopla: Microsoft and the Hadoop Ecosystem
Connected Infrastructure
Cloud DX Connected Health Kits Depend on Azure to Deliver Cloud Storage and Securely Host Data for its Remote Patient Monitoring MICROSOFT AZURE APP BUILDER.
Federico Perrero – Plant Manager
System And Application Software
Microsoft Connect /22/2018 9:50 PM
Accelerate Your Self-Service Data Analytics
Big Data Young Lee BUS 550.
Big DATA.
Mark Quirk Head of Technology Developer & Platform Group
Presentation transcript:

MIS 3500 Instructor: Bob Travica Newer DB Topics 2015

Big Data  3 big V:  Volume: terabytes (15 zeroes), petabytes (18 zeroes)  Variety: Social media, communications, sensors everywhere*, Internet of Things, video feeds, GPS… Implication: various formats  Velocity: wired and wireless continuous feeds 2

Goals and Uses  Goals:  Integrate data on the same object across sources (Customer, Citizen etc.; spatial mashups)  Analysis: Existing patterns, Predictive analysis  Application domains:  Monitoring for business & other purposes (sensors)  Marketing (relationship mktg., Sentiment analysis is social media…)  Energy grid management  Transportation networks management  Health (analysis of cancer cell behavior and of patient vital signs)  Science (human genome)  Policy analysis (United Nations’ system for predicting social problems) 3

Big Data Tasks 4

 Machine-generated data (sensors); automatic creation and transfer *  Home appliances (security, energy consumption, heating, food, entertainment)  Monitoring/Control (cars, athletic equipment, machinery, appliances)*  Example: Smart power grid** 5 Smart meter; Internet & Wi-Fi connectivity

Technologies  Hadoop (framework for file system and processing of large datasets on server clusters)*  Machine learning – automated construction of models to fit data (instead of hypothesis testing as with DW and Analytics)  Open source  Notable developers: Yahoo, Facebook, Yahoo!, Google, Microsoft 6 Microsoft Azure-based Hadoop

7 DATA PROCESSING

 A database for Big Data  Distributed, non-relational, scalable  Based on Google’s BigTable * 8 Row Key (reversed URL)Time StampColumn Key – “Anchor” (Family) + URLpart (Qualifier) "com.cnn.www"t9anchor:cnnsi.com = "CNN" "com.cnn.www"t8anchor:my.look.ca = "CNN.com" Row KeyTime StampColumn Key – “Contents” + keyword in tagged content "com.cnn.www"t6 contents:html = " … ​ " "com.cnn.www"t5 contents:html = " … ​ " "com.cnn.www"t3 contents:html = " … ​ " DATA are cites of “CNN*” Referencing sites DATA are webpages Compressed. There can be any Number of unbound Contents Columns. All columns put together make a “BigTable”.

NoSQL – Not Only SQL 9

Modern Database environments 10

Modern Database environments 11