Technology of Data Analytics. INTRODUCTION OBJECTIVE  Data Analytics mindset – shallow and wide, deep when you need it  Quick overview, useful tidbits,

Slides:



Advertisements
Similar presentations
Utility SQL Bin (v3.3). Agenda  Purpose  Target User  Benefits  System Requirement  User Guide Introduction Navigation Add New SQL Add New Version.
Advertisements

Information literacy B Lecture 4 Database and Spreadsheet Takeshi Tokuyama Tohoku University Graduate School of Information Sciences System Information.
Technical BI Project Lifecycle
BigData Tools Seyyed mohammad Razavi. Outline  Introduction  Hbase  Cassandra  Spark  Acumulo  Blur  MongoDB  Hive  Giraph  Pig.
Formulas and Function Course Overview Logical Functions Data and Time Functions Math Functions Trigonometric Functions Statistical Functions Text Functions.
Computer Science & Engineering 2111 Data Validation and Macros 1 CSE 2111 Lecture-Data Validation and Macros.
This presentation was scheduled to be delivered by Brian Mitchell, Lead Architect, Microsoft Big Data COE Follow him Contact him.
Microsoft Excel 2013: Excel Power User Course with Jim Mollé Learn iT! Computer Software Training.
Introduction to VBA. What is VBA? VBA stands for Visual Basic for Applications. It is a programming language used exclusively by the Microsoft Office.
Analytics Map Reduce Query Insight Hive Pig Hadoop SQL Map Reduce Business Intelligence Predictive Operational Interactive Visualization Exploratory.
MapReduce April 2012 Extract from various presentations: Sudarshan, Chungnam, Teradata Aster, …
Fundamentals of Information Systems, Fifth Edition
1 Overview of Databases. 2 Content Databases Example: Access Structure Query language (SQL)
1 INTRODUCTION TO DATABASE MANAGEMENT SYSTEM L E C T U R E
Chapter 9 Database Management Discovering Computers Fundamental.
Presented by CH.Anusha.  Apache Hadoop framework  HDFS and MapReduce  Hadoop distributed file system  JobTracker and TaskTracker  Apache Hadoop NextGen.
CIS 103 — Applied Computer Technology Last Edited: September 17, 2010 by C.Herbert Using Database Management Systems.
Introduction to Apache Hadoop Zibo Wang. Introduction  What is Apache Hadoop?  Apache Hadoop is a software framework which provides open source libraries.
Hadoop/MapReduce Computing Paradigm 1 Shirish Agale.
Introduction to Hadoop and HDFS
Databases From A to Boyce Codd. What is a database? It depends on your point of view. For Manovich, a database is a means of structuring information in.
An Introduction to HDInsight June 27 th,
Database A database is a collection of data organized to meet users’ needs. In this section: Database Structure Database Tools Industrial Databases Concepts.
When bet365 met Riak and discovered a true, “always on” database.
5-1 McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved.
Database Systems Basic Data Management Concepts
Microsoft Access 2013 Overview of Microsoft Access Databases.
Foundations of Business Intelligence: Databases and Information Management.
Hadoop implementation of MapReduce computational model Ján Vaňo.
Matthew Winter and Ned Shawa
Mining real world data RDBMS and SQL. Index RDBMS introduction SQL (Structured Query language)
HEMANTH GOKAVARAPU SANTHOSH KUMAR SAMINATHAN Frequent Word Combinations Mining and Indexing on HBase.
RoOUG Iunie Bucuresti, 26 Iunie Agenda Inregistrarea participantilor ODI – Common Use Cases 2Iunie 2013.
CS 157B: Database Management Systems II April 10 Class Meeting Department of Computer Science San Jose State University Spring 2013 Instructor: Ron Mak.
PREPARED BY: PN. SITI HADIJAH BINTI NORSANI. LEARNING OUTCOMES: Upon completion of this course, students should be able to: 1. Understand the structure.
1 HBASE – THE SCALABLE DATA STORE An Introduction to HBase XLDB Europe Workshop 2013: CERN, Geneva James Kinley EMEA Solutions Architect, Cloudera.
Analytics Plus Product Overview. Introduction Analytics Plus is a self-service Business Intelligence and advanced analytics software. On-premise reporting.
Big Data Analytics with Excel Peter Myers Bitwise Solutions.
Introduction to Core Database Concepts Getting started with Databases and Structure Query Language (SQL)
1 Seattle University Master’s of Science in Business Analytics Key skills, learning outcomes, and a sample of jobs to apply for, or aim to qualify for,
Beyond Hadoop The leading open source system for processing big data continues to evolve, but new approaches with added features are on the rise. Ibrahim.
Copyright © 2016 Pearson Education, Inc. Modern Database Management 12 th Edition Jeff Hoffer, Ramesh Venkataraman, Heikki Topi CHAPTER 11: BIG DATA AND.
Slide 1 © 2016, Lera Technologies. All Rights Reserved. SAP BO vs SPLUNK vs OBIEE By Lera Technologies.
Microsoft Power Query: an Excel Users Dream for Data Extraction and Cleansing Presented by: Belinda Allen Smith & Allen Consulting, Inc.
BIG DATA/ Hadoop Interview Questions.
What is it and why it matters? Hadoop. What Is Hadoop? Hadoop is an open-source software framework for storing data and running applications on clusters.
Microsoft Power Query 101 Belinda Allen Smith & Allen Consulting, Inc.
Microsoft Ignite /28/2017 6:07 PM
BI 202 Data in the Cloud Creating SharePoint 2013 BI Solutions using Azure 6/20/2014 SharePoint Fest NYC.
Business Intelligence MSCS 6931 Compare Tableau and Power BI Haochen(Bamboo) Sun Sep 30, 2015.
Microsoft Access 2016 Overview of Microsoft Access Databases
Serve as Director Funded by the Louisiana Department of Transportation and Development Developed LaCrash application to electronically capture crash.
SNS COLLEGE OF TECHNOLOGY
Leveraging the Business Intelligence Features in SharePoint 2010
Introduction to R Programming with AzureML
Innovative Technology Solutions
Created by Kamila zhakupova
Central Florida Business Intelligence User Group
Tableau Overview  Tableau is widely used data visualization and BI tool. Tableau is simple to use and has extensive visualization capability that make.
Ministry of Higher Education
Overview of Microsoft Access Databases Chapter 1
Ch 4. The Evolution of Analytic Scalability
Introduction to Apache
Analytics Plus Product Overview 1.
Building your First Cube with SSAS
Charles Tappert Seidenberg School of CSIS, Pace University
Module 2B - Data Systems Marco MERENS Nevin MURAD Lima, March 2019
Overview of Databases and Salesforce Chapter 1
Big-Data Analytics with Azure HDInsight
Presentation transcript:

Technology of Data Analytics

INTRODUCTION OBJECTIVE  Data Analytics mindset – shallow and wide, deep when you need it  Quick overview, useful tidbits, provide a jumping off point AGENDA/ TOPICS  Excel  VBA  Access  SQL  Tableua  Hadoop  Analytical Packages: R/ SAS/ SPSS/ Minitab

SQUARE 1 Business and Technology  Entity  Attributes  Schema  Relational Database  ETL - Extract Transform Load  Data Mining

START WITH EXCEL  It’s the easiest and most available platform  Can teach others to maintain Collect Data Validation Drop Downs Store vLookups Analyze Formulas If, And Pivot Table Report/ Visualize Charts Conditional Formatting Offset

VISUAL BASIC FOR APPLICATIONS Microsoft language Object Oriented- noun.verb; noun.adjective=“adjective” Record macro and play around Modules and Userforms Cell Referencing - cells(x,y).select For loop – for index = startingnumber to ending number If logicalstuff then stuff that happens end if Use it for: Moving data Changing charts

GOOGLE DOCS: COLLECTION  Somebody already did everything for you  Google people are smarter than you  You can use the interwebs: instead of local drive

ACCESS  Beginning of databasing Table Like Excel spreadsheet Tightly defined values allowed View Pulling info from tables using logic A lasting query that is used to populate reports Form Data input Report Generates reports

SQL  Big Boy Access  Same as Access without the bumpers and hand holding  Real deal use in software world  Can be used for maintenance and diagnosing software back ends Table Like Excel spreadsheet Tightly defined values allowed View Pulling info from tables using logic A lasting query that is used to populate reports Query Viewing data Stored Procedures Loading and moving data I don’t really know SRS Web based reports

TABLEAU  Connections  Worksheets  Views  Dashboards  Stories

HADOOP  Virtualizes multiple computers/ servers to create a cloud computing unit  Hadoop Common – contains libraries and utilities needed by other Hadoop modules.  Hadoop Distributed File System (HDFS) – a distributed file-system that stores data on commodity machines, providing very high aggregate bandwidth across the cluster.  Hadoop YARN – a resource-management platform responsible for managing compute resources in clusters and using them for scheduling of users' applications.  Hadoop MapReduce – a programming model for large scale data processing.  Get started at:

Analyze: SAS/ R/ SPSS/ Minitab S.A.S. Academic/ Common R Open source S.P.S.S IBM Minitab Analytical Excel

Other  iTunes U: Data Visualization  CoursEra: Introduction to Data Science  Code Academy: other programming languages

EDUCATION PROJECTS Open Source Education – BDAA Book of Knowledge  Stats Cheat Sheet  Excel Guide  SQL Guide  How to Guides in General….