Download presentation
Presentation is loading. Please wait.
Published byAsia Retter Modified over 9 years ago
1
Technology of Data Analytics
2
INTRODUCTION OBJECTIVE Data Analytics mindset – shallow and wide, deep when you need it Quick overview, useful tidbits, provide a jumping off point AGENDA/ TOPICS Excel VBA Access SQL Tableua Hadoop Analytical Packages: R/ SAS/ SPSS/ Minitab
3
SQUARE 1 Business and Technology Entity Attributes Schema Relational Database ETL - Extract Transform Load Data Mining
4
START WITH EXCEL It’s the easiest and most available platform Can teach others to maintain Collect Data Validation Drop Downs Store vLookups Analyze Formulas If, And Pivot Table Report/ Visualize Charts Conditional Formatting Offset
5
VISUAL BASIC FOR APPLICATIONS Microsoft language Object Oriented- noun.verb; noun.adjective=“adjective” Record macro and play around Modules and Userforms Cell Referencing - cells(x,y).select For loop – for index = startingnumber to ending number If logicalstuff then stuff that happens end if Use it for: Moving data Changing charts
6
GOOGLE DOCS: COLLECTION Somebody already did everything for you Google people are smarter than you You can use the interwebs: instead of local drive
7
ACCESS Beginning of databasing Table Like Excel spreadsheet Tightly defined values allowed View Pulling info from tables using logic A lasting query that is used to populate reports Form Data input Report Generates reports
8
SQL Big Boy Access Same as Access without the bumpers and hand holding Real deal use in software world Can be used for maintenance and diagnosing software back ends Table Like Excel spreadsheet Tightly defined values allowed View Pulling info from tables using logic A lasting query that is used to populate reports Query Viewing data Stored Procedures Loading and moving data I don’t really know SRS Web based reports
9
TABLEAU Connections Worksheets Views Dashboards Stories
10
HADOOP Virtualizes multiple computers/ servers to create a cloud computing unit Hadoop Common – contains libraries and utilities needed by other Hadoop modules. Hadoop Distributed File System (HDFS) – a distributed file-system that stores data on commodity machines, providing very high aggregate bandwidth across the cluster. Hadoop YARN – a resource-management platform responsible for managing compute resources in clusters and using them for scheduling of users' applications. Hadoop MapReduce – a programming model for large scale data processing. Get started at: http://hadoop.apache.org/docs/current/
11
Analyze: SAS/ R/ SPSS/ Minitab S.A.S. Academic/ Common R Open source S.P.S.S IBM Minitab Analytical Excel
12
Other iTunes U: Data Visualization CoursEra: Introduction to Data Science Code Academy: other programming languages
13
EDUCATION PROJECTS Open Source Education – BDAA Book of Knowledge Stats Cheat Sheet Excel Guide SQL Guide How to Guides in General….
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.