Tableau Overview and Publicly Available Data Sources

Slides:



Advertisements
Similar presentations
Learningcomputer.com. Using this Tab, you can import data from external sources including but not limited to: Text files Microsoft Access databases Web.
Advertisements

A Toolbox for Blackboard Tim Roberts
The State of SharePoint BI
Introduction to Mendeley. What is Mendeley? Mendeley is a reference manager allowing you to manage, read, share, annotate and cite your research papers...
Excel Objects, User Interface, and Data Entry. ◦ Application Window  Title Bar  Menu Bar  Toolbars  Status Bar  Worksheet Window  Worksheet Input.
Leveraging BI in SharePoint with PowerPivot and Power View
04 | Building Stellar Data Visualizations Using Power View.
Tutorial 11: Connecting to External Data
SQL Reporting Services Overview SSRS includes all the development and management pieces necessary to publish end user reports in  HTML  PDF 
XP New Perspectives on Microsoft Access 2002 Tutorial 71 Microsoft Access 2002 Tutorial 7 – Integrating Access With the Web and With Other Programs.
The Sixth Form College Farnborough Microsoft® Office OpenXML Jim Lyle Data Analyst The Sixth Form College Farnborough Presented at the Sixth Form Colleges’
By : Bridget Kargbo. /watch?v=XyjY8ZLzZrw &feature=player_embedd ed /watch?v=XyjY8ZLzZrw &feature=player_embedd.
Website Content, Forms and Dynamic Web Pages. Electronic Portfolios Portfolio: – A collection of work that clearly illustrates effort, progress, knowledge,
2012 National BDPA Technology Conference Creating Rich Data Visualizations using the Google API Yolanda M. Davis Senior Software Engineer AdvancED August.
A Spotfire Demo Gallery with Data Science Dr. Brand Niemann Director and Senior Data Scientist Semantic Community November 13, 2011 DRAFT 1.
Advanced Power Teacher Deeper Look Into Grade Book.
Bridging Communities and Data with ArcGIS Open Data Courtney Claessens, Product Engineer Daniel Fenton, Product Engineer.
An Internet of Things: People, Processes, and Products in the Spotfire Cloud Library Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist.
Created: by Paik S Tan Date : November 2 nd 2009.
11 TRAINING COURSE ON MALARIA ELIMINATION FOR THE GMS Databases Ryan Williams Chang Mai, August 2015.
May06-11: ISEAGE Attack Tool Repository and Player Jeremy Brotherton, Timothy Hilby, Brett Mastbergen, Jasen Stoeker.
Business Systems Analyst at MD Anderson Cancer Center Microsoft Office Specialist certified in SharePoint 2013 President of Houston SharePoint User Group.
Power View Overview April 25, POWER VIEW Presentation ready visualizations for the masses.
Tableau Overview Sagar Samtani and Hsinchun Chen MIS 496A Spring
Microsoft Power Query: an Excel Users Dream for Data Extraction and Cleansing Presented by: Belinda Allen Smith & Allen Consulting, Inc.
Microsoft Power BI Stack
Microsoft Power Query 101 Belinda Allen Smith & Allen Consulting, Inc.
EQuIS and Tableau Getting the most out of your tools.
XP Creating Web Pages with Microsoft Office
Review DATA VISUALIZATION WITH TABLEAU ONLINE TUTORIAL Training Guide Fundamentals.
Data Visualization with Tableau
David Lowe, Data librarian Evans library, Florida Tech
Map & Geographic Visualization
QLIK Overview & Desk Aid
Add More Zing to your Dashboards – Creating Zing Plot Gadgets
Watson Analytics Student Edition
Vazi Okhandiar, MCT, PMP, MBA, MSCS SQLSaturday #497 . April 2nd, 2016
21 Essential Data Visualization Tools
Leveraging BI in SharePoint with PowerPivot and Power View
Lab 02: Visualization with Tableau
Baxter Shandobil Adele Rife
Touring Data with Power Map
Using Excel with Google Maps
Qualitative Text Analysis
Azure Machine Learning & ML Studio
Lesson 1: Introduction to Trifacta Wrangler
Working with Data in Windows
07 | Analyzing Big Data with Excel
Exam Braindumps
Publicly Available Data Sources (Free)
Environmental Sensing Monitoring and Analyzing Water Temperatures
Lesson 1: Introduction to Trifacta Wrangler
B.Ramamurthy Partially Based on Ben Jones Book [1]
Unit 2: Lesson 11 & 12 Making Data Visualizations
Agenda About Excel/Calc Spreadsheets Key Features
Using Voyant to Explore Text Data
Course Introduction CSC 576: Data Mining.
Classroom Applications
Publicly Available Data Sources (Free)
Tutorial 7 – Integrating Access With the Web and With Other Programs
Web AppBuilder for ArcGIS
Donald Donais Minnesota SharePoint Users Group – April 2019
TracCloud.
Data Wrangling as the key to success with Data Lake
For Exchange Migrations
TABLEAU ACTIONS Today we are going to cover the five different actions within Tableau. Actions in Tableau are very important as they make the dashboard.
Generate Data with Google Analytics SQL Saturday /04/2019.
Getting Started with Data
Tableau Desktop depends on the breakthrough technology that lets you drag and drop to analyze data. You can associate with information.
Integrated Statistical Production System WITH GSBPM
Presentation transcript:

Tableau Overview and Publicly Available Data Sources Sagar Samtani and Hsinchun Chen with updates from Hongyi Zhu MIS 464 Spring 2019

Tableau Background Tableau is a powerful data visualization software. Capable of creating various interactive visualizations from a multitude of data sources. Tableau is a commercial software, but is available to students for free. Download from (http://www.tableau.com/academic/students) Tableau is primarily a drag-and-drop software.

Data Sources and Types of Visualizations Tableau can connect to variety of data sources, including: Local files – Excel, text, Access Traditional databases – SQL Server, MySQL, Oracle, PostgreSQL, DB2 Cloud technologies – Amazon Aurora, EMR, Redshift, BigQuery Big Data Technologies – Hadoop, Hive, Spark SQL Tableau can create a variety of visualizations including: Basic bar and line charts (e.g., temporal, box plots, etc.) Geospatial analysis Word clouds Treemaps Network analysis, although there are better tools for this (e.g., Gephi)! These visualizations can be combined into interactive dashboards. Can later be published online or shared easily.

Tableau Interface Dimensions Measures Blue: discrete data Green: continuous data Dimensions Data fields that cannot be aggregated Qualitative values (such as names, dates, or geographical data) Measures Data fields that can be measured, aggregated, or used for math operations Numeric, quantitative values Drag-n-drop Data Worksheet Format/Encode Plot types Tabs https://onlinehelp.tableau.com/current/pro/desktop/en-us/datafields_typesandroles.htm

Walkthrough Example The following example will teach you how to load data into Tableau, make three basic visualizations, and put them into a dashboard. Bar chart, Word Cloud, and Geospatial visualization. The data used in this example is an Excel spreadsheet about NFL Offensive players from 1999-2013. It contains: ~40,000 rows of data Player information (physically measurable traits, birthplace, college attended) Positions played Wins achieved in career

Connecting to a Data Source 1 2 3 2 We will have to connect to a data source to start making visualizations. Since our data is in an Excel workbook, we will select that. Second, we will join two of the sheets in the workbook such that we can get access to a larger set of data. Drag the “Unique players” and “Zip codes” sheets to the right. Select the “Inner” join option. We will join the sheets based on zip code.

Creating a Bar Chart 1 3 2 Suppose we want to know which major college conferences have most combined wins since 1999. First, drag the “Conference” dimension into the “Rows” bar, and the “College Wins” into the columns. Hit the drop down on the “College Wins” and select “Sum.” Second, select bar chart on the right hand side. To add a little bit of color, drag the “Conference” into the “Color” mark.

Creating a Word Cloud 1 2 Suppose now we want to get a general sense of the most popular conferences in terms of player enrollment is concerned. A word cloud is a great way to visually represent this. First, switch the “Marks” option to “Text”. Second, drag the “Conference” dimension into the “Text” marks box. Then drag the “Conference” dimension into the “Size” marks box. Adjust the measurement on this by hitting the drop down and selecting “Measure (Count)”

Creating a Geospatial Visualization Consider now that we are interested in the birthplaces of all of the NFL players. We can easily create a map representation. Drag the “Longitude” dimension to columns, and “Latitude” dimension to the rows. Select the map visualization. Add in some color by dragging the “Birth Zip Code” into the “Color” Marks. 1 2

Combining Visualizations into a Dashboard To tell a more comprehensive story, we can create a dashboard combining all of the visualizations. Simply open a dashboard view and start dragging sheets into the dashboard. You can format and add filters into the dashboard as you wish.

US Flights Delayed by Precipitation Domestic Violence in Spain Further Examples It is useful to explore other Tableau visualizations to get ideas. https://public.tableau.com/s/gallery contains many great visualizations. Endangered Safari US Flights Delayed by Precipitation Domestic Violence in Spain

Tableau Resources Gallery of Tableau visualizations: https://public.tableau.com/s/gallery Tableau training videos: http://www.tableau.com/learn/training Sample Tableau data sources: https://public.tableau.com/s/resources Reference book: Tableau Your Data!: Fast and Easy Visual Analysis with Tableau Software. Daniel Murray, 2nd edition, 2015. Available online through UA Library Companion materials: http://tableauyourdata.com/downloads/

Publicly Available Data Sources Name of Data Source # Entries Description Data Formats URL US Data.gov EU OpenData > 300,000 > 15,000 Agriculture, Business, climate, consumer, ecosystem, education, energy, finance, health, local government manufacturing, public safety, science and research HTML, XML, XLSX, CSV, PDF, shapefile, txt, zip http://www.data.gov/ http://data.europa.eu/euodp/data/dataset Kaggle 14,072 Product, insurance, forum comments, twitter data, images CSV, XLSX, SQL https://www.kaggle.com/datasets UC Irvine Machine Learning Repository 468 Research datasets used in past machine learning publications HTML, XML, XLSX, CSV, PDF, txt, zip https://archive.ics.uci.edu/ml Amazon Opendata on AWS 90 Public transportation, satellite images, web pages, genome, ecosystem, etc. Data API (CSV, JSON) https://registry.opendata.aws/ Microsoft Research Open Data 53 Biology, engineering, healthcare, physics, math, science and research CSV, TXT, TSV, PDF https://msropendata.com/

Publicly Available Data Sources Name of Data Source # Entries Description Data Formats URL Awesome Public Datasets (Github Repo) > 600 Agriculture, Biology, Climate, Data Challenges, Economics, Education, Finance, Government, Healthcare, Machine Learning, NLP, Search Engines, Sports, Transportation XLSX, JSON, XML, Zip, CSV, PDF https://github.com/awesomedata/awesome-public-datasets Figshare > 50 Data from: Various sciences (Astronomy, biological, environmental, information, etc.), engineering, commerce, management, tourism XLSX, Zip, XML, CSV, PDF https://figshare.com/ KD Nuggets Data sets designed specifically for data mining tasks JSON, CSV, SQL, XLSX http://www.kdnuggets.com/datasets/index.html VisualData 247 Computer Vision datasets JPG, PNG, … https://www.visualdata.io/ ML Vis 48 Repository of scientific datasets for visualization CSV http://www.mlvis.com/ Google Dataset Search Search engine for publicly available datasets https://toolbox.google.com/datasetsearch Enigma https://public.enigma.com/

US Data.gov Metadata and Additional Info Dataset Search Introduction Data Download Browse by Category

Kaggle Metadata and Description Dataset Search Browse with Filters Other users’ projects using this dataset Metadata and Description Dataset Search Browse with Filters Data Demo and Explore Panel

UCI Repository Search and Browse Metadata and Description

Amazon OpenData Dataset Search User Project Examples with This Dataset Browsing