1 Business Intelligence and the Statistician: Tutorial EPA Statistics Users Group Meeting Brand Niemann Senior Enterprise Architect U.S. EPA April 28,

Slides:



Advertisements
Similar presentations
Cause and Effect , Fishbone, Ishikawa Diagram
Advertisements

Data Science for Business: Semantic Verses Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
Data Science for Tackling the Challenges of Big Data
TOTAL QUALITY MANAGEMENT
Dynamic Case Management for Military and Intelligence Departments Can Improve Their Enterprise Architecture Programs Dr. Brand Niemann Director and Senior.
Total Quality Management Tools
Build VIVO in the Cloud NIH Workshop on Value Added Services for VIVO Brand Niemann Semantic Community March 25-26,
Title: Build EPA Apps in the Cloud Dr. Brand Niemann Former US EPA Senior Enterprise Architect and Data Scientist Current Binary Group Senior Enterprise.
Presentation to Data.gov PMO Semantic Web/Linked Data Team Dr. Brand Niemann Director and Senior Data Scientist Semantic Community July 27,
Build the Binary Group in the Cloud Brand Niemann Senior Enterprise Architect Binary Group August 5, Updated August 8,
Build Systems of Systems in the Cloud: Tutorial Brand Niemann Director and Senior Data Scientist Semantic Community November 9,
Big Data and Social Media & Web Analytics Innovation Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic Community
NIST Scientific Data for Data Science United Nations Open Data / Open Government Conference, April 26-28, Abu Dhabi
Linked Data Visualizations for Eurostat Linked Data Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
Big Data Conference: Analytics and Applications for Federal Big Data Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic.
1 EPA Enterprise Architecture Strategic Planning Sessions Brand Niemann Senior Enterprise Architect U.S. EPA January 19, 20, and 21, 2010
Quality Control Tools A committee for developing QC tools affiliated with JUSE was set up in April Their aim was to develop QC techniques for.
Imagine Everything is Before You: Past, Present, and Future Paper and Demonstration for the 2014 Family History Technology BYU Dr. Brand Niemann.
Business Intelligence. business intelligence is a broad category of applications and technologies for gathering, providing access to, and analyzing data.
Information Sharing Begins With Me Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic Community
GIS Data Science for Collaboration Across Communities: GIScience 2.0 and Beyond Dr. Brand Niemann Director and Senior Data Scientist Semantic Community.
EPA Indicators of Our Health and Environment Updated and Improved Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic.
Farm Data Dashboards: USDA and Microsoft Innovation Challenge Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data.
1 Build Your Own Data.gov Mashup-of-Mashups Catalog Brand Niemann Senior Enterprise Architect U.S. EPA November 5, 2010.
1 Community Engagement Pilot Tutorial: One EPA EPA Web Work Group, EPA Wiki and Blog Work Group, and EPA Open Government Directive Work Group Brand Niemann.
1 Semanticommunity.info Tutorial Brand Niemann December 7, 2010.
C ONTINUOUS Q UALITY I MPROVEMENT M ODEL The Deming cycle: Originally developed by Walter Shewart, but renamed in 1950s because Deming promoted it extensively.
1 Put Your Desktop in the Cloud to Support the Open Government Directive and Data.gov/semantic Tutorial Federal Cloud Computing Use Case, EPA Enterprise.
Data Science for DataBay DataBay "Reclaim the Bay" Innovation Challenge: August 1-3, 2014, Smithsonian Environmental Research Center, 647 Contees Wharf.
Data Science ESIP Publication Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
Data Science for USGS Minerals Big Data Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data Science Data Science.
Business Intelligence (BI) Primer BI Tools in SharePoint 2010 Excel Services Performance Point Services.
1 Wikify Your Best Content in Support of the OGD and Data.gov/semantic: Information Architecture Tutorial EPA Web Work Group, EPA Wiki and Blog Work Group,
The 2012 EuroStat Regional Yearbook for Semantic Interoperability Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic.
Why Doesn't EPA Have a Self- Contained Statistical Unit?: A Tribute to Doug Engelbart Dr. Brand Niemann Director and Senior Data Scientist Semantic Community.
Health Datapalooza IV: Child and Adolescent Health Data App Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
Research on US Federal Government Handling of Data Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic Community
1 A Target Data Architecture for the US EPA: Implementing DRM 3.0 and Data.gov Brand Niemann Senior Enterprise Architect, US EPA April 21, 2009 PARS 2009.
An Internet of Things: People, Processes, and Products in the Spotfire Cloud Library Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist.
Data Science for the NOAA Chief Data Officer Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
Data Science for HealthCare.gov Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
Data Science for Semantics Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data Science Data Science for Semantics.
Department of Commerce App Challenge: Big Data Dashboards Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic Community.
Data Science for DoI BSEE Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data Science Data Science for DoI BSEE.
Data Science for FDA RFI Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
Measure : SPC Dedy Sugiarto.
1 Tutorial for the EAWG: Solution Architecture for 2010 Brand Niemann Senior Enterprise Architect U.S. EPA January 28, 2010.
1 Social Business Intelligence from Open Government Data Brand Niemann Senior Enterprise Architect US EPA November 27, 2010 DISCLAIMER: While allowed to.
NIEM 3.0 Data Analytics App Dr. Brand Niemann Director and Senior Data Scientist Semantic Community AOL Government Blogger.
Building Dashboards SharePoint and Business Intelligence.
Total Quality Management. What is Quality? Quality is a relative concept. Quality is in the eye of the beholder Perfection Doing it right at the first.
© Wiley Total Quality Management by Adnan khan.
Traditional Economic Model of Quality of Conformance
Seven Old Tools of Quality Management
1 Chapter 6 Quality Tools. 2 The Seven Basic Quality Tools. Flowcharts Check Sheets Histograms Pareto Analysis Scatter Diagrams Control Charts Cause-and-Effect.
Data Science for Global Ebola Response Data Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data Science Data Science.
THE 7 BASIC QUALITY TOOLS AS A PROBLEM SOLVING SYSTEM Kelly Roggenkamp.
HealthIT.gov Dashboard: Spotfire not Flash Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
OXFORD SOFTWARE ENGINEERING Software Engineering Services & Consultancy Slide 1.1 © OSEL 2005 Page 1 of 30 Analysis of Defect (and other) Data SPIN London,
Exercise 1 Content –Covers chapters 1-4 Chapter 1 (read) Chapter 2 (important for the exercise, 2.6 comes later) Chapter 3 (especially 3.1, 3.2, 3.5) Chapter.
Data Science for the National Big Data R&D Initiative Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
Class Schedule Today: new content (will be on exam)
WP1. DEVELOPMENT OF TRAINING MATERIALS
Overview (1 of 2) Definition Use within organizations
Managing Quality, Innovation and Knowledge
BIS 221 RANK Education Your Life--
Spotfire 5 Users Guide Dashboard
Title: Build EPA Apps in the Cloud
Histograms: A Valuable Tool for Quality Evaluation
Fishbone Diagrams (cause and effect, or Ishikawa diagrams)
Presentation transcript:

1 Business Intelligence and the Statistician: Tutorial EPA Statistics Users Group Meeting Brand Niemann Senior Enterprise Architect U.S. EPA April 28, Note: This is also my 2010 Individual Development Plan.

2 Overview 1. What is it? 2. How does it affect the statistician? 3. How is it useful? 4. Who in EPA is using it? 5. Where do I find our more? 6. What is Wolfram Alpha?

3 1. What is it? EPA Statistics Training teaches one how to interpret environmental data and do Exploratory Data Analysis (e.g. S-PLUS).S-PLUS –See epadata.wik.is - Interpretation of Environmental Statistics.Interpretation of Environmental Statistics State-of-the-art statistical tools have evolved from Exploratory Data Analysis to comprehensive Business Intelligence Assessments (e.g. Spotfire).Spotfire –See Wikpedia – Business Intelligence.Business Intelligence State-of-the-art statistical techniques and tools are needed to provide quantitative data quality assessments and business intelligence results for Data.gov.Data.gov –Data Quality is especially important to using the Data.gov/semantic for Linking Open Data.Linking Open Data

4 2. How does it affect the statistician? S-PLUS: –Started with Statistical Sciences (1988) and ended up with TIBCO as Spotfire (2008). Spotfire is a business intelligence company whose origins trace back to the Human-Computer Interaction Laboratory at the University of Maryland, College Park, MD, in the early 1990s that was bought by TIBCO in Interpretation of Environmental Statistics: –OEI training on how simple pictures, graphs, and summaries can be used to interpret data (without equations and computers). Exploratory Data Analysis: –An approach to analyzing data for the purpose of formulating hypotheses worth testing named by John Tukey. Business Intelligence: –Computer-based techniques used in spotting, digging-out, and analyzing business data to support better business decision- making.

5 3. How is it useful? Data.gov –Increase public access to high value, machine readable datasets generated by the executive branch of the federal government. Data.gov/semantic –Implement the principles of Linking Open Data to Data.gov. Linking Open Data: –Tim Berners-Lee outlined four principles paraphrased as follows: Use URIs (like URLs) to identify things.URI Use HTTP URIs so that these things can be referred to and looked up ("dereference") by people and user agents. Provide useful information (i.e., a structured description — metadata) about the thing when its URI is dereferenced. Include links to other, related URIs in the exposed data to improve discovery of other related information on the Web.

6 4. Who in EPA is using it? EPA Business Intelligence and Analytics Center, Tim Hinds, Manager (Intranet Site):Intranet Site –EPA provides business intell, analytics tools in SaaS model, FCW, June 18, 2009 (reports as PDF or Web pages). FCW Oracle Business Intelligence Enterprise Edition, Business Objects XI (discontinued use), Informatica PowerCenter, and SAS.Oracle Business Intelligence Enterprise Edition –Recent Oracle Business Intelligence Workshop: Most use Excel, but we think Oracle can do more. Put Your Desktop in the Cloud to Support the Open Government Directive and Data.gov/semantic, Brand Niemann, Senior Enterprise Architect: –Apply to EPA Statistics Training – Interpretation of Environmental Data as a pilot. See next slides of screen captures. –Complete the statistical data stories for the 2008 ROE Indicators and other high-value data sets. See next slides of screen captures.

7 4. Who in EPA is using it? EPA Environmental Statistics Training Histogram of Data Rounded to Nearest Tenth Normal Distribution Parameters Table of Contents

8 4. Who in EPA is using it? Spotfire Help: Curve Fit Models Spotfire on the Web: Non-interactive

9 4. Who in EPA is using it? EPA Ontology Ozone Concentrations Indicator

10 4. Who in EPA is using it? Spotfire Help: ScatterplotsSpotfire on the Web: Non-interactive

11 5. Where do I find our more? EPA Environmental Statistics Training: – Histogram of Data Rounded to Nearest Tenth: – _of_Environmental_Data#Histogram_of_Data_Rounded_to_Nearest_Tenthhttp://epadata.wik.is/Statistics_Users_Group/Environmental_Statistics_Training/Interpretation _of_Environmental_Data#Histogram_of_Data_Rounded_to_Nearest_Tenth Spotfire Help: Curve Fit Models: – Spotfire on the Web: Noninteractive: – EPA Ontology: – ROE Ambient Concentrations of Ozone: – ir_Effects_on_Human_Health_and_the_Environment%3f/2.2.2_ROE_Indicators/ _Amb ient_Concentrations_of_Ozonehttp://epaontology.wik.is/2_Air/2.2_What_Are_the_Trends_in_Outdoor_Air_Quality_and_The ir_Effects_on_Human_Health_and_the_Environment%3f/2.2.2_ROE_Indicators/ _Amb ient_Concentrations_of_Ozone Spotfire Help: Scatterplots: – Spotfire on the Web: Noninteractive: – Spotfire Web Player (Makes Spotfire on the Web Interactive): –

12 5. Where do I find our more? Seven Basic Tools of Quality:Seven Basic Tools of Quality –A designation given to a fixed set of graphical techniques identified as being most helpful in troubleshooting issues related to quality. They are called basic because they are suitable for people with little formal training in statistics and because they can be used to solve the vast majority of quality-related issues. –The tools are: The cause-and-effect or Ishikawa (fishbone) diagram (Online)Ishikawa (fishbone) diagram Online The check sheet (Spreadsheet)check sheet Spreadsheet The control chart (Spotfire)control chart Spotfire The histogram (Spotfire)histogramSpotfire The pareto chart (Spreadsheet)pareto chartSpreadsheet The scatter diagram (Spotfire)scatter diagram Spotfire Stratification (alternately flow chart or run chart) (Visio)Stratificationflow chart run chartVisio

13 6. What is Wolfram Alpha? Computational Knowledge Engine Answer Source Information

14 6. What is Wolfram Alpha? Wolfram Alpha: – How long is the US Coastline?: – –Answer: miles Source: Source: factbook/geos/us.htmlhttps:// factbook/geos/us.html iPhone App: – General Reference: –