Data Science for Energy Outlook 2015 Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data Science Data Science.

Slides:



Advertisements
Similar presentations
Trends in Number of High School Graduates: National
Advertisements

PARTISAN CONTROL AND STATE DECISIONS ABOUT OBAMACARE FULL GO STATES (n = 22) Arkansas Michigan CALIFORNIA MINNESOTA COLORADO NEVADA CONNECTICUT New Hampshire.
Hwy Ops Div1 THE GREAT KAHUNA AWARD !!! TEA 2004 CONFERENCE, MOBILE, AL OCTOBER 09-11, 2004 OFFICE OF PROGRAM ADMINISTRATION HIPA-30.
The West` Washington Idaho 1 Montana Oregon California 3 4 Nevada Utah
TOTAL CASES FILED IN MAINE PER 1,000 POPULATION CALENDAR YEARS FILINGS PER 1,000 POPULATION This chart shows bankruptcy filings relative to.
5 Year Total LIHEAP Block Grant Allotment (FY ) While LIHEAP is intended to assist low-income families with their year-round home energy needs,
BINARY CODING. Alabama Arizona California Connecticut Florida Hawaii Illinois Iowa Kentucky Maine Massachusetts Minnesota Missouri 0 Nebraska New Hampshire.
What are the states in the Northeast Region?
U.S. Civil War Map On a current map of the U.S. identify and label the Union States, the Confederate States, and U.S. territories. Create a map key and.
Chart 6. 12: Impact of Community Hospitals on U. S
Regions By Katelyn Ebenkamp Picture background with textured caption
Hwy Ops Div1 THE GREAT KAHUNA AWARD !!! TEA 2003 CONFERENCE, BURLINGTON, VT SEPTEMBER 3-5, 2003 OFFICE OF PROGRAM ADMINISTRATION HIPA-30.
This chart compares the percentage of cases filed in Maine under chapter 13 with the national average between 1999 and As a percent of total filings,
Fasten your seatbelts we’re off on a cross country road trip!
Map Review. California Kentucky Alabama.
Judicial Circuits. If You Live In This State This Is Your Judicial Circuit Alabama11th Circuit Alaska 9th Circuit Arkansas 8th Circuit Arizona 9th Circuit.
1. AFL-CIO What percentage of the funds received by Alabama K-12 public schools in school year was provided by the state of Alabama? a)44% b)53%
Let’s See What You Know: Draw the outline of the United States Draw California Identify and label our three major bodies of water Star the location of.
The United States.
Medicare Advantage Enrollment: State Summary Five Slide Series, Volume 2 July 2013.
Directions: Label Texas, Arkansas, Louisiana, Mississippi, Tennessee, Alabama, Georgia, Florida, South Carolina, North Carolina, Virginia--- then color.
 As a group, we thought it be interesting to see how many of our peers drop out of school.  Since in the United States education is so important, we.
Warm Up Complete the Coordinate Practice #10. Content Objective: – Compare the physical and political regions. Language Objectives: – SWBAT define region.
CHAPTER 7 FILINGS IN MAINE CALENDAR YEARS 1999 – 2009 CALENDAR YEAR CHAPTER 7 FILINGS This chart shows total case filings in Maine for calendar years 1999.
By Carol Fahringer. I.The United States: Divided Into 8 Different Political Regions.
Study Cards The East (12) Study Cards The East (12) New Hampshire New York Massachusetts Delaware Connecticut New Jersey Rhode Island Rhode Island Maryland.
Hawaii Alaska (not to scale) Alaska GeoCurrents Customizable Base Map text.
US MAP TEST Practice
UNITED STATES HISTORY REGION PROJECT MONDAY, AUGUST 25, 2014.
Education Level. STD RATE Teen Pregnancy Rates Pre-teen Pregnancy Rate.
TOTAL CASE FILINGS - MAINE CALENDAR YEARS 1999 – 2009 CALENDAR YEAR Total Filings This chart shows total case filings in Maine for calendar years 1999.
The United States is a system that can be broken into 5 major parts or regions.
Can you locate all 50 states? Grade 4 Mrs. Kuntz.
1st Hour2nd Hour3rd Hour Day #1 Day #2 Day #3 Day #4 Day #5 Day #2 Day #3 Day #4 Day #5.
NEADA Winter Meeting February 28, 2017.
2012 IFTA / IRP MANAGERS’AND LAW ENFORCEMENT WORKSHOP
The United States Song Wee Sing America.
Expanded State Agency Use of NMLS
The United States.
Supplementary Data Tables, Utilization and Volume
Sales Tax Raw Data State Sales Tax 1 Alabama 4% 2 Alaska 0% 3 Arizona
Physicians per 1,000 Persons
Visa Bankruptcy Education Services
USAGE OF THE – GHz BAND IN THE USA
Visa Bankruptcy Education Services Bankruptcy Statistics May 19, 2016.
Content Objective: Language Objectives:
Table 3.1: Trends in Inpatient Utilization in Community Hospitals, 1992 – 2012
Name the State Flags Your group are to identify which state the flag belongs to and sign correctly to earn a point.
GLD Org Chart February 2008.
Membership Update July 13, 2016.
2008 presidential election
Table 3.1: Trends in Inpatient Utilization in Community Hospitals, 1987 – 2007
State Adoption of Uniform State Test
The States How many states are in the United States?
State Adoption of NMLS ESB
Supplementary Data Tables, Trends in Overall Health Care Market
AIDS Education & Training Center Program Regional Centers
Table 2.3: Beds per 1,000 Persons by State, 2013 and 2014
Regions of the United States
DO NOW: TAKE OUT ANY FORMS OR PAPERS YOU NEED TO TURN IN
Regions of the United States
Supplementary Data Tables, Utilization and Volume
Presidential Electoral College Map
WASHINGTON MAINE MONTANA VERMONT NORTH DAKOTA MINNESOTA MICHIGAN
Expanded State Agency Use of NMLS
Regions Of The United States
CBD Topical Sales Restrictions by State (as of May 23, 2019)
Percent of adults aged 18 years and older who have obesity †
AIDS Education & Training Center Program Regional Centers
USAGE OF THE 4.4 – 4.99 GHz BAND IN THE USA
Presentation transcript:

Data Science for Energy Outlook 2015 Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data Science Data Science for Energy Outlook 2015 July 30,

2 My Note: I decided to participate! Pick excellent government energy content. Make it a Data Science Data Publication.

Data Mining - Data Science – Data Publication Process Data Mining Process: Business Understanding Data Understanding Data Preparation Modeling Evaluation Deployment Data Science Process: Data Preparation Data Ecosystem Data Story Data Science Questions: How was the data collected? Where is the data stored? What are the data results? and Why should we believe the data results? Data Science Data Publication: Knowledge Base Spreadsheet Index Web & PDF Tables to Spreadsheet Data Browser Dynamically Linked Adjacent Visualizations 3

4 Overview

5 Data: All Tables My Note: See Executive Summary: Table ES-1 in Next Slide

6 My Note: Web Table to Spreadsheet. Also PDF Tables in Appendix

7 Interactive Table Viewer (Beta testing): Provides custom data views of the AEO2015 Reference case and previous Reference cases. All available cases can be charted and the data for them downloaded.

8 My Note: Click Path 1.Data 2.Reference or Side Cases 3.Summary Case Tables 4.Table 1 My Note: This could be Filtered Tables in A Data Viewer Like Spotfire My Note: Lots of Options

9 Semantic CommunitySemantic Community Data Science Data Science for Energy Outlook 2015Data ScienceData Science for Energy Outlook 2015 Data Science Data Publication: Knowledge Base

10 Data Science Data Publication: Spreadsheet Index AEO2015.xlsx

11 Data Science Data Publication: Web & PDF Tables to Spreadsheet AEO2015.xlsx

12 Data Science Data Publication: Data Browser

AEO2015 Figure ES-1 Spreadsheet 13 My Note: Copied Data Here for Spotfire fig-es1_data.xls

14 AEO2015 Figure ES-1 Spreadsheet in Spotfire

15 Data Science Data Publication: Dynamically Linked Adjacent Visualizations Cover Page: Content Index and Analytics

Conclusions and Recommendations The Annual Energy Outlook 2015 is both a Web and PDF document with PDF and Excel figure tables which uses an Interactive Table Viewer in Beta testing. I decided to participate in the Data Owls Meetup and selected the excellent Annual Energy Outlook 2015 and made it a Data Science Data Publication. I followed the Federal Big Data Working Group Meetup’s Data Mining - Data Science – Data Publication Process. A Data Science Data Publication has been created with a Knowledge Base in MindTouch, the Knowledge Base Index and Report Tables in Excel, and a Data Viewer in Spotfire. 16

Data Science DC: Algorithms for Geospatial Data Analysis Meetup Description For the July Data Science DC Meetup we're having a themed evening where we'll look at the intersection of data science with mapping and spatial analysis. We will feature two presentations - the first by Anthony Fox from CCRI, who will discuss GeoMesa and how they analyze high-velocity streaming spatio-temporal data. The second speaker is Jason Dalton of Azimuth1, who will discuss using spatial graph analysis to model the US fuel energy infrastructure for the Department of Energy. 17

Data Science DC: Algorithms for Geospatial Data Analysis Meetup Comments 1 I regret to say that I was disappointed. The presentations were rough, especially the second one, and not as technically strong as I'd hoped and expected. This material is far from the leading edge of what's being done in geospatial analysis now, and there wasn't enough for someone who doesn't know the field to pick up how to pursue it. I agree. The first presentation was applying sophisticated statistics to disaggregated data (artificial data). Why should we believe that? There must me some real system data somewhere in the US to use to apply data science to reality. The second demo reminded me of the excellent work at the MIT with MapD: Mapping Twitter Trends in Real- Time: and 18

19 and

Data Science DC: Algorithms for Geospatial Data Analysis Meetup Comments 2 Aggregated data can be real data systematically summarized by some process. These datasets look interesting, let's explore them at Data Owls tonight! I am. Please see: Data Science for Energy Outlook 2015Data Science for Energy Outlook 2015 Good content, well presented. Why look at the numbers when the graphs are so pretty? Maybe to get some idea of where the stuff depicted in the graphs came from? Google Search for PADD (Petroleum Administration for Defense Districts) and API (Application Programming Interface or American Petroleum Institute?) 20

21 My Note: Print Publication Only!

Petroleum Administration for Defense District (PADD) A geographic aggregation of the 50 States and the District of Columbia into five Districts, with PADD 1 further split into three subdistricts. The PADDs include the States listed below: PADD 1 (East Coast): PADD 1A (New England): Connecticut, Maine, Massachusetts, New Hampshire, Rhode Island, and Vermont. PADD 1B (Central Atlantic): Delaware, District of Columbia, Maryland, New Jersey, New York, and Pennsylvania. PADD 1C (Lower Atlantic): Florida, Georgia, North Carolina, South Carolina, Virginia, and West Virginia. PADD 2 (Midwest): Illinois, Indiana, Iowa, Kansas, Kentucky, Michigan, Minnesota, Missouri, Nebraska, North Dakota, Ohio, Oklahoma, South Dakota, Tennessee, and Wisconsin. PADD 3 (Gulf Coast): Alabama, Arkansas, Louisiana, Mississippi, New Mexico, and Texas. PADD 4 (Rocky Mountain): Colorado, Idaho, Montana, Utah, and Wyoming. PADD 5 (West Coast): Alaska, Arizona, California, Hawaii, Nevada, Oregon, and Washington. Map of the PADD districts 22

23 Glossary Map of the PADD districts

24 Overview, and see below for Interactive Visualizations, Data, & Multimedia

25 Interactive Visualizations, Data, & Multimedia: One of Multiple Examples

26 Analysis and Projections: One of Many Examples

27 Data

28 My Note: Monthly and Annual City Average from Average of Individual Cities!? Data: Prices

29

4. Provisions Regarding Disclosure of Information All PSRS survey forms, with the exception of the Form EIA-814, “Monthly Imports Report,” have the same general disclosure information statement. The information reported on Form EIA-814 will be considered “public information” and may be publicly released in company or individually identifiable form, and will not be protected from disclosure in identifiable form. Disclosure limitation procedures are not applied to the statistical data published from this survey’s information. Thus, there may be some statistics that are based on data from fewer than three respondents, or that are dominated by data from one or two large respondents. In these cases, it may be possible for a knowledgeable person to estimate the information reported by a specific respondent. In addition to the use of the information by EIA for statistical purposes, the information may be made available, upon request, to other Federal agencies authorized by law to receive such information for any nonstatistical purposes such as administrative, regulatory, law enforcement, or adjudicatory purposes. Company specific data are also provided to other DOE offices for the purpose of examining specific petroleum operations in the context of emergency response planning and actual emergencies. My Note: So one can use real raw data for Geospatial Data Analysis. My client will be very interested in that! 30