Data Science for NOAA Chief Data Officer and Big Data Predictive Analytics Meetup Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist.

Slides:



Advertisements
Similar presentations
SE Name SE Title Blackboard Training: Approaches and Opportunities.
Advertisements

The Messy World of Grey Literature in Cyber Security 8 th Grey Literature Conference 4-5 December 2006 New Orleans, Louisiana Patricia Erwin – I3P Senior.
Data Science for Natural Medicines: Dead Doctors Don't Lie Radio
Integrated Ocean Observing System (IOOS) Data Management and Communication (DMAC) Standards Process Julie Bosch NOAA National Coastal Data Development.
Data Science for Tackling the Challenges of Big Data
Federal SOA for E-Government The Top Ten Things You Need to Know for YouTube October 15, 2011 DRAFT 1
Data Act at US Department of Treasury Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
Data Science for NSF Polar Cyberinfrastructure & MIT Big Data Course Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community.
Data Science, Data Infrastructure, & Data Publications for the HHS IDEA Lab Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic.
EarthCube Data Science Publications Dr. Joan Aron Dr. Sophia Liu Dr. Brand Niemann May 29, 2015
Build the Binary Group in the Cloud Brand Niemann Senior Enterprise Architect Binary Group August 5, Updated August 8,
Data Science for MyFamilySearch.org and FamilyTree DNA Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
NLM-Semantic Medline Data Science Data Publication Commons Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data.
Big Data and Social Media & Web Analytics Innovation Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic Community
Federal Big Data Working Group Meetup Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
Data Science for RDA Climate Change Data Challenge and Meetup Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data.
Semantic Data Discovery: Proof of Concept for DHS
Data Science for USGS Minerals Big Data Meetup Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data Science Data.
University of North Dakota Office of Institutional Research November 8, 2013 Drivers get ready - new dashboards are coming your way! Presented at the.
Imagine Everything is Before You: Past, Present, and Future Paper and Demonstration for the 2014 Family History Technology BYU Dr. Brand Niemann.
Information Sharing Begins With Me Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic Community
Data Science Publication for NSF Polar Cyberinfrastructure Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
Using Data Science as Evidence in Public Policy With Big Data and Elections Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist.
EPA Indicators of Our Health and Environment Updated and Improved Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic.
The Marine Metadata Interoperability Project A Model for Community Collaboration September 23, 2010 Nan Galbraith WHOI.
Designing the Microbial Research Commons: An International Symposium Overview National Academy of Sciences Washington, DC October 8-9, 2009 Cathy H. Wu.
Farm Data Dashboards: USDA and Microsoft Innovation Challenge Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data.
Partnerships and Broadening Participation Dr. Nathaniel G. Pitts Director, Office of Integrative Activities May 18, 2004 Center.
Data Science for Agency Initiatives 2015 Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
Data Science for RDA Climate Change Data Challenge and Meetup Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data.
RootsTech 2012: My Experiences Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic Community
English 285 Writing in Cyberspace Technology Report On-line Collaboration Deborah Cull November 2005.
Data Science for International Data Week 2016: Concept Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data Science.
Director and Senior Data Scientist/Data Journalist
Data Science for DataBay DataBay "Reclaim the Bay" Innovation Challenge: August 1-3, 2014, Smithsonian Environmental Research Center, 647 Contees Wharf.
Data Science ESIP Publication Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
Data Science for USGS Minerals Big Data Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data Science Data Science.
Data Science for DTIC Data Ecosystem Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
The 2012 EuroStat Regional Yearbook for Semantic Interoperability Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic.
Data Science for USDA Big Data
Data Science for HealthData.gov Developers & Family Caregivers Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community.
11 Aeronautical Information Exchange Model (AIXM) / Weather Information Exchange Model (WXXM) Conference Addressing the NextGen Challenge Charles A. Leader.
Data Science for the National Big Data R and D Initiative Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
Open DATA METI: All Content As Big Data Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic Community
Data Science for Migration Data Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
Health Datapalooza IV: Child and Adolescent Health Data App Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
Research on US Federal Government Handling of Data Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic Community
Data Science for the NOAA Chief Data Officer Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
NOAA Science Advisory Board …advises the Secretary of Commerce for Oceans and Atmosphere on long- and short- range strategies for research, education,
Data Science for Semantics Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data Science Data Science for Semantics.
Department of Commerce App Challenge: Big Data Dashboards Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic Community.
Data Science for DoI BSEE Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data Science Data Science for DoI BSEE.
Welcome to the CoSA Member Webinar March 27, 2014 Let us know who you are, where you’re from, and who is participating with you today Use the chat box.
Data Science for FDA RFI Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
Data Science for Conservation International's Big Ecosystem Data Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community.
11 Welcome to All! October 26-28, 2009 Washington, D.C. Welcome to All! Accelerators for America’s Future Symposium and Workshop October 26-28, 2009 Washington,
1 Social Business Intelligence from Open Government Data Brand Niemann Senior Enterprise Architect US EPA November 27, 2010 DISCLAIMER: While allowed to.
1 Improved Access to EPA and Interagency Information: Before and After with Web 2.0 – Part 7 EPA Jam on Improved Access to Environmental Information, June.
Government Technology & Innovation Incubator for Big Data Analytics Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community.
Director’s Report 6 h BRDI Meeting Washington, DC 31 January 2012 Paul F. Uhlir, J.D. Director, Board on Research Data and Information National Academy.
Data Science for NIST Big Data Framework Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
A Proposed Short Course on Data Stewardship Scott Hausman Deputy Director NOAA’s National Climatic Data Center Preparing Scientists to Steward Their Data.
Data Science for EarthCube 2015 Key Documents Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
Driving Innovation with Open Data Chris Musialek in place for Jeanne Holm Data.gov February 9, 2012.
National Data Science Organizers Lightning Talks From Around the Country Dr. Brand Niemann Founder and Co-Organizer Federal Big Data Working Group Meetup.
HealthIT.gov Dashboard: Spotfire not Flash Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
Data Science for the National Big Data R&D Initiative Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
Data Science for RDA Climate Change Data Challenge and Meetup
First Meetup: Data Science for the Data Act at Treasury
Spotfire 5 Users Guide Dashboard
Presentation transcript:

Data Science for NOAA Chief Data Officer and Big Data Predictive Analytics Meetup Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community November 3,

Agenda 6:30 p.m. Welcome and Introduction (New Tutorial and Mentoring) SlidesSlides – Background: Data Science for NOAA Big Data and DHS Global Terrorism Database. See Big Data Symposia Google Find: Department of Homeland Security and see Recent Presentation belowData Science for NOAA Big DataGlobal Terrorism DatabaseBig Data Symposia – Data Science for the NOAA Chief Data Officer Story and SlidesStorySlides 7:00 p.m. Brief Member Introductions 7:10 p.m. Treeminer.com Video, Mark Silverman and Biplab Pal – Background: Data Science for Vertical Data MiningData Science for Vertical Data Mining – Video Demo 1: HL7 classification: and De mo 2: How Treeminer works for document classification reference to a patent invalidation: 7:30 p.m. Predictive Analytics in the Era of Big Data, Dave Vennergrund, Director, Data Analytics Center of Excellence, SalientFed Slides​ Slides 8:30 p.m. Open Discussion 8:45 p.m. Networking 9:00 p.m. Depart 2

Calendar October 23, Big Data Symposium at the National Research Council, National Academy of Sciences Keck Building, Room 100, 500 Fifth Street, NW, Washington, DC :45 a.m. – Contact the Board director, Paul Uhlir at or at , to register in advance IEEE International Conference on Big Data, October 27-30, Washington DC. Panel on October 27 th with Dr. Joan Aron. – November 4, December 16, 2014, Tackling the Challenges of Big Data, MITProfessionalX Online Course, $545. – /about /about November 4 - "Diverse Data Analytics Applications" a joint George Mason University and IBM ASC Symposium. – Register: 3

FDA Data Innovation Lab and Predictive Analytics Meetup Great discussion and very informative. The meeting was extraordinary - an onion-peeling exercise in which discussions stripped away layers of issues and led to what appears to be basics of predictive analytics and data science. Good discussion on FDA and open data. Lots of potential value, and much remaining to be done. Also enjoyed the predictive analytics discussions. The formatted data is truly helpful. 4

Conference for NSF Data Scientists, Data Infrastructure, and Data Publication Excellent content, very informative! A hearty thank you on behalf of the Digital Government Institute event team for your presentation and participation in the 2014 Government Big Data Conference. We very much appreciate your willingness to share with the government audience today – I am sure the attendees thoroughly enjoyed your presentation and the feedback will be extremely positive. DGI is very fortunate to have had the opportunity to work with the FBDWG team – your participation is appreciated. Thank you all very much! I had so many great conversations and contacts at the event. 5

Semantic Insights Followup Looking for interested individuals who wish to participate in our Natural Language Understanding and Reasoning research. We welcome educational institutions and individual researchers interested in working collaboratively with us. Accounts are available for beta test: – Applying High-speed Pattern Recognition to Generate Queryable Semantics from Big Data - Big Data is filtered and reduced in real-time for event and pattern discovery: – Applying High-speed Pattern Recognition to Generate Queryable Semantics from Big Data Applying High-speed Pattern Recognition to Generate Queryable Semantics from Big Data 6

NOAA Embraces the Business of Big Data NOAA February 2014 RFI seeking solutions for how to make its 20 terabytes of daily data available quickly and at scale. – The request for information drew 70 responses from individuals, academia and industry organizations. Now sharing only about 10 percent of that information, NOAA wanted to hear about ways to get more information into the hands of users – and maybe make a little money on the side. – "It gives you a good idea of what they see as a potential for value," said David McClure, lead analyst for open government services at NOAA and the man behind the agency's big data partnership business model. 7 Source:

NOAA Big Data Partnership Model 8 My Comments: Making something overly complicated. Get a Chief Data Officer and Data Scientists to do this!

NOAA Big Data Industry Day At the Big Data Industry Day on October 17, NOAA will review the draft Statement of Objectives (SOO) and offer industry an opportunity to ask questions. All comments and suggestions to the SOO will be due on October 24, My response (see Story) is that we can do this RFI in a Community of Data Science Practice like the Federal Big Data Working Group Meetup.Story 9

Start: Government Data Hubs NOAA! Three NOAA Data Hubs: See Next Slide My Comment: Should do more Data Hubs!

Click To: Three NOAA Data Hubs The CHAMP Program gathers near real-time data from instrumented arrays and satellites covering important coral reef areas from all over the globe. Please choose an area of interest from the Data drop-down menu. A primary focus of U.S. IOOS is integration of, and expedited access to, ocean observation data for improved decision making. The Data Management and Communication (DMAC) subsystem of U.S. IOOS serves as a central mechanism for integrating all existing and projected data sources. The MarineCadastre.gov Data Registry provides direct access to data currently available through MarineCadastre.gov. Filter the data by provider, thematic category, geographic region, and service type. If you are looking for a data set that is currently not available on MarineCadastre.gov, please us. 11

Click To: Department of Commerce 12

Click To: Data.gov Department of Commerce 13 Data.gov Department of Commerce

Click To: Department of Commerce APIs Agencies with APIs NOAA has 11 APIs!

Also Filter To: Data.gov NOAA 15 Data.gov for NOAA Only 3,560 Data Sets at Data.gov, But 55,602 Data Sets at data.NOAA.gov! (See Next Slide)

Also See Prototype: data.noaa.gov 16 This NOAA Data Catalog is a prototype under active development. Availability and completeness are not guaranteed. 55,602 Data Sets, But No SHP and Only One CSV and 222 Excel!

Filter to Excel in Prototype: data.noaa.gov Excel Data Sets and For Only One Project

Also See: Environmental Research Division's Data Access Program RESTFul Web Services 18

Data Science for NOAA Big Data: Build Knowledge Base 19 Data Science for NOAA Big Data

Data Science for NOAA Big Data: Knowledge Base Contents 6. Government Data Hubs – 6.1. Department of Commerce Data Sets and Information for Developers Complete catalogue of publicly-available Commerce data sets (DoC 22,365 and NOAA 3,560) – Topics – Topic Categories – Dataset Type – Tags – Formats – Organization Types – Organizations – Publisher – 6.2. Department of Commerce Developer Application Programming Interfaces (APIs) Welcome Bureau of Economic Analysis Census Bureau International Trade Administration National Institute of Standards and Technology National Oceanic and Atmospheric Administration (11 APIs) National Telecommunications and Information Administration Patent and Trademark Office 20

Data Science for NOAA Big Data: Spreadsheet Knowledge Base Index 21

Data Science for NOAA Big Data: Spreadsheet Data.gov DoC Index 22

Data Science for NOAA Big Data: Spreadsheet Data.gov NOAA Index 23

Data Science for NOAA Big Data: Spreadsheet data.NOAA.gov Index 24 My Note: In Process

Data Science for NOAA Big Data: Spotfire Cover Page 25 Web Player

Stephen Dennis, Director, Innovation, Science and Technology Directorate, Department of Homeland Security, Big Data Analytics and Homeland Security My Notes on Recent Presentation and Slides: – I don’t know what this “data” stuff is, but I want some of it… – DHS S&T Mission: Strengthen America’s security and resiliency by providing knowledge products and innovative technology solutions for the Homeland Security Enterprise (HSE) – Superstorm Sandy (Initial Findings) from NUSTL My Note: FAIRport this! – Statement of Big Data Problem in DHS S&T’s Big Data Survey: Goal is to improve operational effectiveness and efficiency within the Department and HSE Continue to work cultural issues that tend to plague big data FEMA: Improved Utilization of Data Sets – My Note: I worked on this! Leveraging Leading-edge Data Science Research – My Note: That is what he asked me to show him! Big Data Lessons Learned – Determine what data exists and how it can it be manipulated to make it useful 26

Data Science for DHS: Global Terrorism Database Knowledge Base 27 Global Terrorism Database

Data Science for DHS: Big Data Symposia Knowledge Base 28 Big Data Symposia Google Chrome Find: Department of Homeland Security (14)

Global Terrorism Database Experience: Spotfire Cover Page 29 Web Player

Agenda 7:00 p.m. Brief Member Introductions 7:10 p.m. Treeminer.com Video, Mark Silverman and Biplab Pal – Background: Data Science for Vertical Data MiningData Science for Vertical Data Mining – Video Demo 1: HL7 classification: t and Demo 2: How Treeminer works for document classification reference to a patent invalidation: thttps://docs.google.com/file/d/0B8SK...JiMjAtYXM/edit 7:30 p.m. Predictive Analytics in the Era of Big Data, Dave Vennergrund, Director, Data Analytics Center of Excellence, SalientFed SlidesSlides 8:30 p.m. Open Discussion 8:45 p.m. Networking 9:00 p.m. Depart 30