Big Data Symposium: Analytics and Applications for Federal Big Data – Bureau of Justice Statistics Dr. Brand Niemann Director and Senior Enterprise Architect.

Slides:



Advertisements
Similar presentations
Federal Transparency.gov As Data For the Digital Government Strategy Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
Advertisements

Data Science for Natural Medicines: Dead Doctors Don't Lie Radio
OMB Data Visualization Tool Requirements Analysis: Information Builders Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
Data Science for Business: Semantic Verses Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
Data Science for Tackling the Challenges of Big Data
Director and Senior Data Scientist/Data Journalist
OMB Data Visualization Tool Requirements Analysis: Oracle Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
Alcohol Misuse and Crime: Update on Federal Reporting Fourth meeting of National Partnership on Alcohol Misuse and Crime National Partnership on Alcohol.
OMB Data Visualization Tool Requirements Analysis: Birst Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
A Search for Veterans Benefits Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic Community December 22,
Data Science for MyFamilySearch.org Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community My Personal Family History.
OMB Data Visualization Tool Requirements Analysis: IBM Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
OMB Data Visualization Tool Requirements Analysis: Logi Analytics Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
OMB Data Visualization Tool Requirements Analysis: Microsoft Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
NLM-Semantic Medline Data Science Data Publication Commons Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data.
Big Data and Social Media & Web Analytics Innovation Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic Community
NIST Scientific Data for Data Science United Nations Open Data / Open Government Conference, April 26-28, Abu Dhabi
EPA Big Data Analytics: Data Science for EPA Fracturing Data Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
Linked Data Visualizations for Eurostat Linked Data Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
OMB Data Visualization Tool Requirements Analysis: SAP Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
Hubway Data Visualization Challenge: Spotfire Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
Mandates for Data Transparency in 113th Congress: DataCoalition.org Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic.
3 Round Stones: All Content As Big Data Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic Community
Big Data Conference: Analytics and Applications for Federal Big Data Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic.
A TEDMED Data Reveal: Big and Little Dr. Brand Niemann Director and Senior Data Scientist Semantic Community AOL Government.
Imagine Everything is Before You: Past, Present, and Future Paper and Demonstration for the 2014 Family History Technology BYU Dr. Brand Niemann.
Semantic Knowledge Bases and Be Informed for the FAA Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic Community
Information Sharing Begins With Me Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic Community
GIS Data Science for Collaboration Across Communities: GIScience 2.0 and Beyond Dr. Brand Niemann Director and Senior Data Scientist Semantic Community.
Data Science Publication for NSF Polar Cyberinfrastructure Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
Using Data Science as Evidence in Public Policy With Big Data and Elections Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist.
EPA Indicators of Our Health and Environment Updated and Improved Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic.
Big Data Symposium: Analytics and Applications for Federal Big Data - FEMA Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist.
Data Science for Agency Initiatives 2015 Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
Data Science for DataBay DataBay "Reclaim the Bay" Innovation Challenge: August 1-3, 2014, Smithsonian Environmental Research Center, 647 Contees Wharf.
Data Science ESIP Publication Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
Data Science for USGS Minerals Big Data Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data Science Data Science.
Data Science for DTIC Data Ecosystem Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
The 2012 EuroStat Regional Yearbook for Semantic Interoperability Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic.
Why Doesn't EPA Have a Self- Contained Statistical Unit?: A Tribute to Doug Engelbart Dr. Brand Niemann Director and Senior Data Scientist Semantic Community.
Data Science for USDA Big Data
Data Science for EPA Big Data Analytics: Oregon Data Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
Open DATA METI: All Content As Big Data Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic Community
Data Science for Migration Data Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
Health Datapalooza IV: Child and Adolescent Health Data App Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
SmartGrid and Spotfire Cloud Computing - Similarities in Innovation Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic.
Research on US Federal Government Handling of Data Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic Community
Binary Group Knows What It Knows Because of It’s Information Attitude Brand Niemann Senior Enterprise Architect and Data Scientist August 26,
Data Science for the NOAA Chief Data Officer Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
Harnessing Data to Address Diabetes in the US Dr. Brand Niemann Director and Senior Data Scientist Semantic Community AOL.
Data Science for HealthCare.gov Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
Data Science for Semantics Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data Science Data Science for Semantics.
View Club By Weldon Christin Lily Willow Madeline.
Department of Commerce App Challenge: Big Data Dashboards Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic Community.
Data Science for DoI BSEE Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data Science Data Science for DoI BSEE.
Data Science for Joint Doctrine Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data Science Data Science for Joint.
Data Science for FDA RFI Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
Data Science for Conservation International's Big Ecosystem Data Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community.
NGA Demo Participant Collaboration Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic Community
Cross Information Sharing and Integration for the Intelligence Community: 13 th SOA for eGovernment Conference Dr. Brand Niemann Director and Senior Enterprise.
NIEM 3.0 Data Analytics App Dr. Brand Niemann Director and Senior Data Scientist Semantic Community AOL Government Blogger.
Harnessing Health.Data.gov Data to Address Diabetes in the US Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
SEXUAL OFFENSES: BACKGROUND, CAUSES AND PREVENTION.
Data Science for NIST Big Data Framework Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
Data Science for EarthCube 2015 Key Documents Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
U.S. Federal Government Handling of Data for Open Government Data in Japan Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist.
Data Science for Global Ebola Response Data Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data Science Data Science.
HealthIT.gov Dashboard: Spotfire not Flash Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
Data Science for the National Big Data R&D Initiative Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
Spotfire 5 Users Guide Dashboard
Presentation transcript:

Big Data Symposium: Analytics and Applications for Federal Big Data – Bureau of Justice Statistics Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic Community AOL Government Blogger March 5-6,

Begin With the End in Mind Ms. Jo Strang, Associate Administrator, Safety, Federal Railroad Administration, Department of Transportation “Open Gov 2.0 and Safety.Data.Gov” – New safety data sources and challenge from the National Institute of Justice and new data from Open FEMA Open FEMA – Could not find with Google search – Start with See Safety.Data.Gov – Could not find at that Web site – Start with Bureau of Justice Statistics 2

My Process Found New Releases and Female Victims of Sexual Violence, at BJS.gov Built a Knowledge Base of the Web Site, Metadata, and Data Sources for: – Press Release Press Release – PDF (1.4M) PDF – ASCII file (34K) ASCII file – Comma-delimited format (CSV) (Zip format 26K) Comma-delimited format (CSV) – Help for using BJS products Help for using BJS products – About the Source Data: National Crime Victimization Survey (NCVS)National Crime Victimization Survey (NCVS) Did Extensive Pre-Conditioning of the CSV Spreadsheets for Use and Display of the Data Sets Imported the Data Sets Into Spotfire and Created a Guided Analysis Documented My Data Science Work in a Story and PowerPoint Slides Provided Conclusions and Recommendations 3

Bureau of Justice Statistics 4 Female Victims of Sexual Violence,

5 Build a Knowledge Base: Press Release Report (PDF and ASCII) Help About the Source Data

Knowledge Base*: MindTouch 6 *Well-defined URLs for everything: PDF Text CSV Images See next slide!

Knowledge Base*: MindTouch 7 *Well-defined URLs for everything: PDF Text CSV Images

Knowledge Base in Spreadsheet: Excel 8 This is Linked Open Data! My 5 Steps to Getting to 5 Stars!

Spreadsheet in Dashboard: Spotfire 9 Readme.txt to Master Data Management and Unified Data Architecture : Figures: 3; Tables: 11; and Appendix Tables 16

Spreadsheet in Dashboard: Spotfire 10 From 1995 to 2005, the total rate of sexual violence committed against U.S. female residents age 12 or older declined 64% from a peak of 5.0 per 1,000 females in 1995 to 1.8 per 1,000 females in It then remained unchanged from 2005 to Sexual violence against females includes completed, attempted, or threatened rape or sexual assault. In 2010, females nationwide experienced about 270,000 rape or sexual assault victimizations, compared to about 556,000 in 1995.

Spreadsheet in Dashboard: Spotfire 11 Males had lower rates of rape or sexual assault than females from1995 to 2010 Due to the relatively small number of sample cases, coupled with a low rate of victimization, estimates of male sexual violence from the NCVS cannot be used reliably for further disaggregation by victim and incident characteristics. Therefore, this report focuses exclusively on females.

Spreadsheet in Dashboard: Spotfire 12 The percentage of sexual violence reported to police increased to a high of 56% in 2003 before dropping to 35% in 2010, a level last seen in 1995 The percentage of victimizations known to police because they were reported by another household member declined from 26% in to 10% in , while the percentage reported by an official other than the police increased from 4% to 14%.

Spreadsheet in Dashboard: Spotfire 13 All the tables were carefully formatted in the spreadsheet for display on Spotfire. This is what Data Science does!

Conclusions and Recommendations Built a Knowledge Base of the Web Site, Metadata, and Data Sources with Well-Defined URLs for Everything Did Extensive Pre-Conditioning of the 30 CSV Spreadsheets for Use and Display of the Data Sets Made the Reame.txt File a Master Data Management, a Unified Data Architecture, and Linked Open Data: My 5 Steps to Getting to 5 Stars Imported the Data Sets Into Spotfire and Created a Guided Analysis That Augments the Original Report Documented My Data Science Work in a Story and PowerPoint Slides The Fairfax County Domestic Violence Fatality Review Team 2012 Annual Report is next 14