Presentation is loading. Please wait.

Presentation is loading. Please wait.

Data Science for International Data Week 2016: Concept Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data Science.

Similar presentations


Presentation on theme: "Data Science for International Data Week 2016: Concept Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data Science."— Presentation transcript:

1 Data Science for International Data Week 2016: Concept Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data Science Data Science for Global Ebola Response Data July 16, 2015 1

2 Purpose White House interest in Data Science Meetups helping the US National Big Data Initiative. Research Data Alliance (RDA) interest in a pipeline for reproducible data science. CODATA (Committee on Data for Science and Technology) and the WDS (World Data System) interest in aligning the SciDataCon 2016 and the RDA 8th Plenary (P8) as part of an International Data Week 2016. Data Science Meetups, like the DC Data Community and Federal Big Data Working Group Meetup, interest in Data Science for an International Data Week 2016. United Nations interest in Crowdsourcng Data Science for Global Ebola Response Data. 2

3 White House Interest in Data Science Meetups Inform Data Science Meetups about White House Big Data, Data Science, and Open Data Initiatives. Explore contributions that Data Science Meetups could make at the national and/or local level. Discuss applications of government data sources by Data Science Meetups. Create a lightweight mechanism for data sharing and collaboration between the Federal Government and Data Science Meetups. Contact Person: Dr. Renata Rawlings-Goss, Big Data AAAS Fellows at NSF http://www.nsf.gov/od/oia/activities/interns/aaas_fellows/renata-rawlings-goss.jsp 3

4 Research Data Alliance (RDA) New data standards or harmonization of existing standards. Greater data sharing, exchange, interoperability, usability, and re- usability. Greater discoverability of research data sets. Better management, stewardship, and preservation of research data. Contact Person: Dr. Berman, Chair of RDA/US (all U.S. members of the Research Data Alliance) and Co-Chair of RDA's International Leadership Council 4 https://rd-alliance.org/about/organization/key-profiles/fran-berman.html

5 Concept Note: International Data Week 2016 The vision is for an International Data Week hosted where it can attract the greatest level of attention and be focused on promoting the best exploitation of research data assets to benefit science and society. The week of September 11-16 would have two major events: the RDA 8th Plenary (P8) and the CODATA SciDataCon 2016, in Washington, DC. Collocation of the two events will achieve the greatest impact. Additionally, it will demonstrate that the three organizations (World Data System, CODATA, and RDA) are collaborating closely and on an international scale. Contact Person: Dr. Simon Hodson, CODATA Secretariat and Executive Director 5 https://www.rd-alliance.org/concept-note-international-data-weel.html http://www.codata.org/about-codata/secretariat

6 Meetup.com Meetup is the world's largest network of local groups. Meetup makes it easy for anyone to organize a local group or find one of the thousands already meeting up face-to-face. More than 9,000 groups get together in local communities each day, each one with the goal of improving themselves or their communities. Meetup's mission is to revitalize local community and help people around the world self-organize. Meetup believes that people can change their personal world, or the whole world, by organizing themselves into groups that are powerful enough to make a difference. White House Tech Meetup on April 17 th with leaders of the 50 most successful with coding, web design, etc. Meetup of Data Science Meetup, early November (in planning), based on collaboration between and networking with the 65 largest. Contact Persons: Dr. Harlan Harris, Director, Data Science, Education Advisory Board, and Secretary and Co-Founder, Data Community DC Meetup, Inc., and Dr. Brand Niemann, Director and Senior Data Scientist. Semantic Community, LLC, and Founder and Co-organizer, Federal Big Data Working Group Meetup. 6 http://data-science.meetup.com/ http://www.meetup.com/Data-Science-DC/ http://www.meetup.com/Federal-Big-Data-Working-Group/

7 https://ebolaresponse.un.org/data 7 53 Data Sets and many more out there! Collecting data and mapping the outbreak is crucial for ensuring the right response.

8 Federal Big Data Working Group Meetup A different Data Science Meetup than most: Data Science Teams working with Government Big Data sets to produce Data Science Publication Products that support the US National Big Data Initiative. http://www.meetup.com/Federal-Big-Data-Working-Group/ A pipeline for reproducible data science: Data Science for RDA Climate Change Data Challenge and Meetup http://semanticommunity.info/Data_Science/Data_Science_for_RDA_Climate_Change_Data_Challenge A Data FAIRPort (Free, Accessible, Interoperable, and Reusable): A Data Science Commons or Hub as a community service. http://semanticommunity.info A source of big data, data science, analytics startups: Eastern Foundry is a new Crystal City-based incubator catering to the needs of small business federal contractors. http://eastern-foundry.com/ 8

9 Upcoming Meetups July 20th: Data Science for ACA Data & Semantic Medline Precision Medicine August 3rd: Data Science for Agency Initiatives 2015 August 17th: A NIH – Semantic Medline Data Science Data Publication Commons September 14th: USDA Big Data Science for Precision Farming With FarmLogs September 28th: Climate Change Data - Data Science Meetup of Meetups October 5th: Data Science for American Community Survey October 19th: Sensing Our Air: The Quest for Big Data About Our Air Quality Date and Data Science Team to be announced: Data Science for International Data Week 2016: Ebola Data 9

10 Data Science for International Data Week 2016: Concept Plan for Ebola Data Follow the Cross Industry Standard Process for Data Mining: Business Understanding, Data Understanding, Data Preparation, Modeling, Evaluation, Deployment, and Iterate https://en.wikipedia.org/wiki/Cross_Industry_Standard_Process_for_Data_Mining Integrate structured (relational) and unstructured (graph) data: Use semantic knowledge bases (ontologies, if necessary) and semantic technologies (Cray Graph Computer) Recent example: http://semanticommunity.info/Data_Science/Data_Science_for_Health_Datapalooza_2015/NLM_- _Semantic_Medline_Data_Science_Data_Publicationhttp://semanticommunity.info/Data_Science/Data_Science_for_Health_Datapalooza_2015/NLM_- _Semantic_Medline_Data_Science_Data_Publication Answer the four Data Science Questions: How were the data collected? Where are they stored? What are the results?, and Why should we believe the results? Recent example: http://semanticommunity.info/Data_Science/USDA_Data_Science_MOOChttp://semanticommunity.info/Data_Science/USDA_Data_Science_MOOC Document (curate) the results in Data Science Data Publications: Story, Visualizations, Data, Metadata, Attachments, etc. Recent example: http://semanticommunity.info/Data_Science/Data_Science_for_USGS_Minerals_Big_Datahttp://semanticommunity.info/Data_Science/Data_Science_for_USGS_Minerals_Big_Data UN Global Ebola response: Collecting data and mapping the outbreak is crucial for ensuring the right response. Starting point: https://ebolaresponse.un.org/datahttps://ebolaresponse.un.org/data 10


Download ppt "Data Science for International Data Week 2016: Concept Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data Science."

Similar presentations


Ads by Google