Climate Change & Genomic Data - Data Science Meetup of Meetups Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community.

Slides:



Advertisements
Similar presentations
BELMONT FORUM E-INFRASTRUCTURES AND DATA MANAGEMENT PROJECT Updates and Next Steps to Deliver the final Community Strategy and Implementation Plan Maria.
Advertisements

Data Science for NSF Polar Cyberinfrastructure & MIT Big Data Course Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community.
1 Services and Cloud Computing Work Groups: Status Update Brand Niemann US EPA January 8, 2010.
EarthCube Data Science Publications Dr. Joan Aron Dr. Sophia Liu Dr. Brand Niemann May 29, 2015
Data Science for Big Data Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
Build the Binary Group in the Cloud Brand Niemann Senior Enterprise Architect Binary Group August 5, Updated August 8,
Data Science for MyFamilySearch.org and FamilyTree DNA Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
NLM-Semantic Medline Data Science Data Publication Commons Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data.
Big Data and Social Media & Web Analytics Innovation Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic Community
NIST Scientific Data for Data Science United Nations Open Data / Open Government Conference, April 26-28, Abu Dhabi
Federal Big Data Working Group Meetup Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
Data Science for RDA Climate Change Data Challenge and Meetup Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data.
Semantic Data Discovery: Proof of Concept for DHS
Cloud: SOA, Semantics, & Data Science Welcome and Overview Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
Transforming Data-Driven Publications and Decision Support Joan L. Aron, Ph.D. Consultant Federal Big Data Working Group COM.BigData 2014.
Data Science for USGS Minerals Big Data Meetup Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data Science Data.
Information Sharing Begins With Me Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic Community
Data Science for Agency Initiatives 2015 Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data Science Data Science.
1 Briefing for EPA and OEI Communications Coordinators and Press Officers Brand Niemann US EPA Senior Enterprise Architect and Federal CoP Leader January.
Data Science Publication for NSF Polar Cyberinfrastructure Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
Federal Big Data Working Group Meetup: The Yosemite Project: A Roadmap for Healthcare Information Interoperability and The New Book: Building Ontologies.
Farm Data Dashboards: USDA and Microsoft Innovation Challenge Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data.
Data Science for RDA Climate Change Data Challenge and Meetup Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data.
Federal Big Data Working Group Meetup Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
Data Science for International Data Week 2016: Concept Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data Science.
Director and Senior Data Scientist/Data Journalist
Data Science for DataBay DataBay "Reclaim the Bay" Innovation Challenge: August 1-3, 2014, Smithsonian Environmental Research Center, 647 Contees Wharf.
Data Science for USGS Minerals Big Data Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data Science Data Science.
Data Science for EPA & USGS Fracturing & Fracking­­­­­ Data Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data.
Data Science for DTIC Data Ecosystem Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
The 2012 EuroStat Regional Yearbook for Semantic Interoperability Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic.
Why Doesn't EPA Have a Self- Contained Statistical Unit?: A Tribute to Doug Engelbart Dr. Brand Niemann Director and Senior Data Scientist Semantic Community.
Data Science for USDA Big Data
Data Science for HealthData.gov Developers & Family Caregivers Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community.
Data Science for the National Big Data R and D Initiative Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
Katie A. Learning Collaborative For Audio, please call: Participant code: Please mute your phone Building Child Welfare and Mental.
FY13 Accomplishments 1 Update to the Board of Research Data on Information CENDI INCREASING THE IMPACT OF FEDERALLY FUNDED SCIENCE September 23, 2013 Jerry.
Data Science for NSF Data Science Workshop 2015 Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data Science NSF.
Research on US Federal Government Handling of Data Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic Community
Data Science for the NOAA Chief Data Officer Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
Semantic Data Science for the US Census Bureau Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
Data Driven Farming: Week 6: Deployment Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data Science Week 6 Deployment.
Data Science for Semantics Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data Science Data Science for Semantics.
Data Science for Joint Doctrine Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data Science Data Science for Joint.
STEPHANIE SHIPP SCIENCE & TECHNOLOGY POLICY INSTITUTE AMERICAN EVALUATION ASSOCIATION NOVEMBER 12,2009 Science of Science Policy: Making.
11 Welcome to All! October 26-28, 2009 Washington, D.C. Welcome to All! Accelerators for America’s Future Symposium and Workshop October 26-28, 2009 Washington,
1 Tutorial for the EAWG: Solution Architecture for 2010 Brand Niemann Senior Enterprise Architect U.S. EPA January 28, 2010.
1 Social Business Intelligence from Open Government Data Brand Niemann Senior Enterprise Architect US EPA November 27, 2010 DISCLAIMER: While allowed to.
1 Promoting Careers in Knowledge Management: My Experiences Brand Niemann Library of Congress June 3, 2010.
Government Technology & Innovation Incubator for Big Data Analytics Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community.
Director’s Report 6 h BRDI Meeting Washington, DC 31 January 2012 Paul F. Uhlir, J.D. Director, Board on Research Data and Information National Academy.
Executive Director Update November 14, HLSC Executive Director met one-on-one with leaders of member agencies and compiled a list of their needs.
Data Science for NIST Big Data Framework Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
Defense Strategies Institute Professional Educational Forum Harnessing the Power of Big Data for The Intelligence Community November 17-18, 2015 Mary M.
Midwest Big Data Hub Letters of Intent for NSF Edward Seidel Director, NCSA Founder Prof. of Physics, Prof of Astronomy On behalf of the Midwest.
CENDI Update to the Board of Research Data and Information Bonnie C. Carroll Executive Director CENDI Secretariat.
Chaitan Baru Senior Advisor for Data Science CISE Directorate National Science Foundation NIEHS Webinar October 27, 2015 Image Credit: Exploratorium. Integrating.
Data Science for EarthCube 2015 Key Documents Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
Digital Data Collections ARL, CNI, CLIR, and DLF Forum October 28, 2005 Washington DC Chris Greer Program Director National Science Foundation.
Data Science for Global Ebola Response Data Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data Science Data Science.
Sectoral Operational Programme “INCREASE OF ECONOMIC COMPETITIVENESS” October 2005 MINISTRY OF ECONOMY AND TRADE.
National Data Science Organizers Lightning Talks From Around the Country Dr. Brand Niemann Founder and Co-Organizer Federal Big Data Working Group Meetup.
Data Science and Semantic Insights for DoD Joint Doctrine Meetup Dr. Brand Niemann Founder and Co-Organizer Federal Big Data Working Group Meetup Director.
Data Science for the National Big Data R&D Initiative Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
Data Science for UN, HDX, OSTP, RDA, etc. July 15 th : Data Science for RDA Climate Change Data Challenge and Meetup Goals: Goal 1: Digital Catalog.
South Big Data Innovation Hub
Data Science for RDA Climate Change Data Challenge and Meetup
First Meetup: Data Science for the Data Act at Treasury
Beyond Vendor Fairs: Partnering with Vendors to Engage End Users
Charting Your Path to Success
Presentation transcript:

Climate Change & Genomic Data - Data Science Meetup of Meetups Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Federal Big Data Working Group Meetup Data Science NSF Data Science Workshop 2015 September 28,

Agenda 6:30 p.m. Dr. Renata Rawlings-Goss (last minute conflict), DRAFT Potential Sessions for Meetup Meeting: November 5-6, 2015DRAFT Potential Sessions for Meetup Meeting: November 5-6, :45 p.m. Welcome and Introduction (New Tutorial and Mentoring) Slides Data Science for RDA Climate Change Data Challenge and NSF Data Science Workshop 2015Data Science for RDA Climate Change Data ChallengeNSF Data Science Workshop :15 p.m. Brief Member Introductions 7:30 p.m. Dr. Ben Busby, Computational Biology Branch, NCBI, NLM, NIH SlidesDr. Ben BusbyNCBINLMNIHSlides 8:15 p.m. Open Discussion ​8:45 p.m. Networking 9:00 p.m. Depart 2

Schedule October 5th, Data Science for EPA & USGS Fracturing & Fracking Data (Dr. Sophia Liu, USGS and USGS Staff). See July 13 th Meetup: Data Science for USGS Minerals Big DataData Science for USGS Minerals Big Data October 19th, Sensing Our Air: The Quest for Big Data About Our Air Quality (EPA’s New Chief Data Scientist, Robin A Thottungal, Invited)Robin A Thottungal November 2 nd, Data Science for Random Forests: TIBCO Enterprise Runtime for R. See June 1 st Meetup: Data Science for Homeless Data: QlikView. Tableau, & Spotfire BakeoffData Science for Homeless Data: QlikView. Tableau, & Spotfire Bakeoff November 5-6 th, OSTP/NSF Data Science Meetup of Meetups, Ballston, VA November 16 th, Data Science for the DataAct Datathon December 7 th, Data Science for DoD Joint Doctrine January 4 th, 2016, Data Science for Semantics: MarkLogic and Cray Graph Appliance Update (in process) February 1st, 2016, Data Science for Census American Community Survey (in process) 3

Member Participation Barry Smith, Professor of Philosophy and Co-Author of New Book on Ontology, Meeting with Intelligence Community on September th. December 7 th Meetup. Jonathan Hines, ORNL science writer, doing a story on Semantic Medline and the ORNL CADES – Compute and Data Environment for Science (January 4 th, 2016?) Joan Aron and Brand Niemann, Data Mining - Science - Questions - Publication Process (2016?) Most Recent: Homelessness in Metropolitan WashingtonHomelessness in Metropolitan Washington Weifeng Li (Lexie) and Brand Niemann, FDA Precision Medicine (2016?) Chris Crawford and Jay Patkar, TIBCO Software Federal, Random Forests for Kaggle Competitions and Spotfire TERR. November 2 nd Meetup. Washington DC Homeless Data Bakeoff Part 2, November 4 th (Downtown Hotel in process) Steve Hanmer, Mission Source, Allyson Ugarte, Treasury, co-planning Data Science for Data Act Datathon Meetup. He attended the Data Act Datathon and Forum will report. November 16 th Meetup. 4

NIH Data Commons FAIR Principles: Findable Accessible Interoperable Reusable Cloud: Data Software Results Federal Science Policy: OSTP Public Access to Scientific Data Memo (February 2013) New Program: Big-Data-to- Knowledge (2013) New Position: Associate Director of Data Science (2014) Digital Enterprise (2015): Data Commons Metadata Open APIs Digital Objects Containers A NIH – Semantic Medline Data Science Data Publication Commons 5

OSTP/NSF Data Science Meetup of Meetups Week of November 2 nd : NSF Data Science/Big Data Principal Investigators (About 300) NSF Data Hubs (4) Organizers of Largest Data Science/Big Data Meetups (About 65) Pipeline for Return on Investment: PIs put their data, tools and research results in the Data Hubs Data Hubs provide those data, tools, and research results to the world, but especially to the Data Science/Big Data Meetups Data Science/Big Data Meetups collaborate with PIs and Data Hubs to increase usage and feedback 6

We Already Do This! Semantic Community: Provides a Community Sandbox that is like a GitHub, Data Hub, Data Commons, etc. Metadata (MindTouch) Open APIs (MIndTouch) Digital Objects (MindTouch) Containers (Spotfire) Organize the Federal Big Data Working Group Meetup Support Agencies and Programs in Crowdsourcing Their Data Sets Mentor Data Scientists (Tutorials and MOOCs) and Entrepreneurs (Eastern Foundry) Federal Big Data Working Group Meetup: Federal: Supports the Federal Big Data Initiative, but not endorsed by the Federal Government or its Agencies; Big Data: Supports the Federal Digital Government Strategy which is "treating all content as data", so big data = all your content; Working Group: Data Science Teams composed of Federal Government and Non- Federal Government experts producing big data products; and Meetup: The world's largest network of local groups to revitalize local community and help people around the world self- organize like MOOCs (Massive Open On-line Classes) now embraced by the White House. 7

Data Mining - Science - Questions - Publication Process Data Mining Process: Business Understanding Data Understanding Data Preparation Modeling Evaluation Deployment Data Science Process: Data Preparation Data Ecosystem Data Story Data Science Questions: How was the data collected? Where is the data stored? What are the data results? and Why should we believe the data results? Data Science Data Publication: Knowledge Base Spreadsheet Index Web & PDF Tables to Spreadsheet Data Browser Dynamically Linked Adjacent Visualizations 8

Data Science Data Curation for Sustainable Data Science Meetups of Meetups I just finished four data science ecosystems: RDA Climate Data Challenge (July 15): ata_Challenge ata_Challenge RDA Information Week 2016 (Ebola Response and Nepal Earthquake) (July 17): _Data _Data USDA Microsoft Innovation Challenge (July 27): Business#Story Business#Story US Data Act (July 28): 9

Collaboration for Data Science Win-Wins USDA Open Government Data Training, Innovation Competition, and Online Course in Data-Driven Farming: _Farming_Business#Story _Farming_Business#Story Many Curated Government Data Sets and Data Science Products: Pick an Agency and/or a Data Set and Look for a Meetup on That: Mentor Startups Partnership with Eastern Foundry: Group/events/ / Group/events/ / 10

AAAS Fellow Biography Dr. Renata Rawlings-Goss AAAS Fellow Directorate for Computer & Information Science & Engineering Office of the Assistant Director Class of 2015 Dr. Renata Rawlings-Goss is a biophysicist who completed her doctorate work at the University of Michigan-Ann Arbor. She was successfully awarded a competitive AGEP postdoctoral fellowship with the Center for Computational Medicine, where she developed new predictive statistics for patient monitored diabetes. Subsequently, she became a Penn-Port fellow in the department of genetics at the University of Pennsylvania, where her research interests included data- driven analysis of genetic/expression variation among worldwide populations for diseases such as cancer. In addition, Dr. Rawlings-Goss has served as a scientific consultant to lawyers, physicians, and the non-profit sector around data analytics as well as taught courses at Rutgers University and Lincoln University in bioinformatics, biostatistics, genetics and mathematical modeling a continuing effort to increase participation of under- represented minorities in science. Currently, Dr. Rawlings- Goss is a AAAS science policy fellow at the National Science Foundation working on Big Data policy. 11

DRAFT Potential Sessions for Meetup Meeting: November 5, 2015 National Data Science Organizers Workshop Please note: This will be in-person by invitation and remote for all Day 1 - November 5th, 2015 (Half Day) 12:00 pm (Pre-conference Lunch with Big Data Regional Hubs Leaders) 1:30 pm Session 1: Data Science for the Nation Keynote: What are the National Priorities?, White House Office of Science and Technology Policy - Deputy Director for Technology and Innovation Impacts of Data Science on National Priorities Data Kind: Speaker Data Science for Social Good: Speaker Federal Meetup: Speaker Discussion: Using Meetups to explore National Challenges 5:00 pm: Evening Event at AAAS: Grassroots Data Science Across the Nation Lighting Talks: Every group gets 10 slides and 3 minutes Highlight past events in National Priority Areas or of national interest, state plans for the future, and give challenges, and ideas for how a Network of Data Science Organizers can solve national problems. Networking Reception: Highlight AAAS, S&T Fellows, and Affinity Groups 12

DRAFT Potential Sessions for Meetup Meeting: November 6, 2015 Day 2 - November 6th, 2015 (Full Day) 8:00 am Session 2: Exposing Data Available Datasets: Speakers Socrata Open Data Portal demo: Speaker Open Data.gov / Open Data Working Group: Speaker Exposing data resources Meetup Contributions Product Creation: Connecting data sources among regions. 13

DRAFT Potential Sessions for Meetup Meeting: November 6, :00 am Break 10:30 am Session 3: Coordination and Support of Data Science Meetups Resources for Meetups: Federal Support for Meetup groups Coordination mechanisms: You Tube Channel, Podcast, White Papers, listserv Meetup of Data Science Meetup groups Online Discussion: Mechanisms to spread good ideas among regions. 14

DRAFT Potential Sessions for Meetup Meeting: November 6, :30 pm Lunch Speaker 1:30 pm Session 4: The National Priority Challenge National Priority Challenge-Speaker National Data Science Challenges and Hackathons: Proposed by steering committee RDA Research Data Alliance (RDA): P8 venue to announce specific challenge (2016) Working Session: Launching National Priority Challenge :45 pm Closing Remarks: TBA 15

Agenda 6:30 p.m. Dr. Renata Rawlings-Goss (last minute conflict), DRAFT Potential Sessions for Meetup Meeting: November 5-6, 2015DRAFT Potential Sessions for Meetup Meeting: November 5-6, :45 p.m. Welcome and Introduction (New Tutorial and Mentoring) Slides Data Science for RDA Climate Change Data Challenge and NSF Data Science Workshop 2015Data Science for RDA Climate Change Data ChallengeNSF Data Science Workshop :15 p.m. Brief Member Introductions 7:30 p.m. Dr. Ben Busby, Computational Biology Branch, NCBI, NLM, NIH SlidesDr. Ben BusbyNCBINLMNIHSlides 8:15 p.m. Open Discussion ​8:45 p.m. Networking 9:00 p.m. Depart 16