EGI and Data Scientists: Demand Sy Holsinger EGI.eu Senior Strategy and Policy Officer EGI Community Forum 2015 12 November 2015, Bari EDISON – Education.

Slides:



Advertisements
Similar presentations
EInfrastructures (Internet and Grids) US Resource Centers Perspective: implementation and execution challenges Alan Blatecky Executive Director SDSC.
Advertisements

BELMONT FORUM E-INFRASTRUCTURES AND DATA MANAGEMENT PROJECT Updates and Next Steps to Deliver the final Community Strategy and Implementation Plan Maria.
Data-intensive Research Policy In Ireland A brief overview By J.-C. Desplat.
Identification of critical success factors for implementing NLLS, through collaboration and exchange of expertise IDENTIFY LLP-2008-RO-KA1-KA1NLLS.
Open Library Environment Designing technology for the way libraries really work November 19, 2008 ~ ASERL, Atlanta Lynne O’Brien Director, Academic Technology.
Careers in IT Farrokh Alemi, Ph.D.. Course on Project Management Credit.
EGI: A European Distributed Computing Infrastructure Steven Newhouse Interim EGI.eu Director.
EGI-Engage EGI-Engage Engaging the EGI Community towards an Open Science Commons Project Overview 9/14/2015 EGI-Engage: a project.
Human resources reform: a people strategy for IFAD Liz Davis Director, Human Resources Division 8-9 July th Replenishment.
EGI-InSPIRE RI EGI-InSPIRE RI European Grid Infrastructure: status and services for users 04/11/ Gergely Sipos.
Advanced Computing Services for Research Organisations Bob Jones Head of openlab IT dept CERN This document produced by Members of the Helix Nebula consortium.
Access to electronic scientific information: policies, strategies and programmes The Brazilian experience Elenara Chaves Edler de Almeida Brazilian Federal.
Enhancing formal and professional training capacity in Biodiversity Informatics: Collaboration and funding opportunities Dimitris Koureas Natural History.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Future Plans T. Ferrari/EGI.eu 1.
Overview & Status of the Middleware & EGI Core Proposals Steven Newhouse Interim EGI Director EGEE Technical Director 26/05/2016 Status of EGI & Middleware.
Introduction of the Curriculum for Prospective NHTI Faculty NHTI Coordinating Committee Association of College & University Housing Officers – International.
European Grid Initiative – EGI Business and Technology Transfer for EGI EGI Design Study Project EGEE’09 Barcelona September 2009.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Steven Newhouse Technical Director CERN.
RI EGI-InSPIRE RI EGI Future activities Peter Solagna – EGI.eu.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Bob Jones EGEE project director CERN.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI (Present and) Future of the EGI Services for WLCG Peter Solagna – EGI.eu.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Business SSC for EGI SSC Workshop: Preparing.
The DEER The Distributed European Electronic Resource.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Sustaining EGI Sergio Andreozzi Strategy and Policy Manager, EGI.eu EGI Council.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI strategy and Grand Vision Ludek Matyska EGI Council Chair EGI InSPIRE.
Interoperability and Integration of EGI with Helix Nebula - Workshop Sergio Andreozzi Strategy and Policy Manager (EGI.eu) 11/04/2013 EGI Community.
3 nd Helix Nebula Workshop on Interoperability among e-Infrastructures and Commercial Clouds Sergio Andreozzi Strategy and Policy Manager, EGI.eu EGI Technical.
EGI-Engage is co-funded by the Horizon 2020 Framework Programme of the European Union under grant number EGI vision for the EOSC Tiziana.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Plans for PY2 Steven Newhouse Project Director, EGI.eu 30/05/2011 Future.
Resource Provisioning EGI_DS WP3 consolidation workshop, CERN Fotis Karayannis, GRNET.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Business Engagement Program for SMEs Javier Jiménez Business Development.
EGI-InSPIRE RI EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI
RI EGI-InSPIRE RI Astronomy and Astrophysics Dr. Giuliano Taffoni Dr. Claudio Vuerli.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Enabling SSO capabilities in the EGI Cloud services Peter Solagna – EGI.eu.
EGI-Engage EGI-Engage WP3 e-Infrastructure Commons Diego Scardaci EGI.eu/INFN 6/18/2016 EGI-Engage – First.
June 23, 2016 Organizational Overview. 2 Automation Federation Background A fragmented community of automation professional associations and societies.
INDIGO Outreach and Exploitation process Peter Solagna, Matthew Viljoen EGI.eu.
EGI-Engage EGI Webinar - Introduction - Gergely Sipos EGI.eu / MTA SZTAKI 6/26/
European Grid Initiative The EGI Federated Cloud as Educational and Training Infrastructure for Data Science Tiziana Ferrari/ EGI.eu.
EGI-InSPIRE EGI-InSPIRE RI The European Grid Infrastructure Steven Newhouse Director, EGI.eu Project Director, EGI-InSPIRE 29/06/2016CoreGrid.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI A pan-European Research Infrastructure supporting the digital European Research.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Engagement meeting Gergely Sipos EGI.eu 1.
EGI-InSPIRE RI EGI Compute and Data Services for Open Access in H2020 Tiziana Ferrari Technical Director, EGI.eu
Co-ordination & Harmonisation of Advanced e-Infrastructures for Research and Education Data Sharing Grant.
EGI-InSPIRE EGI-InSPIRE RI EGI strategy towards the Open Science Commons Tiziana Ferrari EGI-InSPIRE Director at EGI.eu.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI.eu Service Portfolio - EGI CF’13 - Apr 2013 EGI.eu Service Portfolio.
EGI-InSPIRE RI An Introduction to European Grid Infrastructure (EGI) March An Introduction to the European Grid Infrastructure.
Ian Bird, CERN WLCG Project Leader Amsterdam, 24 th January 2012.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Role and Challenges of the Resource Centre in the EGI Ecosystem Tiziana Ferrari,
EGI… …is a Federation of over 300 computing and data centres spread across 56 countries in Europe and worldwide …delivers advanced computing.
EDISON Data Science Framework: Building the Data Science Profession
EGI Foundation (Session Chair)
EGI towards H2020 Feedback (from survey)
GISELA & CHAIN Workshop Digital Cultural Heritage Network
EGI and EGI-Engage PY2 Overview
Senior Strategy and Policy Officer
Steven Newhouse EGI-InSPIRE Project Director, EGI.eu
National e-Infrastructure Vision
Introduction to EGI; Training activities and plans
EGI-Engage Engaging the EGI Community towards an Open Science Commons
Antonella Fresa Technical Coordinator
EGI Webinar - Introduction -
The EGI.eu Organisation
Statistics Canada and Data’s New Realty
GISELA & CHAIN Workshop Digital Cultural Heritage Network
EGI-Engage T. Ferrari/EGI.eu.
Introdicution to EGI.eu
Technical Outreach Expert
EGI Pay-for-Use Brief Summary
EOSC-hub Contribution to the EOSC WGs
Presentation transcript:

EGI and Data Scientists: Demand Sy Holsinger EGI.eu Senior Strategy and Policy Officer EGI Community Forum November 2015, Bari EDISON – Education for Data Intensive Science to Open New science frontiers Grant (INFRASUPP : CSA)

Outline Overview of EGI –Role in EDISON –Focus on EGI Data Services EGI and Data Scientists –Profiles and Scope –General Market –Current Situation –Needs –Recruitment Summary/Conclusions 2 EGI and Data Scientists: Demands EGI CF’15, Bari – 12 Nov 2015

About EGI EGI and Data Scientists: Demands 3 EGI CF’15, Bari – 12 Nov 2015 EGI.eu in Amsterdam EGI [Infrastructure] –Federation of 350 Resource Centres across 50 countries –Provides distributed computing and storage resources to accelerate data-intensive research EGI.eu [Coordination Body] –Non-profit foundation based in Amsterdam (~20 staff) –26 participants (e.g. NGIs, EIROs) form governing body (EGI Council) Support projects –EGI-Engage: towards Open Science Commons Includes 9 research communities (competence centers) –AARC: Federated AAI between e-Infrastructures –INDIGO-DataCloud: EGI FedCloud and beyond –EDISON: Building the data science profession –+others

Role of EGI in EDISON EGI and Data Scientists: Demands 4 EGI CF’15, Bari – 12 Nov 2015 WP4 Leader: Sustainability and certification of the Data Scientist Profession –Business models definition –Definition of a certification scheme WP3 task leader on: EDISON Online Educational Environment –Development of the model curricula for the e-Infrastructure specialization –Operate the training marketplace and cloud IaaS for running of the hands-on activities with students WP2 support: Educational Focus and Data Science Body of Knowledge (BoK) –Education and training needs and required competencies Computer Science, Scientific Computing, Scientific Infrastructure –Data Scientist: Body of Knowledge (DS-BoK) and Data Science Competence Framework (CF-DS) profiles Provide input on required competencies and skills as well as available training courses in the EGI.eu community

EGI Data Services EGI and Data Scientists: Demands 5 EGI CF’15, Bari – 12 Nov 2015 Data management is performed by interoperable components Different components address different needs –Storage management at site level –Transfer between sites –File Transfer –Content Distribution –Federated Data Manager –Metadata Catalogue –Security –Standards

EGI Data Services EGI and Data Scientists: Demands 6 EGI CF’15, Bari – 12 Nov 2015 EGI and Data Scientists

Data Scientist: Profiles EGI and Data Scientists: Demands 7 EGI CF’15, Bari – 12 Nov 2015 Oscar Corcho BDVA Summit 2015 Madrid

Data Scientist: Profiles EGI and Data Scientists: Demands 8 EGI CF’15, Bari – 12 Nov 2015 Oscar Corcho BDVA Summit 2015 Madrid EGI Community e.g. e-Science centers or experts at universities/research centers EGI.eu USCT EGI Resource Providers EGI end-users

Data Scientist: Scope EGI and Data Scientists: Demands 9 EGI CF’15, Bari – 12 Nov 2015 Expected increase in demand over next 5 years –Both e-Infrastructures and research infrastructures have similar needs Difficulty finding complete profile needed –Required knowledge in a wide range of topics – many with some Need to understand data requirements and translate them into technical services and solutions e.g. scalability of access; type of data; integration needs –Staff filling DS role (or in part) from another position experience challenges Role of Data Science –No data analysis of scientific data (those are done by the researchers) –Support them to develop tools that allow them to do that

Data Scientist: General Market in EGI EGI and Data Scientists: Demands 10 EGI CF’15, Bari – 12 Nov 2015 Why need a Data Scientist? –Demand from user communities to be able to understand requirements and adapt services to their needs –Data driven market –Need to evolve the data infrastructure through innovation from those requirements –Complexity requires high level of knowledge across a range of technical topics and issues Applicable Areas –Data management –Data analysis –Storage management –Operations –Software integration –Federated service management

Data Scientist: Current Situation EGI and Data Scientists: Demands 11 EGI CF’15, Bari – 12 Nov 2015 Data Scientists in EGI –Internal: 4 EGI.eu staff providing data scientist activities (20%) No one with the title “Data Scientist” Typically part of the User Community Support Team / Technical Outreach –External: ~4 per Competence Center (~35) –NGIs have user support teams ~3 per NGI = ~100 Specialize in different technologies (support users based on requirements) –Distributed nature of EGI and multi-domain require reliance on EGI Champion network to target a wider range of domains Data Science Skills in EGI –Data science knowledge required to provide outreach Know the science workflow to be able to simplify it Know the scientific tools that are being used in order to optimise them and integrate them with other EGI tools services –Need access to data scientists for domain expertise Quite important when required tools don’t exist or apply – then deep dive happens for more customizable solutions – use case basis only

Data Scientist: Needs EGI and Data Scientists: Demands 12 EGI CF’15, Bari – 12 Nov 2015 Data Scientist skills –Hard skills currently higher priority than soft skills However soft skills still required as DS is not a job carried out in a dark corner What’s a good number? –More data scientists increases Number of communities that can be supported Innovation through better translation of requirements –However needs to be balanced with overall budget and diversified strategy Education/Training –Master’s degree, but most have a PhD. Not necessarily required if having specialized training/certification –Often required and mainly provided in-house –Externally as opportunities arise (e.g. FitSM) EDISON Training and Certification Scheme fills this need

Data Scientist: Recruitment EGI and Data Scientists: Demands 13 EGI CF’15, Bari – 12 Nov 2015 Timeline –Can range from 3 months to 1 year to find and select new employees (depends on the role and specialization required) Position Creators –Top Management approves position to be made available –Senior management with line manager design profile Employment Contracts –More specialized the flexible hiring needs to be (e.g. remote working) –Support providing work permits for non-EU members Visibility of Open Positions –Rely on EGI community network –Use of external agencies rare (if ever) –No formal internship programme or partnership with organisation

Summary/Conclusions EGI and Data Scientists: Demands 14 EGI CF’15, Bari – 12 Nov 2015 Expected increase in demand for Data Scientists –Data driven market Difficult to find profile with full skill set –e-Infrastructures is a complex environment, but still not common even industry (opportunity!) Limitations –Available budget vs. overall needs (balance) Need to develop partnerships –With universities so they know what skills are needed –Professional training organisations and certification authorities (for those post-university) EDISON is an excellent opportunity to support this field and the needs of e-Infrastructures

Thank you for your @syholsinger