Presentation is loading. Please wait.

Presentation is loading. Please wait.

Richelle Serrano Clindata Insight September 4-8, 2018

Similar presentations


Presentation on theme: "Richelle Serrano Clindata Insight September 4-8, 2018"— Presentation transcript:

1 Richelle Serrano Clindata Insight September 4-8, 2018
Innovation and Your Career in Rapidly Changing World of Job Titles: How Different is Data Scientist from What I do? Richelle Serrano Clindata Insight September 4-8, 2018

2 Agenda Explosion of the Term “Data Scientist”
Short History of Data Science – Highlights Job Market : Data Scientist and Statistical Programmer Job Differences and Similarities Comparisons Data Scientist vs Statistical Programmer Pharma Domain Expertise Data Science Initiatives How to Stay Current in Today’s Job Market WUSS 2018

3 Explosion of the term “Data Science”
Why? The amount of data has skyrocketed Data-driven decisions are more profitable Machine learning has changed how business is conducted WUSS 2018

4 Numerous Data Science Articles Harvard Business Review Publishes
Short History of Data Science Numerous Data Science Articles What is Data Science? A Taxonomy of Data Science The Data Science Venn Diagram William S. Cleveland’s Data Science: An Action Plan for Expanding the Technical Areas of the Field of Statistics Harvard Business Review Publishes Tom Davenport and D.J. Patil’s “Data Scientist: The Sexiest Job of the 21st Century” Dataology & Data Science Research Center Established Members of the International Federation of Classification Societies (IFCS) include the term data science in conference title International Association for Statistical Computing (IASC) is Formed Peter Naur’s Concise Survey of Computer Methods is Published Explosion of Data Science Talk 1974 1977 1996 2001 2007 2009 2010 2012 1960’s 1970’s 1980’s 1990’s 2000 2010 65 70 75 90 95 00 05 10 15 18 80 85 Today WUSS 2018 1962 1977 1989 1997 2003 2005 2008 2011 Articles Abound & Continue… Why the Term ‘Data Science’ is Flawed but Useful - Pete Warden ‘Data Science’: What's in a name? - David Smith Data Science, Moore’s Law, and Moneyball - Harlan Harris John W Tukey’s Exploratory Data Analysis is Published The Future of Data Analysis written by John Tukey, published in The Annals of Mathematical Statistics First Knowledge Discovery in Databases (KDD) Workshop hosted Prof. C. F. Jeff Wu Calls for statistics to be renamed data science and statisticians to be renamed data scientists Launch of Data Science Journals National Science Board publishes “Long-lived Digital Data Collections: Enabling Research and Education in the 21st Century.” JISC Publishes final report, “The Skills, Role & Career Structure of Data Scientists & Curators: Assessment of Current Practice & Future Needs. WUSS 2018

5 Job Comparison Data Scientist vs Statistical Programmer
US CA SF Statistical Programmer (Pharmaceutical) 360 84 27 Data Scientist (Pharmaceutical) 1645 339 145 Approx. 1 in 5 (22%) 1 in 4 (25%) 1 in 5 (19%) Statistical Programmer vs Data Scientist All Industries: 1000:16,000 WUSS 2018

6 Pharmaceutical Programmers have industry specific knowledge and domain expertise
CDISC Standards Pharma programmer needs ability to implement CDISC SDTM and ADaM data standards. Statistical Methods For Clinical Trials Therapeutic Area Knowledge Regulated Environment Pharma Programmer understands a regulated environment, knows and follows GCP and ICH guidelines. Data scientist in other disciplines may face much less regulation. Pharma Programmer must apply the right statistical models for clinical trial data to study design, data collection, and analysis. Data Scientist is tasked with building predictive models and developing machine learning capabilities to analyze data. Vs. WUSS 2018

7 Comparison of Job Descriptions
Data Scientist Statistical Programmer Select and apply a computational approach to advance the product development. Identify data that supports multi-functional groups progress and objectives. Designs, develops, modifies programs to analyze & evaluate clinical trial data. Create study specific and ad hoc listings summary tables figures. Summary Apply variety of analytics to identify trends or insights from multiple data sources/streams. Knowledge of many prog. languages and technologies, less of any expert in any one. Construct prototypes of analytics workflows. Design and construct new processes for data modeling and production using prototypes, algorithms, predictive models, custom analysis. Communicates to multiple functional groups across organization. Statistical programming expertise (SAS, R) CDISC hands on experience Study lifecycle from Protocol to TFL Therapeutic area expertise Regulatory support, ISS/ISE data integration, reports, SDRG and ADRG Manage and audit programming documentation Communicates with Biometrics groups, CRO’s, Data Managers, Medical Affairs and statisticians. Skills Software Python, R, Tensorflow, Tableau used more SAS, R Programming, SAS datasets, CSV files Spotfire used more Edu. MS or PhD in Mathematics, Statistics, Data Science BS or MS in Computer Science, Mathematics, Statistics Work in Predictive models Use of machine learning or AI to automate process for Business Analytics and other business or functional groups Ways to make use of streaming data (wearables), where the amount of data is enormous, taking a record every second. Customize the data to produce an output which displays the behavior of the data.  Contribute to data standards creation, management, and governance. Systems development - macros and utility programs, evaluation and introduction of new technologies including visualization Resourcing and Project Management Current Initiatives WUSS 2018

8 R O C K E T S How to Stay Relevant
- Real-World Evidence to enrich the understanding of data - Open mind to open source (R, Python, etc.) and new technologies (Tableau, Spotfire, ggplot2, shiny, etc.). - Communication. The ability to share findings in ways the organization can understand and act on. - Knowledge; take courses, go to workshops, or study statistics on your own - Engage in the full lifecycle of the study - Therapeutic area expertise cannot be replaced - Structured and Unstructured Data; propose solutions to handle difficult and new types of data WUSS 2018

9 CONCLUSION Data Scientist roles are quite broad and job role is applied differently across industries. Data Science business initiatives are focused on Development of Predictive Models Use of Machine Learning or AI to Automate Process The Challenges Ahead to Make Use of Streaming Data and Turning That into an Endpoint Statistical Programmers specialized work and industry specific knowledge cannot be replaced. Rockets to Relevance -embrace todays challenges in data analysis and continue to build your domain and industry expertise. WUSS 2018

10 WUSS 2018


Download ppt "Richelle Serrano Clindata Insight September 4-8, 2018"

Similar presentations


Ads by Google