What is Data Science and Who is Data Scientist

Slides:



Advertisements
Similar presentations
Information and Communication Technology Senior Secondary Subject Selection Information.
Advertisements

Technology of Data Analytics. INTRODUCTION OBJECTIVE  Data Analytics mindset – shallow and wide, deep when you need it  Quick overview, useful tidbits,
Big Data and Predictive Analytics in Health Care Presented by: Mehadi Sayed President and CEO, Clinisys EMR Inc.
Social Media Friend or foe?
Machine Learning Case study. What is ML ?  The goal of machine learning is to build computer systems that can adapt and learn from their experience.”
1 genSpace: Community- Driven Knowledge Sharing for Biological Scientists Gail Kaiser’s Programming Systems Lab Columbia University Computer Science.
Chapter 14 The Second Component: The Database.
CS157A Spring 05 Data Mining Professor Sin-Min Lee.
Copyright © 2006 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Technology Education Copyright © 2006 by The McGraw-Hill Companies,
Introduction to Data Science Kamal Al Nasr, Matthew Hayes and Jean-Claude Pedjeu Computer Science and Mathematical Sciences College of Engineering Tennessee.
Data Mining By Andrie Suherman. Agenda Introduction Major Elements Steps/ Processes Tools used for data mining Advantages and Disadvantages.
Mining Large Data at SDSC Natasha Balac, Ph.D.. A Deluge of Data Astronomy Life Sciences Modeling and Simulation Data Management and Mining Geosciences.
Tennessee Technological University1 The Scientific Importance of Big Data Xia Li Tennessee Technological University.
CIS 9002 Kannan Mohan Department of CIS Zicklin School of Business, Baruch College.
INTRODUCTION TO DATA MINING MIS2502 Data Analytics.
Introduction – Addressing Business Challenges Microsoft® Business Intelligence Solutions.
#GPUGSummit | #INreno15 #GPUGSummit CALLING ALL GEEKS! FIND OUT HOW APPS WORK WITH MICROSOFT DYNAMICS GP David Musgrave MVP Managing Director, Winthrop.
CS157B Fall 04 Introduction to Data Mining Chapter 22.3 Professor Lee Yu, Jianji (Joseph)
Introduction of Data Mining and Association Rules cs157 Spring 2009 Instructor: Dr. Sin-Min Lee Student: Dongyi Jia.
Next Back MAP 3-1 Management Information Systems for the Information Age Copyright 2002 The McGraw-Hill Companies, Inc. All rights reserved Chapter 3 Data.
 Understand a variety of online social networking tools  Explain to job seekers the benefits of using online social networking methods  Evaluate the.
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
Data Mining: Knowledge Discovery in Databases Peter van der Putten ALP Group, LIACS Pre-University College LAPP-Top Computer Science February 2005.
1 Melanie Alexander. Agenda Define Big Data Trends Business Value Challenges What to consider Supplier Negotiation Contract Negotiation Summary 2.
BOARD OF ADVOCATES October 9, UPDATES ABET Accredited until 2021 Passed with only 1 concern Must be addressed by next visit “[Students will have]
Freedom to think: The Science of Data Dr Quentin Williams.
MIS2502: Data Analytics Advanced Analytics - Introduction.
ABOUT ME ADAPTIVE SOFTWARE | Samudra Kanankearachchi Senior Software Data Science Specialist NEXT GENERATION OF ADAPTIVE ENTERPRISE.
What’s the Big Deal about Big Data? Jennifer Lewis Priestley, Ph.D. Professor of Statistics and Data Science.
LECTURE 2: DATA MINING. WHAT IS DATA MINING? 2 D ATA M INING AND D ATA W AREHOUSES ? It evolved in to being as the science of databases evolved Database.
ISQS 3358, Business Intelligence Anatomy of Business Intelligence Zhangxi Lin Texas Tech University 1.
D ATA S CIENTISTS Who are they and what do they do?
Machine Learning. Definition Machine learning is a subfield of computer science that evolved from the study of pattern recognition and computational.
SAP BO ONLINE TRAINING B Y H YDERABADSYS O NLINE T RAINING Contact Us: INDIA: USA:
Business Intelligence Overview. What is Business Intelligence? Business Intelligence is the processes, technologies, and tools that help us change data.
Dato Confidential 1 Danny Bickson Co-Founder. Dato Confidential 2 Successful apps in 2015 must be intelligent Machine learning key to next-gen apps Recommenders.
Introduction.  Instructor: Cengiz Örencik   Course materials:  myweb.sabanciuniv.edu/cengizo/courses.
Chapter 3 Building Business Intelligence Chapter 3 DATABASES AND DATA WAREHOUSES Building Business Intelligence 6/22/2016 1Management Information Systems.
AZURE MACHINE LEARNING Bringing New Value To Old Data SQL Saturday #
FACULTY EXTERNSHIP OPPORTUNITIES IN DATA SCIENCE AND DATA ANALYTICS Facilitated by: FilAm Software Technology, Clark Freeport Zone Ecuiti, San Francisco,
Lecture-6 Bscshelp.com. Todays Lecture  Which Kinds of Applications Are Targeted?  Business intelligence  Search engines.
Lecture 1 Book: Hadoop in Action by Chuck Lam Online course – “Cloud Computing Concepts” lecture notes by Indranil Gupta.
SQL Server Performance Tuning
Denis Reznik Data Architect, Intapp, Inc. Microsoft Data Platform MVP
SNS COLLEGE OF TECHNOLOGY
MIS2502: Data Analytics Advanced Analytics - Introduction
Thank You! #sqlsatdnipro Denis
What is a Data Scientist and How Do I Become One?
Query Execution Expectation-Reality Denis Reznik
به نام خدا Big Data and a New Look at Communication Networks Babak Khalaj Sharif University of Technology Department of Electrical Engineering.
Data Mining Modified from
Global Enterprise Search
It’s Always a Hard Choice
Everything you ever wanted to ask but were too shy
SQL Server 2014 Hidden Treasures Denis Reznik Microsoft SQL Server MVP
Hidden Gems of SQL Server 2014
Hidden gems of SQL Server 2016
SQL Server Performance Tuning Nowadays
Big Data Young Lee BUS 550.
Hidden Gems of SQL Server 2016
Data Science Meetup Matthew Renze Data Science Consultant
Hidden Gems of SQL Server 2014
Moving Social Science into the Fourth Paradigm: Opportunity Abounds
Deadlocks Everything you ever wanted to ask but were too shy
Hidden Gems of SQL Server 2014
McGraw-Hill Technology Education
Hidden Gems of SQL Server 2014
Denis Reznik SQL Server 2017 Hidden Gems.
Why should I care about SQL, if I have ORM?
Denis Reznik SQL Server 2017 Hidden Gems.
Presentation transcript:

What is Data Science and Who is Data Scientist Denis Reznik Data Architect at Intapp Kyiv

About me Denis Reznik Data Architect at Intapp, Inc. Microsoft Data Platform MVP Co-Founder of Ukrainian Data Community Father of a boy :) 2 |

Agenda What is Data Science? Who is Data Scientist? Discover some info about Data Scientists using Data Science

Data Science

Data Science is a new term Data Science is a new term. But in the same sense as Columbus was discovered NEW continent 1000 years ago (c) Hector Garcia-Molina. Professor in the Departments of Computer Science and Electrical Engineering at Stanford University

Fourth Paradigm of Science Thousands of years Empirical Few hundreds of years Theoretical Last fifty years Computational “Query the world” Last twenty years eScience (Data Science) “Download the world”

Data Science and Others Business Intelligence Statistics Data(base) Management Visualization Machine Learning Data Mining Artificial Intelligence Big Data

Big Data Science Tasks Facebook Amazon Google LinkedIn Netflix Rozetka Microsoft

Regular Data Science Tasks Data analysis What percentage of users back to our site? Which products usually bought together? Modeling/statistics How many cars we are going to sell next year? Which city is better for opening new office? Engineering/prototyping Product to use a prediction model Visualization of analytics

Human VS. Machine

Human vs. Machine Human Machines Naturally can work with small amount of data Have a knowledge about domain Good image recognition Machines Can make intensive computations Knows only numbers and strings (well, actually only numbers)

(c) Russ Thompson Senior Research Scientist at Alexa A data scientist is the adult version of the kid who can’t stop asking “Why?” (c) Russ Thompson Senior Research Scientist at Alexa

A data scientist is a statistician who lives in San Francisco (c) somebody

Data Scientist vs Software Professional Both terms are very wide

Who is Data Scientist? Scientist Data Scientist Someone who find new discoveries Make a hypothesis Investigate that hypothesis Data Scientist Do the same with data Look for meaning, knowledge in the data Answering questions and rely on data

Data Science

Data Scientist is the sexiest job of 21st century (c) Harvard Business Review  Oh, Really?

Data Dilemma Cost vs. Value Big Data How much Value can I extract from the data? How much it will cost to me to store that data? Big Data No individual record is particularly valuable Having every record is incredibly valuable

Data Science Project Should have a goal (aka Problem) Result How many customers will buy this car next month? Which capacity should I reserve for my database growth? (well, I’ve just discovered that my everyday job become a Data Science problem) In which city should I open the office? Result Prototype of a working algorithm Deploy prediction model to use on a daily basis Visualize trends Find hidden correlations between parameters

Let’s do some Data Science

Where to Learn? University Online Resources Books Coursera Pluralsight Etc. Books

How to start? Your own company Open competitions (Kaggle)

Summary What is Data Science? Who is Data Scientist? Discover info about Data Science using Data Science

Resources The Fourth Paradigm Cloudera: Training A New Generation Of Data Scientists The Future of Data Science - Data Science @ Stanford Coursera: Data Manipulation at Scale Coursera: Machine Learning Understanding Data Science and Why It’s So Important Data Scientist: The Sexiest Job of the 21st Century

Thank you! Denis Reznik Twitter: @denisreznik Email: denisreznik@live.ru Blog: http://reznik.uneta.com.ua Facebook: https://www.facebook.com/denis.reznik.5 LinkedIn: http://ua.linkedin.com/pub/denis-reznik/3/502/234