D ATA S CIENTISTS Who are they and what do they do?

Slides:



Advertisements
Similar presentations
Web Mining.
Advertisements

Data Mining in Industry: Putting Theory into Practice Bhavani Raskutti.
Business Analytics for the 21 st Century TRENDS AND HOT TOPICS.
Quantitative Research and Analytics, Proprietary and Confidential1 Ryan Michaluk
PROVEN. RESULTS. FASTER. University of Tennessee © Bradley, Miles and Pannell Business Analytics Strategic Approaches.
Overview of Data Mining & The Knowledge Discovery Process Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University.
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Introduction to Data Mining Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
Week 9 Data Mining System (Knowledge Data Discovery)
Data Mining By Archana Ketkar.
Data Analytics Program at Drake Brad C. Meyer, Chair Information Management and Business Analytics.
Data Mining – Intro.
Introduction to Data Science Kamal Al Nasr, Matthew Hayes and Jean-Claude Pedjeu Computer Science and Mathematical Sciences College of Engineering Tennessee.
Data Mining & Data Warehousing PresentedBy: Group 4 Kirk Bishop Joe Draskovich Amber Hottenroth Brandon Lee Stephen Pesavento.
Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305.
TURKISH STATISTICAL INSTITUTE INFORMATION TECHNOLOGIES DEPARTMENT (Muscat, Oman) DATA MINING.
Data Mining By Andrie Suherman. Agenda Introduction Major Elements Steps/ Processes Tools used for data mining Advantages and Disadvantages.
『 Data Mining 』 By Jung, hae-sun. 1.Introduction 2.Definition 3.Data Mining Applications 4.Data Mining Tasks 5. Overview of the System 6. Data Mining.
Data Mining By Jason Baltazar, Phil Cademas, Jillian Latham, Rachel Peeler & Kamila Singh.
Chapter 5: Data Mining for Business Intelligence
MAKING THE BUSINESS BETTER Presented By Mohammed Dwikat DATA MINING Presented to Faculty of IT MIS Department An Najah National University.
Data Mining Dr. Chang Liu. What is Data Mining Data mining has been known by many different terms Data mining has been known by many different terms Knowledge.
Lecture 9: Knowledge Discovery Systems Md. Mahbubul Alam, PhD Associate Professor Dept. of AEIS Sher-e-Bangla Agricultural University.
Analytics for Smart Decisions The Wonderful World of Big Data, Business Analytics and Business Intelligence - Demystified.
Introduction to SQL Server Data Mining Nick Ward SQL Server & BI Product Specialist Microsoft Australia Nick Ward SQL Server & BI Product Specialist Microsoft.
Banking on Analytics Dr A S Ramasastri Director, IDRBT.
Vision + Focus + Execution Meiliu Lu, RVR 5016, For CSc 209 Spring 2003, 5/6/03.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Business Solutions. Agenda Overview Business Solutions Benefits Company Summary.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
1 STAT 5814 Statistical Data Mining. 2 Use of SAS Data Mining.
Understanding the field & setting expectations.  Personal  International  UNT Alumni (Mathematics)  Academic  Economics & Mathematics  Professional.
Keuze semester big data
Data Mining Tools some examples.
MIS2502: Data Analytics Advanced Analytics - Introduction.
Site Technology TOI Fest Q Celebration From Keyword-based Search to Semantic Search, How Big Data Enables That?
Business Analytics Skills
OMIS 694, Big Data Analytics
Big Data Analytics Platforms. Our Team NameApplication Viborov MichaelApache Spark Bordeynik YanivApache Storm Abu Jabal FerasHPCC Oun JosephGoogle BigQuery.
Academic Year 2014 Spring Academic Year 2014 Spring.
1 Seattle University Master’s of Science in Business Analytics Key skills, learning outcomes, and a sample of jobs to apply for, or aim to qualify for,
Copyright © 2016 Pearson Education, Inc. Modern Database Management 12 th Edition Jeff Hoffer, Ramesh Venkataraman, Heikki Topi CHAPTER 11: BIG DATA AND.
What is Data Science and Who is Data Scientist
The United States Postal Service processed over 150 billion pieces of mail in 2013—far too much for efficient human sorting. But as recently as 1997,
Chapter 3 Building Business Intelligence Chapter 3 DATABASES AND DATA WAREHOUSES Building Business Intelligence 6/22/2016 1Management Information Systems.
AZURE MACHINE LEARNING Bringing New Value To Old Data SQL Saturday #
Data Resource Management – MGMT An overview of where we are right now SQL Developer OLAP CUBE 1 Sales Cube Data Warehouse Denormalized Historical.
Unlock your Big Data with Analytics and BI on Office365 Brian Culver ● SharePoint Fest Seattle● BI102 ● August 18-20, 2015.
Zohreh Raghebi.  A software platform provides an integrated environment  Machine learning  Data mining  Text mining  Predictive analytics  Business.
Leverage Big Data With Hadoop Analytics Presentation by Ravi Namboori Visit
What Will You Learn Today?
Data Mining – Intro.
Big Data A Quick Review on Analytical Tools
MIS2502: Data Analytics Advanced Analytics - Introduction
Make Predictions Using Azure Machine Learning Studio
Information technology skills for the 21st century
What is Business Intelligence?
Data Mining Tools some examples.
Next Generation cargo ERP Enterprise Resource Prediction
Coding - The Ultimate Survival Skill
A GACP and GTMCP company
Experiences with Business Analytics Curriculum Implementation
MIS5101: Data Analytics Advanced Analytics - Introduction
Data Warehousing and Data Mining
Machine Learning with R
Data Science in Practice
orange.biolab.si A general-purpose open source component-based
Big DATA.
Welcome! Knowledge Discovery and Data Mining
Igor Stančin, Alan Jović to: {igor.stancin,
An Introduction to Data Science using Python
Presentation transcript:

D ATA S CIENTISTS Who are they and what do they do?

W HO DO Y OU T HINK W E A RE ? What my friends think I doWhat my mom thinks I do What society thinks I do What my boss thinks I doWhat I think I doWhat I really do

W HAT B UZZ IS A ROUND ?

“Data Scientist is an statistician who lives in California.” “Data Scientist is an statistics on Mac.” “A data scientist is someone who's is better at statistics than any developer and better at development than any statistician” worse statistician worse developer “A data scientist is someone who's is better at statistics than any developer and better at development than any statistician”

“I keep saying that the sexy job in the next 10 years will be statisticians. And I’m not kidding.” − Chief Economist, Google

“Data Scientist: The Sexiest Job of the 21st Century. New key player in organizations is the “data scientist.” It’s a high-ranking professional with the training and curiosity to make discoveries in the world of big data” − Harvard Business School

W HAT S KILLS ARE N ECESSARY ? StatisticsMathVisualization High Performance Computing Computer Vision Machine Learning Data Engineering Data Warehousing Data Science Domain Expertise

W HAT O PPORTUNITIES W E O FFER ? Recommendation Systems Targeted Advertisement Market Segmentation Fraud Detection Customer Behavior Analysis Stock Price Forecasting

W HAT T ASKS W E S OLVE ? Regression Classification Similarity matching Clustering Behavior Description Among all the our customers, which are likely to respond to a given offer? What is the chance that the new contract will be signed? How much will a given customer use the service? How much money can we earn selling the service? What items are commonly purchased together? Do our customers form natural groups or segments? What is the typical cell phone usage of this customer segment? Among all user transactions, which are likely be marked as fraud? Link Prediction What items could be recommended for user, based on his previous purchases? What are the companies similar to our best business customers?

W HAT T OOLS DO W E U SE Big Data Technologies Programming Languages IDEs and Libraries Visualization Enterprise Software Python, R, Java (bonus Clojure, Haskell, Scala) Hadoop, HDFS & MapReduce... (bonus: Spark, Storm) D3.js, Gephi, ggplot2, Tableau SPSS, MatLab, SAS ( the enterprise man) NoSQLMongoDB, Couchbase, Cassandra, HBase Knime, Weka, RapidMiner, SciPy, NumPy, scikit-learn, MS Excel: The most used and most understood DS tool

B USINESS A NALYTICS Descriptive Analytics Predictive Analytics Prescriptive Analytics What happened? What is happening? Business reporting Dashboards Scorecards Data warehousing Well defined business problems and opportunities What will happen? Why will it happen? What should I do? Why should I do it? Data Mining Text Mining Web Mining Forecasting Optimization Simulation Decision Modeling Expert Systems Accurate projections of the future states and conditions Best possible business decisions and transactions Questions Enablers Outcomes

C OMMON SDLC WILL NOT F IT H ERE Business Understanding Data Data Understanding Data Preparation Modeling Evaluation Deployment

T HE R OAD IS NOT E ASY

Q UESTIONS ?