An Introduction to Data Science using Python

Slides:



Advertisements
Similar presentations
Data warehouse example
Advertisements

About ISoft … What is Decision Tree? Alice Process … Conclusions Outline.
Introduction to WEKA Aaron 2/13/2009. Contents Introduction to weka Download and install weka Basic use of weka Weka API Survey.
Presented To: Madam Nadia Gul Presented By: Bi Bi Mariam.
DASHBOARDS Dashboard provides the managers with exactly the information they need in the correct format at the correct time. BI systems are the foundation.
CS Machine Learning. What is Machine Learning? Adapt to / learn from data  To optimize a performance function Can be used to:  Extract knowledge.
Machine Learning Queens College Lecture 1: Introduction.
Intelligent Systems Lecture 23 Introduction to Intelligent Data Analysis (IDA). Example of system for Data Analyzing based on neural networks.
Machine Learning CUNY Graduate Center Lecture 1: Introduction.
CIS 9002 Kannan Mohan Department of CIS Zicklin School of Business, Baruch College.
Project MLExAI Machine Learning Experiences in AI Ingrid Russell, University.
Machine Learning for Language Technology Introduction to Weka: Arff format and Preprocessing.
Some working definitions…. ‘Data Mining’ and ‘Knowledge Discovery in Databases’ (KDD) are used interchangeably Data mining = –the discovery of interesting,
Introduction to SQL Server Data Mining Nick Ward SQL Server & BI Product Specialist Microsoft Australia Nick Ward SQL Server & BI Product Specialist Microsoft.
Data Mining In contrast to the traditional (reactive) DSS tools, the data mining premise is proactive. Data mining tools automatically search the data.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
BOĞAZİÇİ UNIVERSITY DEPARTMENT OF MANAGEMENT INFORMATION SYSTEMS MATLAB AS A DATA MINING ENVIRONMENT.
D ATA S CIENTISTS Who are they and what do they do?
Machine Learning. Definition Machine learning is a subfield of computer science that evolved from the study of pattern recognition and computational.
Data Science Interview Questions 1.What do you mean by word Data Science? Data Science is the extraction of knowledge from large.
AZURE MACHINE LEARNING Bringing New Value To Old Data SQL Saturday #
Introduction to Machine Learning, its potential usage in network area,
What we mean by Big Data and Advanced Analytics
Introducing Precictive Analytics
What Business Analytics Can Do For You!
Bhakthi Liyanage SQL Saturday Atlanta 15 July 2017
Teck Chia Partner, Exponent.vc
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
IBM Predictive Analytics Virtual Users’ Group Meeting March 30, 2016
Machine Learning with Spark MLlib
Software Testing.
SNS COLLEGE OF TECHNOLOGY
Machine Learning for Computer Security
Machine Learning overview Chapter 18, 21
Machine Learning overview Chapter 18, 21
DATA MINING © Prentice Hall.
It’s All About Me From Big Data Models to Personalized Experience
CS 445/545 Machine Learning Spring, 2017
Data Mining 101 with Scikit-Learn
CH. 1: Introduction 1.1 What is Machine Learning Example:
Prepared by Kimberly Sayre and Jinbo Bi
AV Autonomous Vehicles.
Azure Machine Learning 101
Introduction Data Mining for Business Analytics.
THE ENTERPRISE ANALYTICAL JOURNEY
Introduction to Azure Machine Learning Studio
What is Pattern Recognition?
FUNDAMENTALS OF MACHINE LEARNING AND DEEP LEARNING
Data Mining Modified from
Brief Intro to Python for Statistics
Data Analytics at CNU Dmitriy Shaltayev
Data Science introduction.
Data Science with Python
Overview of Machine Learning
3.1.1 Introduction to Machine Learning
INNOvation in TRAINING BUSINESS ANALYSTS HAO HElEN Zhang UniVERSITY of ARIZONA
Course Introduction CSC 576: Data Mining.
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Python for Data Analysis
A Method for the Comparison of Criminal Cases using digital documents
Machine Learning with Databricks
Machine Learning overview Chapter 18, 21
CSE591: Data Mining by H. Liu
Igor Stančin, Alan Jović to: {igor.stancin,
Machine Learning in Business John C. Hull
What is Artificial Intelligence?
R for Data Science Data science Data science is a booming field in today’s world. Since Artificial Intelligence is the main focus of today’s technology,
An Introduction to Data Science using Python
An Introduction to Data Science using Python
Presentation transcript:

An Introduction to Data Science using Python Ganesh Lohani Sr Data Analyst Lockheed Martin ganeshlohani@hotmail.com

What is happening? Massive amount of data 90% of the data in the world today has been created in the last two years alone Big Data (3 Vs):Volume Velocity Variety Data is Everywhere and in many formats: Structured Data Semi structured Data Unstructured Data Data has been considered as assets More opportunities to work on data platform Turn data into Information, Decision Making and Business Value

What is Data Science Data science is a field to extract insights/trends/ intelligence that supports the business leaders to make the better decision Data Science is also a process of validating assumption model hypothesis related to business activities Data science is a relatively new field and deeply rooted to Statistics and Decision Support System It is a Multidisciplinary field ( Domain Knowledge, Tools & technology, Mathematics & Statistics, Programmimg languages)

Data Science Methodology Statement of the problem/Objective of the Study Data Preparation Feature selection Exploratory Data Analysis Model development Test the Model/Hypothesis Communicate the findings to the stakeholders Deployment ( data as a product) Feedback/Lesson Learned and Continuous improvement

Python For Data Science/Data Analysis Python is a open source software used as Data Science tool It is user friendly The Code syntax is simple to read and follow. It supports functional, object oriented, and structural programming languages

Python For Data Science/Data Analysis Python Basics: Variables and Data Types Data Frame ( holds the data, like table in SQL Server) Tuples ( initialized with small brackets, inmutables ) List ( collection of values, mutable), Dictionary ( key value pair) Operations ( comparison Mathematical and and Boolean) Function Methods, Conditional Statement ( If Else, While Loop, For Loop) Python Libraries NumPy (Numerical Computation) Pandas ( Data Analysis) Matplotlib ( Data Visualization) SciKit-Learn ( Machine learning Algorithms)

Machine Learning It is a technique to teach the computer that use data instead of explicitly writing the code. It is a branch of Artificial Intelligence (AI) and deeply rooted to Statistics and Mathematics The output is never 100 accurate. Our goal is to optimize the algorithm/model Example: Weather Forecast: 50 % Chance of rain today

Common Types of Machine Learning Algorithms Supervised Learning Classification ( Email: Spam, No Spam) Regression ( Forecast the Car price, Share price over time) Decision Tree ( Will Rain Today? Yes, No) Unsupervised Learning Clustering ( Customer Segmentation: Gold, Silver, Bronge) Reinforcement Learning React to the environment ( Autonomous Car) Natural Language Processing Text Mining ( Twitter Data Analysis, Customer Survey Data )

Machine Learning Model Simple Regression Demo Machine Learning Model Simple Regression

What Feedback do you have for me? Question & Answer What Feedback do you have for me?

Useful Links https://www.python.org/downloads/ https://www.python.org/doc/ https://numpy.org/devdocs/user/quickstart.html https://pandas.pydata.org/pandas-docs/stable/getting_started/index.html https://matplotlib.org/tutorials/index.html https://scikit-learn.org/stable/index.html https://www.datasciencecentral.com/ https://www.kaggle.com/ https://azure.microsoft.com/en-us/services/machine-learning-studio/ https://machinelearningmastery.com/machine-learning-in-python-step-by-step/