Data Science introduction.

Slides:



Advertisements
Similar presentations
Data Mining Sangeeta Devadiga CS 157B, Spring 2007.
Advertisements

Classical Techniques: Statistics, Neighborhoods, and Clustering.
Data Mining By Archana Ketkar.
Data Mining Adrian Tuhtan CS157A Section1.
Introduction to WEKA Aaron 2/13/2009. Contents Introduction to weka Download and install weka Basic use of weka Weka API Survey.
Presented To: Madam Nadia Gul Presented By: Bi Bi Mariam.
Data Mining By Andrie Suherman. Agenda Introduction Major Elements Steps/ Processes Tools used for data mining Advantages and Disadvantages.
OLAM and Data Mining: Concepts and Techniques. Introduction Data explosion problem: –Automated data collection tools and mature database technology lead.
Dr. Awad Khalil Computer Science Department AUC
Knowledge Discovery & Data Mining process of extracting previously unknown, valid, and actionable (understandable) information from large databases Data.
Data Mining By Jason Baltazar, Phil Cademas, Jillian Latham, Rachel Peeler & Kamila Singh.
Data Mining Techniques
Data Mining. 2 Models Created by Data Mining Linear Equations Rules Clusters Graphs Tree Structures Recurrent Patterns.
CS490D: Introduction to Data Mining Prof. Chris Clifton April 14, 2004 Fraud and Misuse Detection.
Intelligent Systems Lecture 23 Introduction to Intelligent Data Analysis (IDA). Example of system for Data Analyzing based on neural networks.
Understanding Data Analytics and Data Mining Introduction.
Tang: Introduction to Data Mining (with modification by Ch. Eick) I: Introduction to Data Mining A.Short Preview 1.Initial Definition of Data Mining 2.Motivation.
© 2008 Pearson Prentice Hall, Experiencing MIS, David Kroenke Slide 1 Chapter 9 Competitive Advantage with Information Systems for Decision Making.
Introduction to Data Mining Group Members: Karim C. El-Khazen Pascal Suria Lin Gui Philsou Lee Xiaoting Niu.
INTRODUCTION TO DATA MINING MIS2502 Data Analytics.
1 1 Slide Introduction to Data Mining and Business Intelligence.
Knowledge Discovery and Data Mining Evgueni Smirnov.
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
Knowledge Discovery and Data Mining Evgueni Smirnov.
 Fundamentally, data mining is about processing data and identifying patterns and trends in that information so that you can decide or judge.  Data.
Fox MIS Spring 2011 Data Mining Week 9 Introduction to Data Mining.
1 Statistical Techniques Chapter Linear Regression Analysis Simple Linear Regression.
EXAM REVIEW MIS2502 Data Analytics. Exam What Tool to Use? Evaluating Decision Trees Association Rules Clustering.
CRM - Data mining Perspective. Predicting Who will Buy Here are five primary issues that organizations need to address to satisfy demanding consumers:
What is Data Mining? process of finding correlations or patterns among dozens of fields in large relational databases process of finding correlations or.
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
DATA MINING By Cecilia Parng CS 157B.
Business Intelligence - 2 BUS 782. Topics Data warehousing Data Mining.
DATA MINING PREPARED BY RAJNIKANT MODI REFERENCE:DOUG ALEXANDER.
Data Mining. Overview the extraction of hidden predictive information from large databases Data mining tools predict future trends and behaviors, allowing.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
DECISION TREES Asher Moody, CS 157B. Overview  Definition  Motivation  Algorithms  ID3  Example  Entropy  Information Gain  Applications  Conclusion.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Ahmed K. Ezzat, SQL Server 2008 and Data Mining Overview 1 Data Mining and Big Data.
Data Mining – Introduction (contd…) Compiled By: Umair Yaqub Lecturer Govt. Murray College Sialkot.
DATA MINING and VISUALIZATION Instructor: Dr. Matthew Iklé, Adams State University Remote Instructor: Dr. Hong Liu, Embry-Riddle Aeronautical University.
 Analysis of statistics generated by market activity such as past price and volume to come up with reasonable outcome in future using charts as a primary.
Supplemental Chapter: Business Intelligence Information Systems Development.
Computer Information Technology
Data Mining Functionalities
Data Mining.
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Introduction Data Mining for Business Analytics (3rd ed.)
Machine Learning for Computer Security
By Arijit Chatterjee Dr
MIS2502: Data Analytics Advanced Analytics - Introduction
Project 5 Data Mining & Structural Equation Modeling
DATA MINING © Prentice Hall.
Data Mining 101 with Scikit-Learn
ALZHEIMER DISEASE PREDICTION USING DATA MINING TECHNIQUES P.SUGANYA (RESEARCH SCHOLAR) DEPARTMENT OF COMPUTER SCIENCE TIRUPPUR KUMARAN COLLEGE FOR WOMEN.
Waikato Environment for Knowledge Analysis
Adrian Tuhtan CS157A Section1
CALIFORNIA STATE UNIVERSITY, SACRAMENTO
Chapter 2 Describing Data: Graphs and Tables
Introduction to Azure Machine Learning Studio
Machine Learning & Data Science
Sangeeta Devadiga CS 157B, Spring 2007
Data Analysis.
Supporting End-User Access
Data Mining: Introduction
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Displaying Data – Charts & Graphs
Welcome! Knowledge Discovery and Data Mining
Presentation transcript:

Data Science introduction

What is Data Science? Data science is a "concept to unify statistics, data analysis and their related methods" in order to "understand and analyze actual phenomena" with data.[3] It employs techniques and theories drawn from many fields within the broad areas of mathematics, statistics, information science, and computer science, in particular from the subdomains of: - machine learning (computers learn without being explicitly programmed) - classification (identifying sets and categories) - cluster analysis (grouping sets of objects into clusters that have similar properties) - data mining (discovering patterns in large datasets) - databases (organized collection of data) - visualization (representation of data, graphs, plots etc.)

Linear Regression (Machine Learning) Other questions Linear Regression can guess at. What will the temperature be in 3 days? What will my sales be next quarter? What will the price of a stock be tomorrow

Decision Tree (Machine Learning)

Abnormal events or behavior Purchases on credit cards that are not consistent with previous purchases. - outside your usual price range - outside your usual radius of purchasing area - outside your usual gap if time between purchases

Which is better (or Which will happen)? Which is would customers like more? “25% off” or “buy one get one free” If the stock price of coke goes up? Will the price of Pepsi go up, down, unchanged?

Clustering Algorithms Which viewers like the same types of Movies? Walmart – Beer and Diapers on Friday story

Answering a specific question? What will happen to the stock price next week? May go up, may go down, may stay the same What will the stock price be next week? (Data must include stock price history to predict) Which stock is the best to buy for the next week? What will the effect on derivatives be on changes in stock prices? What stocks are correlated to each other? How will changes in correlated stocks effect derivatives?

Popular Tools for Data Science

Data Science Process