CSC 478 Programming Data Mining Applications Course Summary Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University.

Slides:



Advertisements
Similar presentations
Clustering Basic Concepts and Algorithms
Advertisements

Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
An Overview of Machine Learning
SAK 5609 DATA MINING Prof. Madya Dr. Md. Nasir bin Sulaiman
Principal Component Analysis
© Prentice Hall1 DATA MINING TECHNIQUES Introductory and Advanced Topics Eamonn Keogh (some slides adapted from) Margaret Dunham Dr. M.H.Dunham, Data Mining,
Machine Learning Group University College Dublin 4.30 Machine Learning Pádraig Cunningham.
CSE 546 Data Mining Machine Learning Instructor: Pedro Domingos.
A Technique for Advanced Dynamic Integration of Multiple Classifiers Alexey Tsymbal*, Seppo Puuronen**, Vagan Terziyan* *Department of Artificial Intelligence.
Introduction to WEKA Aaron 2/13/2009. Contents Introduction to weka Download and install weka Basic use of weka Weka API Survey.
Data Mining – Intro.
Multimedia Data Mining Arvind Balasubramanian Multimedia Lab The University of Texas at Dallas.
Graph Classification.
Introduction to machine learning
General Information Course Id: COSC6342 Machine Learning Time: MO/WE 2:30-4p Instructor: Christoph F. Eick Classroom:SEC 201
1 © Goharian & Grossman 2003 Introduction to Data Mining (CS 422) Fall 2010.
OLAM and Data Mining: Concepts and Techniques. Introduction Data explosion problem: –Automated data collection tools and mature database technology lead.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
Introduction to Machine Learning for Information Retrieval Xiaolong Wang.
Anomaly detection with Bayesian networks Website: John Sandiford.
General Information Course Id: COSC6342 Machine Learning Time: TU/TH 10a-11:30a Instructor: Christoph F. Eick Classroom:AH123
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
Introduction to machine learning and data mining 1 iCSC2014, Juan López González, University of Oviedo Introduction to machine learning Juan López González.
LOGO Ensemble Learning Lecturer: Dr. Bo Yuan
Machine Learning Lecture 1. Course Information Text book “Introduction to Machine Learning” by Ethem Alpaydin, MIT Press. Reference book “Data Mining.
Transfer Learning Task. Problem Identification Dataset : A Year: 2000 Features: 48 Training Model ‘M’ Testing 98.6% Training Model ‘M’ Testing 97% Dataset.
Data Mining Teaching experience at the FIB. What is Data Mining? A broad set of techniques and algorithms brought from machine learning and statistics.
Today Ensemble Methods. Recap of the course. Classifier Fusion
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Last Words DM 1. Mining Data Steams / Incremental Data Mining / Mining sensor data (e.g. modify a decision tree assuming that new examples arrive continuously,
Machine Learning Documentation Initiative Workshop on the Modernisation of Statistical Production Topic iii) Innovation in technology and methods driving.
1 CSC 594 Topics in AI – Text Mining and Analytics Fall 2015/16 6. Dimensionality Reduction.
Data Visualization Michel Bruley Teradata Aster EMEA Marketing Director April 2013 Michel Bruley Teradata Aster EMEA Marketing Director.
Syllabus. We covered Regression in Applied Stats. We will review Regression and cover Time Series and Principle Components Analysis. Reference Book.
Topic Modeling using Latent Dirichlet Allocation
Collaborative Filtering Zaffar Ahmed
CSC 594 Topics in AI – Text Mining and Analytics
Team Dogecoin: An Experience in Predicting Hospital Readmissions Acknowledgements The Problem Hospitals in the UK must keep track of which patients, once.
Summary „Data mining” Vietnam national university in Hanoi, College of technology, Feb.2006.
Introduction to Data Mining by Yen-Hsien Lee Department of Information Management College of Management National Sun Yat-Sen University March 4, 2003.
Summary „Rough sets and Data mining” Vietnam national university in Hanoi, College of technology, Feb.2006.
Data Mining and Decision Support
Data Analytics CMIS Short Course part II Day 1 Part 1: Introduction Sam Buttrey December 2015.
Competition II: Springleaf Sha Li (Team leader) Xiaoyan Chong, Minglu Ma, Yue Wang CAMCOS Fall 2015 San Jose State University.
CSC 478 Programming Data Mining Applications Course Summary Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
General Information Course Id: COSC6342 Machine Learning Time: TU/TH 1-2:30p Instructor: Christoph F. Eick Classroom:AH301
Knowledge Discovery in a DBMS Data Mining Computing models and finding patterns in large databases current major challenge in database systems & large.
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Machine Learning with Spark MLlib
Data Mining – Intro.
Who am I? Work in Probabilistic Machine Learning Like to teach 
Sentiment analysis algorithms and applications: A survey
Machine Learning overview Chapter 18, 21
DATA MINING © Prentice Hall.
Introduction to Data Mining
Data Mining 101 with Scikit-Learn
Techniques for Dimensionality Reduction
Special Topics in Data Mining Applications Focus on: Text Mining
Data Mining: Concepts and Techniques Course Outline
Machine Learning Dimensionality Reduction
Mining and Analyzing Data from Open Source Software Repository
Machine Learning for High-Throughput Stress Phenotyping in Plants
כריית מידע -- מבוא ד"ר אבי רוזנפלד.
Data Warehousing and Data Mining
Classification and Prediction
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Machine Learning – a Probabilistic Perspective
Presentation transcript:

CSC 478 Programming Data Mining Applications Course Summary Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University

What we did  Data Mining Overview  The KDD Process  Data Preprocessing and Understanding  Using Python and Numpy  Using Scikit-learn modules  Some emphasis on visualizing and understanding characteristics of the data  Supervised Knowledge Discovery  Regression Analysis  Classification  Techniques such as KNN, Ridge Regression, Decision Tree and Bayesian classification  Lots of emphasis on model evaluation  Evaluation metrics  Train-Test methodologies such as cross-validation 2

What we did  Unsupervised Knowledge Discovery  Cluster analysis  Using PCA and SVD for dimensionality reduction, data characterization, and noise reduction.  Association rule discovery  Emphasis on using unsupervised approaches as components of larger knowledge discovery efforts  E.g., using PCA before clustering; using clustering as the basis for classification  Real application domains  Text Mining and document analysis/filtering  Recommender systems  Predictive modeling for marketing/business applications  Image analysis 3

What we did not do (and you should learn later)  Approaches for mining sequential/temporal data  Markov models; time series analysis, sequential pattern mining  Ensemble and Hybrid Classifiers/Predictors  Combining multiple classifiers  Random Forest classifiers  AdaBoost and meta-learners  Support Vector Machines and Kernel-Based Classifiers  Topic modeling with Latent factor models  LDA  Latent Dirichlet Allocation  Non-Negative Matrix Factorization 4