Last Words DM 1. Mining Data Steams / Incremental Data Mining / Mining sensor data (e.g. modify a decision tree assuming that new examples arrive continuously,

Slides:



Advertisements
Similar presentations
Office of SA to CNS GeoIntelligence Introduction Data Mining vs Image Mining Image Mining - Issues and Challenges CBIR Image Mining Process Ontology.
Advertisements

1 Undergraduate Curriculum Revision Department of Computer Science February 10, 2010.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Big Data Management and Analytics Introduction Spring 2015 Dr. Latifur Khan 1.
CS/CMPE 535 – Machine Learning Outline. CS Machine Learning (Wi ) - Asim LUMS2 Description A course on the fundamentals of machine.
MS DB Proposal Scott Canaan B. Thomas Golisano College of Computing & Information Sciences.
Intelligent Systems Group Emmanuel Fernandez Larry Mazlack Ali Minai (coordinator) Carla Purdy William Wee.
Computer Science Prof. Bill Pugh Dept. of Computer Science.
Data Mining – Intro.
Advanced Database Applications Database Indexing and Data Mining CS591-G1 -- Fall 2001 George Kollios Boston University.
Presented To: Madam Nadia Gul Presented By: Bi Bi Mariam.
DASHBOARDS Dashboard provides the managers with exactly the information they need in the correct format at the correct time. BI systems are the foundation.
Introduction to Data Science Kamal Al Nasr, Matthew Hayes and Jean-Claude Pedjeu Computer Science and Mathematical Sciences College of Engineering Tennessee.
Machine Learning Usman Roshan Dept. of Computer Science NJIT.
CSC 478 Programming Data Mining Applications Course Summary Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University.
LÊ QU Ố C HUY ID: QLU OUTLINE  What is data mining ?  Major issues in data mining 2.
OLAM and Data Mining: Concepts and Techniques. Introduction Data explosion problem: –Automated data collection tools and mature database technology lead.
Data Mining Techniques
A REVIEW OF FEATURE SELECTION METHODS WITH APPLICATIONS Alan Jović, Karla Brkić, Nikola Bogunović {alan.jovic, karla.brkic,
University of Toronto 8/30/20151 Data Mining The Art and Science of Obtaining Knowledge from Data Dr. Saed Sayad.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Last Words COSC Big Data (frameworks and environments to analyze big datasets) has become a hot topic; it is a mixture of data analysis, data mining,
General Information Course Id: COSC6342 Machine Learning Time: TU/TH 10a-11:30a Instructor: Christoph F. Eick Classroom:AH123
Chapter 1 Introduction to Data Mining
Introduction to Web Mining Spring What is data mining? Data mining is extraction of useful patterns from data sources, e.g., databases, texts, web,
University of Colorado Denver Department of Mathematical and Statistical Sciences Graduate program in Applied Mathematics Applications Continuous models:
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
Millions of points of measurement Dense spatial and temporal data Need visual analytic tools as conventional analyses are too inefficient Visualization.
RESOURCES, TRADE-OFFS, AND LIMITATIONS Group 5 8/27/2014.
Machine Learning Lecture 1. Course Information Text book “Introduction to Machine Learning” by Ethem Alpaydin, MIT Press. Reference book “Data Mining.
1 ICDM 2004 Business Meeting 11/4/2004 Data Mining on ICDM Submission Data Shusaku Tsumoto Ning Zhong and Xindong Wu.
Guiding Principles. Goals First we must agree on the goals. Several (non-exclusive) choices – Want every CS major to be educated in performance including.
Data Mining Teaching experience at the FIB. What is Data Mining? A broad set of techniques and algorithms brought from machine learning and statistics.
ICDM 2003 Review Data Analysis - with comparison between 02 and 03 - Xindong Wu and Alex Tuzhilin Analyzed by Shusaku Tsumoto.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Big Data Analytics Large-Scale Data Management Big Data Analytics Data Science and Analytics How to manage very large amounts of data and extract value.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
GeoSpatial and GeoTemporal Informatics for dynamic and complex systems May Yuan.
Slides for “Data Mining” by I. H. Witten and E. Frank.
9/03 Data Mining – Introduction G Dong (WSU)1 CS499/ Data Mining Fall 2003 Professor Guozhu Dong Computer Science & Engineering WSU.
Data Mining and Decision Support
CSC 478 Programming Data Mining Applications Course Summary Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Data Mining in Germany IIM Conference, Oct. 24, 2012 Gottfried Schwarz, DLR > Lecture > Author Document > Datewww.DLR.de Chart 1.
General Information Course Id: COSC6342 Machine Learning Time: TU/TH 1-2:30p Instructor: Christoph F. Eick Classroom:AH301
Data Science Interview Questions 1.What do you mean by word Data Science? Data Science is the extraction of knowledge from large.
CS570: Data Mining Spring 2010, TT 1 – 2:15pm Li Xiong.
Why Intelligent Data Analysis? Joost N. Kok Leiden Institute of Advanced Computer Science Universiteit Leiden.
Machine Learning Usman Roshan Dept. of Computer Science NJIT.
July 7, 2016 Data Mining: Concepts and Techniques 1 1.
Why Should You Apply to Graduate School? Masters Degree
Data Mining – Intro.
2009: Topics Covered in COSC 6368
School of Computer Science & Engineering
Data-Drive Analytics for Precision Medicine
Topics Covered in COSC 6368 More general topics:
Data Mining: Concepts and Techniques Course Outline
What is Pattern Recognition?
Topics Covered in COSC 6340 Data models (ER, Relational, XML (short))
Research Areas Christoph F. Eick
Data Warehousing and Data Mining
Topics Covered in COSC 6340 Data models (ER, Relational, XML)
CS577/EE537 Advanced Computer Networks
Course Summary ChengXiang “Cheng” Zhai Department of Computer Science
CS577 / EE537 Advanced Computer Networks
Evaluation of Relational Operations: Other Techniques
2004: Topics Covered in COSC 6368
CSE591: Data Mining by H. Liu
Presentation transcript:

Last Words DM 1. Mining Data Steams / Incremental Data Mining / Mining sensor data (e.g. modify a decision tree assuming that new examples arrive continuously, and old examples are discarded) 2. Text Mining 3. Mining the Web/Mining Graphs and other complex structures 4. Mining spatial-temporal data, particularly environmental, cell-phone, and traffic data 5. Contrast mining (e.g. how do two groups of people differ) 6. Data Mining and Privacy 7. Mining Social Networks (kind of hot these days) 8. Statistical Techniques (Principal component analysis, multi- dimensional scaling, feature selection, statistical testing, Bayesian classifier,...)  typically taught in a Machine Learning class. 9. Preprocessing probably deserves more coverage 10. High Performance Data Mining  Parallel Programming Course Other Important Topics in Data Mining that we didn’t or very little discuss in this class

Last Words DM 1. Develop a unifying theory for data mining (e.g. explaining how and when over-fitting occurs) 2. Mining data streams / mining sensor networks / mining sequential data 3. High performance data mining platforms / combining parallel computing and data mining ( 4. Spatial data mining / temporal data mining / spatial temporal 5. Mining graphs and other complex types of data 6. More research on the interestingness of knowledge 7. Distributed data mining (cannot pass the complete data set; distributed decision making, e.g. in sensor networks) 8. Data mining for genomic and earth science problems 9. What is the data mining process --- kind of software engineering for data mining; development of data mining methodologies… 10. Data Mining without violating privacy and security New Challenges for the Field of Data Mining

Last Words DM Complementary Knowledge For Getting Jobs in Data Mining Data Mining Databases Data Structures & Algorithms Software Design Machine Learning AI High Performance Computing Evolutionary Computing Pattern Recognition Statistics Optimization Information Retrieval Image Processing GIS Data Visualization Search Techniques Experimental Evaluation Software Engineering

Last Words DM 2008 Student Textbook Evaluation Overall positive evaluation but –Some felt that algorithms were not explained in sufficient detail, particularly examples are missing –A few felt the material should be better indexed –Some felt it lack highlighting of key points –Some felt it is at an intermediate level, and does not give sufficient depth if the textbook is your only source of knowledge; it also introduces topics more intuitively and not formally, as some more advanced textbook do. 2 students felt that the textbook does not introduce topics very clearly, and that it is not comprehensive.