Panel: The Art of Data Mining, and the Quest for Greater Insight Moderator: Moderator: Kate Smith-Miles, Deakin University, Australia Panelists: Panelists:

Slides:



Advertisements
Similar presentations
Year Two Year Three Year One Research methods teaching in the social sciences: An integrated approach to inquiry- based learning.
Advertisements

PARTITIONAL CLUSTERING
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
Active subgroup mining for descriptive induction tasks Dragan Gamberger Rudjer Bošković Instute, Zagreb Zdenko Sonicki University of Zagreb.
PCI – 2 point scale Important Not Important Sample or Sample or Sample or Group 1 Group 2 Group 3 Response to 1 st PCI variable Response to 2 nd PCI variable.
EC Research Methodology Instructor: Dr. Bruce Chien-Ta Ho TEL : (04) ext 16 Mobile: :
RQF Trials and the Newcastle Experience Barney Glover.
Foundations of Computational Intelligence The basis of Smart Adaptive Systems of the future? Bogdan Gabrys Smart Technology Research Centre Computational.
Introduction to WEKA Aaron 2/13/2009. Contents Introduction to weka Download and install weka Basic use of weka Weka API Survey.
Computer Science Universiteit Maastricht Institute for Knowledge and Agent Technology Data mining and the knowledge discovery process Summer Course 2005.
Teachers Name : Suman Sarker Telecommunication Technology Subject Name : Computer Controller System & Robotics Subject Code : 6872 Semester :7th Department.
Moving forward with Scalable Game Design. The landscape of computer science courses…  Try your vegetables (sneak it in to an existing course)  Required.
Overview 1.Association between financial problems and mental health problems 2.Financial counsellors need some basic knowledge about how to assist with.
CSCI 347 / CS 4206: Data Mining Module 01: Introduction Topic 03: Stages in Data Mining.
Overview of Distributed Data Mining Xiaoling Wang March 11, 2003.
RDA Wheat Data Interoperability Working Group Outcomes RDA Outputs P5 9 th March 2015, San Diego.
National Science Centre An Important Component of Research Funding in Poland RECFA visit to Poland, Krakow 2012 Andrzej Jajszczyk.
Check-in on Curriculum Progress Next Steps.  Brings all of the pieces together.  Transparency  Creates curriculum conversation  A tool for the journey.
Miss V Tatler Computer Science Subject Leader
Data Mining Techniques
1 Doing Statistics for Business Doing Statistics for Business Data, Inference, and Decision Making Chapter 8 Hypothesis Testing : An Introduction.
Possibilities for Applying Data Mining for Early Warning in Food Supply Networks Adrie J.M. Beulens,Yuan Li, Mark R. Kramer, Jack G.A.J. van der Vorst.
Data Management Development and Implementation: an example from the UK SLA Conference, Boston, June 2015 Geraldine Clement-Stoneham Knowledge and Information.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
Data Mining Chun-Hung Chou
CSO engagement in policy process Hille Hinsberg State Chancellery Government Communication Officer
Using Authentic Discovery Projects to Improve Student Outcomes in Statistics Joint Mathematics Meetings January 16, 2010 Dianna Spence Brad Bailey Robb.
Part III Course materials Teaching Modules 7 & 8.
Research Methodology.
Paradigms, Theory, And Research
Time Series Data Analysis - I Yaji Sripada. Dept. of Computing Science, University of Aberdeen2 In this lecture you learn What are Time Series? How to.
1 ICDM 2004 Business Meeting 11/4/2004 Data Mining on ICDM Submission Data Shusaku Tsumoto Ning Zhong and Xindong Wu.
1 A Conceptual Framework of Data Mining Y.Y. Yao Department of Computer Science, University of Regina Regina, Sask., Canada S4S 0A2
ICDM 2003 Review Data Analysis - with comparison between 02 and 03 - Xindong Wu and Alex Tuzhilin Analyzed by Shusaku Tsumoto.
A Better Place to Think About Business Ceremonial Environmental Strategies: A Meta-Analysis of Environmental Strategy and Implementation at Large Firms.
Science in Business Data Mining? Background: support managerial decision making Background: support managerial decision making Is there a science to data.
Data Preparation as a Process Markku Ursin
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
Printing: This poster is 48” wide by 36” high. It’s designed to be printed on a large-format printer. Customizing the Content: The placeholders in this.
Question paper 1997.
Topic (iii): Macro Editing Methods Paula Mason and Maria Garcia (USA) UNECE Work Session on Statistical Data Editing Ljubljana, Slovenia, 9-11 May 2011.
Scientific Methods.  The Italian Physicist Galileo Galilei and the English philosopher Francis Bacon are usually credited as the principal founders of.
Systematic Review: Interpreting Results and Identifying Gaps October 17, 2012.
CURRENT PRACTICES FOR DATA COLLECTION & DECISION ANALYSIS Session Moderator: Robert E. Lee, Jr. Chair, PhRMA Trademark Subcommittee.
Research How-To’s in a Community Setting Gregory W. Heath, DHSc, MPH Director of Research.
Applied Multivariate Statistics Cluster Analysis Fall 2015 Week 9.
# 1 COST Action TD0804 Bennett M. Brooks Brigitte Schulte-Fortkamp Edinburgh, UK -- Novotel October 2009 Hot Topics in Soundscapes Group discussions.
Personal reflective writing is a piece of writing that basically involves your views and feelings about a particular subject. The goal of personal reflective.
SACSA Online A SACSA professional development support resource.
A Decision Support Based on Data Mining in e-Banking Irina Ionita Liviu Ionita Department of Informatics University Petroleum-Gas of Ploiesti.
1 Machine Learning Lecture 8: Ensemble Methods Moshe Koppel Slides adapted from Raymond J. Mooney and others.
Day 17: Duality and Nonlinear SVM Kristin P. Bennett Mathematical Sciences Department Rensselaer Polytechnic Institute.
Why Intelligent Data Analysis? Joost N. Kok Leiden Institute of Advanced Computer Science Universiteit Leiden.
Build an Enterprise IT Security Training Program
Simon Deakin CBR, University of Cambridge
Open Access and Knowledge Production: ‘Leximetric’ Data Coding
The role of social media in the professional development The individual and institutional perspective Paulina Barczyszyn Jacek Nożewski, Róża Smolak,
Look at the following graph?
NCTQ Teacher Prep Review:
DATA MINING © Prentice Hall.
Introduction to Data Mining
General principles in building a predictive model
Facilitation skills.
Introduction Data Mining for Business Analytics.
Professor S K Dubey,VSM Amity School of Business
CS6700 Advanced AI Prof. Carla Gomes Prof. Bart Selman
A Primer on Customer Satisfaction Management
Visual and Performing Arts
Metamorphic Exploration of an Unsupervised Clustering Program
Presentation transcript:

Panel: The Art of Data Mining, and the Quest for Greater Insight Moderator: Moderator: Kate Smith-Miles, Deakin University, Australia Panelists: Panelists: Kristin Bennett, Rensselaer Polytechnic Institute, USA Sven Crone, Lancaster University, UK Wlodzislaw Duch, Nicolaus Copernicus University, Poland Isabelle Guyon, ClopiNet, USA Nik Kasabov, Auckland University of Technology, New Zealand Zhi-Hua Zhou, Nanjing University, China

Overview The data mining process requires a number of decisions to be made in each stage: The data mining process requires a number of decisions to be made in each stage: selection of data and variables, selection of data and variables, choice of suitable sampling methods, choice of suitable sampling methods, data pre-processing steps, data pre-processing steps, selection of the best knowledge discovery algorithms selection of the best knowledge discovery algorithms selection of parameters. selection of parameters. With so many choices that can have significant impact upon the eventual success of the results, data mining can sometimes be seen as more art than science unless the user is highly knowledgeable. With so many choices that can have significant impact upon the eventual success of the results, data mining can sometimes be seen as more art than science unless the user is highly knowledgeable. Is there a science to data mining? Or is it still more art than science? What insights do our experts have about which methods to use when? Is there a science to data mining? Or is it still more art than science? What insights do our experts have about which methods to use when?

Aims This panel discussion aims to bring together experts in data mining to see if we can come up with some ideas about: This panel discussion aims to bring together experts in data mining to see if we can come up with some ideas about: our collective knowledge of when certain techniques (algorithms, pre-processing methods, etc.) are expected to perform well. our collective knowledge of when certain techniques (algorithms, pre-processing methods, etc.) are expected to perform well. How much insight do we have into the most effective data mining process? How much insight do we have into the most effective data mining process? How can recent research in model selection and meta-learning help us to gain greater insight into the most effective data mining steps for a given problem? How can recent research in model selection and meta-learning help us to gain greater insight into the most effective data mining steps for a given problem? Can we take some of the mystery and need for trial and error out of the process, and come up with some expert guidelines, and lay the foundations for merging this information with large scale empirical analysis in the future? Can we take some of the mystery and need for trial and error out of the process, and come up with some expert guidelines, and lay the foundations for merging this information with large scale empirical analysis in the future?

Questions for discussion Is there a science to data mining? Is there a science to data mining? Do you have your own rules (developed by experience) about when certain methods should be used, or not used? Do you have your own rules (developed by experience) about when certain methods should be used, or not used? selection of data and variables, selection of data and variables, choice of suitable sampling methods, choice of suitable sampling methods, data pre-processing steps, data pre-processing steps, selection of the best knowledge discovery algorithms selection of the best knowledge discovery algorithms selection of parameters. selection of parameters. What about empirical studies (meta-learning, model selection, etc.) aimed to learn these rules? What about empirical studies (meta-learning, model selection, etc.) aimed to learn these rules? What would we need to do to take the trial-and-error and art out of the process to make data mining more user- friendly and effective? What would we need to do to take the trial-and-error and art out of the process to make data mining more user- friendly and effective? Next steps? Next steps?