ECML/PKDD Discovery Challenges Petr Berka University of Economics, Prague
ECML/PKDD Challenge Workshop Petr Berka, LISp, Discovery Challenge Idea Realistic data mining conditions collaborative rather then competitive nature rather vague specification of the problem Differences to real KDD projects short time for analysis (2-3 months) only indirect access to domain and data experts during the KDD process
ECML/PKDD Challenge Workshop Petr Berka, LISp, Challenge Settings Data and their full description available on the web for all participants Submissions evaluated by domain experts ( and by data mining experts ) Workshop at ECML/PKDD to present the results and discus them with domain experts Results and comments of experts available on the web (after the workshop)
ECML/PKDD Challenge Workshop Petr Berka, LISp, Discovery Challenges
ECML/PKDD Challenge Workshop Petr Berka, LISp, Discovery Challenge 2005 Data about chronic hepatitis (thanks to Shimane Medical University and Chiba University Hospital – S. Hirano & S. Tsumoto) Gene expression data (thanks to Université Claude Bernard, Lyon – O. Gandrillon) Clickstream data (thanks to an Czech e-shop)
ECML/PKDD Challenge Workshop Petr Berka, LISp, Discovery Challenge 2005 workshop program 10:30 – 12:30 Click-stream data 12:30 – 14:00 lunch break 14:00 – 16:00 Gene Expression Data 16:00 – 16:30 coffee break 16:30 – 18:30 Hepatitis Data