You are here

Workshop Details

PTDM: Workshop on Practical Theories of Data Mining

Organized by Tijl De Bie, Kleanthis-Nikolaos (Akis) Kontonasios and Eirini Spyropoulou
08:30 - 18:00
Room: Rembrandt Room
https://sites.google.com/site/ptdm2012

The goal of this ICDM 2012 workshop is to help closing the gap between data mining practice and theory. To this end, we intend to explore what is the essence of exploratory data mining and how to formalize it in a useful but theoretically well-founded way.

The workshop is motivated by a widely perceived discrepancy between theoretical data mining prototypes and practitioners' requirements. A notable example is frequent pattern mining. Despite its attractive theoretical foundations, the practical use of frequent pattern mining methods has been limited. This is due to a difficulty to overcome issues, such as the pattern explosion problem and a discrepancy between usefulness and frequency. These issues have been addressed to some extent in the past 15 years, through heuristic post-processing steps and through rigorously motivated adaptations. The multitude of possible solution strategies has unfortunately to a large extent undermined the original elegance, and made it hard for practitioners to understand how to use these techniques.

The problem is however not restricted to frequent pattern mining alone. The multitude of available methods for typical exploratory data mining problems such as (subspace) clustering and dimensionality reduction is such that practitioners face a daunting task in selecting a suitable method. Additionally to the usability issues, less attention has been given on pattern mining methods for relational databases. Although most real world databases are relational, most pattern mining research has focused on one-table data.

We believe the core reasons for these difficulties are:


  • Different users inevitably have different prior beliefs and goals, whereas most exploratory data mining algorithms have a rigid objective function and do not consider this.
  • Formally comparing the quality of different data mining patterns is hard due to their widely varying nature (e.g. comparing a dimensionality reduction with a frequent itemset), unless their "interestingness" can be quantified in a comparable manner.
  • The iterative process of data mining is often not considered.
  • Data mining in complex relational data is hard to fit into standard data mining prototypes.
  • More generally, data mining methods tend to be rigid, defined for highly specific tasks, for highly specific and idealized data, and for very specific types of patterns./li>


The purpose of this workshop will be to serve as a forum of exchanging ideas on how to formalize exploratory data mining in order to make it useful in practice. This workshop will survey some existing attempts at addressing the problems mentioned above. We particularly encourage papers that present principled theoretical contributions motivated by real world requirements.

event_type == 1) {?> event_type == 2) {?> event_type == 3) {?> event_type == 4) {?> event_type == 5) {?> event_type == 1) {?> event_type == 2) {?> event_type == 3) {?> event_type == 4) {?> event_type == 5) {?> event_type == 1) {?> event_type == 2) {?> event_type == 3) {?> event_type == 4) {?> event_type == 5) {?> event_type == 1) {?> event_type == 2) {?> event_type == 3) {?> event_type == 4) {?> event_type == 5) {?> event_type == 1) {?> event_type == 2) {?> event_type == 3) {?> event_type == 4) {?> event_type == 5) {?> event_type == 1) {?> event_type == 2) {?> event_type == 3) {?> event_type == 4) {?> event_type == 5) {?> event_type == 1) {?> event_type == 2) {?> event_type == 3) {?> event_type == 4) {?> event_type == 5) {?> event_type == 1) {?> event_type == 2) {?> event_type == 3) {?> event_type == 4) {?> event_type == 5) {?> event_type == 1) {?> event_type == 2) {?> event_type == 3) {?> event_type == 4) {?> event_type == 5) {?> event_type == 1) {?> event_type == 2) {?> event_type == 3) {?> event_type == 4) {?> event_type == 5) {?> event_type == 1) {?> event_type == 2) {?> event_type == 3) {?> event_type == 4) {?> event_type == 5) {?> event_type == 1) {?> event_type == 2) {?> event_type == 3) {?> event_type == 4) {?> event_type == 5) {?> event_type == 1) {?> event_type == 2) {?> event_type == 3) {?> event_type == 4) {?> event_type == 5) {?> event_type == 1) {?> event_type == 2) {?> event_type == 3) {?> event_type == 4) {?> event_type == 5) {?>
08:30 - 09:00 Introduction
Tijl De Bie, Akis Kontonasios and Eirini Spyropoulou
08:30 - 09:00 Introduction
Tijl De Bie, Akis Kontonasios and Eirini Spyropoulou
08:30 - 09:00 Introduction Introduction 08:30 - 09:00 Introduction
Tijl De Bie, Akis Kontonasios and Eirini Spyropoulou
09:00 - 10:00 Keynote Talk
Kathleen Marchal, Universiteit Gent, Belgium
09:00 - 10:00 Keynote Talk
Kathleen Marchal, Universiteit Gent, Belgium
09:00 - 10:00 Keynote Talk Keynote Talk 09:00 - 10:00 Keynote Talk
Kathleen Marchal, Universiteit Gent, Belgium
10:00 - 10:30 Coffee Break
10:00 - 10:30 Coffee Break

10:00 - 10:30 Coffee Break Coffee Break 10:00 - 10:30 Coffee Break
10:30 - 11:30 Keynote Talk: From Inductive Querying to Declarative Modelling for Data Mining
Luc De Raedt, KU Leuven, Belgium
10:30 - 11:30 Keynote Talk: From Inductive Querying to Declarative Modelling for Data Mining
Luc De Raedt, KU Leuven, Belgium
10:30 - 11:30 Keynote Talk: From Inductive Querying to Declarative Modelling for Data Mining Keynote Talk: From Inductive Querying to Declarative Modelling for Data Mining 10:30 - 11:30 Keynote Talk: From Inductive Querying to Declarative Modelling for Data Mining
Luc De Raedt, KU Leuven, Belgium
11:30 - 11:50 Thorough analysis of log data with dependency rules: Practical solutions and theoretical challenges
Wilhelmiina Hämäläinen
11:30 - 11:50 Thorough analysis of log data with dependency rules: Practical solutions and theoretical challenges
Wilhelmiina Hämäläinen
11:30 - 11:50 Thorough analysis of log data with dependency rules: Practical solutions and theoretical challenges Thorough analysis of log data with dependency rules: Practical solutions and theoretical challenges 11:30 - 11:50 Thorough analysis of log data with dependency rules: Practical solutions and theoretical challenges
Wilhelmiina Hämäläinen
11:50 - 12:10 Enhancing the Analysis of Large Multimedia Applications Execution Traces with FrameMiner
Christiane Kamdem Kengne, Leon Constantin Fopa, Noha Ibrahim, Alexandre Termier, Marie-Christine Rousset and Takashi Washio
11:50 - 12:10 Enhancing the Analysis of Large Multimedia Applications Execution Traces with FrameMiner
Christiane Kamdem Kengne, Leon Constantin Fopa, Noha Ibrahim, Alexandre Termier, Marie-Christine Rousset and Takashi Washio
11:50 - 12:10 Enhancing the Analysis of Large Multimedia Applications Execution Traces with FrameMiner Enhancing the Analysis of Large Multimedia Applications Execution Traces with FrameMiner 11:50 - 12:10 Enhancing the Analysis of Large Multimedia Applications Execution Traces with FrameMiner
Christiane Kamdem Kengne, Leon Constantin Fopa, Noha Ibrahim, Alexandre Termier, Marie-Christine Rousset and Takashi Washio
12:10 - 12:30 Generalized Expansion Dimension
Michael Nett, Michael E. Houle, and Hisashi Kashima
12:10 - 12:30 Generalized Expansion Dimension
Michael Nett, Michael E. Houle, and Hisashi Kashima
12:10 - 12:30 Generalized Expansion Dimension Generalized Expansion Dimension 12:10 - 12:30 Generalized Expansion Dimension
Michael Nett, Michael E. Houle, and Hisashi Kashima
12:30 - 14:00 Lunch Break
12:30 - 14:00 Lunch Break

12:30 - 14:00 Lunch Break Lunch Break 12:30 - 14:00 Lunch Break
14:00 - 15:00 Keynote Talk
Kai Puolamäki, Aalto University, Finland
14:00 - 15:00 Keynote Talk
Kai Puolamäki, Aalto University, Finland
14:00 - 15:00 Keynote Talk Keynote Talk 14:00 - 15:00 Keynote Talk
Kai Puolamäki, Aalto University, Finland
15:00 - 15:20 Generating Diverse Realistic Data Sets for Episode Mining
Albrecht Zimmermann
15:00 - 15:20 Generating Diverse Realistic Data Sets for Episode Mining
Albrecht Zimmermann
15:00 - 15:20 Generating Diverse Realistic Data Sets for Episode Mining Generating Diverse Realistic Data Sets for Episode Mining 15:00 - 15:20 Generating Diverse Realistic Data Sets for Episode Mining
Albrecht Zimmermann
15:20 - 15:40 Logical Itemset Mining
Shailesh Kumar, Chandrashekar V. and C.V. Jawahar
15:20 - 15:40 Logical Itemset Mining
Shailesh Kumar, Chandrashekar V. and C.V. Jawahar
15:20 - 15:40 Logical Itemset Mining Logical Itemset Mining 15:20 - 15:40 Logical Itemset Mining
Shailesh Kumar, Chandrashekar V. and C.V. Jawahar
15:40 - 16:00 Coffee Break
15:40 - 16:00 Coffee Break

15:40 - 16:00 Coffee Break Coffee Break 15:40 - 16:00 Coffee Break
16:00 - 17:00 Keynote Talk
Pieter Adriaans, Universiteit van Amsterdam, The Netherlands
16:00 - 17:00 Keynote Talk
Pieter Adriaans, Universiteit van Amsterdam, The Netherlands
16:00 - 17:00 Keynote Talk Keynote Talk 16:00 - 17:00 Keynote Talk
Pieter Adriaans, Universiteit van Amsterdam, The Netherlands
17:00 - 18:00 Panel discussion
TBC
17:00 - 18:00 Panel discussion
TBC
17:00 - 18:00 Panel discussion Panel discussion 17:00 - 18:00 Panel discussion
TBC