Knowledge-Intensive Subgroup Mining


Techniques for Automatic and Interactive Discovery

Autor/en:
Martin Atzmüller
Umfang:
214
EAN/ISBN:
978-3-89838-307-3
Band:
307
Ausgabe:
softcover
Buchreihe:
Dissertationen zur Künstlichen Intelligenz
Kategorien:
Buch
Informatik
Künstliche Intelligenz
Dissertationen zur Künstlichen Intelligenz
Englisch
Gesamtverzeichnis AKA Verlag#Complete Index AKA Publisher
Preis:
inkl. 7% MWSt
50,00 €
Subgroup mining is a powerful and broadly applicable data mining approach: In general, the goal is to efficiently discover novel, potentially useful and ultimately interesting knowledge given by subgroup patterns. However, in real-world situations these requirements often cannot be fulfilled, e.g., if the applied methods do not scale for large data sets, if too many results are presented, or if many of the discovered patterns are already known to the user. This thesis proposes a combination of several techniques in order to cope with the sketched problems: Concerning automatic methods we present the novel SD-Map algorithm that is fast and effective. We describe interactive techniques for subgroup introspection and analysis, and we present advanced visualization methods that can be used for optimization, comparison and exploration. Furthermore, we propose to include background knowledge into the mining process. The techniques are combined into a knowledge-intensive process supporting both automatic and interactive methods for subgroup mining. The evaluation consists of two parts: With respect to objective evaluation criteria (efficiency and effectiveness), we provide an experimental evaluation using synthetic data. Subjective evaluation criteria include the user acceptance, the benefit, and the interestingness of the results. The approach has been successfully implemented in medical and technical applications, for which we present five case studies, using real-world data sets.