Multi-Perspective Clustering of Process Execution Traces
Process mining techniques enable extracting process models from process event logs. Problems can arise if process mining is applied to event logs of flexible processes that are extremely heterogeneous. Here, trace clustering can be used to reduce the complexity of logs. Common techniques use isolated criteria such as activity profiles for clustering. Especially in flexible environments, however, additional data attributes stored in event logs are a source of unused knowledge for trace clustering. In this paper, we present a multi-perspective trace clustering approach that improves the homogeneity of trace subsets. Our approach provides an integrated definition of similarity between traces by defining a distance measure that combines information about executed activities, performing resources, and data values. The evaluation with real-life event logs, one from a hospital and one with traffic fine data, shows that the homogeneity of the resulting clusters can be significantly improved compared to existing techniques.
Copyright (c) 2019 Stefan Jablonski, Maximilian Rögliner, Stefan Schönig, Katrin Maria Wyrtki
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Authors who publish with this journal agree to the following terms: Authors retain copyright and grant the journal 'Enterprise Modelling and Information Systems Architectures - International Journal of Conceptual Modeling' and the Gesellschaft für Informatik e.V. (GI) the permission of first publication, and the non-exclusive, irrevocable and non-time limited publication permission for the submitted work including the permissions to store, copy, distribute and reproduce their work in printed and electronic form for the duration of the legal copyright. This includes the right of translation. Authors grant the journal 'Enterprise Modelling and Information Systems Architectures - International Journal of Conceptual Modeling' and the Gesellschaft für Informatik e.V. (GI) the permission to license their work under a Creative Commons BY-SA 4.0 license that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book) given an acknowledgement of its initial publication in this journal.
Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access). The submitting corresponding author on behalf of all co-authors asserts that she/he is entitled to the granting of the above mentioned permissions for the submitted work.