Cut to the Trace! Process-Aware Partitioning of Long-Running Cases in Customer Journey Logs

  • Conference paper
  • First Online:
Advanced Information Systems Engineering (CAiSE 2021)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 12751))

Included in the following conference series:

Abstract

Customer journeys are recordings of customer interactions with organizational information systems. These interactions are often recorded into so-called customer journey logs. Customer journeys often correspond to long-running and flexible traces that may temper the use of process analytics techniques such as process mining. A common method to make long-running traces suitable for process mining algorithms is to partition them at the largest temporal differences between consecutive events. However, these techniques ignores process context that journeys are often influenced by. In this work, we propose a probabilistic framework that generalizes previous techniques and introduces two novel process-aware partitioning approaches. The first method is inspired by the directly-follows relation, a predominant abstraction in process discovery. The second approach leverages LSTMs, a type of Neural Networks that learn long-term dependencies in sequences. We show that both approaches outperform existing time partitioning methods on both synthetic and real-world customer journey data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
EUR 29.95
Price includes VAT (Germany)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
EUR 93.08
Price includes VAT (Germany)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
EUR 117.69
Price includes VAT (Germany)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    K is a hyper-parameter that controls for the number of cases in the journey log.

  2. 2.

    https://github.com/gaelbernard/cjp.

References

  1. Aalst, W.: Data science in action. In: Process Mining, pp. 3ā€“23. Springer, Heidelberg (2016). https://doi.org/10.1007/978-3-662-49851-4_1

    ChapterĀ  Google ScholarĀ 

  2. Benevenuto, F., Rodrigues, T., Cha, M., Almeida, V.: Characterizing user behavior in online social networks. In: Proceedings of the 9th ACM SIGCOMM Conference on Internet Measurement, pp. 49ā€“62 (2009)

    Google ScholarĀ 

  3. Berendt, B., Mobasher, B., Nakagawa, M., Spiliopoulou, M.: The impact of site structure and user environment on session reconstruction in web usage analysis. In: ZaĆÆane, O.R., Srivastava, J., Spiliopoulou, M., Masand, B. (eds.) WebKDD 2002. LNCS (LNAI), vol. 2703, pp. 159ā€“179. Springer, Heidelberg (2003). https://doi.org/10.1007/978-3-540-39663-5_10

    ChapterĀ  Google ScholarĀ 

  4. Bernard, G., Andritsos, P.: A process mining based model for customer journey map**. In: Forum and Doctoral Consortium Papers Presented at the 29th International Conference on Advanced Information Systems Engineering (CAiSE Forum), pp. 49-56. CEUR workshop proceedings (2017)

    Google ScholarĀ 

  5. Bernard, G., Andritsos, P.: CJM-ab: abstracting customer journey maps using process mining. In: Mendling, J., Mouratidis, H. (eds.) CAiSE 2018. LNBIP, vol. 317, pp. 49ā€“56. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-92901-9_5

    ChapterĀ  Google ScholarĀ 

  6. Bernard, G., Andritsos, P.: Truncated trace classifier. removal of incomplete traces from event logs. In: Nurcan, S., Reinhartz-Berger, I., Soffer, P., Zdravkovic, J. (eds.) BPMDS/EMMSAD -2020. LNBIP, vol. 387, pp. 150ā€“165. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-49418-6_10

    ChapterĀ  Google ScholarĀ 

  7. Bernard, G., Senderovich, A., Andritsos, P.: Cut to the trace: Technical report. Technical report, University of Toronto (Mar 2021). https://github.com/gaelbernard/cjp/raw/master/TechnicalReport/TechnicalReport.pdf

  8. Bose, R.P.J.C., van der Aalst, W.M.P.: Analysis of patient treatment procedures. In: Daniel, F., Barkaoui, K., Dustdar, S. (eds.) BPM 2011, Part I. LNBIP, vol. 99, pp. 165ā€“166. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-28108-2_17

    ChapterĀ  Google ScholarĀ 

  9. Bose, R.J.C., Mans, R.S., van der Aalst, W.M.: Wanna improve process mining results? In: IEEE Symposium on Computational Intelligence and Data Mining (CIDM), pp. 127ā€“134. IEEE (2013)

    Google ScholarĀ 

  10. Bradley, A.P.: The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recogn. 30(7), 1145ā€“1159 (1997)

    ArticleĀ  Google ScholarĀ 

  11. Daigler, J., Davies, J., Manusama, B., Bharaj, G.: Market guide for customer journey analytics. Technical report, Gartner (Feb 2019)

    Google ScholarĀ 

  12. Dev, H., Liu, Z.: Identifying frequent user tasks from application logs. In: Proceedings of the 22nd International Conference on Intelligent User Interfaces, pp. 263ā€“273 (2017)

    Google ScholarĀ 

  13. van Dongen, B., Ferreira, D.R., Weber, B.: Business processing intelligence challenge 2011 (bpic 11). Technical report, IEEE Task Force on Process Mining (2011). https://www.win.tue.nl/bpi/doku.php?id=2011:challenge

  14. Dustdar, S., Gombotz, R.: Discovering web service workflows using web services interaction mining. Int. J. Bus. Process Integr. Manag. 1(4), 256ā€“266 (2006)

    ArticleĀ  Google ScholarĀ 

  15. Evermann, J., Rehse, J.-R., Fettke, P.: A deep learning approach for predicting process behaviour at runtime. In: Dumas, M., Fantinato, M. (eds.) BPM 2016. LNBIP, vol. 281, pp. 327ā€“338. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-58457-7_24

    ChapterĀ  Google ScholarĀ 

  16. Fazzinga, B., Flesca, S., Furfaro, F., Masciari, E., Pontieri, L.: Efficiently interpreting traces of low level events in business process logs. Inf. Syst. 73, 1ā€“24 (2018)

    ArticleĀ  Google ScholarĀ 

  17. Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. textbf9(8), 1735ā€“1780 (1997). https://doi.org/10.1162/neco.1997.9.8.1735

  18. Kumar, A., Salo, J., Li, H.: Stages of user engagement on social commerce platforms: analysis with the navigational clickstream data. Int. J. Electron. Commer. 23(2), 179ā€“211 (2019)

    ArticleĀ  Google ScholarĀ 

  19. Leemans, S.J.J., Fahland, D., van der Aalst, W.M.P.: Discovering block-structured process models from event logs - a constructive approach. In: Colom, J.-M., Desel, J. (eds.) PETRI NETS 2013. LNCS, vol. 7927, pp. 311ā€“329. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-38697-8_17

    ChapterĀ  Google ScholarĀ 

  20. Lemon, K.N., Verhoef, P.C.: Understanding customer experience throughout the customer journey. J. Mark. 80(6), 69ā€“96 (2016)

    ArticleĀ  Google ScholarĀ 

  21. Leno, V., Augusto, A., Dumas, M., La Rosa, M., Maggi, F.M., Polyvyanyy, A.: Identifying candidate routines for robotic process automation from unsegmented ui logs. In: 2020 2nd International Conference on Process Mining (ICPM), pp. 153ā€“160. IEEE (2020)

    Google ScholarĀ 

  22. Leonardi, G., Striani, M., Quaglini, S., Cavallini, A., Montani, S.: Towards semantic process mining through knowledge-based trace abstraction. In: Ceravolo, P., van Keulen, M., Stoffel, K. (eds.) SIMPDA 2017. LNBIP, vol. 340, pp. 45ā€“64. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-11638-5_3

    ChapterĀ  Google ScholarĀ 

  23. Mannhardt, F., Tax, N.: Unsupervised event abstraction using pattern abstraction and local process models, pp. 55ā€“63 (2017)

    Google ScholarĀ 

  24. Srivastava, J., Cooley, R., Deshpande, M., Tan, P.N.: Web usage mining: discovery and applications of usage patterns from web data. ACM SIGKDD Explor. Newsl. 1(2), 12ā€“23 (2000)

    ArticleĀ  Google ScholarĀ 

  25. Tax, N., Sidorova, N., Haakma, R., van der Aalst, W.: Mining process model descriptions of daily life through event abstraction. In: Bi, Y., Kapoor, S., Bhatia, R. (eds.) IntelliSys 2016. SCI, vol. 751, pp. 83ā€“104. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-69266-1_5

    ChapterĀ  Google ScholarĀ 

  26. Tax, N., Sidorova, N., Haakma, R., van der Aalst, W.M.: Mining local process models. J. Innov. Digit. Ecosyst. 3(2), 183ā€“196 (2016). https://doi.org/10.1016/j.jides.2016.11.001. http://www.sciencedirect.com/science/article/pii/S2352664516300232

  27. Tax, N., Verenich, I., La. Rosa, M., Dumas, M.: Predictive business process monitoring with LSTM neural networks. In: Dubois, E., Pohl, K. (eds.) CAiSE 2017. LNCS, vol. 10253, pp. 477ā€“492. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-59536-8_30

    ChapterĀ  Google ScholarĀ 

  28. van der Aalst, W., Weijters, T., Maruster, L.: Workflow mining: discovering process models from event logs. IEEE Trans. Knowl. Data Eng. 16(9), 1128ā€“1142 (2004)

    ArticleĀ  Google ScholarĀ 

  29. Verenich, I., Dumas, M., Rosa, M.L., Maggi, F.M., Teinemaa, I.: Survey and cross-benchmark comparison of remaining time prediction methods in business process monitoring. ACM Trans. Intell. Syst. Technol. 10(4) (2019). https://doi.org/10.1145/3331449

  30. van Zelst, S.J., Mannhardt, F., de Leoni, M., Koschmider, A.: Event abstraction in process mining: literature review and taxonomy. Granular Comput. 1ā€“18 (2020). https://doi.org/10.1007/s41066-020-00226-2

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Gaƫl Bernard .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

Ā© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Bernard, G., Senderovich, A., Andritsos, P. (2021). Cut to the Trace! Process-Aware Partitioning of Long-Running Cases in Customer Journey Logs. In: La Rosa, M., Sadiq, S., Teniente, E. (eds) Advanced Information Systems Engineering. CAiSE 2021. Lecture Notes in Computer Science(), vol 12751. Springer, Cham. https://doi.org/10.1007/978-3-030-79382-1_31

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-79382-1_31

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-79381-4

  • Online ISBN: 978-3-030-79382-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Navigation