Abstract
Knowledge Tracing (KT) is one of the fundamental tasks in intelligent education, which mainly predicts students’ performance on new questions based on their historical learning interaction behavior. Although many deep learning methods have been successfully applied to KT tasks, there are still some problems in real scenarios. The data acquisition of student interactions is often the key problem in many KT tasks, since most KT models require a large amount of student interaction data for model training. However, such data is difficult to obtain because it may contain personal information about students. Our goal is to leverage existing student exercises from another subject/dataset (namely source domain) to train a KT model and adapt it to the current subject/dataset (namely target domain) using only a small amount of target domain data. Therefore, we propose a novel Domain Adaptive Knowledge Tracing model (DAKT). To achieve the adaptation of knowledge tracing, we design a domain-shared answer embedding module to capture the behavioral features of students’ past answers and a domain-adaptive knowledge state module to adapt the model to the target domain. We conduct extensive experiments on four benchmark datasets to demonstrate the effectiveness of our model.
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs13042-024-02219-y/MediaObjects/13042_2024_2219_Fig1_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs13042-024-02219-y/MediaObjects/13042_2024_2219_Fig2_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs13042-024-02219-y/MediaObjects/13042_2024_2219_Fig3_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs13042-024-02219-y/MediaObjects/13042_2024_2219_Figa_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs13042-024-02219-y/MediaObjects/13042_2024_2219_Fig4_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs13042-024-02219-y/MediaObjects/13042_2024_2219_Fig5_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs13042-024-02219-y/MediaObjects/13042_2024_2219_Fig6_HTML.png)
Similar content being viewed by others
Data availability
The datasets used in this study’s experiments can be found at the following link: 1. https://sites.google.com/site/assistmentsdata/home/assistment2009-2010-data/skill-builder-data-2009-2010. 2. https://sites.google.com/site/assistmentsdata/home/2015assistments-skillbuilder-data. 3. https://sites.google.com/view/assistmentsdatamining/dataset?authuser=0. 4. https://pslcdatashop.web.cmu.edu/KDDCup/downloads.jsp.
Notes
References
Abdelrahman G, Wang Q, Nunes B (2023) Knowledge tracing: a survey. ACM Computi Surv 55(11):1–37
Corbett AT, Anderson JR (1994) Knowledge tracing: modeling the acquisition of procedural knowledge. User Model User Adapt Interact 4:253–278
Piech C, Bassen J, Huang J, Ganguli S, Sahami M, Guibas LJ, Sohl-Dickstein J (2015) Deep knowledge tracing. Adv Neural Inform Process Syst 28:505–513
Ghosh A, Heffernan N, Lan AS (2020) Context-aware attentive knowledge tracing. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp. 2330–2339
Choi Y, Lee Y, Cho J, Baek J, Kim B, Cha Y, Shin D, Bae C, Heo J (2020) Towards an appropriate query, key, and value computation for knowledge tracing. In: Proceedings of the Seventh ACM Conference on Learning@ Scale, pp. 341–344
Cheng S, Liu Q, Chen E, Zhang K, Huang Z, Yin Y, Huang X, Su Y (2022) Adaptkt: A domain adaptable method for knowledge tracing. In: Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining, pp. 123–131
Cheng B, Misra I, Schwing AG, Kirillov A, Girdhar R (2022) Masked-attention mask transformer for universal image segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 1290–1299
Corbett A (2000) Cognitive mastery learning in the act programming tutor. In: Adaptive User Interfaces. AAAI Technical Report SS-00-01
Embretson SE, Reise SP (2013) Item response theory
Lord F, Novick MR (1968) Statistical theories of mental test scores. Addison. Reading
Cen H, Koedinger K, Junker B (2008) Comparing two irt models for conjunctive skills. In: Intelligent Tutoring Systems: 9th International Conference, ITS 2008, Montreal, Canada, June 23-27, 2008 Proceedings 9:796–798. Springer
Cen H, Koedinger K, Junker B (2006) Learning factors analysis–a general method for cognitive model evaluation and improvement. In: Intelligent Tutoring Systems: 8th International Conference, ITS 2006, Jhongli, Taiwan, June 26-30, 2006. Proceedings 8:164–175. Springer
Lipton ZC, Berkowitz J, Elkan C (2015) A critical review of recurrent neural networks for sequence learning. ar**v preprint ar**v:1506.00019
Graves A, Graves A (2012) Long short-term memory. In: Supervised Sequence Labelling with Recurrent Neural Networks, 37–45
Zhang J, Shi X, King I, Yeung D-Y (2017) Dynamic key-value memory networks for knowledge tracing. In: Proceedings of the 26th International Conference on World Wide Web, pp. 765–774
Pandey S, Karypis G (2019) A self-attentive model for knowledge tracing. ar**v preprint ar**v:1907.06837
Liu Z, Liu Q, Chen J, Huang S, Luo W (2023) simplekt: a simple but tough-to-beat baseline for knowledge tracing. ar**v preprint ar**v:2302.06881
Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser Ł, Polosukhin I (2017) Attention is all you need. Adv Neural Inform Process Syst 30
Nakagawa H, Iwasawa Y, Matsuo Y (2019) Graph-based knowledge tracing: modeling student proficiency using graph neural network. In: IEEE/WIC/ACM International Conference on Web Intelligence, pp. 156–163
Liu Y, Yang Y, Chen X, Shen J, Zhang H, Yu Y (2020) Improving knowledge tracing via pre-training question embeddings. ar**v preprint ar**v:2012.05031
Su Y, Liu Q, Liu Q, Huang Z, Yin Y, Chen E, Ding C, Wei S, Hu G (2018) Exercise-enhanced sequential modeling for student performance prediction. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 32
Liu Q, Huang Z, Yin Y, Chen E, **ong H, Su Y, Hu G (2019) Ekt: Exercise-aware knowledge tracing for student performance prediction. IEEE Trans Knowl Data Eng 33(1):100–115
Wang J, Lan C, Liu C, Ouyang Y, Qin T, Lu W, Chen Y, Zeng W, Yu P (2022) Generalizing to unseen domains: A survey on domain generalization. IEEE Trans Knowl Data Eng
Wang M, Deng W (2018) Deep visual domain adaptation: a survey. Neurocomputing 312:135–153
Bruzzone L, Marconcini M (2009) Domain adaptation problems: a dasvm classification technique and a circular validation strategy. IEEE Trans Pattern Anal Mach Intell 32(5):770–787
Gong B, Grauman K, Sha F (2013) Connecting the dots with landmarks: Discriminatively learning domain-invariant features for unsupervised domain adaptation. In: International Conference on Machine Learning, pp. 222–230. PMLR
Ganin Y, Lempitsky V (2015) Unsupervised domain adaptation by backpropagation. In: International Conference on Machine Learning, pp. 1180–1189. PMLR
Venkateswara H, Eusebio J, Chakraborty S, Panchanathan S (2017) Deep hashing network for unsupervised domain adaptation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 5018–5027
Saito K, Kim D, Sclaroff S, Darrell T, Saenko K (2019) Semi-supervised domain adaptation via minimax entropy. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 8050–8058
Liang J, Hu D, Feng J (2020) Do we really need to access the source data? source hypothesis transfer for unsupervised domain adaptation. In: International Conference on Machine Learning, pp. 6028–6039. PMLR
Ganin Y, Lempitsky V (2015) Unsupervised domain adaptation by backpropagation. In: International Conference on Machine Learning, pp. 1180–1189. PMLR
Tzeng E, Hoffman J, Saenko K, Darrell T (2017) Adversarial discriminative domain adaptation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 7167–7176
Hajian-Tilaki K (2013) Receiver operating characteristic (roc) curve analysis for medical diagnostic test evaluation. Casp J Inter Med 4(2):627
Fawcett T (2006) An introduction to roc analysis. Pattern Recogn Lett 27(8):861–874
Acknowledgements
This work was supported in part by the National Natural Science Foundation of China under Grant 62276138 and Grant 62076135 and in part by the Jiangsu “Qing Lan” Project.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Tang, Y., Yang, W., **e, Y. et al. Domain adaptive knowledge tracing. Int. J. Mach. Learn. & Cyber. (2024). https://doi.org/10.1007/s13042-024-02219-y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s13042-024-02219-y