Abstract
In recent years, online education has been given more and more attention with the widespread use of the internet. The teaching procedure divides space and makes time for online learning; though teachers cannot control the learners accurately, the state of education calculates learners’ learning situation. This paper explains that the discourse analysis method is utilized to examine the online teaching behavior of teachers and student behavior like class attendance, how many are active in class, and learning behavior in online education. Also, discourse analysis will optimize and enhance the classification of the text understood according to their language. After pre-processing, feature extraction was done by utilizing Term Frequency-Inverse Document Frequency, and feature selection was calculated via utilizing chi-square examination for teacher discourse like learning behavior, languages understood by students, and language types. Moreover, the machine learning-based classification technique Support Vector Machine (SVM) is considered to analyze the teacher discourse in class automatically, and results are compared with existing techniques.
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs10639-024-12515-3/MediaObjects/10639_2024_12515_Fig1_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs10639-024-12515-3/MediaObjects/10639_2024_12515_Fig2_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs10639-024-12515-3/MediaObjects/10639_2024_12515_Fig3_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs10639-024-12515-3/MediaObjects/10639_2024_12515_Fig4_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs10639-024-12515-3/MediaObjects/10639_2024_12515_Fig5_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs10639-024-12515-3/MediaObjects/10639_2024_12515_Fig6_HTML.png)
Similar content being viewed by others
Data availability
Data sharing is not applicable to this article as no new data were created or analyzed in this study.
Code availability
Not applicable.
References
Arguel, A., et al. (2017). Inside out: Detecting learners’ confusion to improve interactive digital learning environments. Journal of Educational Computing Research, 55(4), 526–551.
Asaly, S., et al. (2022). Using support vector machine (SVM) with GPS ionospheric TEC estimations to potentially predict earthquake events. Remote Sensing, 14(12), 2822.
Aswad, M., & Sardi, A. (2023). The discourse analysis of Diction effects on teachers used in Teaching English as a Foreign Language. AL-ISHLAH. Jurnal Pendidikan, 15(2), 2647–2654.
Casanova, M., & Alvarez, I. M. (2012). Online cooperative learning and key interpsychological mechanisms: An exploratory study through the analysis of the discourse content. Creative Education, 3(08), 1345.
Chonggao, P. (2021). Simulation of student classroom behavior recognition based on cluster analysis and random forest algorithm. Journal of Intelligent & Fuzzy Systems, 40(2), 2421–2431.
Ewing, B. (2017). Theorizing critical discourse theory and analysis for investigating mathematics classrooms. Creative Education, 8(13), 797592064–2090.
Ghani, T., Jahan, N., Khan, M. M., Rahman, S. T., & Islam, S. T. A. (2021). Development and analysis of a machine learning based Software for assisting online classes during COVID-19. Journal of Software Engineering and Applications, 14(3), 83–94.
Hasram, S., Arif, F. K. M., Nasir, M. K. M., Mohamad, M., Daud, M. Y., Abd Rahman, M. J., & Mohammad, W. M. R. W. (2020). Online games for primary school vocabulary teaching and learning: A literature review. Creative Education, 11(11), 2327.
Hew, K. F., et al. (2020). What predicts student satisfaction with MOOCs: A gradient boosting trees supervised machine learning and sentiment analysis approach. Computers & Education, 145, 103724.
Janssona, M., Hrastinski, S., Stenbom, S., & Enoksson, Fredrik. (2021). Online question and answer sessions: How students support their own and other students’ processes of inquiry in a text-based learning environment. The Internet and Higher Education, 51, 100817.
Kumari, S. (2022). Text mining and pre-processing methods for social media data extraction and processing. In Handbook of research on opinion mining and text analytics on literary works and social media (pp. 22–53). IGI Global. https://doi.org/10.4018/978-1-7998-9594-7.ch002
Li, G., et al. (2021). A convolutional neural network (CNN) based approach for the recognition and evaluation of classroom teaching behavior. Scientific Programming, 2021, 1–8.
Liu, S. (2021). Analysis of the current Situation and trends of Online Education Research based on VOSviewer. Open Journal of Social Sciences, 9(9), 426–432.
Lu, Y. L. (2020). How do nurses acquire English medical discourse ability in nursing practice? Exploring nurses’ medical discourse learning journeys and related identity construction. Nurse Education Today, 85, 104301.
Ma, H., Yao, J., & Liu, L. (2017). Research on the correlation between Learning Effectiveness and Online Learning Behavior based on Online Education Scene. Creative Education, 8, 13.
Muthuprasad, T., et al. (2021). Students’ perception and preference for online education in India during COVID-19 pandemic. Social Sciences & Humanities open, 3(1), 100101.
Naeem, M. Z., et al. (2022). Classification of movie reviews using term frequency-inverse document frequency and optimized machine learning algorithms. PeerJ Computer Science, 8, e914.
Omaima, A., Aditya, J., & Huzefa, R. (2018). Needle in a haystack: Identifying learner posts that require urgent response in MOOC discussion forums. Computers & Education, 118, 1–9.
Rai, P. (2022). Co** strategies of University EFL Teachers for Online Instruction during COVID-19 in Nepal. Journal of NELTA, 27(1–2), 40–52.
Rivera, J. L. (2019). Blended learning-effectiveness and application in teaching and learning foreign languages. Open Journal of Modern Linguistics, 9(2), 129–144.
Shen, C., Panda, S., & Vogelstein, J. T. (2022). The chi-square test of distance correlation. Journal of Computational and Graphical Statistics, 31(1), 254–262.
Wang, X., & Yuan, D. (2022). An empirical analysis of social interaction in online teaching in open universities based on flipped classroom. Computational Intelligence and Neuroscience, Article ID 3089239.
Wang, Y., Stein, D., & Shen, S. (2021). Students’ and teachers’ perceived teaching presence in online courses. Distance Education, 42(3), 373–390.
Warren, A. N., & Lester, J. N. (2020). How teachers deliberate policy: Taking a stance on third grade reading legislation in online language teacher education. Linguistics and Education, 57, 100813.
Wu, Q. (2019). MOOC learning behavior analysis and teaching intelligent decision support method based on improved decision tree C4. 5 algorithm. International Journal of Emerging Technologies in Learning (Online), 14(12), 29.
**ng, W., Popov, V., Zhu, G., Horwitz, P., & McIntyre, C. (2019). The effects of transformative and non-transformative discourse on individual performance in collaborative-inquiry learning. Computers in Human Behavior, 98, 267–276.
Zeng, Z., Chaturvedi, S., & Bhat, S. (2017). Learner affect through the looking glass: Characterization and detection of confusion in online courses. In Proceedings of the 10th International Conference on Educational Data Mining.
Zhang, H., et al. (2020). A learning style classification approach based on deep belief network for large-scale online education. Journal of Cloud Computing, 9, 1–17.
Zhu, Y. (2020). The expectation of TikTok in international media: A critical discourse analysis. Open Journal of Social Sciences, 8(12), 136.
Zhu, M., Herring, S. C., & Bonk, C. J. (2019). Exploring presence in online learning through three forms of computer-mediated discourse analysis. Distance Education, 40(2), 205–225.
Funding
None.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Human and animal rights
This article does not contain any studies with human or animal subjects performed by any of the authors.
Informed consent
Informed consent does not apply as this was a retrospective review with no identifying patient information.
Consent to participate
Not applicable.
Consent for publication
Not applicable.
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Gothai, E., Saravanan, S., Selvan, C.T. et al. Optimized machine learning model discourse analysis. Educ Inf Technol (2024). https://doi.org/10.1007/s10639-024-12515-3
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s10639-024-12515-3