Abstract
Data cleaning is a complex, multi-step process of modifying input data to ensure that it is free of irrelevant, incorrect, duplicated and/or incomplete data. It accounts for about 60% of the work by data scientists. Data cleaning in companies, business and organizations requires 1) an availability of knowledgeable and experienced expert(s) in a designated area of business operation (for example, teaching/learning, enrollment, sales, manufacturing, etc.), 2) a specialized set of data cleaning tools for automatic and/or manual modification of data, and 3) well-defined data cleaning techniques/procedures and secure protocols of working with corporate/organizational data. These days it is important not just to teach a new subject or topic, but also to teach it in a smart way - using smart pedagogy - with the goal to maximize student learning outcomes. This paper presents the up-to-date findings and outcomes of a multi-aspect project in the Department of Computer Science and Information Systems, the Department of Education, Counseling and Leadership, and the InterLabs Research Institute in Bradley University (IL, USA) that is aimed to design, develop and test the innovative data cleaning curriculum based on corresponding features of smart pedagogy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Gartner: Gartner Says Advanced Analytics Is a Top Business Priority. https://www.gartner.com/en/newsroom/press-releases/2014-10-21-gartner-says-advanced-analytics-is-a-top-business-priority
Stevens, E.: The 7 Most Useful Data Analysis Methods and Techniques (2022). https://careerfoundry.com/en/blog/data-analytics/data-analysis-techniques/
IBM Analytics: Working with Messy Data. https://developer.ibm.com/tutorials/ba-cleanse-process-visualize-data-set-1/
Forbes: Cleaning Big Data. https://www.forbes.com/sites/gilpress/2016/03/23/data-preparation-most-time-consuming-least-enjoyable-data-science-task-survey-says/?sh=6aad0a716f63
Tableau: Componenst of Quality Data. https://www.tableau.com/learn/articles/what-is-data-cleaning
Uskov, V.L., Bakken, J., Ganapathi, K.S., Gayke, K., Galloway, B., Fatima, J.: Data cleaning and data visualization systems for learning analytics. In: Uskov, V.L., Howlett, R.J., Jain, L.C.L. (eds.) Smart Education and e-Learning 2020, pp. 183–197. Springer (2020). ISBN 978–981–15–5584–8, https://doi.org/10.1007/978-981-15-5584-8, https://www.springer.com/gp/book/9789811555831
Verusen Marketing: The Death of the Data Cleanse. https://verusen.com/the-death-of-the-data-cleanse/
University of Illinois at Urbana-Champaign: Theory and Practice of Data Cleaning course. https://ws.engr.illinois.edu/sitemanager/getfile.asp?id=506
The Pennsylvania State University: Data Collection and Cleaning course. https://bulletins.psu.edu/search/?caturl=%2Fsearch&search=daan+822
Cornell University: eCornell – Data Cleaning with the Tidyverse course. https://ecornell.cornell.edu/courses/data-science/data-cleaning-with-the-tidyverse/
Harvard University: Data Science – Wrangling course, a part of the Professional Certificate in Data Science. https://pll.harvard.edu/course/data-science-wrangling?delta=2
Stanford University: Python Data Cleaning course, a part of Technology Training. https://uit.stanford.edu/service/techtraining/class/python-data-cleaning
University of Texas at Arlington: Texas: Data Cleaninj with OpenRefine and Cleaning Data in Python, a part of library services for studenst, faculty and tsaff. https://libguides.uta.edu/humanitiesdata/clean
Coursera: (offered by Johns Hopkins University), Getting and Cleaning Data course. https://www.coursera.org/learn/data-cleaning
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Uskov, V.L. et al. (2022). Smart Pedagogy-Focused Design and Development of Innovative Curriculum for Data Cleaning. In: Uskov, V.L., Howlett, R.J., Jain, L.C. (eds) Smart Education and e-Learning - Smart Pedagogy. SEEL-22 2022. Smart Innovation, Systems and Technologies, vol 305. Springer, Singapore. https://doi.org/10.1007/978-981-19-3112-3_3
Download citation
DOI: https://doi.org/10.1007/978-981-19-3112-3_3
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-3111-6
Online ISBN: 978-981-19-3112-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)