Smart Pedagogy-Focused Design and Development of Innovative Curriculum for Data Cleaning

  • Conference paper
  • First Online:
Smart Education and e-Learning - Smart Pedagogy (SEEL-22 2022)

Abstract

Data cleaning is a complex, multi-step process of modifying input data to ensure that it is free of irrelevant, incorrect, duplicated and/or incomplete data. It accounts for about 60% of the work by data scientists. Data cleaning in companies, business and organizations requires 1) an availability of knowledgeable and experienced expert(s) in a designated area of business operation (for example, teaching/learning, enrollment, sales, manufacturing, etc.), 2) a specialized set of data cleaning tools for automatic and/or manual modification of data, and 3) well-defined data cleaning techniques/procedures and secure protocols of working with corporate/organizational data. These days it is important not just to teach a new subject or topic, but also to teach it in a smart way - using smart pedagogy - with the goal to maximize student learning outcomes. This paper presents the up-to-date findings and outcomes of a multi-aspect project in the Department of Computer Science and Information Systems, the Department of Education, Counseling and Leadership, and the InterLabs Research Institute in Bradley University (IL, USA) that is aimed to design, develop and test the innovative data cleaning curriculum based on corresponding features of smart pedagogy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
EUR 29.95
Price includes VAT (Germany)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
EUR 245.03
Price includes VAT (Germany)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
EUR 320.99
Price includes VAT (Germany)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info
Hardcover Book
EUR 320.99
Price includes VAT (Germany)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Gartner: Gartner Says Advanced Analytics Is a Top Business Priority. https://www.gartner.com/en/newsroom/press-releases/2014-10-21-gartner-says-advanced-analytics-is-a-top-business-priority

  2. Stevens, E.: The 7 Most Useful Data Analysis Methods and Techniques (2022). https://careerfoundry.com/en/blog/data-analytics/data-analysis-techniques/

  3. IBM Analytics: Working with Messy Data. https://developer.ibm.com/tutorials/ba-cleanse-process-visualize-data-set-1/

  4. Forbes: Cleaning Big Data. https://www.forbes.com/sites/gilpress/2016/03/23/data-preparation-most-time-consuming-least-enjoyable-data-science-task-survey-says/?sh=6aad0a716f63

  5. Tableau: Componenst of Quality Data. https://www.tableau.com/learn/articles/what-is-data-cleaning

  6. Uskov, V.L., Bakken, J., Ganapathi, K.S., Gayke, K., Galloway, B., Fatima, J.: Data cleaning and data visualization systems for learning analytics. In: Uskov, V.L., Howlett, R.J., Jain, L.C.L. (eds.) Smart Education and e-Learning 2020, pp. 183–197. Springer (2020). ISBN 978–981–15–5584–8, https://doi.org/10.1007/978-981-15-5584-8, https://www.springer.com/gp/book/9789811555831

  7. Verusen Marketing: The Death of the Data Cleanse. https://verusen.com/the-death-of-the-data-cleanse/

  8. University of Illinois at Urbana-Champaign: Theory and Practice of Data Cleaning course. https://ws.engr.illinois.edu/sitemanager/getfile.asp?id=506

  9. The Pennsylvania State University: Data Collection and Cleaning course. https://bulletins.psu.edu/search/?caturl=%2Fsearch&search=daan+822

  10. Cornell University: eCornell – Data Cleaning with the Tidyverse course. https://ecornell.cornell.edu/courses/data-science/data-cleaning-with-the-tidyverse/

  11. Harvard University: Data Science – Wrangling course, a part of the Professional Certificate in Data Science. https://pll.harvard.edu/course/data-science-wrangling?delta=2

  12. Stanford University: Python Data Cleaning course, a part of Technology Training. https://uit.stanford.edu/service/techtraining/class/python-data-cleaning

  13. University of Texas at Arlington: Texas: Data Cleaninj with OpenRefine and Cleaning Data in Python, a part of library services for studenst, faculty and tsaff. https://libguides.uta.edu/humanitiesdata/clean

  14. Coursera: (offered by Johns Hopkins University), Getting and Cleaning Data course. https://www.coursera.org/learn/data-cleaning

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Vladimir L. Uskov .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Uskov, V.L. et al. (2022). Smart Pedagogy-Focused Design and Development of Innovative Curriculum for Data Cleaning. In: Uskov, V.L., Howlett, R.J., Jain, L.C. (eds) Smart Education and e-Learning - Smart Pedagogy. SEEL-22 2022. Smart Innovation, Systems and Technologies, vol 305. Springer, Singapore. https://doi.org/10.1007/978-981-19-3112-3_3

Download citation

Publish with us

Policies and ethics

Navigation