Text Classification for Mining Massive Aviation Inspection Reports

  • Conference paper
Statistical Data Analysis Based on the L1-Norm and Related Methods

Part of the book series: Statistics for Industry and Technology ((SIT))

  • 848 Accesses

Abstract

There are massive numbers of aviation inspection reports collected each year in the USA. These reports record findings from aviation surveillance inspections as well as accident or incident investigations. The goal of this paper is to apply text classification to the mining of these reports, and to show that the text classification methodology can be a critical element of the aviation safety decision support system. The performances of several text classification models are evaluated in the context of mining aviation inspection reports. The evaluation is given in terms of misclassification rates. Further breakdowns of the misclassification rates and related findings from the dataset suggest ways for improving data quality and for gathering information which are more pertinent for filing inspection reports.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
EUR 29.95
Price includes VAT (Germany)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
EUR 42.79
Price includes VAT (Germany)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
EUR 53.49
Price includes VAT (Germany)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. A. Cheng, R. Liu, and J. Luxhøj. Monitoring multivariate aviation safety data by data depth: control charts and threshold systems. HE Transactions on Operations Engineering 32 (2000), 861–872.

    Google Scholar 

  2. GAO (United States General Accounting Office). AVIATION SAFETY: Weakness in Inspection and Enforcement Limit FAA in Identifying and Responding to Risks. (1998) GAO/RCED-98-6.

    Google Scholar 

  3. T. Hastie, R. Tibshirani and J. Friedman. The Elements of Statistical Learning, Data Mining, Inference, and Prediction. (2001), Springer.

    Google Scholar 

  4. D. Lewis. Naive (Bayes) at forty: the independence assumption in information retrieval. In ECML ′98: Tenth European Conference on Machine Learning (1998), 4-15.

    Google Scholar 

  5. R. Liu. BootQC: Bootstrap for Robust Analysis of Aviation Safety Data. Developments in Robust Statistics, ICOR 2001, ed. R. Dutter, P. Filzmoser, U. Gather, and P. Rousseeuw. Springer, Heidelberg, (2001) press.

    Google Scholar 

  6. D. Madigan, H. Ju and Y. Vardi. On the naive Bayes model for text classification. (2002) Technical report, Dept. of Statistics, Rutgers University.

    Google Scholar 

  7. A. McCallum, and K. Nigam. A comparison of event models for naive Bayes text classification. In Proceedings of the AAAI-98 Workshop on Machine Learning for Text Categorization, (1998).

    Google Scholar 

  8. A. Ng and M. Jordan. On Discriminative vs. Generative classifiers: A comparison of logistic regression and naive Bayes. Advances in Neural Information Processing Systems, 14 (2001).

    Google Scholar 

  9. R. Schapire and Y. Singer. A boosting-based system for the text categorization. Machine Learning, 39(2/3) (2000), 135–168.

    Article  MATH  Google Scholar 

  10. D. Spiegelhalter and R. Knill-Jones. Statistical and knowledge based approaches to clinical decision support systems, with an application in gastroenterology (with discussion). Journal of the Royal Statistical Society — Ser. A147 (1984), 35–77

    Article  MATH  Google Scholar 

  11. Y. Yang and X. Liu. A re-examination of text categorization methods. Proceedings of the 22nd ACM SIGIR Conference on Research and Development in Information Retrieval (1999), 42-49.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer Basel AG

About this paper

Cite this paper

Liu, R.Y., Madigan, D., Eyheramendy, S. (2002). Text Classification for Mining Massive Aviation Inspection Reports. In: Dodge, Y. (eds) Statistical Data Analysis Based on the L1-Norm and Related Methods. Statistics for Industry and Technology. Birkhäuser, Basel. https://doi.org/10.1007/978-3-0348-8201-9_31

Download citation

  • DOI: https://doi.org/10.1007/978-3-0348-8201-9_31

  • Publisher Name: Birkhäuser, Basel

  • Print ISBN: 978-3-0348-9472-2

  • Online ISBN: 978-3-0348-8201-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Navigation