Text Classification for Mining Massive Aviation Inspection Reports

Liu, Regina Y.; Madigan, David; Eyheramendy, Susana

doi:10.1007/978-3-0348-8201-9_31

Regina Y. Liu³,
David Madigan³ &
Susana Eyheramendy³

Part of the book series: Statistics for Industry and Technology ((SIT))

848 Accesses

Abstract

There are massive numbers of aviation inspection reports collected each year in the USA. These reports record findings from aviation surveillance inspections as well as accident or incident investigations. The goal of this paper is to apply text classification to the mining of these reports, and to show that the text classification methodology can be a critical element of the aviation safety decision support system. The performances of several text classification models are evaluated in the context of mining aviation inspection reports. The evaluation is given in terms of misclassification rates. Further breakdowns of the misclassification rates and related findings from the dataset suggest ways for improving data quality and for gathering information which are more pertinent for filing inspection reports.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: EUR 29.95; Price includes VAT (Germany)

eBook: EUR 42.79; Price includes VAT (Germany)

Softcover Book: EUR 53.49; Price includes VAT (Germany)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Potential Threat Detection from Industrial Accident Reports Using Text Mining

Mining Human Error Incident Patterns with Text Classification

Advances in Text Classification Based on Machine Learning

References

A. Cheng, R. Liu, and J. Luxhøj. Monitoring multivariate aviation safety data by data depth: control charts and threshold systems. HE Transactions on Operations Engineering 32 (2000), 861–872.
Google Scholar
GAO (United States General Accounting Office). AVIATION SAFETY: Weakness in Inspection and Enforcement Limit FAA in Identifying and Responding to Risks. (1998) GAO/RCED-98-6.
Google Scholar
T. Hastie, R. Tibshirani and J. Friedman. The Elements of Statistical Learning, Data Mining, Inference, and Prediction. (2001), Springer.
Google Scholar
D. Lewis. Naive (Bayes) at forty: the independence assumption in information retrieval. In ECML ′98: Tenth European Conference on Machine Learning (1998), 4-15.
Google Scholar
R. Liu. BootQC: Bootstrap for Robust Analysis of Aviation Safety Data. Developments in Robust Statistics, ICOR 2001, ed. R. Dutter, P. Filzmoser, U. Gather, and P. Rousseeuw. Springer, Heidelberg, (2001) press.
Google Scholar
D. Madigan, H. Ju and Y. Vardi. On the naive Bayes model for text classification. (2002) Technical report, Dept. of Statistics, Rutgers University.
Google Scholar
A. McCallum, and K. Nigam. A comparison of event models for naive Bayes text classification. In Proceedings of the AAAI-98 Workshop on Machine Learning for Text Categorization, (1998).
Google Scholar
A. Ng and M. Jordan. On Discriminative vs. Generative classifiers: A comparison of logistic regression and naive Bayes. Advances in Neural Information Processing Systems, 14 (2001).
Google Scholar
R. Schapire and Y. Singer. A boosting-based system for the text categorization. Machine Learning, 39(2/3) (2000), 135–168.
Article MATH Google Scholar
D. Spiegelhalter and R. Knill-Jones. Statistical and knowledge based approaches to clinical decision support systems, with an application in gastroenterology (with discussion). Journal of the Royal Statistical Society — Ser. A147 (1984), 35–77
Article MATH Google Scholar
Y. Yang and X. Liu. A re-examination of text categorization methods. Proceedings of the 22nd ACM SIGIR Conference on Research and Development in Information Retrieval (1999), 42-49.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Statistics, Rutgers University, Hill Center, Piscataway, NJ, 08854-8019, USA
Regina Y. Liu, David Madigan & Susana Eyheramendy

Authors

Regina Y. Liu
View author publications
You can also search for this author in PubMed Google Scholar
David Madigan
View author publications
You can also search for this author in PubMed Google Scholar
Susana Eyheramendy
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Statistics Group, University of Neuchâtel, P.O. Box 805, CH-2002, Neuchâtel, Switzerland
Yadolah Dodge (Prof. of Statistics and Operation Research) (Prof. of Statistics and Operation Research)

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Liu, R.Y., Madigan, D., Eyheramendy, S. (2002). Text Classification for Mining Massive Aviation Inspection Reports. In: Dodge, Y. (eds) Statistical Data Analysis Based on the L₁-Norm and Related Methods. Statistics for Industry and Technology. Birkhäuser, Basel. https://doi.org/10.1007/978-3-0348-8201-9_31

Download citation

DOI: https://doi.org/10.1007/978-3-0348-8201-9_31
Publisher Name: Birkhäuser, Basel
Print ISBN: 978-3-0348-9472-2
Online ISBN: 978-3-0348-8201-9
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Text Classification for Mining Massive Aviation Inspection Reports

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Potential Threat Detection from Industrial Accident Reports Using Text Mining

Mining Human Error Incident Patterns with Text Classification

Advances in Text Classification Based on Machine Learning

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Text Classification for Mining Massive Aviation Inspection Reports

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Potential Threat Detection from Industrial Accident Reports Using Text Mining

Mining Human Error Incident Patterns with Text Classification

Advances in Text Classification Based on Machine Learning

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation