Exploiting the Duality of Maximal Frequent Itemsets and Minimal Infrequent Itemsets for I/O Efficient Association Rule Mining

  • Conference paper
  • First Online:
Database and Expert Systems Applications (DEXA 2000)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1873))

Included in the following conference series:

Abstract

Any algorithm for mining association rules must discover the set of all maximal frequent itemsets (maxL) from a database. Given a set of itemsets X, to verify that X is maxL, two conditions must be checked: (1) any itemset x in X is frequent, and (2) the dual of X must be the set of all minimal infrequent itemsets (minS). This observation leads us to a family of algorithms for mining association rules. Given a reasonable guess of minS and maxL, we verify their duality relationship, and refine the two sets until the above two conditions hold. We note that previously proposed algorithms such as Apriori and Pincer-Search are all members of our algorithm family. Also, we study a member algorithm called FlipFlop. Through a series of experiments, we show that FlipFlop significantly reduces the I/O requirement of mining association rules.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Charu C. Aggarwal and Philip S. Yu. Data mining techniques for associations, clustering and classification. In Proc. of the 3rd PAKDD Conference, Bei**g, 1999.

    Google Scholar 

  2. R. Agrawal, T. Imielinski, and A. Swami. Mining association rules between sets of items in large databases. In Proc. ACM SIGMOD, Washington, D.C., May 1993.

    Google Scholar 

  3. R. Agrawal and R. Srikant. Fast algorithms for mining association rules in large databases. In Proc. of the 20th VLDB Conference, Santiago, Chile, 1994.

    Google Scholar 

  4. Alex A. Freitas. On objective measures of rule surprisingness. In Proc. of the 2nd PKDD Conference, Nantes, France, September 1998.

    Google Scholar 

  5. Christian Hidber. Online association rule mining. In Proc. of ACM SIGMOD International Conference on Management of Data, Philadephia, May 1999.

    Google Scholar 

  6. Dao-I Lin and Zvi M. Kedem. Pincer-search: A new algorithm for discovering the maximum frequent set. In Proc. of the 6th EDBT Conference, 1998.

    Google Scholar 

  7. K.K. Loo, C.L. Yip, B. Kao, and D. Cheung. Exploiting the duality of maximal frequent itemsets and minimal infrequent itemsets for I/O efficient association rule mining. Technical report TR-2000-03, Dept. of CSIS, HKU, 2000.

    Google Scholar 

  8. Jong Soo Park, Ming-Syan Chen, and Philip S. Yu. An effiective hash-based algorithm for mining association rules. In Proc. ACM SIGMOD, California, 1995.

    Google Scholar 

  9. Hannu Toivonen. Sampling large databases for association rules. In Proc. of the 22th VLDB Conference, Bombay, India, September 1996.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2000 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Loo, K.K., Lap, Y.C., Kao, B., Cheung, D. (2000). Exploiting the Duality of Maximal Frequent Itemsets and Minimal Infrequent Itemsets for I/O Efficient Association Rule Mining. In: Ibrahim, M., Küng, J., Revell, N. (eds) Database and Expert Systems Applications. DEXA 2000. Lecture Notes in Computer Science, vol 1873. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44469-6_66

Download citation

  • DOI: https://doi.org/10.1007/3-540-44469-6_66

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-67978-3

  • Online ISBN: 978-3-540-44469-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Navigation