Discovering New Intents with Deep Aligned Clustering

  • Chapter
  • First Online:
Intent Recognition for Human-Machine Interactions

Abstract

Discovering new intents is a crucial task in dialogue systems. Most existing methods are limited in transferring the prior knowledge from known intents to new intents. These methods also have difficulties in providing high-quality supervised signals to learn clustering-friendly features for grou** unlabeled intents. In this work, we introduce an effective method (Deep Aligned Clustering) to discover new intents with the aid of limited known intent data. Firstly, by leveragin a few labeled known intent samples as prior knowledge to pre-train the model. Then, k-means is performed to produce cluster assignments as pseudo-labels. Moreover, an alignment strategy is proposed to tackle the label inconsistency problem during clustering assignments. Finally, the intent representations are learned under the supervision of the aligned pseudo-labels. With an unknown number of new intents, the number of intent categories is predicted by eliminating low-confidence intent-wise clusters. Extensive experiments on two benchmark datasets show that the method presented is more robust and achieves substantial improvements over the state-of-the-art methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 49.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Perkins, H., Yang, Y.: Dialog intent induction with deep multi-view clustering. Proceedings of the 58th Conferenceon Empirical Methods in Natural Language Processing, pp. 4016–4025 (2019)

    Google Scholar 

  2. Min, Q.K, Qin, L.B., Teng, Z.Y., et al.: Dialogue State Induction Using Neural Latent Variable Models. Proceedings of the 29th International Joint Conference on Artificial Intelligence, pp. 3845–3852 (2020)

    Google Scholar 

  3. Vedula, N., Lipka, N., Maneriker, P., Parthasarathy, S.: Open intent extraction from natural language interactions. Proceedings of the 29th Web Conference, pp. 2009–2020 (2020)

    Google Scholar 

  4. Platt, J.: Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. Adv. Large Marg. Classif. 10(3), 61–74 (1999)

    Google Scholar 

  5. Devlin, J., Chang, M.-W., Lee, K., et al.: BERT: Pre-training of deep bidirectional transformers for language understanding. Proceedings of the 17th Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 4171–4186 (2019)

    Google Scholar 

  6. Han, K., Vedaldi, A., Zisserman, A.: Learning to discover novel visual categories via deep transfer clustering. Proceedings of the 16th International Conference on Computer Vision, pp. 8400–8408 (2019)

    Google Scholar 

  7. Caron, M., Bojanowski, P., Joulin, A., et al.: Deep clustering for unsupervised learning of visual features. Proceedings of the 15th International Conference on Computer Vision, pp. 132–149 (2018)

    Google Scholar 

  8. Zhan, X., **e, J., Liu, Z., et al.: Online deep clustering for unsupervised representation learning. Proceedings of the 43rd Institute of Electrical and Electronics Engineers Conference on Computer Vision and Pattern Recognition. pp. 6688–6697 (2020)

    Google Scholar 

  9. Kuhn, H.W.: The Hungarian method for the assignment problem. Naval Res. Logist. Quart. 2(1–2), 83–97 (1955)

    Article  MathSciNet  MATH  Google Scholar 

  10. Rousseeuw, P.J.: Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J. Comput. Appl. Math. 20, 53–65 (1987)

    Article  MATH  Google Scholar 

  11. Larson, S., Mahendran, A., Peper, J.J., et al.: An evaluation dataset for intent classification and out-of-scope prediction. Proceedings of the 58th Conferenceon Empirical Methods in Natural Language Processing, pp. 1311–1316 (2019)

    Google Scholar 

  12. Casanueva, I., Temcinas, T., Gerz, D., et al.: Efficient intent detection with dual sentence encoders. Proceedings of the 2nd Workshop on Natural Language Processing for Conversational Artificial Intelligence, pp. 38–45 (2020)

    Google Scholar 

  13. MacQueen, J., et al. Some methods for classification and analysis of multivariate observations. Proceedings of the 5th Berkeley Symposium on Mathematical Statistics and Probability, pp. 281–297 (1967)

    Google Scholar 

  14. Gowda, K.C., Krishna, G.: Agglomerative clustering using the concept of mutual nearest neighbourhood. Pattern Recogn. 10(2), 105–112 (1978)

    Article  MATH  Google Scholar 

  15. **e, J., Girshick, R., Farhadi, A.: Unsupervised deep embedding for clustering analysis. Proceedings of the 33rd International Conference on Machine Learning, pp. 478–487 (2016)

    Google Scholar 

  16. Schölkopf, B., Platt, J.C., Shawe-Taylor, J., et al.: Estimating the support of a high-dimensional distribution. Neural Comput. 13(7), 1443–1471 (2001)

    Article  MATH  Google Scholar 

  17. Chang, J., Wang, I., Meng, G., et al.: Deep adaptive image clustering. Proceedings of the 15th Institute of Electrical and Electronics Engineers International Conference on Computer Vision, pp. 5879–5887 (2017)

    Google Scholar 

  18. Pennington J, Socher R, Manning C.: Glove: Global vectors for word representation. Proceedings of the 19th Conference on Empirical Methods in Natural Language Processing, pp. 1532–1543 (2014)

    Google Scholar 

  19. Basu, S., Banerjee, A., Mooney, R.J.: Active semi-supervision for pairwise constrained clustering. Proceedings of the 6th Society for Industrial and Applied Mathematics International Conference on Data Mining, pp. 333–344 (2004)

    Google Scholar 

  20. Hsu, Y.C., Lv, Z., Kira, Z.: Learning to cluster in order to transfer across domains and tasks. Proceedings of the 6th International Conference on Learning Representations (2018)

    Google Scholar 

  21. Hsu, Y.-C., Lv, Z., Schlosser, J., Odom, P., Kira, Z.: Multi-class classification without multi-class labels. Proceedings of the 16th International Conference on Computer Vision (2019)

    Google Scholar 

  22. Lin, T.E., Xu, H.: Deep unknown intent detection with margin loss. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pp. 5491–5496 (2019)

    Google Scholar 

  23. Thomas, W., Lysandre, D., Victor, S., et al.: Transformers: state-of-the-art natural language processing. Proceedings of the 26th Conference on Empirical Methods in Natural Language Processing: System Demonstrations, Online. Association for Computational Linguistics, pp. 38–45 (2020)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Xu, H., Zhang, H., Lin, TE. (2023). Discovering New Intents with Deep Aligned Clustering. In: Intent Recognition for Human-Machine Interactions . SpringerBriefs in Computer Science. Springer, Singapore. https://doi.org/10.1007/978-981-99-3885-8_9

Download citation

  • DOI: https://doi.org/10.1007/978-981-99-3885-8_9

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-99-3884-1

  • Online ISBN: 978-981-99-3885-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Navigation