deepGTTM-III: Multi-task Learning with Grou** and Metrical Structures

Hamanaka, Masatoshi; Hirata, Keiji; Tojo, Satoshi

doi:10.1007/978-3-030-01692-0_17

Masatoshi Hamanaka¹⁷,
Keiji Hirata¹⁸ &
Satoshi Tojo¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11265))

Included in the following conference series:

International Symposium on Computer Music Multidisciplinary Research

1059 Accesses
6 Citations

Abstract

This paper describes an analyzer that simultaneously learns grou** and metrical structures on the basis of the generative theory of tonal music (GTTM) by using a deep learning technique. GTTM is composed of four modules that are in series. GTTM has a feedback loop in which the former module uses the result of the latter module. However, as each module has been independent in previous GTTM analyzers, they did not form a feedback loop. For example, deepGTTM-I and deepGTTM-II independently learn grou** and metrical structures by using a deep learning technique. In light of this, we present deepGTTM-III, which is a new analyzer that includes the concept of feedback that enables simultaneous learning of grou** and metrical structures by integrating both deepGTTM-I and deepGTTM-II networks. The experimental results revealed that deepGTTM-III outperformed deepGTTM-I and had similar performance to deepGTTM-II.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

deepGTTM-I&II: Local Boundary and Metrical Structure Analyzer Based on Deep Learning Technique

Genre Recognition from Symbolic Music with CNNs: Performance and Explainability

Article Open access 17 December 2022

Residual LSTM neural network for time dependent consecutive pitch string recognition from spectrograms: a study on Turkish classical music makams

Article 11 October 2023

References

Lerdahl, F., Jackendoff, R.: A Generative Theory of Tonal Music. MIT Press, Cambridge (1985)
Google Scholar
Hirata, K., Hiraga, R.: Ha-Hi-Hun plays Chopin’s Etude. In: Working Notes of IJCAI-03 Workshop on Methods for Automatic Music Performance and Their Applications in a Public Rendering Contest (2003)
Google Scholar
Hirata, K., Matsuda, S., Kaji, K., Nagao, K.: Annotated music for retrieval, reproduction, and sharing. In: Proceedings of the 2004 International Computer Music Conference (ICMC 2004), pp. 584–587 (2004)
Google Scholar
Hirata, K., Matsuda, S.: Interactive music summarization based on GTTM. In: Proceedings of the 2002 International Society for Music Information Retrieval Conference (ISMIR 2002), pp. 86–93 (2002)
Google Scholar
Hamanaka, M., Hirata, K., Tojo, S.: Melody morphing method based on GTTM. In: Proceedings of the 2008 International Computer Music Conference (ICMC 2008), pp. 155–158 (2008)
Google Scholar
Hamanaka, M., Hirata, K., Tojo, S.: Melody extrapolation in GTTM approach. In: Proceedings of the 2009 International Computer Music Conference (ICMC 2009), pp. 89–92 (2009)
Google Scholar
Hamanaka, M., Hirata, K., Tojo, S.: Implementing ‘a generative theory of tonal music’. J. New Music Res. 35(4), 249–277 (2006)
Article Google Scholar
Hamanaka, M., Hirata, K., Tojo, S.: FATTA: full automatic time-span tree analyzer. In: Proceedings of the 2007 International Computer Music Conference (ICMC 2007), pp. 153–156 (2007)
Google Scholar
Miura, Y., Hamanaka, M., Hirata, K., Tojo, S.: Decision tree to detect GTTM group boundaries. In: Proceedings of the 2009 International Computer Music Conference (ICMC 2009), pp. 125–128 (2009)
Google Scholar
Kanamori, K., Hamanaka, M.: Method to detect GTTM local grou** boundaries based on clustering and statistical learning. In: Proceedings of the 2014 International Computer Music Conference (ICMC 2014), pp. 125–128 (2014)
Google Scholar
Hamanaka, M., Hirata, K., Tojo, S.: \(sigma\)GTTM III: learning-based time-span tree generator based on PCFG. In: Proceedings of the 11th International Symposium on Computer Music Multidisciplinary Research (CMMR 2015), pp. 303–317 (2015)
Google Scholar
Hamanaka, M., Hirata, K., Tojo, S.: Musical structural analysis database based on GTTM. In: Proceedings of the 2014 International Society for Music Information Retrieval Conference (ISMIR 2014), pp. 325–330 (2014)
Google Scholar
Hamanaka, M., Hirata, K., Tojo, S.: deepGTTM-I: local boundary analyzer based on a deep learning technique. In: Proceedings of the 12th International Symposium on Computer Music Multidisciplinary Research (CMMR 2016), pp. 8–20 (2016)
Google Scholar
Hamanaka, M., Hirata, K., Tojo, S.: deepGTTM-II: automatic generation of metrical structure based on deep learning technique. In: Proceedings of 13th Sound and Music Computing Conference (SMC 2016), pp. 203–210 (2016)
Google Scholar
Choi, K., Fazekas, G., Sandler, M.: Automatic tagging using deep convolutional neural networks. In: Proceedings of the 2016 International Society for Music Information Retrieval Conference (ISMIR 2016), pp. 805–811 (2016)
Google Scholar
Zhou, X., Lerch, A.: Chord detection using deep learning. In: Proceedings of the 2015 International Society for Music Information Retrieval Conference (ISMIR 2015), pp. 52–58 (2015)
Google Scholar
Deng, J., Kwok, Y.: Hybrid Gaussian-HMM-deep learning approach for automatic chord estimation with very large vocabulary. In: Proceedings of the 2016 International Society for Music Information Retrieval Conference (ISMIR 2016), pp. 812–818 (2016)
Google Scholar
Oord, A., Sander, D., Benjamin, S.: Deep content-based music recommendation. In: Proceedings of the Advances in Neural Information Processing Systems 26 (NIPS 2013), pp. 2643–2651 (2013)
Google Scholar
Sigtia, S., Benetos, E., Dixon, S.: An end-to-end neural network for polyphonic piano music transcription. IEEE/ACM Trans. Audio Speech Lang. Process. (TASLP) 24(5), 927–939 (2016)
Article Google Scholar
Hinton, G.E., Osindero, S., Teh, Y.W.: A fast learning algorithm for deep belief nets. Neural Comp. 18, 1527–1554 (2006)
Article MathSciNet Google Scholar
MakeMusic Inc., “Finale” (2018). http://www.finalemusic.com/

Download references

Acknowledgments

This work was supported by JSPS KAKENHI Grant Numbers 17H01847, 25700036, 16H01744, and 23500145.

Author information

Authors and Affiliations

RIKEN, Tokyo, Japan
Masatoshi Hamanaka
Future University Hakodate, Hakodate, Japan
Keiji Hirata
JAIST, Nomi, Japan
Satoshi Tojo

Authors

Masatoshi Hamanaka
View author publications
You can also search for this author in PubMed Google Scholar
Keiji Hirata
View author publications
You can also search for this author in PubMed Google Scholar
Satoshi Tojo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Masatoshi Hamanaka .

Editor information

Editors and Affiliations

Laboratoire PRISM, AMU-CNRS, Marseille, France
Mitsuko Aramaki
INESC TEC, Porto, Portugal
Matthew E. P. Davies
Laboratoire PRISM, AMU-CNRS, Marseille, France
Richard Kronland-Martinet
Laboratoire PRISM, AMU-CNRS, Marseille, France
Sølvi Ystad

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hamanaka, M., Hirata, K., Tojo, S. (2018). deepGTTM-III: Multi-task Learning with Grou** and Metrical Structures. In: Aramaki, M., Davies , M., Kronland-Martinet, R., Ystad, S. (eds) Music Technology with Swing. CMMR 2017. Lecture Notes in Computer Science(), vol 11265. Springer, Cham. https://doi.org/10.1007/978-3-030-01692-0_17

Download citation

DOI: https://doi.org/10.1007/978-3-030-01692-0_17
Published: 24 November 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-01691-3
Online ISBN: 978-3-030-01692-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

deepGTTM-III: Multi-task Learning with Grou** and Metrical Structures

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

deepGTTM-I&II: Local Boundary and Metrical Structure Analyzer Based on Deep Learning Technique

Genre Recognition from Symbolic Music with CNNs: Performance and Explainability

Residual LSTM neural network for time dependent consecutive pitch string recognition from spectrograms: a study on Turkish classical music makams

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

deepGTTM-III: Multi-task Learning with Grou** and Metrical Structures

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

deepGTTM-I&II: Local Boundary and Metrical Structure Analyzer Based on Deep Learning Technique

Genre Recognition from Symbolic Music with CNNs: Performance and Explainability

Residual LSTM neural network for time dependent consecutive pitch string recognition from spectrograms: a study on Turkish classical music makams

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation