Chinese Chorales Dataset: A High-Quality Music Dataset for Score Generation

  • Conference paper
  • First Online:
Music Intelligence (SOMI 2023)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 2007))

Included in the following conference series:

  • 114 Accesses

Abstract

For a long time, the JSB Chorales Dataset has served as the benchmark for choral composition generation, with numerous models and algorithms achieving remarkable results on this dataset, which is designed to generate Bach-style choral music. However, when we aim to tackle the task of generating Chinese vocal choral compositions, we encounter a lack of suitable Chinese music datasets for this purpose. The Chinese Chorales Dataset presented in this paper is a high-quality collection of Chinese choral music, comprising 125 Chinese choral songs stored in MusicXML format, divided into 441 musical segments. This dataset has been professionally crafted to meet the needs of Chinese composers seeking to create high-quality choral compositions. We also provide a compressed .npz file version containing pitch, fermata, tempo, and chord information, split into training, validation, and test sets. Additionally, we conducted multiple experiments on this dataset to validate the effectiveness of the information contained within. For access to the dataset and usage details, please visit https://github.com/123654ad/Chinese-Chorales-Dataset/tree/main.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 49.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 64.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Boulanger-Lewandowski, N., Bengio, Y., Vincent, P.: Modeling temporal dependencies in high-dimensional sequences: application to polyphonic music generation and transcription. ar**v preprint ar**v:1206.6392 (2012)

  2. Chen, K., Zhang, W., Dubnov, S., **a, G., Li, W.: The effect of explicit structure encoding of deep neural networks for symbolic music generation. In: 2019 International Workshop on Multilayer Music Representation and Processing (MMRP), pp. 77–84. IEEE (2019)

    Google Scholar 

  3. Cuthbert, M.S., Ariza, C.T.: music21: a toolkit for computer-aided musicology and symbolic music data. In: Proceedings of the 11th International Society for Music Information Retrieval Conference, ISMIR 2010, Utrecht, Netherlands, 9–13 August 2010. DBLP (2010)

    Google Scholar 

  4. Elowsson, A., Friberg, A.: Algorithmic composition of popular music. In: The 12th International Conference on Music Perception and Cognition and The 8th Triennial Conference of the European Society for The Cognitive Sciences of Music, pp. 276–285 (2012)

    Google Scholar 

  5. Gardner, J., Simon, I., Manilow, E., Hawthorne, C., Engel, J.: Mt3: multi-task multitrack music transcription. ar**v preprint ar**v:2111.03017 (2021)

  6. Hadjeres, G., Pachet, F., Nielsen, F.: DeepBach: a steerable model for Bach chorales generation. In: International Conference on Machine Learning, pp. 1362–1371. PMLR (2017)

    Google Scholar 

  7. Hernandez-Olivan, C., Beltran, J.R.: Music composition with deep learning: a review. In: Advances in Speech and Music Technology: Computational Aspects and Applications, pp. 25–50 (2022)

    Google Scholar 

  8. Hernandez-Olivan, C., Puyuelo, J.A., Beltran, J.R.: Subjective evaluation of deep learning models for symbolic music composition. ar**v preprint ar**v:2203.14641 (2022)

  9. Hernandez-Olivan, C., Zay Pinilla, I., Hernandez-Lopez, C., Beltran, J.R.: A comparison of deep learning methods for timbre analysis in polyphonic automatic music transcription. Electronics 10(7), 810 (2021)

    Article  Google Scholar 

  10. Ji, S., Luo, J., Yang, X.: A comprehensive survey on deep music generation: multi-level representations, algorithms, evaluations, and future directions. ar**v preprint ar**v:2011.06801 (2020)

  11. Liang, F.T., Gotham, M., Johnson, M., Shotton, J.: Automatic stylistic composition of Bach chorales with deep LSTM. In: ISMIR, pp. 449–456 (2017)

    Google Scholar 

  12. Manilow, E., Wichern, G., Seetharaman, P., Le Roux, J.: Cutting music source separation some Slakh: a dataset to study the impact of training data quality and quantity. In: 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), pp. 45–49. IEEE (2019)

    Google Scholar 

  13. Peracha, O.: Improving polyphonic music models with feature-rich encoding. ar**v preprint ar**v:1911.11775 (2019)

  14. Peracha, O.: JS fake chorales: a synthetic dataset of polyphonic music with human annotation. ar**v preprint ar**v:2107.10388 (2021)

  15. Raffel, C.: Learning-based methods for comparing sequences, with applications to audio-to-MIDI alignment and matching. Doctoral dissertation (2016)

    Google Scholar 

  16. Su, L.: Attend to chords: improving harmonic analysis of symbolic music using transformer-based models (2021)

    Google Scholar 

  17. Wang, Y., et al.: Opencpop: a high-quality open source Chinese popular song corpus for singing voice synthesis. ar**v preprint ar**v:2201.07429 (2022)

  18. Wang, Z., et al.: POP909: a pop-song dataset for music arrangement generation. ar**v preprint ar**v:2008.07142 (2020)

  19. Wu, S., Li, X., Sun, M.: Chord-conditioned melody harmonization with controllable harmonicity. In: ICASSP 2023–2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 1–5. IEEE (2023)

    Google Scholar 

  20. Zhou, J., Zhu, H., Wang, X.: Choir transformer: generating polyphonic music with relative attention on transformer. ar**v preprint ar**v:2308.02531 (2023)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zhenyu Wang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Peng, Y., Zhang, L., Wang, Z. (2024). Chinese Chorales Dataset: A High-Quality Music Dataset for Score Generation. In: Li, X., Guan, X., Tie, Y., Zhang, X., Zhou, Q. (eds) Music Intelligence. SOMI 2023. Communications in Computer and Information Science, vol 2007. Springer, Singapore. https://doi.org/10.1007/978-981-97-0576-4_10

Download citation

  • DOI: https://doi.org/10.1007/978-981-97-0576-4_10

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-97-0575-7

  • Online ISBN: 978-981-97-0576-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Navigation