Chinese Chorales Dataset: A High-Quality Music Dataset for Score Generation

Peng, Yongjie; Zhang, Lei; Wang, Zhenyu

doi:10.1007/978-981-97-0576-4_10

Yongjie Peng¹⁰,
Lei Zhang¹¹ &
Zhenyu Wang¹⁰

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 2007))

Included in the following conference series:

Summit on Music Intelligence

114 Accesses

Abstract

For a long time, the JSB Chorales Dataset has served as the benchmark for choral composition generation, with numerous models and algorithms achieving remarkable results on this dataset, which is designed to generate Bach-style choral music. However, when we aim to tackle the task of generating Chinese vocal choral compositions, we encounter a lack of suitable Chinese music datasets for this purpose. The Chinese Chorales Dataset presented in this paper is a high-quality collection of Chinese choral music, comprising 125 Chinese choral songs stored in MusicXML format, divided into 441 musical segments. This dataset has been professionally crafted to meet the needs of Chinese composers seeking to create high-quality choral compositions. We also provide a compressed .npz file version containing pitch, fermata, tempo, and chord information, split into training, validation, and test sets. Additionally, we conducted multiple experiments on this dataset to validate the effectiveness of the information contained within. For access to the dataset and usage details, please visit https://github.com/123654ad/Chinese-Chorales-Dataset/tree/main.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 49.99; Price excludes VAT (USA)

Softcover Book: USD 64.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

MUSIB: musical score inpainting benchmark

Article Open access 05 May 2023

MUSICNTWRK: Data Tools for Music Theory, Analysis and Composition

A Transformer Based Pitch Sequence Autoencoder with MIDI Augmentation

References

Boulanger-Lewandowski, N., Bengio, Y., Vincent, P.: Modeling temporal dependencies in high-dimensional sequences: application to polyphonic music generation and transcription. ar**v preprint ar**v:1206.6392 (2012)
Chen, K., Zhang, W., Dubnov, S., **a, G., Li, W.: The effect of explicit structure encoding of deep neural networks for symbolic music generation. In: 2019 International Workshop on Multilayer Music Representation and Processing (MMRP), pp. 77–84. IEEE (2019)
Google Scholar
Cuthbert, M.S., Ariza, C.T.: music21: a toolkit for computer-aided musicology and symbolic music data. In: Proceedings of the 11th International Society for Music Information Retrieval Conference, ISMIR 2010, Utrecht, Netherlands, 9–13 August 2010. DBLP (2010)
Google Scholar
Elowsson, A., Friberg, A.: Algorithmic composition of popular music. In: The 12th International Conference on Music Perception and Cognition and The 8th Triennial Conference of the European Society for The Cognitive Sciences of Music, pp. 276–285 (2012)
Google Scholar
Gardner, J., Simon, I., Manilow, E., Hawthorne, C., Engel, J.: Mt3: multi-task multitrack music transcription. ar**v preprint ar**v:2111.03017 (2021)
Hadjeres, G., Pachet, F., Nielsen, F.: DeepBach: a steerable model for Bach chorales generation. In: International Conference on Machine Learning, pp. 1362–1371. PMLR (2017)
Google Scholar
Hernandez-Olivan, C., Beltran, J.R.: Music composition with deep learning: a review. In: Advances in Speech and Music Technology: Computational Aspects and Applications, pp. 25–50 (2022)
Google Scholar
Hernandez-Olivan, C., Puyuelo, J.A., Beltran, J.R.: Subjective evaluation of deep learning models for symbolic music composition. ar**v preprint ar**v:2203.14641 (2022)
Hernandez-Olivan, C., Zay Pinilla, I., Hernandez-Lopez, C., Beltran, J.R.: A comparison of deep learning methods for timbre analysis in polyphonic automatic music transcription. Electronics 10(7), 810 (2021)
Article Google Scholar
Ji, S., Luo, J., Yang, X.: A comprehensive survey on deep music generation: multi-level representations, algorithms, evaluations, and future directions. ar**v preprint ar**v:2011.06801 (2020)
Liang, F.T., Gotham, M., Johnson, M., Shotton, J.: Automatic stylistic composition of Bach chorales with deep LSTM. In: ISMIR, pp. 449–456 (2017)
Google Scholar
Manilow, E., Wichern, G., Seetharaman, P., Le Roux, J.: Cutting music source separation some Slakh: a dataset to study the impact of training data quality and quantity. In: 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), pp. 45–49. IEEE (2019)
Google Scholar
Peracha, O.: Improving polyphonic music models with feature-rich encoding. ar**v preprint ar**v:1911.11775 (2019)
Peracha, O.: JS fake chorales: a synthetic dataset of polyphonic music with human annotation. ar**v preprint ar**v:2107.10388 (2021)
Raffel, C.: Learning-based methods for comparing sequences, with applications to audio-to-MIDI alignment and matching. Doctoral dissertation (2016)
Google Scholar
Su, L.: Attend to chords: improving harmonic analysis of symbolic music using transformer-based models (2021)
Google Scholar
Wang, Y., et al.: Opencpop: a high-quality open source Chinese popular song corpus for singing voice synthesis. ar**v preprint ar**v:2201.07429 (2022)
Wang, Z., et al.: POP909: a pop-song dataset for music arrangement generation. ar**v preprint ar**v:2008.07142 (2020)
Wu, S., Li, X., Sun, M.: Chord-conditioned melody harmonization with controllable harmonicity. In: ICASSP 2023–2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 1–5. IEEE (2023)
Google Scholar
Zhou, J., Zhu, H., Wang, X.: Choir transformer: generating polyphonic music with relative attention on transformer. ar**v preprint ar**v:2308.02531 (2023)

Download references

Author information

Authors and Affiliations

The School of Control and Computer Engineering, North China Electric Power University, Bei**g, China
Yongjie Peng & Zhenyu Wang
Bei**g National Day School-Longyue Experimental Middle School, Bei**g, China
Lei Zhang

Authors

Yongjie Peng
View author publications
You can also search for this author in PubMed Google Scholar
Lei Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Zhenyu Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhenyu Wang .

Editor information

Editors and Affiliations

Central Conservatory of Music, Bei**g, China
**aobing Li
**’an Jiaotong University, **’an, China
**aohong Guan
Zhengzhou University, Zhengzhou, China
Yun Tie
Central Conservatory of Music, Bei**g, China
**nran Zhang
Central Conservatory of Music, Bei**g, China
Qingwen Zhou

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Peng, Y., Zhang, L., Wang, Z. (2024). Chinese Chorales Dataset: A High-Quality Music Dataset for Score Generation. In: Li, X., Guan, X., Tie, Y., Zhang, X., Zhou, Q. (eds) Music Intelligence. SOMI 2023. Communications in Computer and Information Science, vol 2007. Springer, Singapore. https://doi.org/10.1007/978-981-97-0576-4_10

Download citation

DOI: https://doi.org/10.1007/978-981-97-0576-4_10
Published: 04 February 2024
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-0575-7
Online ISBN: 978-981-97-0576-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Chinese Chorales Dataset: A High-Quality Music Dataset for Score Generation

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

MUSIB: musical score inpainting benchmark

MUSICNTWRK: Data Tools for Music Theory, Analysis and Composition

A Transformer Based Pitch Sequence Autoencoder with MIDI Augmentation

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Chinese Chorales Dataset: A High-Quality Music Dataset for Score Generation

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

MUSIB: musical score inpainting benchmark

MUSICNTWRK: Data Tools for Music Theory, Analysis and Composition

A Transformer Based Pitch Sequence Autoencoder with MIDI Augmentation

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation