Introduction

Intangible cultural heritage (ICH) is a crucial component of human culture, representing the accumulation of human civilization and wisdom. In 2003, UNESCO adopted the Convention for the Safeguarding of the ICH, which greatly contributed to the protection and record of ICH in the world [1]. After two decades of continuous exploration and efforts, ICH protection work has achieved fruitful results, including a series of word-level protection items, such as Chinese Kunqu, Spanish Flamenco, and Indian Kutiyattam. As of December 2023, the UNESCO Representative List of the ICH of Humanity included 730 heritage projects from 145 different countries [2]. Minnan nursery rhymes (MNRs), created as songs for children, are recognized as an artistic form within the “oral traditions and expressions” domain of UNESCO’s 2003 Convention for the Safeguarding of the ICH [1]. MNRs are categorized into this domain as they are traditional forms of oral literature, usually specific to a culture, region, or language group. As a crucial part of cultural traditions, MNRs are transmitted from generation to generation, typically through informal ways such as family members, educational settings, community events, or various media platforms.

MNRs encompass a rich and enduring legacy that spans from the Tang Dynasty to the contemporary era. Local historical classics from the Tang Dynasty (618–907) documented a nursery rhyme called “Yue guang guang,” which celebrated the beauty of moonlight [3, 40,41,42], as well as internet sources. The MNRs edited by experts in the field of Minnan culture ensure the accuracy and reliability of the data. Moreover, internet sources can provide access to a broader range of MNRs, which helps explore variations, interpretations, and even uncover lesser-known MNRs that might not be included in edited collections. After manually selecting and removing duplicates, we have obtained a total of 617 traditional MNRs and 289 modern MNRs. The MNRs are categorize into different content types by domain experts, and Table 1 shows five of them.

Table 1 The dataset of MNRs

Methodology

Overall framework

As shown in Fig. 1, our approach comprises three main parts: data pre-processing, lyrical term analysis, and complex network analysis. In the data pre-processing step, we sequentially process the lyrics of MNRs through text segmentation and the TF-IDF (term frequency-inverse document frequenc) method. Subsequently, we utilize techniques such as word cloud generation and information entropy to analyze the characteristics of lyrical terms. Finally, in the complex network analysis phase, we define various measures, including network characteristic indicators, degree distribution, core-periphery analysis, and community detection.

Fig. 1
figure 1

Research framework

Data pre-processing

We outline tailored processing steps for the lyrics of MNRs, as specific characteristics exist in the lyrical texts, including a mixture of written Chinese (both simplified and traditional) and written Hokkien (a written form of the Minnan language).

  1. (1)

    Language Standardization: This step standardizes the MNRs dataset by converting all lyric texts into simplified Chinese, which ensures consistency and ease of processing.

  2. (2)

    Removing Irrelevant Characters: By removing irrelevant characters such as special characters and punctuation, the lyrical text is streamlined, and further steps can focus solely on elements contributing to understanding the lyrics.

  3. (3)

    Custom Dictionary: Traditional text segmentation methods might not accurately handle non-Chinese texts, particularly those in the Minnan language. Therefore, we create an integrated Chinese-Minnan dictionary to allow for more accurate tokenization.

  4. (4)

    Lyrics Segmentation and Post-processing: The LTP (Language Technology Platform) is employed to segment the lyrics into meaningful units. To enhance the accuracy of the segmented lyrical terms, the post-processing, like filtering out noise, irrelevant tokens, and correcting segmentation errors, is applied to refine the segmented tokenization.

  5. (5)

    Keyword Extraction: By calculating the TF-IDF values, the critical terms of lyrics are identified based on the obtained TF-IDF values.

Lyrical term analysis method

Word clouds

After the pre-processing algorithm TF-IDF on assessing the significance of terms in section 4.2, the word cloud technology is applied to discern main themes or trends of the MNRs’ lyrics based on the obtained TF-IDF values. Subsequently, these lyrical terms are visually represented in a cloud diagram, with their sizes adjusted proportionally to their significance.

Emotion entropy

A song typically conveys either positive or negative emotions. Instead of discussing the emotional tendency of a rhyme, we delve into the emotional preference of words extracted in 4.2. The emotion category of a word is usually context-dependent, and it may have the opposite emotion category in different contexts [43, 44]. Furthermore, in the MNRs, many lyrical words are not included in traditional categories of emotions, and they do not express direct positive or negative feelings, like the negative word “anger” and the positive word “amusement”. In this study, the emotions of lyrical words are determined by deep-learning technology and experts. Specifically, a supervised-learning based skip-gram model is employed to train word vectors based on large-scale text. Triples (a word/phrase, sentence in which word/phrase is located, sentiment category of the word/phrase) are used as the training samples of the model. The training set is obtained by integrating the NTUSD,Footnote 1 HowNetFootnote 2 dictionaries, and experts-designed dictionaries. The emotion category of lyrical words are automatically predicted by the trained skip-gram model. Crucially, the predicted sentiment categories are then verified by domain experts, ensuring the credibility of the predictions.

To tackle the issue of a lyrical word potentially having contrasting emotional classes in different lyrical sentences, we define the concept of “emotion entropy” to describe the emotional preference of a word. Specifically, emotion entropy quantifies the degree to which a word is associated with a specific (pure) emotion-positive or negative-instead of an ambiguous emotional connotation [45]. We define two possible emotional states of a word \((x)\) as positive (\(x_{1}\)) and negative (\(x_{2}\)), and \(p(x_1)\) is the probability that the word \(x\) conveys a positive emotion. In contrast, \(p(x_2)\) is the probability that the word conveys a negative emotion. Therefore, the emotion entropy \(H(x)\) of the word \((x)\) could be defined as formula 1 by using the Shannon entropy formula [46]. It is noticeable that the emotions (positive or negative) of some words are labeled by a deep neural network model (Long short-term memory neural network) and validated by experts.

$$\begin{aligned} H(x) = -p(x_1) \log _2 p(x_1) - p(x_2) \log _2 p(x_2) \end{aligned}$$
(1)

As shown in formula 1, a higher value of emotion entropy indicates that the emotional connotation of a word is more uncertain or unpredictable, suggesting it is less emotionally pure. Conversely, a word with a lower value indicates a more precise emotional connotation or higher “emotion purity,” so the probability of this word conveying either positive or negative emotion is more heavily inclined toward one particular emotion.

Complex network analysis method

We introduce the construction of MNR networks in section 4.4.1 and provide measures of complex networks in section 4.4.2.

The construction of MNR networks

In this section, we construct a Minnan nursery rhyme network (abbreviated as MNR network) by representing lyrical keywords as network nodes and capturing the co-occurrence relationships between nodes as edges. The processes involved in constructing and analyzing the MNR networks are listed as follows: (1) Building the Vocabulary: A vocabulary is created, comprising all unique terms extracted in section 4.2. This step aims to eliminate duplicates and organize the vocabulary in a structured manner for constructing a co-occurrence matrix.

(2) Co-occurrence matrix generation: The co-occurrence matrix is constructed based on the frequency of occurrence of each keyword pair within the lyrics. Each row and column of the matrix corresponds to a unique keyword from the vocabulary. The value at the intersection of a row and column represents the frequency with which the corresponding keywords co-occur within the nursery rhymes.

(3) MNR network construction: Each unique keyword is assigned as a node, and the co-occurrence relationships between keywords are depicted as edges. Whenever two keywords appear together in a nursery rhyme, they are regarded as having a co-occurring relation. The weight of an edge is determined by the frequency of their co-occurrence.

The definition of network measures

Overall network characteristic indicators

We explore the overall network characteristics of the MNR networks by indicators: network density, average clustering coefficient, and average path length. Analyzing these indicators allows us to discern patterns, trends, and emergent properties within the network, enriching our understanding of linguistic and cultural contexts within MNRs. We assume \(N\) and \(E\) denote the number of nodes and edges for simplifying the following definitions.

A. Network density Network density is to measure the level of connection density among nodes, with higher values indicating denser connections [47]. It can be quantified by the equation 2, where \(\rho\) represents the network density.

$$\begin{aligned} \rho =\frac{2E}{N(N-1)} \end{aligned}$$
(2)

B. Average clustering coefficient The average clustering coefficient quantifies the tendency of nodes to form clusters or groups within a network, with values ranging from 0 to 1 [48]. A higher coefficient signifies closer connections between nodes and a more clustered network. This indicator is commonly used to analyze network community structure, evolution processes, and randomness. The definition of the average clustering coefficient \(C\) is provided in the formula 3, where the \(c_i\) represents the clustering coefficient of node \(i\).

$$\begin{aligned} C = \frac{1}{N} \sum _{i=1}^{N} c_i \end{aligned}$$
(3)

C. Average path length The average path length of networks is employed to estimate the average efficiency of information delivery in a network [49]. As defined in the formula 4, the average path length \(L\) is calculated as the average number of shortest path lengths along the shortest paths for all possible pairs of network nodes. The symbol \(d_{ij}\) in the formula 4 denotes the shortest path length between node \(i\) and node \(j\).

$$\begin{aligned} L = \frac{1}{N(N-1)/2} \sum _{i \ne j} d_{ij} \end{aligned}$$
(4)
Individual network characteristic indicators

In network theory, individual network characteristic indicators provide valuable information about individual nodes and their roles within a network [50], allowing for a better understanding of the behavior and functionality of nodes. In the following measures, we denote the adjacency matrix and the weight matrix of a network as \(\textbf{A}\) and \(\textbf{W}\).

A. Degree centrality The degree of a node refers to the number of connections that a node has in a network, where a higher value indicates its prominence and influence within the network [51,52,53]. The node degree is defined as the formula 5. The element \(A_{ij}\) of the adjacency matrix \(\textbf{A}\) is defined as 0 if there is no link between node \(i\) and node \(j\), and \(A_{ij} = 1\) if a link exists between them.

$$\begin{aligned} D_i = \sum _{j=1}^{N} A_{ij} \end{aligned}$$
(5)

B. Weighted degree centrality The weighted degree is a variant of the degree centrality [51,52,53], as defined in formula 6. It takes into account the weights associated with the edges between nodes, which \(W_{i}\) is the weighted degree of the node \(i\), and \(\omega _{ij}\) represents the weight of the node between \(i\) and \(j\).

$$\begin{aligned} W_i = \sum _{j=1}^{N} \omega _{ij} \end{aligned}$$
(6)

C. Betweenness centrality The betweenness centrality measures the extent to which a node lies on paths between other nodes in a network. Nodes with high betweenness centrality act as crucial bridges or connectors, facilitating the flow of information and resources between different parts of the network [51,52,53]. The formula 7 shows how to calculate betweenness centrality, where \(N_{st}\) represents the number of shortest paths from node \(s\) to \(t\), and \(N_{st}(i)\) denotes the number of shortest paths passing through node \(i\).

$$\begin{aligned} S_i = \sum _{s \ne i \ne t} \frac{N_{st}(i)}{N_{st}} \end{aligned}$$
(7)

D. Closeness centrality Closeness centrality detects how close a node is to all other nodes in the network [51,52,53]. The closeness centrality of a node in formula 8 is quantified as the average of the shortest path length from the node to other nodes in the network. Nodes with high closeness centrality tend to have shorter average path lengths to other nodes, allowing for efficient information dissemination and quick access to resources. The variable \(B_{i}\) represents closeness centrality, where \(N\) is the total number of nodes and \(\sum _{j=1}^N d_{ij}\) represents the distance from node \(i\) to all other nodes.

$$\begin{aligned} B_i = \frac{N-1}{\sum _{j=1}^N d_{ij}} \end{aligned}$$
(8)

E. Eigenvector centrality Eigenvector centrality assesses a node’s importance within a network not solely by its direct connections but also by the importance of its neighboring nodes [54, 55]. Consequently, nodes with high eigenvector centrality are usually linked to other important and influential nodes. This measure is especially useful for identifying influential nodes that are influential in the network, even if they do not have many direct connections. We assume that \(M_{i}\) represents the eigenvector centrality of node \(i\), and it is defined as follows,

$$\begin{aligned} M_{i} = \frac{1}{\lambda }\sum \limits _{j = 1}^{N} {A_{ij} x_{j} } \end{aligned}$$
(9)

where \(\lambda\) is the eigenvalue eigenvector associated with \(M_{i}\).

Degree distribution

In complex networks, degree distribution is viewed as a critical metric for characterizing network structure. It is the probability distribution of node degrees over the entire network. The power-law characteristic of degree distribution has attracted significant academic interest due to its prevalence across a wide array of natural and artificial systems, such as urban traffic [56], earthquake magnitudes [57], and power outage extents [58]. When the degree distribution of a network follows a power-law relationship, the probability P(k) of nodes with degree k is characterized by \(P(k) \propto k^{-\gamma }\) as defined in formula 10, where the \({\gamma }\) is the power-law exponent [59]. As shown in formula 10, the inverse relationship indicated by the negative exponent \({(-\gamma )}\) implies that there are many nodes with a small number of connections and only a few nodes with a large number of connections.

$$\begin{aligned} P(k) = k^{-\gamma } \end{aligned}$$
(10)

By taking the logarithm of both sides of the formula 10, we derive the relationship as \(\log (P(k)) = -\gamma \log (k) + r\) presented in formula 11. Here, the constant \(r\) represents any multiplicative factors that might have been present in the original equation before taking the logarithmic transformation.

$$\begin{aligned} \log (P(k)) = -\gamma \log (k) + r \end{aligned}$$
(11)
Core-periphery structure

The core-periphery structure of complex networks typically consists of a densely connected core and a sparsely connected periphery [60]. The “core” consists of a highly interconnected group of nodes, which often play crucial roles in the network’s function and dynamics. The “periphery” contains nodes with fewer connections, typically connected to the rest of the network through the core nodes. Other nodes within the network belong to the set of “semi-periphery”. This core-periphery structure can influence the robustness of a network, the efficiency of information or resource transfer, and the overall behavior of the network.

In a network, it is assumed that the coreness value of node \(i\) is \(C_{i}\). The overall average coreness of the network is represented by \(C_{mean}\), and the standard deviation of the network coreness is denoted as \(C_{sd}\). As shown in formula 12, a node \(i\) belonging to one of the three types “core”, “periphery,” or “semi-periphery” is defined as its coreness value \(C_i\) relative to the network’s average coreness \(C_{mean}\) and standard deviation \(C_{sd}\).

$$\begin{aligned} i \in {\left\{ \begin{array}{ll} \begin{aligned} &{}\text {core node} &{}&{} \text {if } C_{i} \ge C_{\text {mean}} + C_{\text {sd}}, \\ &{}\text {semi-periphery node} &{}&{} \text {if } C_{\text {mean}} + C_{\text {sd}}> C_{i} > |C_{\text {mean}} - C_{\text {sd}} |, \\ &{}\text {periphery node} &{}&{} \text {if } C_{i} \le |C_{mean} - C_{sd} |. \end{aligned} \end{array}\right. } \end{aligned}$$
(12)
Community structure

Community structure in complex networks is characterized by organizing nodes into clusters. The connections within each cluster are dense, while connections are sparser between clusters. The Louvain algorithm [61], as one of the most popular community detection algorithms, is to find communities in a network by optimizing the modularity score, which quantifies the strength of the division of a network into communities. Higher modularity values indicate a more robust partitioning of the network into communities. The modularity of the given partition of a network is defined as formula 13, where \(\gamma\) is the resolution parameter, and the \(\sigma (c_i,c_j)\) is 1 if nodes i and j are in the same community else 0 [62].

$$\begin{aligned} Q = \frac{1}{2m} \sum _{ij} \left( A_{ij} - \gamma \frac{k_ik_j}{2m}\right) \delta (c_i,c_j) \end{aligned}$$
(13)

Results

Basic information statistics

Analysis of word cloud

The word cloud diagram is a visualization technique that summarizes the textual content by presenting high-frequency words prominently [Folk culture

As shown in Fig. 5, the community, characterized by red-colored nodes, encapsulates the distinctive imagery of traditional folk culture in the Minnan region using terms such as “Goddess Mazu (妈祖),” “Buddha (佛祖),” “Guanyin (观音),” “dragon boat (龙船),” “the 7th day of the lunar month, often associated with particular festivals (初七),” “matchmaking (做媒),” and “lighting lamps (点灯).” Traditional nursery rhythms related to these terms usually convey the praying and blessing emotions and describe the related elements of the local culture. For example, the lyrics of a notable traditional nursery rhyme “Swallows Fly (燕仔飞)” go as follows: “swallows fly (燕仔飞), has the dragon boat come yet (龙船遘抑未)?... a wooden comb combs the bride’s hair (岭兜一支柴梳好梳头), both the comb and hair are shiny (梳也光,篦也光)....” This nursery rhyme is widespread in the Minnan region, particularly in the Hua’an county of Zhangzhou, and it vividly portrays scenes to reflect the marriage customs involving the terms “dragon boat” and “combing the bride’s hair.” Another famous rhyme “** (去逛街)” and “Singing cat (猫咪唱歌), the lyrics are respectively go as follows: “Going shop** is so fun, buying shirts, trousers, and bowls and chopsticks (去逛街,真有趣,买衫买裤买碗筷)...” and “the cat is so amusing, wanting to learn to sing do re mi, singing many times, but it always comes out as mi mi mi (猫咪猫咪真有趣,想要学唱 do re mi,唱来唱去,怎么都是 mi mi mi)...” Overall, the traditional MNRs incorporate diverse emotions of Minnan generations, and mainly describe the negative sentiments in the years of hardship. With China’s economic growth and modernization, the modern MNRs reflect societal changes and often aim to capture the joyful aspects of children’s lives.

Daily life and natural scenery

As shown in Fig. 5b, the yellow-color community and dark blue-color community symbolize the daily life of children and the natural scenes in children’s experiences, respectively, in the modern MNR network. The genres of many word nodes overlap in the two communities, i.e., a variety of words can be relevant to both daily life and natural environments. Such an overlap is caused by an interconnection between children’s everyday experiences and their perception or interaction with nature. Words like “singing (唱歌),” hide-and-seek (捉迷藏),” ” drawing (画画),” “body (身体)” and “washed up bright and clean (白白)” in the daily-life community reflect everyday activities and concepts in children’s routines, possibly instilling values of personal hygiene, politeness, respect for others, and appreciation of the natural world.

Within the dark blue-colored community of the modern MNR network, terms such as “garden (花园),” “beach (海边),” “sea waves (海浪),” ”landscape (风景),” and “rural (田园)” represent the theme of natural scenes. For example, the modern nursery rhyme “** on the beautiful bed. (月娘月光光,起厝田**。爱食三色糖,爱睏水眠床。)” The second version contains similar thematic content: “Bright moonlight, rising up in the center of the fields. River snails transform into water jars, and paper boxes transform into beds (月娘月光光, 起厝田**,田螺做水缸,纸盒做眠床。)” In contrast, the lyrics of another variant differ considerably from the above two nursery rhymes. This version portrays the “Bright Moon (月娘月光光)” as follows: “Bright moonlight, an old man in the garden plans to cultivate new crops. Yet, his efforts are met with frustration. The onions fail to sprout, the tea plants remain flowerless, and the melons develop without seeds. This series of agricultural endeavors leaves the old man exasperated to the point of despair (月娘月光光,老公仔伫菜园,菜园掘松松,老公仔欲种葱,葱无芽。欲种茶,茶无花。欲种瓜,瓜无子。老公仔气甲欲死).”

Values of MNRs

The MNRs encompass a wide range of topics, including societal issues, historical figures and events, local folklore, and natural elements, reflecting various aspects of life and culture of the Minnan region. These MNRs will foster children’s social interaction, cooperation, and teamwork by participating in group recitations or singing sessions. The rhythmic patterns, repetition, and melodic tones of MNRs contribute to the language development of children. Therefore, MNRs serve as a significant education value for children’s education from an early age.

In addition, by incorporating elements of local dialects, societal norms, folklore, and historical events, MNRs serve as a living repository of cultural knowledge. As children learn and pass on these rhymes, they create connections with their cultural knowledge. So, MNRs also serve as an invaluable tool for the preservation of Minnan cultural heritage because cultural knowledge is reinforced and inherited through these interactions.

The popularity of MNRs is not only within the Minnan and Taiwanese regions but also extends to Chaoshan and Leizhou in Guangdong province, Hainan Island, **yang and Yuhuan in Zhejiang province. Moreover, the dissemination of MNRs transcends geographical boundaries into Southeast Asian regions owing to people’s migration. The shared elements, such as proverbs, stories, and values, that are found within the MNRs create a sense of belonging among Minnan-speaking communities. For communities dispersed across various regions, these shared cultural practices serve as an essential link connecting the Minnan people at home and abroad, significantly contributing to the cultural identity and cohesion among its members.

Limitation

Given the long history of Minnan nursery rhymes, the data gathered in this study remains relatively sparse. It is necessary to conduct more comprehensive and in-depth fieldwork to collect MNRs.

As the lyrics of MNRs comprise numerous non-Mandarin characters and Minnan proverbs, the traditional word segmentation method employed in this study may still result in inaccurate outcomes. It is noticeable that the obtained segmentation may potentially influence the structure of MNR networks. Consequently, it is essential to design a more sophisticated approach for extracting and analyzing the semantic meaning of lyrics while also considering rhyme patterns. The tailored segmentation approach would align with the unique characteristics of the Minnan language, ensuring further precise analysis and interpretation.

In this study, we employ the conventional approach of community detection to analyze the MNR network. It is worth highlighting that the integration of advanced techniques for community detection, such as machine learning and graph neural network-driven methods, may potentially unveil novel insights and provide a deeper understanding of the symbolic imagery embedded within MNRs.

Conclusion and future research

This study aims to examine the unique characteristics and patterns of Minnan nursery rhymes through text mining methods, including text segmentation, TF-IDF, and complex network theory. Specifically, we first collected traditional MNRs edited by experts and modern MNRs sourced from the internet. Subsequently, we preprocessed and segmented the lyrics using our designed Chinese-Minnan dictionary. We further extracted vital terms using the TF-IDF method and constructed separate networks for traditional and modern MNRs, with the terms serving as nodes and the co-occurrence relationships forming the links.

The analysis of lyrical terms revealed that a limited number of key terms had a high frequency of occurrence and conveyed specific emotions in both traditional and modern MNRs. Additionally, we employed network indicators to evaluate the structure of the constructed MNR networks. The findings demonstrated that both the traditional and modern MNR networks exhibited overall sparsity, internal cohesion, and strong community characteristics. When examining node centrality, the highly central nodes in the traditional MNR network reflected unique elements associated with idyllic depictions of life, folks, values, and distinctive cultural themes in the Minnan region. Conversely, important nodes in the modern MNR network are primarily related to positive emotional words. Our further investigation included analyzing the degree distribution of the traditional and modern MNR networks, and both of them followed a power-law distribution. We also explored the core-periphery structure of the MNR networks. The core terms within the traditional MNR network represent folkloric and food culture, as well as emotional expressions specific to the Minnan region. In contrast, the core of the modern MNR network did not relate to Minnan culture and was primarily focused on conveying joyful emotions among children. Lastly, we conducted community detection on both the traditional and modern MNR networks, resulting in five communities for each. In the traditional MNR network, these communities represent family relationships, folk culture, food culture, animal imagery, and emotional expressions, aligning with the values and beliefs of the Minnan people in historical periods. Conversely, the modern MNR network is also comprised of five communities, but with differing imagery symbols, including family relationships, animal imagery, emotional expressions, daily life, and natural scenery. These changes could be attributed to shifts in people’s lifestyles, economic development, and the impact of cultural globalization.

The exploration of Minnan nursery rhymes through the complex network analysis unveiled intriguing research directions for future work. We plan to compare the characteristics of Minnan nursery rhymes with nursery rhymes from other regions in our future research. Such a comparative study can help deepen our understanding of the diversity and uniqueness of nursery rhymes across different regions, contributing to the preservation of cultural heritage. Moreover, constructing a large-scale and comprehensive multimodal knowledge graph to encompass MNRs will be a promising direction for future research. This research has the potential to address hidden knowledge embedded in MNRs and promote interdisciplinary research and cross-cultural understanding in the fields of linguistics, cultural studies, and digital humanities. Another interesting direction for future research is the impact of Minnan nursery rhymes on children’s development and language acquisition. This study may involve empirical research, such as conducting experiments or surveys, to examine how exposure to Minnan nursery rhymes influences aspects such as language proficiency, cognitive development, and cultural awareness of children.