1 Introduction

During the COVID-19 pandemic, production economics and operational management are called upon to support decision-making processes. In this burgeoning literature, Ivanov and Dolgui (2020) unveiled unforeseen and unprecedented fragilities in supply chains. They scrutinized the existing operational research studies dealing with disruption propagation and structural dynamics in supply chains. Dasaklis et al. (2012) predicted that outbreaks of epidemics will account for a great number of deaths. They assessed that effective control of an epidemic outbreak calls for a rapid response: “Available resources such as essential medical supplies and well-trained personnel need to be deployed rapidly and to be managed in conjunction with available information and financial resources in order to contain the epidemic before it reaches uncontrollable or disastrous proportions.” Their latter prediction became sadly very true in 2020.

Vaccines are vital to human survival, health and development, and are hot topics of common concerns for governments, experts, and scholars around the world (CDC 1999). When infectious diseases strike, in addition to providing timely assistance to infected patients, the development of vaccines has become critically important. From a domestic perspective in China, the Changchun Changsheng Vaccine Event in 2018 caused a sensation throughout the country; The number of problematic vaccines investigated was as high as 650,000, of which 247,359 have been already vaccinated, and this involved 215,184 children. The novel coronavirus pneumonia (COVID-19), which was noticed in January 2020 in Wuhan, China, as of May 2020, went outbreak and has become a worldwide pandemic, forcing the World Health Organization (WHO) to declare it as a most severe world public health crisis. Public health experts come to a consensus on the urgent needs of vaccine-type preventive biological products to save people’s life. Therefore, the development and safety of vaccines have been the primary issue in fighting infectious diseases, preventing sudden health emergencies, and managing sudden life-threatening public health events.

Major international infectious diseases in recent years include Severe Acute Respiratory Syndrome (SARS), Ebola Virus Disease, Middle East Respiratory Syndrome (MERS-CoV), and the most recent Novel Coronavirus Pneumonia (NCP or COVID-19). According to WHO, as of August 16, 2003, the number of patients with SARS worldwide was 8,096, the number of deaths was 774, and the mortality rate was 9.6% (WHO 2020f). From 2012 until January 15, 2020, the total number of laboratory-confirmed MERS infection cases reported globally to WHO was 2,506, with 862 associated deaths (WHO 2020e). The Ebola virus has considerable epidemic potential. For example, the scale of the West African epidemic in 2013–2016 was unprecedented, with more than 28,000 confirmed cases and 11,000 deaths. For the Ebola infectious disease incidents in Africa, as of February 11, 2020, a total of 3432 Ebola cases were reported, including 3309 confirmed cases and 123 suspected cases, of which 2253 died, with a mortality rate of 66% (WHO 2020c).

At present, as of May 19, 2022, COVID-19 in China has caused 1,879,192 confirmed cases and 15,713 deaths (WHO COVID-19 Dashboard), and more severe outbreaks happen worldwide out of control, as detailed in the next paragraph. The main transmission routes are respiratory droplets and contact transmission, which are highly contagious. People of all ages may be infected, mainly adults, and the elderly and frail people seem to be more likely to be infected (WHO 2020b).

According to WHO, as of May 19, 2022, the total number of infected people by COVID-19 is over 520million, and total deaths are over 6 million (6,270,232), still rapidly growing. The breakdown of these numbers by countries are as follows: China (infected count: 1,879,192, Deaths: 15,713), United States (over 81 million, 992,667), India (over 43 million, 524,293), England (over 22 millon, 177,580), Italy (over 17 millon, 165,494), Spain (over 12 millon, 105,642), France (28 millon, 144,302), Brazil (30 millon, 664,987), Iran (over 7 millon, 141,236).

Infectious diseases in China also include hand-foot-mouth disease, H1N1 influenza, and so on. From 2008 to 2009, a total of 1,644,480 cases of hand, foot, and mouth disease were reported in China, 14,977 patients were severe, and 479 people died. Among them, infants and young children were mainly under five years old. The infection routes of hand, foot and mouth diseases (HFMD) include digestive tract, respiratory tract and contact transmission (Zhu et al. 2011). On March 31, 2010, more than 127,000 confirmed cases of H1N1 influenza were reported in 31 provinces across the country, including 800 deaths. On June 11, 2010, WHO stated that 18,156 people worldwide have died since the outbreak of H1N1 influenza a year ago. The transmission of H1N1 influenza is mainly through the respiratory tract, but it can also be transmitted through contact with infected pigs or their feces and the surrounding polluted environment. The susceptible population is young adults aged 20–45 (Vaillant et al. 2009).

The spread of infectious diseases is rapid, broad, and fatal, and it is crucial to carry out epidemic prevention and control promptly. Hence, speeding up the development of vaccines and ensuring safety and VE is the key to confront global virus threats.

At present, relevant studies of vaccination mainly focused on the following aspects: First, the necessity and importance of vaccination. As pointed out by WHO in 2018, current vaccination alone can prevent 2–3 million deaths each year (WHO 2020a). Ventola estimated that the average mortality rate from preventable diseases due to vaccination has decreased by 97.8% (Ventola 2016).

The second is the study of vaccine storage and transportation conditions. The transportation, storage, and other aspects of the vaccine require very strictly restricted temperature and other requirements. When the ambient temperature of a vaccine is outside a moderate range, the VE can be weakened or even completely lost. Patients mistakenly believe that they received a routine vaccine while the vaccine they received is invalid, causing a more significant increase in the probability of catching a disease (WHO 2004). Generally, high temperature has a tremendous negative impact on the effectiveness of almost all vaccines but also is the fact that too low temperature can inactivate freeze-sensitive vaccines such as tetanus toxoid, Haemophilus influenza b (Hib), and inactivated poliovirus (WHO 2020g). Therefore, kee** the vaccines at the optimal temperature range at all times during transportation is critical to maintaining VE (Yakum et al. 2015). In Brazil, the vaccine has to be delivered through the Amazon jungle and a long distance to reach remote communities, along with poor and difficult transportation conditions, causing the overall VE to be reduced (WHO 2020d). According to the statistics, a considerable proportion of vaccines in the course of transportation lose their effectiveness partially or even wholly because of the exposure to excessively high or sub-zero temperatures (Das et al. 2019). A study (2007–2014) showed that vaccines were exposed to freezing temperatures during storage (29.7%) and transportation (28.1%) in low- and middle-income countries (Hanson et al. 2017). Kinfe et al. believed that, in some develo** countries, although vaccination coverage has been increased, due to old-fashioned equipment or newly developed equipment that cannot be upgraded in time (due to the poor economic conditions), the recurrence rate of vaccine-preventable diseases remains high (Kinfe et al. 2019). Kumru found that the requirements for storage and transportation are high (Kumru et al. 2014). Proper, standardized, and scientific storage is the core condition for ensuring vaccine safety.

The third is the study of vaccination effect and influencing factors. As pointed out by Phillips et al. (2017), Brison and LeTallec (2017), in low-income and middle-income countries, many children do not always develop immunity after vaccination. At the same time, inadequate preparation of medical facilities and sanitation facilities and severe cold chain breaks are important factors affecting willingness and vaccination effectiveness. Huang et al., by understanding the positive rate of HBsAb in lymphoma patients after receiving different doses of hepatitis B vaccine, evaluated the immune effect of different doses of Hepatitis B vaccine and analyzed the influencing factors (Zhuang and Wang 2020). Lymphoma patients can get a better immune effect if they receive a high-dose hepatitis B vaccine. Rupprecht and Salahuddin pointed out that the preparation technology of rabies vaccine has been significantly advanced, and the protection rate for rabies vaccine has reached a higher level (about 90%) (Rupprecht and Salahuddin 2019). Still, the occurrence of rabies has not yet been eliminated. Yang et al. investigated the primary immune effect and influencing factors of the hepatitis B vaccine by understanding the response of the 5ug recombinant hepatitis B vaccine in adults (Yang et al. 2010) and K2 algorithm (Cooper and Herskovits 1992), BN structure learning and parameter learning are performed to establish a mixed Bayesian evaluation of VE network model.

The remainder of the paper is organized as follows. In Sect. 2, a risk assessment index system for VE is constructed, a BN model combined with an interpretative structural model method is proposed, and a BN structure learning is performed. In addition, a BN parameter is estimated based on a MAP algorithm (Heckerman et al. 1995). In Sect. 3, we mainly conduct an empirical study and analyze the results and verify the feasibility of the model through stability, effectiveness analysis, and comparison with other methods. Section 4 draws the conclusion, provides political implications, discusses research limitations, and outlines future research directions. The overall framework of the paper is illustrated in Fig. 1.

Fig. 1
figure 1

Framework and flowchart of the paper

2 Theory construction and model building

2.1 Establishing risk index system for VE

This article mainly studies the influenza vaccine, which belongs to the second-class vaccine in China, namely the self-paid vaccine. Figure 2 shows the approval status of influenza vaccines issued by the China National Institutes for Food and Drug Control (NIFDC) in the past 3 years. In 2017–2019, a total of nine vaccine manufacturers produced influenza vaccines. Among them, there was no influenza vaccine listed in Taizhou **dike in 2017. In 2018, the Shanghai Institute of Biology and Bei**g Kexing, Changchun Changsheng, and Taizhou **dike were not shown in the list of influenza vaccines. In 2019, Shanghai Institute of Biology, Dalian Yalifeng, Tian** Zhongyi Anke, and Changchun Changsheng were not listed. In 2017, 2018, and 2019, the numbers of influenza vaccines listed in China each were respectively 29.907 million, 16.124 million, and 20.217 million (NIFDC 2020).

Fig. 2
figure 2

Data Sources: https://www.nifdc.org.cn/nifdc/

Approval of China’s influenza vaccine market.

According to the reports by the United States Centers for Disease Control and Prevention (CDC), the flu season has caused 22–31 million infections and 12,000 to 30,000 deaths in the US. from 2019 to 2020. China’s population is four times that of the United States, and the impact of influenza on the people is even more significant. According to statistics from the US CDC, from 2019 to 2020, American influenza vaccine manufacturers predicted that they would provide up to 162 million to 169 million doses of influenza vaccines to the market. From 2019 to 2020, of the 1.395 billion people in China, only some 20 million doses were administered to a small percentage of the population; In contrast, the US has about 327 million people and 169 million doses were administered. The two countires’s vaccination rates were 2% and 51% respectively. Compared to the US, the flu vaccination rate in China has a large gap.

Figure 3 shows the distribution of influenza vaccine manufacturers that supply Hunan Province. In recent years, as there are no vaccine manufacturers located in Hunan Province, influenza vaccines have to be transported from various originsthrough a cold chain. Since the temperature of the vaccine has a significant impact on VE, and many factors affect VE during the transportation process, it is vital to conduct a VE risk assessment to ensure VE

Fig. 3
figure 3

Distribution of influenza vaccines from origins to Hunan Province

Risk assessment indicators for VE are selected by incorporating the following two aspects: (1) Research results from the existing literature, preliminary design indicators from the perspective of cold chain transportation, reports and case studies as well as relevant analyses from the practice, and comprehensively refined risk factors from qualitative and quantitative studies (Head et al. 2019; Phillips et al. 2017; Yang et al. 2016); and (2) Building a risk assessment index system for VE from four dimensions (a through d in Table 1 below), by referring to historical literature and expert opinions from Hunan provincial CDC (Rogers et al. 2010; Tostanoski and Jewell 2017).

Table 1 Construction of VE risk warning indicators

This indicator system is developed to aim at the vaccine production workshop, cold chain storage, cold chain transportation, Hunan provincial CDC storage, vaccination site storage, and other links. In this system, the risk factors are initially classified, possible risk factors included in each link are analyzed, and a risk assessment index system for VE is constructed. The system consists of eleven indicators, and the assessment criteria of each indicator are referred to as the opinions of experts from the Hunan provincial CDC. Among them, the evaluation of VE is obtained through testing and calculation, as shown in Table 2. The establishment of this indicator system focuses on external factors, namely from the perspective of the vaccine transport chain. The eleven indicators are set up to refine the risk factors and improve the accuracy of the assessment.

Table 2 Valuation of V.E

2.2 BN structure learning with hybrid ISM method

BN structure learning can be designed to incorporate expert experience or learn from data, but directly acquiring knowledge from experts can be time-consuming and error-prone. In the case of a large number of variables in a BN, it is often difficult for domain experts to describe the structural dependencies and independence between variables. On the other hand, for areas where human experts have limited knowledge but sufficient cases and sufficient data available, through search-scoring algorithms, the BN structure can be learned from the data. However, it is challenging to automatically discover the structure from the data because there are many possible structures, which would require massive high-quality data for rigorous detection.

As China does not yet have a complete vaccine traceability system, through the cooperation with the Hunan provincial CDC, staff from its subordinate departments have collected influenza data from nine influenza vaccine manufacturers and vaccination sites across the country from 2017 to 2019, including data on vaccine delivery and vaccination. Data on the preconceived risk assessment indicators were collected. Currently, influenza vaccines that have been marketed internationally include influenza inactivated vaccines and live attenuated influenza vaccines. The influenza vaccines that have been sold in China are both trivalent inactivated vaccines and tetravalent inactivated vaccines. Trivalent vaccines are administered for infants over six months, children, and adults to prevent influenza caused by type A H1N1, type A H3N2, and type B Victoria. The tetravalent component is one more than the trivalent vaccine, that is, the trivalent vaccine prevents influenza caused by the Yamagata type B, and the tetravalent is used for children over three years old and adults. In this experiment, the composition of the vaccine is not involved, so no distinction is made between trivalent and tetravalent vaccines, and they are collectively referred to as influenza vaccines.

These data include:

  1. 1.

    Ranking of vaccine manufacturers’ cities by GDP in all cities of their province (last year).

  2. 2.

    Comprehensive quality of the refrigerated truck driver.

  3. 3.

    Refrigerator compartment temperature records of production and transportation link.

  4. 4.

    Storage room temperature records of the Hunan provincial CDC storage link.

  5. 5.

    Comprehensive quality of storage guards of the Hunan provincial CDC storage link.

  6. 6.

    Refrigerator compartment temperature records of vaccine delivery link.

  7. 7.

    Comprehensive quality of vaccination nurses at vaccination sites.

  8. 8.

    Temperature record in the storage room of vaccination sites.

  9. 9.

    Ranking of vaccination sites’ cities by GDP in all cities of their province (Last year).

  10. 10.

    Temperature on the day of inoculation.

  11. 11.

    Evaluation of VE (Zapata 2017).

In the above list, data of the first 11 nodes are obtained through field surveys, and the last node, the assessment of VE, needs to be tested. The usual methods for testing VE include the negative test design method (Foppa et al. 2013; Jackson and Nelson 2013) and the antibody titer test method is commonly used to evaluate the potency and VE (Ganz et al. 1978). This study selected the antibody titer test method. The explanation of how to evaluate the effectiveness after vaccination is as follows:

One vaccine is randomly selected from each batch of vaccines as an experimental sample. The sampling procedure is used to ensure that the samples are representative and with required sample sizes for the study (estimated to be 400). We selected some participants from those who came to the community hospital to receive the flu vaccine and who also were willing to cooperate with our research and sign a consent form, stating that “after seeking the consent of the vaccine vaccinators, they are required to return to the community hospital for an antibody titer test one month after the vaccination.” Since it takes 2–4 weeks for the influenza vaccine to produce protective antibodies, performing an antibody titer test one month after vaccination is the best time. Results of the antibody titer test are closely related to VE

In experimental research, there are often some “non-processing factors” or confounding factors, which will affect the research results, such as gender and age. Confounding factors are among the essential concepts in observational and controlled studies (Pearl et al. 1999). In epidemiological observation research, confounding is a core concept in causal inference, and it is the leading cause of false correlation (no causality but still related). It must be controlled during causal inference. The randomized experiment is the most straightforward and effective method for evaluating causality. Through randomization, the distribution of known and unknown confounding factors can be balanced in the training group and the test group, which means that the randomized experiment is adopted to eliminate known and unknown confounding factors’ results in average causal effects (Pourhoseingholi et al. 2012).

A total of 402 sets of experimental data were collected in this study. First, the data was preprocessed to check for outliers. In general, we tend to ignore outliers when building models. This is not a wise practice, as outliers shift data and reduce accuracy. An outlier is an observation that goes well beyond the overall pattern in the sample. The causes of outliers can be divided into two categories: human error and natural error. Human errors, such as errors caused during data collection, recording, or entry, can cause outliers in the data. Because the data in this study were collected through professional surveys and artificially discretized, human errors may exist. Sorting the data table is a simple but effective way to highlight outliers. Just sort the data table for each variable and look for unusually high or low values. As shown in Fig. 4, the sample data set has been sorted in ascending order, and the highest value is significantly different from the other values. Although this method cannot quantify, the abnormality of outliers, abnormally high or low values is easy to find. Besides, visualize the data through a box plot, which can also detect outliers intuitively, as shown in Fig. 4. There are four abnormal points in the figure, the values are: a1 = 11, c1 = 22, c2 = 30, and d4 = 11. These outliers correspond to errors caused by the researcher’s input process. Deleting abnormal observations can also be done by using other statistical models to predict the “true” observations occupied by outliers and then adopt the predicted values of these true values in the statistical analysis.

Fig. 4
figure 4

Outlier data detection based on a boxplot

In the process of preprocessing the data, it was found that there were several missing data. Besides, the abnormal values were treated as missing data and were made up through missing data techniques and the Rubin’s EM algorithm. In the end, there were a total of six missing data. Therefore, an EM algorithm was employed to handle the missing data. The EM algorithm is iterative. Each iteration can ensure that the likelihood function value increases and converges to a local maximum. The basic principles and implementation steps of the EM algorithm were analyzed, the missing values are interpolated, and a complete experimental data set was established at the end. Compared with directly excluding missing data for experiments, the introduction of the EM algorithm to fill missing data does not need to discard any data, making the data more comprehensive and the accuracy of the experimental results higher.

Selecting the “train_test_split” method of the “Scikit-Learn library” of Python software, we randomly divided the 402 sets of data into training data and test data: Using the cross-validation method to determine the ratio of the training set to the test set to approximately 3: 1, resulting in 302 cases in the set training data and 100 cases in the set of test data. The cross-validation accuracy of the BN model on the training set is shown in Table 3. “Train_test_split” is a commonly used function in cross-validation, and its function is to randomly select train data and test data from the sample in proportion. Then the feature extraction and discretization of the data were carried out, and finally, the data is used for structure learning and parameter learning. Matlab software was used to test the stability of the model using the ten-fold cross-validation method, as shown in Table 3.

Table 3 Accuracy and mean of ten-fold cross-validation method

2.2.1 Step 1: Domain experts establish pairwise interaction/influence relationships between variables

The 302 sets of training set data were used as the data for structure learning and parameter learning. The K2 algorithm is selected to build the network structure. Still, because the K2 algorithm can be susceptible to the order of variables in the initial setting, the ISM method is adopted to obtain a reasonable initial node ordering for the K2 algorithm. An influence graph is deduced, which represents the direct relationship between any two nodes. To prevent experts from judging the unlikely relationship, the following assumptions are made: (1) Factors in each link affect each other; (2) Factory shipment may directly affect Hunan provincial CDC storage and VE; (3) Hunan provincial CDC storage Links may directly affect vaccine delivery, and VE; (4) Vaccine distribution links may directly affect vaccination sites, and VE; (5) Vaccine site storage links may directly affect VE After designing the list of possible relationships in advance, the variable relationships were reduced to 52 relationships, as shown in Fig. 5, where “3 × 2” means that there are 6 (= 3 × 2) one-to-one direct relationships between the three factors, and similar for notation “2 × 1” and so on. Two groups of domain experts were invited to judge these 52 relationships, discuss the differences and reach consensus, and build a Structural Self-Interaction Matrix (SSIM) to capture the relationship between variables.

Fig. 5
figure 5

Providing the relationship between reference variables

2.2.2 Step 2: Establish a structural self-interaction matrix \(SSIM = [r_{ij} ]_{n \times n}\), and for each pair of variables \(K \in (1, \ldots ,n)\) and \(X_{j}\) in the set \(K \in (1, \ldots ,n)\), express the direct relationship between them by \(K \in (1, \ldots ,n)\)

$$ r_{ij} = \left\{ {\begin{array}{*{20}c} 1 \\ 0 \\ \end{array} \begin{array}{*{20}c} {X_{i} {\text{ and }}X_{j} {\text{ are directly related}}} \\ {{ }X_{i} {\text{ and }}X_{j} {\text{ are not directly related}}} \\ \end{array} } \right. $$
(1)

Therefore, based on the judgment of domain experts, the following SSIM is established.

2.2.3 Step 3: Compute the reachability matrix M by using the SSIM

From the matrix M, one can determine if a connection path between two variables exists. M is computed by \(K \in (1, \ldots ,n)\), where \({1} \le k \le n\) represents the length of the longest path of the SSIM.

figure a

2.2.4 Step 4: Divide M into different hierarchical structure influence diagrams

To decompose the reachability matrix M, we first defined the reachability set R(Xi), containing the variables that can be reached from Xi (including Xi), the prior set Q(Xi) consisting of the variables from which Xi can be reached (including Xi), and the reachable set and the preceding intersection set A = R(Xi) ∩ Q(Xi). The level extraction is performed according to the condition R(**) = A. The top variables are extracted first, which is the final goal of the system. After separating the top element, the other elements are extracted in turn until the bottom element is extracted. All variables are divided into nine levels in order:

\(L_{1} = \left\{ e \right\}\),\(L_{2} = \left\{ {d_{4} } \right\}\),\(L_{3} = \left\{ {d_{3} } \right\}\),\(L_{{4}} = \left\{ {c_{2} ,d_{2} } \right\}\),\(L_{{5}} = \left\{ {d_{1} } \right\}\),\(L_{{6}} = \left\{ {b_{2} ,c_{1} } \right\}\),\(L_{7} = \left\{ {b_{1} } \right\}\),\(L_{8} = \left\{ {a_{{2}} ,a_{3} } \right\}\), \(L_{{9}} = \left\{ {a_{1} } \right\}\).

2.2.5 Step 5: Draw a directed acyclic graph based on the direct influence relationship

The formed ISM is shown in Fig. 6a and from which the initial ordering of the BN nodes can be obtained:

  1. 1.

    Ranking of vaccine manufacturers’ cities by GDP in all cities of their province (Last year) a1.

  2. 2.

    Comprehensive quality of refrigerated truck driver a2 and Refrigerated truck temperature a3.

  3. 3.

    Storage room temperature records of the Hunan provincial CDC storage link b1.

  4. 4.

    Comprehensive quality of storage guards b2 and Personal quality of distribution staff of the vaccine distribution link c1.

  5. 5.

    Comprehensive quality of vaccination nurses at the storage point of the vaccination site d1.

  6. 6.

    Temperature of the refrigerated truck in the vaccine distribution process c2 and Storage room temperature of the inoculation site d2.

  7. 7.

    Ranking of vaccination sites’ cities by GDP in all cities of their province (Last year) d3.

  8. 8.

    Temperature on the day of the inoculation d4.

  9. 9.

    VE e.

Fig. 6
figure 6

Risk of vaccination: a ISM Model for VE Risk, b BN Structure for VE Risk

The maximum number of parent nodes of each node is 11, which is the maximum number of parent nodes of the lowest node.

2.2.6 Step 6: Learn BN structure from data

The ordering information of the nodes is used as the input of the initial sequence in the K2 algorithm. The Matlab BNT toolbox is used to learn the BN structure. The final ISM-K2 BN structure is obtained by learning the 302-training data, as shown in Fig. 6b. Observing Fig. 6a and b, a new arrow appears in (b), which also indicates that there is a difference between the expert experience and the actual case. The initial structure can be supplemented and improved through sufficient case data learning. Compared with the manual construction of BN that rely solely on the opinions of consulting experts, the ISM-K2 structure learning method combines expert experience and historical data information, adding a lot of supplementary information to the final network model, which helps to improve the accuracy of the model reasoning.

2.3 BN parameter learning based on map algorithm

Based on the network structure in Fig. 6b and the 302 training cases (represented by Dr), the MAP algorithm (Zhou et al. 2016) in software GeNIe 2.4 was used to perform parameter learning for determining the following conditional probability: \(\theta_{ijk}\), where Pa(Xi) represents the parent node of node i, and \(\theta_{ijk}\) denotes the conditional probability given that Xi takes the kth value \(X_{i}^{k}\) and Pa(Xi) takes the jth state \(Pa(X_{i} )^{j}\). Using the training set De, the valuation \(\theta_{ijk}\) of \(\theta_{ijk}\) can be calculated by the following formula:

$$ \theta^{ * }_{ijk} = \frac{{N_{ijk} + \alpha_{ijk} }}{{N_{ij} + \alpha_{ij} }}, $$
(2)

where \(\theta_{ijk}\) counts cases, \(\theta_{ijk}\) in De takes the kth value and its parent node takes the jth value, that is: \(N_{ij} = \sum\limits_{k = 1}^{{r_{i} }} {N_{ijk} }\).

Let \(\alpha_{ij} = \sum\limits_{k = 1}^{{r_{i} }} {\alpha_{ijk} }\), where \(\theta_{ijk}\) is a hyper-parameter reflecting the expert’s guess on the virtual data count of the parameter \(\theta_{ijk}\). If not, enough judgment is obtained from experts in related fields, \(\alpha_{ijk} = 1\) or \(\alpha_{ijk} = \frac{1}{{r_{i} q_{i} }}\) is generally used [35], where ri represents the number of values of Xi and qi counts the parent nodes of Xi. Because of the lack of inputs from experts, \(\alpha_{ijk} = \frac{1}{{r_{i} q_{i} }}\) is taken in this study.

The final BN model is given in Fig. 7. “d1_1, c1_1” in Fig. 7 refer to “d11, c11” in Table 3, and so on. Each node stores a conditional probability table (CPT). Due to the inconvenience of displaying multiple nodes in the structure, only the CPT of temperature d2 in the storage room at the inoculation site is shown (Table 4).

Fig. 7
figure 7

BN for VE Risk

Table 4 CPT of storage room temperature at the inoculation site

Among nodes, if the score of comprehensive quality d1 of the nurses at the inoculation site is 2, and the score of personal quality c1 of the distribution personnel at the distribution link is 1, then the probability that the storage room temperature at the inoculation site is qualified is 0.82.

3 Analysis of empirical results and discussions

The test set of 100 cases was applied to verify the model. Factors other than the storage room temperature record (d2) and VE (e) were regarded as known evidence, and they were used as test results. Then a consistency analysis between the prediction results of the BN model from these two nodes and the data in the records is performed. The confusion matrix is shown in Fig. 8a and b summarizes the difference between the predictions from the BN model and the data recorded in the test set. The elements on the diagonal line indicate the number of cases in which the forecast is consistent with the records, and the other elements tell the number of inconsistent cases. The light-yellow color shows a higher degree of difference. In general, 90% and 92% of the predicted data of the two nodes are consistent with the records, and most of the inconsistencies are similar, so they can be considered to be partially consistent.

Fig. 8
figure 8

These are the two figures of the confusion matrix: a confusion matrix for temperature record (d2) of inoculation point storage room; b confusion matrix for VE (e)

3.1 Stability analysis

BP neural network is a commonly used algorithm in the field of security risk early warning and assessment. Still, it is susceptible to the initial weight, causing the results of multiple pieces of training not stable. In contrast, the BN has exceptionally high stability. Once the network structure and CPT are determined, the operation results will not change. The experiment was repeated eight times, and the vaccine efficacy e was taken as a prediction object. The experimental results are shown in Table 5.

Table 5 Comparison of experimental results between BN and B.P. neural network

In Table 5, the average accuracy rates of the BP neural network and BN are 89.9% and 90%, respectively. Although the difference is only 0.1%, the results of the eight experiments using the BP neural network method show instability, that is, the accuracy rate of each experiment is different. The highest accuracy is 93.5%, and the lowest is 86.7% giving a big difference between the levels of accuracy. Therefore, after analyzing the stability of the model, we found that the stability of the BN is significantly better than that of the B.P. neural network.

3.2 Effectiveness analysis

To evaluate the ISM-K2 structure learning method in the developed the BN in this study, Firstly, the hill climbing algorithm was compared with the ISM method in structure learning and it was found that the Bayesian network using ISM was more accurate. Then the maximum likelihood estimation (MLE) method was compared with the MAP method in parametric learning, and it was seen that the MAP algorithm yielded conclusions that were closer to reality. Two other models were compared: a BN based on ISM structure learning only and a BN for structural learning using the GA-K2 algorithm. The results of the two methods are shown in Fig. 9. Among them, compared with the ISM-K2 BN, the ISM BN lacks multiple directional connections. That is, the network information is missing. The GA-K2 BN uses a genetic algorithm as a fitness function to obtain the initial order of the nodes and then employs the K2 algorithm structure to learn the BN structure. Therefore, this approach can lead to many abnormal and counterintuitive relationships, such as the arrow from the comprehensive quality of refrigerated truck driver (a2) to the temperature on the day of inoculation (d4). Five experiments were conducted, and the inferred objects were the personal qualities of the distribution personnel (c1), the temperature record of the storage room at the inoculation site (d2), the GDP ranking of the city where the inoculation site was located (d3), the temperature on the day of inoculation (d4), and the temperature record of refrigerated truck (c2). The experimental results are shown in Table 6.

Fig. 9
figure 9

Comparison of Single Structure BN for VE Risks: a description of BNISM; b description of BNGA-K2

Table 6 Comparison of experimental results between BNISM and BNGA-K2

The ISM BN model has a relatively good prediction effect on the GDP ranking of the city where the inoculation site is located (d3). The overall prediction effect of the GA-K2 BN model is better than that of the ISM model. However, compared with the ISM-K2 BN method used in this study, their accuracy rates are relatively low, and the structure of the GA-K2 BN model also has an unconventional directing relationship.

Therefore, the model produced by the ISM-K2 BN has better training and reasoning performance. This method is suitable for BN modeling with relatively fewer data and relatively little domain expert knowledge.

The data was sampled eight times, with 200 sets of random data taken as a group each time. In addition to the neural network models of the original manuscript, two more methods, decision tree methods and naive Bayesian network are slected for the comparison with the proposed Bayesian networks. Figure 10 in the revised manuscript illustrates the accuracy of the results obtained from these four methods. The horizontal axis of the graph indicates the number of trials and the vertical axis shows the difference in accuracy from 100%. The neural network is occasionally able to produce very accurate results, but has the disadvantage of unstable and wabbly results, which need to be optimised. The accuracy of the naive Bayesian network does not compete with the results obtained from the Bayesian network method due to the improvement in this paper. Moreover, the Bayesian network is more interpretable. We compared the experimental results of the proposed Bayesian Network method with the decision tree model, which is also interpretable, and found that the Bayesian network is more accurate, averaging 0.5% than the decision tree model.

Fig. 10
figure 10

Accuracy gap comparison of VE prediction

3.3 Probabilistic reasoning

Probabilistic reasoning is to use the values of some nodes in the BN (called evidence) to infer the posterior distribution of unknown nodes (called observation nodes). The BN has a bidirectional reasoning function: Reason about the probability of the resulting child nodes from the parent nodes as the cause (prediction reasoning), and infer the probability of parent nodes by using the information of their child nodes (diagnostic reasoning).

Perform analysis of the control variables in the BN model (Fig. 7). First, the predictive reasoning is performed: Assume that the values of the three variables in the production and transportation link are all 1, that is, the GDP ranking of the vaccine manufacturers’ cities last year accounted for the top 30% of the province’s GDP rankings, the comprehensive quality of refrigerated truck drivers is good, and the temperature of refrigerated truck compartments is fully qualified. The effect of changes in VE values is small. Conversely, if all the three variables of the factory transportation link take the value 3, that is, all of them fail, VE will decrease by 5%. Assume that the values of the four variables in the Hunan provincial CDC storage and vaccine delivery link are 3, that is, the storage room temperature exceeds 8℃ for more than half an hour, the comprehensive quality of storage guards is lower, personal qualities of delivery staff of vaccine delivery link is lower, and the refrigerator compartment temperature of vaccine delivery link exceeds 8 ℃, for more than half an hour, VE will reduce by 6%. Assume that the values of all factors in the storage at the vaccination site are all 3, that is, the comprehensive quality of the vaccination nurses is lower, the temperature in the storage room of vaccination sites exceeds 8 ℃ for more than half an hour, the city where the vaccination site is located ranking the last 40% of the province’s GDP in the last year, and the highest temperature on the day of inoculation is above 30 ℃, VE will decrease by 10%. This shows that the last three links have a greater impact on VE than the first link, and it is recommended that the regulatory authorities pay more attention to strengthening the supervision of these three links.

Then we can perform diagnostic reasoning. Assuming that the antibody titer of the vaccine is zero after one month of vaccination, that is, the test result is unsatisfactory, it then can be inferred that the probability of the highest temperature above 30 ℃ on the day of vaccination is 73%, the probability that the temperature of the refrigerated truck compartment during the distribution process exceeds 8 ℃ for more than half an hour is 19%, the probability of a lower overall quality of refrigerated truck drivers in the production and transportation was 44%, and the probability of lower comprehensive quality of storage guards is 13%. Studies have shown that in addition to the storage temperature of the vaccine, the daily temperature of the vaccine injection, the economic level of the vaccine production site, and the quality of staff handling vaccines all have varying degrees of impact on VE Therefore, the investigation can be performed according to the order of probability to quickly determine the risk factors for vaccine failure, to carry out fast and accurate accountability.

Surprisingly, when performing the single-variable control, no matter what value of node a1 of the GDP ranking of the vaccine manufacturers’ cities is set, the probability of the VE node does not change, indicating that there is no direct correlation between the GDP ranking of the vaccine manufacturers ‘cities and the VE If the value of the temperature on the day of vaccination is set to 3, which means the maximum daily temperature is above 30 ℃, the probability of VE will decrease by 3%. If the value of the comprehensive quality of staff handling vaccines (a2, b2, c1, d1) is all set to 1, the probability of VE will increase by 6%, which indicates that the comprehensive quality of staff handling vaccines has a significant impact on VE

Different from the existing studies that only focused on the temperature of the cold chain of vaccines (Pourhoseingholi et al. 2012; Das et al. 2019), this study incorporates the economic level of vaccination sites’ cities, the temperature on the day of injection, the comprehensive quality of staff handling vaccines, and scores the total quality quantification into one overall measure, which has not been proposed in previous studies. Although our experiment is a randomized experiment, which theoretically eliminates the influence of confounding factors, due to the consideration of experimental rigor, there may be other non-measurable factors. However, from the results of our experiments, BN has excellent interpretability and accuracy in the analysis of VE and inference.

Further, if the BN model is used for diagnostic reasoning, it can quickly and effectively track the adverse events of ineffective vaccines, which will help improve VE and reduce the spread of infectious diseases.

4 Conclusion

This paper designs a BN model that requires only limited historical data and expert questionnaires to perform more accurate VE analysis and empirical analysis. In this study, we focused on the influenza vaccine in the second-class vaccine. Combined with historical literature and domain expert experience, and because of the impact of vaccine storage temperature on VE, eleven possible risk factors, existing in the four main stages of the cold chain transportation links, were analyzed.

In cooperation with the Hunan Provincial CDC, through data collection and data discretization, the ISM-K2 BN model was adopted to conduct a risk assessment of VE Findings help to clarify the cold chain distribution responsibility of vaccines and accurately strengthen the requirements of each link in “Transportation-Storage-Distribution-Inoculation” separately.

The study can provide individual assistance for the guarantee and evaluation of VE, and serve as a specific reference for strengthening China’s vaccine quality and safety supervision value. The BN model, combined with the ISM method, provides a practical reasoning approach for analyzing vaccine-related data obtained from the supply chain process, and in practice, serves as a prediction strategy for vaccine effectiveness risk.

As for the limitation of the study, for one thing, some factors that affect VE, such as unknown manufacturers’ qualifications, the individual physique of the vaccine recipients, were not included in the models. For another, information is nonadditive, and due to the rarity of events, identification of point probabilities or probability distributions to characterize parameter uncertainty would be subject to degrees of imprecision. In such cases, we will express the assessment of an event in the form of interval or imprecise probabilities using Dempster-Shafer’s belief function and Choquet capacity. These aspects need to be further improved in future research.