Background

Atypical porcine pestivirus (APPV) belongs to the genus Pestivirus in the family Flaviviridae and is a novel, highly differentiated pestivirus that was first identified in pigs in the USA through metagenomic sequencing in 2015 [1]. APPV was classified as Pestivirus K by the International Committee on Taxonomy of Viruses (ICTV) in 2018 [2]. The clinical presentation of pigs infected with APPV is characterized by congenital tremor (CT) type A-II in piglets [3], while adult pigs may become viral carriers and shedders [4]. It is not surprising that APPVs are present and have become a major threat in China, which is an important country for pig farming and trade [5].

APPV is a highly variable single-stranded RNA virus, and its genome is approximately 11.0 kb in size and comprises a single open reading frame (ORF) flanked by untranslated regions (UTRs) at the 5’- and 3’-ends. The ORF encodes a continuous polyprotein, which is processed into 12 mature proteins, including four structural proteins (C, Erns, E1, and E2) and eight nonstructural proteins (Npro, P7, NS2, NS3, NS4A, NS4B, NS5A, and NS5B) [1]. The NS3 gene has been shown to be highly conserved in Chinese strains of APPV, while the NS5A, Npro and Erns genes are highly variable [6]. All of the Chinese strains can be classified into 3 genotypes (clades) and 5 subgenotypes (subclades) (1.2 and 1.4–1.7) within genotype 1 [6]. The genomic presence of APPV has been detected in pigs from southern China, including Guangdong, Guangxi, Guizhou, Jiangxi, Yunnan, Anhui and other provinces [

Fig. 2
figure 2

Phylogenetic analysis of Chinese APPV strains. Phylogenetic trees based on the nucleotide sequences of the complete polyprotein CDS (A) and the NS5A gene (B) were constructed by the neighbor-joining (NJ) method with 1,000 bootstrap replicates in MEGA11 software. The APPV-SDHY-2022 strain reported in this study is indicated with a red dot

Table 3 Homology analysis of NS5A nucleotide sequence within clades or subclades (%)

Recombination analysis

To further explore the genetic evolution of APPV, potential recombination events were identified using Recombination Detection Program version 4 (RDP4) and then examined using SimPlot version 3.5.1. Among all available APPV strains, 8 strains (GD-DH01-2018, GD-BZ01-2018, JX-JM01-2018A01, GD2, GD-HJ-2017.04, GD-LN-2017.04, GD-CT4, and GD-MH01-2018) had potential genetic recombination events. Although NGS of APPV-SDHY-2022 confirmed recombination events of JX-JM01-2018A01 and GD-HJ-2017.04 by RDP4 (see Additional file 4: Table s4), no obvious genetic recombination in APPV-SDHY-2022 strains was observed by SimPlot software in this study (Fig. 3).

Fig. 3
figure 3

Recombination analysis of the complete genomes of the APPV-SDHY-2022 strain from Shandong Province. Potential recombination events were identified using Recombination Detection Program 4 (RDP4) and then examined using similarity plots and bootstrap analysis in Simplot 3.5.1. The major and minor parents were JX-JM01-2018A01 and GD-HJ-2017.04, respectively

Amino acid sequence analysis

Amino acid sequences of individual viral proteins of all the Chinese APPV strains were analyzed. No amino acid insertions or deletions were found in the APPV-SDHY-2022 strain. The amino acid sequences of the individual proteins were compared to identify those that differentiate Clade II from Clade I and Clade III, and 20 unique amino acids were found in Clade II strains (Fig. 4), among which, most sites were distributed on NS5A(7H,16A,69Q,131Q,152M,189I,280A,397F,437A) and NS5B(77V,139P,193P,231K,274A), and the remaining sites were on Npro (85D,120E), C(90K), Erns(91K,139Y) and NS3(30T). Interestingly, the amino acids at these unique sites were identical between Clade I and Clade III strains, demonstrating that it is possible to determine the type of strain by measuring these specific amino acids alone.

Fig. 4
figure 4

The unique amino acids found in Clade II APPV strains. Amino acid sequences of viral proteins were aligned with reference strains using MEGA11 and BioEdit software

Glycosylation analysis

In this study, putative N-glycosylation sites in the three important glycoproteins, Erns, E1, and E2, in Chinese APPV strains were also predicted. APPV-SDHY-2022, along with most of the strains in Clade II, is heavily glycosylated, with a total of ten N-glycosylation sites (N104 in the E1 protein; N12, N26, N43, N64, and N99 in the Erns protein; N51,N64,N103, and N127 in the E2 protein) (Fig. 5). All the Chinese APPV strains had a conserved putative N-glycosylation site at N104 with a consensus N-I-T motif in the E1 protein. The putative N-glycosylation sites in the Erns and E2 proteins differed greatly among strains in different subclades, and 9 patterns of putative N-glycosylation sites were observed in E2 proteins, including N51 + N64 + N103, N64 + N103, N51 + N64 + N103 + N141, N51 + N64 + N127 + N103 + N141, N51 + N64 + N103 + N127, N64 + N103 + N127, N51 + N127, N51 + N64, N64 (Fig. 5). Among the N-glycosylation sites of E2 proteins, a putative site at N64 was highly conserved.

Fig. 5
figure 5

Putative N-glycosylation sites of Erns, E1 and E2 proteins. The putative N-glycosylation sites within the Erns, E1 and E2 sequences of Chinese APPV strains were predicted according to a glycosylation analysis algorithm, and are shown as a blue shaded box

Antigen prediction

To analyze the effect of glycosylation sites on the antigenicity of the E2 protein, the antigenic index was determined by the Jameson-Wolf method in this study, and the results showed that aa positions at 1 ~ 9, 15 ~ 28, 34 ~ 44, 49 ~ 55, 62 ~ 82, 118 ~ 130, 136 ~ 158, 174 ~ 184, 188 ~ 196 and 200 ~ 205 of the E2 protein were the potential immunodominant regions. A comparison of the antigenic index within Chinese strains with and without a specific putative site showed that the putative N-glycosylation site at N51 had a negative effect on the antigenicity of the corresponding region (Fig. 6).

Fig. 6
figure 6

Antigenicity prediction for the E2 protein. The Jameson-Wolf algorithm, which combines secondary structure information with backbone flexibility to predict surface accessibility, was used to determine the predicted antigenic index, with a threshold value of 1.7. The putative N-glycosylation sites within the E2 sequences of Chinese APPV strains are shown as a blue arrow. Representative strains from different Clades/subclades or patterns of putative N-glycosylation sites were included, and the strains in each subclade with different patterns of putative N-glycosylation sites are underlined

To further analyze the effect of glycosylation sites on conformational epitopes of the E2 protein, BepiPred-3.0 was used to predict B-cell conformational epitopes. The results showed that the 15 most likely B-cell conformational epitope residues varied among different Clades/subclades or patterns of N-glycosylation sites, and 39E, 70R, 173R, 190K, and 191N were conserved residues among all Chinese strains (Table 4) (see also the graphical representations of the predicted epitopes in Fig. 7).

Table 4 Prediction of potential B-cell conformational epitopes from E2 protein sequence
Fig. 7
figure 7

Conformational B-cell epitope prediction for the E2 protein. The potential B-cell conformational epitopes of the E2 protein in APPV Chinese strains were predicted by BepiPred-3.0, and the residues with scores above the threshold (default value is 0.1512) are predicted to be part of an epitope and colored in yellow on the graph (where Y-axes depict BepiPred-3.0 epitope scores and X-axes protein sequence positions). Shown is the graphical output of B-cell discontinuous epitope predictions for the E2 protein with APPV-SDHY-2022 as an example