Introduction

Sonography has become an essential component of the diagnostic evaluation for pediatric appendicitis because it is noninvasive and does not expose the patient to ionizing radiation. When the entire appendix is visualized, primary factors intrinsic to the appendix itself such as compressibility, appendiceal wall thickness and appendiceal diameter are incorporated into the diagnostic interpretation of appendicitis. However, the frequency of incomplete and/or non-visualization of the appendix varies widely depending on operator experience and patient factors, and is sometimes as high as 60% [1,2,3,4]. Clinical scoring systems and laboratory data, such as white blood cell (WBC) count or absolute neutrophil count (ANC), can also be integrated into decision making, but these confer limited diagnostic yield [5]. Occasionally, the sonographic diagnosis of appendicitis remains equivocal despite visualization of an appendix. Secondary sonographic findings such as free fluid, echogenic fat, regional hyperemia or an obstructing or extruded appendicolith may assist in the diagnosis [6,7,8,9]. Similarly, the absence of these secondary sonographic findings may be sufficient to exclude the diagnosis of appendicitis [10,11,12]. Structured ultrasound (US) evaluation for secondary findings, in combination with sonographer training aimed at searching for these findings, has been shown to provide more meaningful US interpretations [13], and standardized reporting has been shown to decrease reliance on computed tomography (CT) as well as admissions for observation [14, 15]. When the appendix is visualized on US, but appendicitis cannot be confirmed, this may lead to additional cross-sectional imaging, transfer to centers with available pediatric surgeons, or hospitalization for clinical monitoring. Because US is the primary imaging modality in the evaluation of pediatric appendicitis, it is important to optimize its application through the awareness and detection of secondary signs.

Many studies have evaluated the addition of clinical judgment, laboratory values or combinations of secondary US findings in the diagnostic evaluation for appendicitis. To our knowledge, there are no pediatric studies that have evaluated the effect of age on the test performance of secondary US findings of appendicitis. Younger children tend to have less reliable signs and symptoms and present at later stages as evidenced by a higher frequency of perforation [16]. Therefore, more advanced disease may allow for greater reliance on secondary sonographic characteristics with or without visualization of an appendix.

Our goal was to determine the age-related test characteristics and prevalence of secondary sonographic findings in the diagnosis of pediatric appendicitis. We hypothesized that secondary sonographic findings may be more diagnostically important in evaluating younger children with suspected appendicitis. We sought to evaluate the test characteristics of secondary sonographic findings overall as well as in cases when the appendix was not visualized or was incompletely seen.

Materials and methods

Study design and setting

We performed a retrospective review of medical records of pediatric patients younger than 19 years of age who presented to a single urban academic pediatric emergency department between November 2015 and October 2018 and who had US ordered to evaluate suspected appendicitis. In general, patients are evaluated by utilizing the institutional clinical pathway for appendicitis, which involves a screening of complete blood cell count and calculation of a Pediatric Appendicitis Score (PAS). For patients with a PAS greater than or equal to four, the pathway suggests a sonogram as the initial diagnostic modality, although this is ultimately left up to clinician discretion [17]. The emergency department has approximately 60,000 patient visits annually. The institutional review board of our center approved this study.

Study population

Participants were identified by a previously developed radiology database implemented within the department of radiology in the beginning of November 2015, with the goal of standardizing the terminology used to describe sonographer observations when performing sonograms for suspected appendicitis. All patients who had US ordered for evaluation of appendicitis between Nov. 1, 2015, and Oct. 31, 2018, in the emergency department were included in the study. The questionnaire was completed by the sonographer performing the US at the time of the exam. All sonograms were performed under the supervision of an attending radiologist, though the attending did not necessarily personally scan the patient. The questionnaire was available to, but was not systematically reviewed or edited by, the faculty radiologist after sonography completion. A majority of sonograms (about 70%) were scanned by five exclusively pediatric sonographers, highly trained by pediatric radiologists with a special interest in sonography. This group of sonographers had an experience range of between 3 and 15 years, with a median of 8.8 years.

Patients 19 years of age and older and patients with a history of relevant abdominal surgery (such as appendectomy or ventriculoperitoneal shunt placement) were excluded from study participation. In addition, four patients were excluded as the final outcome of appendicitis could not be confirmed.

Methods

Data were abstracted electronically from the radiology data tool and uploaded to a Research Electronic Data Capture (REDCap) database. The following data elements were abstracted: demographics (age and sex), date of visit, sonography data (exam date, sonographer name, quality of sonographic window) and primary US findings including visualization of the appendix, the appendiceal tip, maximal appendiceal diameter and compressibility. Visualization of the appendix was defined as visualization of part or all of the appendix or a candidate appendix (a blind-ending tubular structure in the expected anatomical region) as deemed by the interpreting radiologist (Figs. 1 and 2). Templated descriptors mutually agreed upon between the Department of Radiology, the Department of Surgery and the Division of Emergency Medicine before the creation of the radiology data tool were used to standardize the description of positive, negative and equivocal studies.

Fig. 1
figure 1

Transverse ultrasound image shows a normal appendix (cursors) in a 5-year-old girl

Fig. 2
figure 2

Normal appendix in a 16-year-old boy. Longitudinal ultrasound image shows a blind-ending appendiceal tip (arrowheads) with free fluid (asterisk)

Recorded secondary findings of appendicitis included free fluid, regional hyperemia, appendicolith and echogenic fat (Figs. 3, 4, 5 and 6). Free fluid was defined as any volume of fluid in the pelvis, echo-free or echogenic, outside of the bowel lumen, either adjacent to or remote from the right lower quadrant/traditional appendix location. Hyperemia was defined as the subjective appearance of hypervascularity as compared to adjacent muscle (typically the psoas muscle or anterior abdominal wall musculature) when a color/power Doppler window is applied and gain/sensitivity settings are optimized. Appendicolith was defined as an echogenic structure, casting a posterior acoustic shadow, with a convex, near-field margin, either within or outside the lumen of the candidate appendix. Echogenic fat was defined as a complete rind of homogeneously bright soft tissue, without a gap, surrounding the perimeter of the candidate appendix, or as subjectively increased echogenicity of the right lower quadrant as compared to left lower quadrant fat. For each element, the findings were recorded as present, absent or maybe. Free text comments could be added. For the purposes of this analysis, if the secondary finding was recorded as maybe or no comment, it was assumed to be present or absent, respectively. Additional variables such as ethnicity, race, weight, height (when available), duration of symptoms (hours), WBC count, ANC, US imaging impression, additional cross-sectional imaging impressions (when obtained) and position of the appendix (retrocecal/not retrocecal) were manually extracted by record review.

Fig. 3
figure 3

Acute appendicitis in a 10-year-old boy. Longitudinal ultrasound image shows free fluid (FF, asterisk) adjacent to a loop of fluid-filled bowel

Fig. 4
figure 4

Acute appendicitis in an 18-year-old woman. Transverse ultrasound image shows the appendix (arrowheads) surrounded by prominent echogenic fat (EF)

Fig. 5
figure 5

Acute appendicitis in an 18-year-old woman. Longitudinal power Doppler ultrasound image shows the appendix (arrowheads) with hyperemia of the appendiceal wall and the surrounding echogenic fat

Fig. 6
figure 6

Acute appendicitis in a 15-year-old girl. Longitudinal ultrasound image shows an inflamed appendiceal tip (solid arrowheads) with an appendicolith (arrow) casting a sharp acoustic shadow (open arrowhead)

Patient management variables such as operative versus nonoperative management were also obtained. If the patient underwent an operation, pathology was reviewed and the case was divided into the following categories: (1) no appendicitis, (2) mild/early/tip/mucosal infiltrate only, (3) moderate/full thickness inflammation or (4) severe/gangrene/perforation/necrosis. If the patient did not have an operation, the case was delineated as follows: (1) no appendicitis, discharged from the emergency department; (2) no appendicitis, if hospitalized for serial exams and discharged in improved condition or with an alternative diagnosis; or (3) imaged-confirmed appendicitis, hospitalized, antibiotics only, discharged. If the patient was discharged from the emergency department, the institution’s electronic medical record was reviewed for any follow-up visits within 14 days of the initial US. Although every patient received a follow-up chart review, if no additional outpatient visits or emergency department visits were noted in the chart, this was recorded as unconfirmed follow-up.

Outcomes

Our primary outcome was a diagnosis of appendicitis (either based on surgical pathology or perforation or abscess based on US or additional cross-sectional imaging). Perforation was classified by either (1) a well-defined fluid collection in the region of the appendix on US or additional cross-sectional imaging or (2) visualized perforation during the operation or on gross pathology. The diagnosis was further classified into uncomplicated versus complicated appendicitis (abscess at the time of diagnosis or within 14 days, prolonged hospitalization >2 calendar days at the initial visit or readmission within 14 days, parenteral antibiotics >2 days in the hospital or at home). Our primary analysis of interest was the test performance of secondary sonographic findings to detect the final diagnosis of appendicitis, overall and stratified by age. The age groups (<6 years, 6 to <11 years and 11 to <19 years) were determined a priori to attempt to primarily divide into common developmental categories (i.e. young child, school-age, adolescence), which have been previously described in the literature given age-dependent clinical presentations of appendicitis [18, 19], and secondarily to maximize statistical power with relatively similar sample sizes between groups.

Data analysis

Since this was a retrospective review of a previously collected data instrument, we aimed to include the entire population of patients during the study period to provide maximal precision. Descriptive statistics were used to characterize the demographic and clinical features of the study population. Test characteristics with 95% confidence intervals (CIs) were calculated for primary and secondary US findings, including sensitivity, specificity, negative predictive value, positive predictive value, overall accuracy and the area under the receiver operating characteristic curve, both in the total sample and within strata of age group. Risk difference calculations with 95% CIs were performed to determine the significance to compare test characteristics between age groups. A subgroup analysis was also conducted to evaluate the test characteristics of secondary US findings among the patients in whom the appendix was described as not visualized. There are many definitions of equivocal or nondiagnostic US in the literature [6, 20]; however, non-visualization of the appendix was believed to allow for the analysis of the impact of secondary sonographic findings in the absence of any primary findings. All statistical analyses were performed utilizing STATA (version 14.1; StataCorp, College Station, TX) and SPSS (version 23; IBM, Armonk, NY).

Results

Characteristics of study subjects

During the study period, 1,219 patients underwent a sonogram for appendicitis that included tabulation of discrete findings by a pediatric sonographer on an electronic radiology questionnaire. Fifty-one patients were excluded due to age and 17 patients were excluded due to a history of relevant abdominal surgery. Four patients with presumed uncomplicated appendicitis did not have operative care or imaging evidence of perforation so they were excluded due the uncertainty surrounding an official diagnosis per the study definition of appendicitis, leaving a total of 1,147 patients in the study. Of these patients, only 50/1,147 patients (4.4%) had a no comment designation with regard to compressibility (when the appendix was visualized) or the presence or absence of an appendicolith. The majority of patients were ages 11 to younger than 19 years (48.6% [558/1,147]). Of the 1,147 patients, 357 (31.1%) were referred from an outside facility, though all, including the 59.4% (212/357) who had previous sonograms, had onsite sonograms once in our emergency department. The summary statistics of this population are presented in Table 1.

Table 1 Patient characteristics (n=1,147)

Main results

Prevalence and test characteristics of primary sonographic findings are presented in Online Supplementary Material 1. The appendix was visualized completely or partially in 82.4% (945/1,147) of cases. Twenty-five pediatric sonographers performed at least one US examination (mean: 46, range: 1,322) with five sonographers who tended to work evening and night shifts performing 68% (781/1,147) of the scans.

Of the 1,147 patients included in the study, 431 (37.6%) had an ultimate diagnosis of appendicitis, of which 131 (30.4%) were subclassified as complicated appendicitis (Table 2). Cases of complicated appendicitis were much more common in the youngest age group (64.1% [25/39]) than in the middle age group (35.5% [61/172]) or the oldest age group (20.5% [45/220]) (P<0.001 and P<0.001, respectively). The overall frequency of negative appendectomy was 5.0% (20/400).

Table 2 Final patient outcome

Secondary sonographic findings within the study population are presented in Table 3. Free fluid was most commonly seen in 63.4% (727/1,147) of cases and an appendicolith was least commonly seen in 18.0% (206/1,147) of cases. Echogenic fat had the highest test performance overall (sensitivity=94.2%, specificity=91.5%) and across all age groups. Free fluid had the lowest test performance overall (sensitivity=74.0%, specificity=43.0%) and across all age groups. Observation of an appendicolith was more predictive of appendicitis in the youngest age group (<6 years) when compared to the middle age group (6 to <11 years, P<0.001) and to the oldest age group (11 to <19 years; P<0.001). Free fluid was more predictive of appendicitis in the middle age group versus the oldest age group (P=0.023). Test performance of the secondary sonographic characteristics among those without perforation is available in Online Supplementary Material 2. Overall, in the study population, there were no statistically significant differences in the age-stratified test performance of the individual secondary findings for a diagnosis of pediatric appendicitis except in the scenarios described above.

Table 3 Age-stratified test characteristics of secondary sonographic findings in the prediction of appendicitis

Visualization rates of the appendix did not vary significantly between age groups. Within the subset of 202 cases where the appendix was described as not visualized on US (Table 4), there were only 16 cases of appendicitis. The prevalence of appendicitis within this subset (7.9%) was lower than the overall prevalence of appendicitis within the study group (37.6%). Free fluid was the most sensitive (93.8% [95% CI: 69.8, 99.8]) and appendicolith was the most specific (97.9% [95% CI: 94.6, 99.4]) predictor for appendicitis in this group. Of the 78 cases with absence of secondary sonographic findings, none was diagnosed with appendicitis. Thirteen of the 16 cases of appendicitis had 2 or more secondary sonographic findings of appendicitis, and as the number of secondary findings increased, the likelihood of appendicitis increased (Online Supplementary Material 3).

Table 4 Test characteristics of secondary sonographic findings in the prediction of appendicitis when the appendix is described as not visualized

Discussion

Our study investigated the age-stratified test performance of secondary sonographic findings in the prediction of pediatric appendicitis. The prevalence of secondary findings did not vary significantly between the different age groups. Echogenic right lower quadrant fat was the most predictive secondary sonographic finding for appendicitis across all age groups, which confirms similar findings in other studies of pediatric [21,22,23] and adult appendicitis [24]. The sonographic visualization of an appendicolith was more accurate for the diagnosis of appendicitis in the youngest age group as compared to the older two age groups. This may bear some relation to the higher frequency of complicated appendicitis (64.1% [25/39]) in this age group, which has also been previously reported in the literature [16, 25,26,27].

The appendix was partially or completely visualized in 82.4% (945/1,147) of cases, with similar visualization frequencies across all age groups (81.2% [453/558] to 84.2% [351/417]). Although non-visualization of the appendix is not the only criterion for nondiagnostic US, we evaluated this particular subset of patients to analyze the impact of secondary sonographic findings in the absence of primary findings such as appendiceal diameter or compressibility. Seventy-eight of the 202 non-visualized cases had no secondary sonographic findings and were not diagnosed with appendicitis, suggesting that, in the absence of secondary findings, one could potentially rule out appendicitis when the appendix is not visualized. Similarly, in a recent study by Wiersma et al. [10], of the 20% (41/212) of study patients with non-visualization of the appendix and no evidence of secondary signs, none developed acute appendicitis. In the remaining non-visualized cases, appendicolith, echogenic fat or hyperemia were highly specific; therefore, if the pretest probability of appendicitis is high based on other historical, physical and laboratory findings, these findings may push one to pursue additional cross-sectional imaging. Nevertheless, the sample of non-visualized cases is small and the study is likely underpowered to make these conclusions.

Structured US evaluation for appendicitis and associated secondary sonographic findings as well as standardized reporting of these findings have previously led to improved visualization rates and less utilization of additional cross sectional imaging [13,14,15]. Our study had a visualization rate that was relatively high as compared to rates reported in the literature [1, 4]. This study was performed at an academic institution with sonographers and radiologists familiar with pediatric patients and secondary sonographic findings, so further study should be directed at dissemination of structured US evaluation and recognition of these findings to referring facilities.

Our study has several limitations. First, although the data were prospectively collected during each US examination, this is a retrospective study. Therefore, we are limited to the data and the images collected on the initial US studies. Although a standardized radiology tool for describing US findings was used, we did not review the images from each US to ensure standardized application of these descriptors. This was a sonographer-generated database, and although the questionnaire was available in the electronic medical record, it was not systematically edited or reviewed. While about 70% of the sonograms were performed by 5 dedicated pediatric sonographers with a median of 8.8 years of experience at our facility, it is possible the radiologists’ interpretation of secondary findings may have differed from the sonographers. Although this is acknowledged as a limitation, it increasingly reflects the workflow of busy imaging departments where direct radiologist scanning of all patients with concern for appendicitis is not routine.

Second, for purposes of analysis of secondary findings, we assumed that maybe equaled present and no comment equaled absent (which could indicate no evaluation was performed or the finding was not seen). This could potentially under- or overestimate the effect of each secondary finding; however, a designation of no comment never exceeded more than 2.1% (24/1,147) of the study population for each secondary finding.

Third, the population is highly subject to selection bias. Out of the study population, 31.1% (357/1,147) were referred into the study site due to concern for appendicitis, and 59.4% (212/357) of those referred patients had a previous abdominal US. Given that the pretest probability of appendicitis is higher in this subset of patients, the sonographers and radiologists may have been inclined to spend more time attempting to visualize the appendix and/or secondary findings. However, the sonographers were not necessarily aware of historical details, perhaps minimizing this potential bias.

Fourth, we were limited to our institutional chart review for follow-up purposes. Although every patient had a follow-up chart review, 28.6% (205/716) of patients had no additional outpatient or emergency department visits within our institution and were assumed to not have appendicitis within 14 days. It is possible that a limited number of these patients returned to a different institution and were diagnosed with appendicitis.

Finally, this is a single-center study staffed with pediatric sonographers and pediatric radiologists available 24 h a day, which is not universal at many institutions. The visualization frequencies of the appendix (82.4% [945/1,147] in our study population) and secondary findings are much higher than those reported at non-pediatric institutions and community hospitals. Nevertheless, in the literature, appendix visualization at some institutions is reported as high as 91.7% [28]. Furthermore, 5 of 25 sonographers (the evening and night shift team) performed 68% (781/1,147) of the right lower quadrant scans, further skewing the institution-specific bias. Therefore, the impact of these secondary findings on the ultimate diagnosis of pediatric appendicitis may not be easily extrapolated to other settings.

Conclusion

Secondary sonographic findings are valuable factors in the diagnosis of appendicitis in pediatric patients, and appear to be strong predictors even in the absence of direct visualization of the appendix. In the majority of secondary sonographic findings, there is no statistically significant difference in the age-stratified test performance for pediatric appendicitis. However, the observation of an appendicolith is a more accurate predictor of appendicitis in the youngest of patients.