Abstract
Characterization of live biotherapeutic product (LBP) batches typically includes a measurement of viability, such as colony forming units (CFU). However, strain-specific CFU enumeration assays can be complicated by the presence of multiple organisms in a single product with similar growth requirements. To overcome specific challenges associated with obtaining strain-specific CFU values from multi-strain mixtures, we developed a method combining mass spectrometry-based colony identification with a traditional CFU assay. This method was assessed using defined consortia made from up to eight bacterial strains. Among four replicate batches of an eight-strain mixture, observed values differed from expected values by less than 0.4 log10 CFU among all strains measured (range of differences, -0.318 to + 0.267). The average difference between observed and expected values was + 0.0308 log10 CFU, with 95% limits of agreement from -0.347 to 0.408 (Bland–Altman analysis). To estimate precision, a single batch of eight-strain mixture was assayed in triplicate by three different users, for a total of nine measurements. Pooled standard deviation values ranged from 0.067 to 0.195 log10 CFU for the eight strains measured, and user averages did not differ significantly. Leveraging emerging mass-spectrometry-based colony identification tools, a novel method for simultaneous enumeration and identification of viable bacteria from mixed-strain consortia was developed and tested. This study demonstrates the potential for this approach to generate accurate and consistent measurements of up to eight bacterial strains simultaneously and may provide a flexible platform for future refinements and modifications.
Key points
• Enumeration of live biotherapeutics is essential for product quality and safety.
• Conventional CFU counting may not differentiate between strains in microbial products.
• This approach was developed for direct enumeration of mixed bacterial strains simultaneously.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Live biotherapeutic products (LBPs) are products containing one or more live microorganism(s) as active substances that are intended to be used for the treatment, prevention, or cure of medical conditions (US Food and Drug Administration 2016, Dreher-Lesnick et al. 2017). In the United States, LBPs are categorized as biological products and, therefore, require development of meaningful tests for identity, purity, and potency of specific drug substances and drug products in their final dosage form. It is expected that the therapeutic effects of LBPs will rely on activity of living organisms delivered in the final dosage. As such, assays designed to measure the total viable organisms in the final product formulation (e.g., colony forming units (CFU) per dose) are often included as a measurement of product potency (Dreher-Lesnick et al. 2017, Pot et al. 2021). However, this relatively straight-forward measurement can be complicated by the presence of multiple bacterial strains in an LBP, especially if they share similar cultivation requirements and/or phenotypic traits.
In situations where selective culture is not practical or appropriate, obtaining CFU counts for multiple bacterial strains from a single mixture grown on the same medium requires identification of individual colonies in sufficient numbers to accurately determine viable amounts of each bacterial strain. Identifying hundreds of individual colonies using conventional methods like metabolic testing, colony PCR, or 16S rRNA gene sequencing can be complex, time consuming, and prohibitively expensive (Cook et al. 2003; Lao et al. 2022). However, within the field of laboratory diagnostic testing, several commercially available platforms now provide rapid and accurate microbial identification from direct analysis of colony material using matrix-assisted laser desorption ionization time-of-flight mass spectrometry (MALDI-TOF MS) technology. MALDI-TOF MS methods have been developed and implemented for identification of various bacterial, fungal, and archaeal targets isolated from numerous sources including clinical specimens, food and dairy products, and environmental samples (Singhal et al. 2015; Gong et al.
The number of colonies picked (i.e., the sampling depth) varied between 115 and 442 colonies per run, depending on the experiment (Table S3). To ensure the purity of overnight cultures and establish a reference measurement for comparison purposes (expected CFU), the OD-adjusted overnight strain cultures were enumerated simultaneously with the mixtures, and CFU values were adjusted according to the dilution factor of each strain in its respective mixture. At least two colonies from each set of reference plates were also selected for MALDI-TOF MS analysis to confirm species-level identities.
Colony identification via MALDI-TOF MS
A manufacturer-supplied protocol (extended direct transfer method) was used to prepare sampled colonies for MALDI-TOF MS analysis. Briefly, fresh colony material was transferred onto an MBT Biotarget 96 (Bruker Daltonics) target site using a sterile wooden toothpick, overlaid with 1 μL of 70% formic acid (Sigma Aldrich) in HPLC grade water (Sigma Aldrich), and allowed to air dry at room temperature. Once dry, 1 μL of α-Cyano-4-hydroxycinnamic acid (HCCA) matrix solution (10 mg/ml in Bruker Standard Solvent, Sigma Aldrich) was applied and allowed to dry once more before loading sample targets into the MALDI-TOF MS instrument for analysis. The Bruker bacterial test standard (Bruker Daltonics) was included on each Biotarget 96 chip for automated instrument calibration and quality control, per manufacturer recommendations.
Sample mass spectra were acquired on a Bruker MALDI Biotyper (MBT) Smart MALDI-TOF MS instrument (Bruker Daltonics, Billerica, MA, USA) set to detect a molecular mass range of 2 to 20 kDa with a laser frequency of 200 Hz. Bruker flexControl (v 3.4) and MBT Compass (v 4.1) software programs were used for automated instrument control and data acquisition, respectively. Species-level sample identities were determined by the MTB software searching against the Bruker BDAL MSP library database (RUO v. 9). Samples with Biotyper ID scores ≥ 1.70 were accepted for identification, while ID scores < 1.70 were considered unreliable and those samples were excluded from analyses. This cutoff score has been used previously, in conjunction with the formic acid overlay method, for identification of anaerobic bacterial isolates (Hsu and Burnham 2014).
Data analysis
Expected and observed CFU values were calculated manually in Excel (v. 2108) spreadsheet software and log10 transformed prior to analysis. Statistical analyses and data visualization were performed in GraphPad Prism statistical software (v. 9.4.1). Differences of paired log10 CFUs were tested parametrically using a paired t-test, with normality of the differences confirmed using the Anderson–Darling normality test. Correlation and unweighted linear regression (least square) analyses were conducted to examine the linear relationship between observed and expected log10 CFU values, using an extra sum-of-squares F-test to test for significant deviation of slope from the null expectation. Further, agreement analysis was performed using Bland–Altman analysis to calculate the bias and limits of agreement between observed and expected values. A significance threshold of 0.05 was used for all significance testing, and all reported p-values are 2-tailed unless otherwise noted in the text. One replicate data pair of B. intestinalis was removed from the eight-strain mixture analysis due to a technical error leading to loss of the expected CFU value for this data pair. This point was removed from the graphs and all further statistical analysis.
Results
Preliminary testing of four-strain consortia
To gage the ability of this protocol to discriminate between closely related bacteria, a mixture containing four different Bacteroides species (B. thetaiotaomicron, B. finegoldii, B. intestinalis, and B. stercoris) in approximately equal amounts was enumerated in duplicate. Strong concordance was observed when comparing expected CFU values (axenic enumeration) with the observed CFU values from the mixture (Fig. 1A). Based on enumerations of the individual strain cultures, the expected log10 CFU values ranged from 6.34 to 7.06, with an average value of 6.78 ± 0.284 (SD), while observed log10 CFU values ranged from 6.47 to 7.03, with an average of 6.75 ± 0.204 (SD). The average difference between paired log10 CFU values was 0.0376 ± 0.124 (SD) and not significantly different from zero (p = 0.420, paired t-test).
To determine whether broad differences in surface structure of bacterial strains (i.e., Gram reaction) would impact enumeration fidelity, a mixture of four Gram-positive bacteria (Bif. breve, Bif. catenulatum, L. ruminis, and R. gnavus) was evaluated in a similar fashion to the Bacteroides mixture. The expected log10 CFU values for this mixture had a wider range from 5.15 to 6.93, with an average value of 6.19 ± 0.573 (SD), while the observed log10 CFU values similarly ranged from 5.36 to 6.71 with an average of 6.23 ± 0.588 (SD). The average difference between observed and expected log10 CFU values (Fig. 1B) was 0.045 ± 0.262 (SD) and was not statistically significant (p = 0.639, paired t-test). Despite the wider range of CFU concentrations in this mixture, concordance with expected CFU values was similar to that of the first experimental mixture. Notably, colony sampling effort was increased to account for the greater range of concentrations (Table S3).
Testing eight-strain consortium and agreement analysis
For the primary evaluation, an eight-strain consortium was constructed from the component strains of the previous consortia. This mixture consisted of both Gram-positive and Gram-negative organisms representing three distinct bacterial phyla. A total of four batch replicates were tested for assessment of agreement between expected and observed enumeration values for the eight constituent strains. Expected log10 CFU values ranged from 6.03 to 7.44 with an overall average of 6.74 ± 0.346, while observed log10 CFU values ranged from 5.88 to 7.33, with an average value of 6.71 ± 0.357. The average difference between observed and expected log10 CFU values was 0.0306 ± 0.193, and this difference was not statistically significant (p = 0.384, paired t-test). Observed and expected values were strongly correlated (Pearson r = 0.850), and linear a regression analysis found the slope of best fit was not significantly different from 1 (p = 0.228, sum-of-squares F-test; Fig. 2B).
A Bland–Altman analysis was performed to describe overall agreement between observed and expected log10 CFU values for this eight-strain mixture, plotting the differences (observed minus expected) of paired values against their averages (Fig. 2C). The estimated method bias (i.e., the mean of the paired differences between observed and expected values) was 0.0306, with 95% limits of agreement ranging from -0.347 to 0.408. The distribution of paired differences was consistent over the range of average CFU values, and no evidence of proportional bias in the log-transformed data, as the paired differences were normally distributed about the mean throughout the range of average values.
Measurement precision study
To evaluate the repeatability of our experimental measurements, a single batch of eight-strain mixture was made in YCFAC media with 12.5% glycerol (final concentration) and divided into aliquots which were stored at -80 °C. For testing, frozen aliquots were thawed to room temperature under anaerobic conditions before being serially diluted and plated for enumeration and colony identification following the previous mixture enumeration protocol. Three different lab users performed the enumeration protocol three times, with each user replicate performed on a different day, for a total of nine replicate measurements (Fig. 3). All users were trained to run the MALDI-TOF MS for colony identification but had varied levels of experience with the instrument ranging from infrequent or sporadic use to frequent, almost daily use. All eight component strains were recovered and enumerated in each of nine replicate enumeration runs. Average log10 CFU values for the eight stains ranged from 5.26 to 6.57, and pooled standard deviation values ranging from 0.067 to 0.193 (Table 1). There was no significant variation between users (p = 0.775, 2-way ANOVA).
Discussion
The motivation for this study was to develop and test a novel method for direct enumeration of multiple bacterial strains from a defined consortium simultaneously. This protocol uses MALDI-TOF MS-based rapid colony identification to determine the proportional abundances of each constituent strain and multiplying these values by the total CFU present in the sample to determine absolute CFU values. In addition to finding strong positive correlations between the experimental and reference methods (Fig. 2B), no evidence of systematic or proportional biases was found in the Bland–Altman agreement analysis. We further demonstrated the repeatability and consistency of this enumeration method across nine replicate measurements of a single batch of eight-strain mixture, finding only modest differences in standard deviation values for the eight component strains.
Advantages of the approach developed in this study include the relatively low cost per sample and rapid turn-around times of MALDI-TOF MS colony identification methods (Dhiman et al. 2011; Singhal et al. 2015). In our lab, manual preparation of 96-sample MS target chips, including colony picking, deposition, and drying time of formic acid and matrix solutions, took ~ 40 to 60 min, depending on the user’s level of experience. This approach also quantifies viable organisms in terms of CFU, which remains the most widely recognized measure of live microbials (US Food and Drug Administration 2016, US Food and Drug Administration 2018). Potential barriers to implementation could include high start-up and maintenance costs of MALDI-TOF MS instrumentation as well as limitations in available reference libraries (Haider et al. 2023). While multiple studies have reported overall cost savings of MALDI-TOF MS use in routine clinical testing over traditional microbiological methods (Gaillot et al. 2011; Tran et al. 2015), cost effectiveness would likely depend on the rate of use and unit costs of alternative identification methods for the target organisms. While this study utilized bacterial strains with species-level MSPs already present in the Bruker BDAL MSP library, specific reference spectra can be generated for novel strains or species of interest and grouped into customized libraries (Hou et al. 2019). MALDI-TOF MS characterization might also serve as part of the comprehensive strain identity and characterization requirements for microorganisms used as active substances in LBPs (Paquet et al. 2021).
MALDI-TOF MS methods are also potentially amenable to automation through use of robotic colony picking and sample preparation equipment. Automation of sample preparation can reduce the amounts of consumable materials and hands-on time required for sample preparation and may also improve consistency and repeatability of colony identification results (Chudejova et al. 2017; Heestermans et al. 2022). In a previous validation study examining over 500 clinical isolates, Chudejova et al. (2017) found automated sample preparation using the MALDI Colonyst robot improved overall identification scores and reduced error rates among both bacterial and fungal isolates when compared with manual sample deposition methods. While our study utilized only one MALDI-TOF MS instrument for MS-based colony identification, other instruments may perform at comparable levels (Pence et al. 2014; Levesque et al. 2015; Porte et al. 2017; Brown-Elliott et al. 2019; Park et al. 2021).
One limitation of this study is that we chose not to conduct statistical power analyses for determining appropriate sampling depths prior to experimental testing, nor was the sampling depth strictly controlled between experimental replicates (Table S3). Except for the precision study experiments, replicate plates from mixed enumeration studies were sampled to completion rather than to a pre-specified number of colonies. Various considerations would factor into calculation of the appropriate sample sizes needed for an assay of this format, such as the intended purpose of the test (e.g., product release, process control, and stability testing), the specific composition of a microbial consortium or LBP, and the precision and/or accuracy required under the applicable regulatory framework (US Food and Drug Administration 2016, Dreher-Lesnick et al. 2017). Meaningful power or sample size calculations for this method would need to account for multiple potential sources of sampling error related to the reported CFU values and composition of the product being tested.
Despite limitations in scope and scale, this study demonstrates the feasibility of a newly developed method to simultaneously generate accurate and repeatable CFU values for up to eight bacterial strains from a defined consortium. Accurate identification and quantification of living microbial components in food and/or drug products are essential parts of product characterization for dosing optimization, clinical trial design, and general assessments of product safety, potency, and stability. As a proof-of-concept, this study should help inform future efforts to develop accurate and repeatable CFU enumeration protocols related to LBPs, probiotic supplements, and food products containing multiple live microbial strains.