Log in

Pareto-efficient designs for multi- and mixed-level supersaturated designs

  • Original Paper
  • Published:
Statistics and Computing Aims and scope Submit manuscript

Abstract

Supersaturated designs are used in science and engineering to efficiently explore a large number of factors with a limited number of runs. It is not uncommon in engineering to consider a few, if not all, factors at more than two levels. Multi- and mixed-level supersaturated designs may, therefore, be handy. While the two-level supersaturated designs are widely studied, the literature on multi- and mixed-level designs is still scarce. A recent paper establishes that the group LASSO should be preferred as an analysis method because it can retain the natural group structure of multi- and mixed-level designs. A few optimality criteria for such designs also exist in the literature. These criteria typically aim to find designs that maximize average pairwise orthogonality. However, the literature lacks guidance on the better or ‘right’ optimality criteria from a screening perspective. In addition, the existing optimal designs are often balanced and are rarely available. We propose two new optimality criteria based on the large-sample properties of group LASSO. Our criteria fill the gap in the literature by providing design selection criteria that are directly related to the preferred analysis method. We then construct Pareto-efficient designs on the two new criteria and demonstrate that (a) our optimality criteria can be used to order existing optimal designs on their screening performance, (b) the Pareto-efficient designs are often better than or as good as the existing optimal designs, and (c) the Pareto-efficient designs can be constructed using a coordinate exchange algorithm and are, therefore, available for any choice of the number of runs, factors, and levels. A repository of three- and four-level designs with the number of runs between 8 and 16 is also provided.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

References

  • Akhtar, Y., Zhang, F., Colbourn, C.J., Stufken, J., Syrotiuk, V.R.: Scalable level-wise screening experiments using locating arrays. J. Qual. Technol. (2023). https://doi.org/10.1080/00224065.2023.2220973

    Article  Google Scholar 

  • Bach, F.R.: Consistency of the group lasso and multiple kernel learning. J. Mach. Learn. Res. 9(6), 1179–1225 (2008)

    MathSciNet  Google Scholar 

  • Booth, K.H.V., Cox, D.R.: Some systematic supersaturated designs. Technometrics 4(4), 489–495 (1962)

    Article  MathSciNet  Google Scholar 

  • Box, G.E.P.: Discussion of the papers of Satterthwaite and Budne. Technometrics 1(1), 174–180 (1959)

    Google Scholar 

  • Candès, E.J., Tao, T.: Decoding by linear programming. IEEE Trans. Inf. Theory 51(12), 4203–4215 (2005)

    Article  MathSciNet  Google Scholar 

  • Candès, E.J., Tao, T.: The Dantzig selector: statistical estimation when p is much larger than n. Ann. Stat. 35(6), 2313–2351 (2007)

    MathSciNet  Google Scholar 

  • Cao, Y., Smucker, B.J., Robinson, T.J.: A hybrid elitist pareto-based coordinate exchange algorithm for constructing multi-criteria optimal experimental designs. Stat. Comput. 27(2), 423–437 (2017)

    Article  MathSciNet  Google Scholar 

  • Draguljić, D., Woods, D.C., Dean, A.M., Lewis, S.M., Vine, A.J.E.: Screening strategies in the presence of interactions. Technometrics 56(1), 1–15 (2014)

    Article  MathSciNet  Google Scholar 

  • Eldar, Y.C., Mishali, M.: Robust recovery of signals from a structured union of subspaces. IEEE Trans. Inf. Theory 55(11), 5302–5316 (2009)

    Article  MathSciNet  Google Scholar 

  • Fan, J., Li, R.: Variable selection via non-concave penalized likelihood and its oracle properties. J. Am. Stat. Assoc. 96(456), 1348–1360 (2001)

    Article  Google Scholar 

  • Fang, K.-T., Lin, D.K., Ma, C.-X.: On the construction of multi-level supersaturated designs. J. Stat. Plan. Inference 86(1), 239–252 (2000)

    Article  MathSciNet  Google Scholar 

  • Fang, K.T., Ge, G., Liu, M.Q.: Uniform supersaturated design and its construction. Sci. China Ser. A Math. 45, 1080–1088 (2002)

    Article  MathSciNet  Google Scholar 

  • Fang, K.T., Lin, D.K.J., Liu, M.Q.: Optimal mixed-level supersaturated design. Metrika 58, 279–291 (2003)

    Article  MathSciNet  Google Scholar 

  • Fang, K.T., Ge, G., Liu, M.Q., Qin, H.: Combinatorial constructions for optimal supersaturated designs. Discret. Math. 279(1–3), 191–202 (2004)

    Article  MathSciNet  Google Scholar 

  • Fang, K.T., Ge, G., Liu, M.Q.: Construction of optimal supersaturated designs by the packing method. Sci. China Ser. A Math. 47, 128–143 (2004)

    Article  MathSciNet  Google Scholar 

  • Georgiou, S., Koukouvinos, C.: Multi-level \(k\)-circulant supersaturated designs. Metrika 64, 209–220 (2006)

    Article  MathSciNet  Google Scholar 

  • Hedayat, A.S., Sloane, N.J.A., Stufken, J.: Orthogonal Arrays: Theory and Applications. Springer Series in Statistics, Springer, New York (1999)

    Book  Google Scholar 

  • Kauffman, K.J., Dorkin, J.R., Yang, J.H., Heartlein, M.W., DeRosa, F., Mir, F.F., Fenton, O.S., Anderson, D.G.: Optimization of lipid nanoparticle formulations for mRNA delivery in vivo with fractional factorial and definitive screening designs. Nano Lett. 15(11), 7300–7306 (2015)

    Article  Google Scholar 

  • Lu, X., Hu, W., Zheng, Y.: A systematical procedure in the construction of multi-level supersaturated design. J. Stat. Plan. Inference 115(1), 287–310 (2003)

    Article  MathSciNet  Google Scholar 

  • Phoa, F.K., Pan, Y.H., Xu, H.: Analysis of supersaturated designs via the Dantzig selector. J. Stat. Plan. Inference 139(7), 2362–2372 (2009)

    Article  MathSciNet  Google Scholar 

  • Satterthwaite, F.E.: Random balance experimentation. Technometrics 1(2), 111–137 (1959)

    Article  MathSciNet  Google Scholar 

  • Schoen, E.D., Eendebak, P.T., Nguyen, M.V.: Complete enumeration of pure-level and mixed-level orthogonal arrays. J. Comb. Des. 18(2), 123–140 (2010)

    Article  MathSciNet  Google Scholar 

  • Schoen, E.D., Eendebak, P.T., Vazquez, A.R., Goos, P.: Systematic enumeration of definitive screening designs. Stat. Comput. 32(6), 109 (2022)

    Article  MathSciNet  Google Scholar 

  • Simon, N., Tibshirani, R.: Standardization and the group LASSO penalty. Stat. Sin. 22(3), 983 (2012)

    Article  MathSciNet  Google Scholar 

  • Singh, R.: Best practices for multi- and mixed-level supersaturated designs. J. Qual. Technol. (2023). https://doi.org/10.1080/00224065.2023.2259022

    Article  Google Scholar 

  • Singh, R., Stufken, J.: Selection of two-level supersaturated designs for main effects models. Technometrics 65(1), 96–104 (2023). https://doi.org/10.1080/00401706.2022.2102080

    Article  MathSciNet  Google Scholar 

  • Stallrich, J., Young, K., Weese, M., Smucker, B., Edwards, D.: Optimal supersaturated designs for lasso sign recovery (2023)

  • Sun, F., Lin, D.K.J., Liu, M.Q.: On construction of optimal mixed-level supersaturated designs. Ann. Stat. 39(2), 1310–1333 (2011)

  • Tai, M., Ly, A., Leung, I., Nayar, G.: Efficient high-throughput biological process characterization: definitive screening design with the Ambr250 bioreactor system. Biotechnol. Prog. 31(5), 1388–1395 (2015)

    Article  Google Scholar 

  • Tibshirani, R.: Regression shrinkage and selection via the lasso. J. R. Stat. Soc. Ser. B (Methodol.) 58(1), 267–288 (1996)

    MathSciNet  Google Scholar 

  • Wainwright, M.J.: High-Dimensional Statistics: A Non-asymptotic Viewpoint, vol. 48. Cambridge university Press, Cambridge (2019)

  • Weese, M.L., Stallrich, J.W., Smucker, B.J., Edwards, D.J.: Strategies for supersaturated screening: group orthogonal and constrained Var(\(s\)) designs. Technometrics 63(4), 443–455 (2022)

    Article  MathSciNet  Google Scholar 

  • Xu, H., Wu, C.F.J.: Construction of optimal multi-level supersaturated designs. Ann. Stat. 33(6), 2811–2836 (2005)

    Article  MathSciNet  Google Scholar 

  • Yamada, S., Lin, D.K.J.: Three-level supersaturated designs. Stat. Probab. Lett. 45(1), 31–39 (1999)

    Article  MathSciNet  Google Scholar 

  • Yamada, S., Ikebe, Y.T., Hashiguchi, H., Niki, N.: Construction of three-level supersaturated design. J. Stat. Plan. Inference 81(1), 183–193 (1999)

    Article  MathSciNet  Google Scholar 

  • Yang, Y., Zou, H., Bhatnagar, S.: Gglasso: Group Lasso penalized learning using a unified BMD algorithm (2020). R package version 1.5. https://CRAN.R-project.org/package=gglasso

  • Yuan, M., Lin, Y.: Model selection and estimation in regression with grouped variables. J. R. Stat. Soc. Ser. B (Stat. Methodol.) 68(1), 49–67 (2006)

    Article  MathSciNet  Google Scholar 

  • Zhao, P., Yu, B.: On model selection consistency of Lasso. J. Mach. Learn. Res. 7, 2541–2563 (2006)

    MathSciNet  Google Scholar 

Download references

Acknowledgements

The authors were partially supported by the AMS–Simons Travel Grant 2023.

Author information

Authors and Affiliations

Authors

Contributions

All authors contributed to coneptualization, analysis, and writing of this manuscript.

Corresponding author

Correspondence to Rakhi Singh.

Ethics declarations

Conflict of interest

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Appendix A The EPCEA algorithm

Appendix A The EPCEA algorithm

The modified EPCEA algorithm is provided in Algorithm 2. But first, we explain some notations and describe an algorithm that will be used in Algorithm 2. Also, note that every time the coordinate exchange operator is invoked, it makes all possible coordinate exchanges. In particular it adds +1 (mod \(v_j\)), +2 (mod \(v_j\)),..., + \((v_j-1)\) (mod \(v_j\)) to the original level in the jth factor. Recall that the values of our criteria in Sect. 3 are dependent on the chosen sets from all possible \({m \atopwithdelims ()k}\) sets. Let u be the maximum number of sets that are computationally feasible. We define \(S_k = \min (u, {m \atopwithdelims ()k})\) to denote the sets to be used for computing criteria values for k. Define \(S^u = (S_2, \ldots ,S_{\lceil n/3 \rceil })\) for the collection of sets used.

Algorithm 1
figure a

Steps of EPCEA

Algorithm 2
figure b

Modified EPCEA

For the results in this paper, we set \(u=1000\), \(ndes = 2\), \(ntry = 500\), and the two starting designs are randomly created.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Singh, R. Pareto-efficient designs for multi- and mixed-level supersaturated designs. Stat Comput 34, 38 (2024). https://doi.org/10.1007/s11222-023-10354-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s11222-023-10354-9

Keywords

Navigation