Abstract
Supersaturated designs are used in science and engineering to efficiently explore a large number of factors with a limited number of runs. It is not uncommon in engineering to consider a few, if not all, factors at more than two levels. Multi- and mixed-level supersaturated designs may, therefore, be handy. While the two-level supersaturated designs are widely studied, the literature on multi- and mixed-level designs is still scarce. A recent paper establishes that the group LASSO should be preferred as an analysis method because it can retain the natural group structure of multi- and mixed-level designs. A few optimality criteria for such designs also exist in the literature. These criteria typically aim to find designs that maximize average pairwise orthogonality. However, the literature lacks guidance on the better or ‘right’ optimality criteria from a screening perspective. In addition, the existing optimal designs are often balanced and are rarely available. We propose two new optimality criteria based on the large-sample properties of group LASSO. Our criteria fill the gap in the literature by providing design selection criteria that are directly related to the preferred analysis method. We then construct Pareto-efficient designs on the two new criteria and demonstrate that (a) our optimality criteria can be used to order existing optimal designs on their screening performance, (b) the Pareto-efficient designs are often better than or as good as the existing optimal designs, and (c) the Pareto-efficient designs can be constructed using a coordinate exchange algorithm and are, therefore, available for any choice of the number of runs, factors, and levels. A repository of three- and four-level designs with the number of runs between 8 and 16 is also provided.
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11222-023-10354-9/MediaObjects/11222_2023_10354_Fig1_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11222-023-10354-9/MediaObjects/11222_2023_10354_Fig2_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11222-023-10354-9/MediaObjects/11222_2023_10354_Fig3_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11222-023-10354-9/MediaObjects/11222_2023_10354_Fig4_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11222-023-10354-9/MediaObjects/11222_2023_10354_Fig5_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11222-023-10354-9/MediaObjects/11222_2023_10354_Fig6_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11222-023-10354-9/MediaObjects/11222_2023_10354_Fig7_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11222-023-10354-9/MediaObjects/11222_2023_10354_Fig8_HTML.png)
Similar content being viewed by others
References
Akhtar, Y., Zhang, F., Colbourn, C.J., Stufken, J., Syrotiuk, V.R.: Scalable level-wise screening experiments using locating arrays. J. Qual. Technol. (2023). https://doi.org/10.1080/00224065.2023.2220973
Bach, F.R.: Consistency of the group lasso and multiple kernel learning. J. Mach. Learn. Res. 9(6), 1179–1225 (2008)
Booth, K.H.V., Cox, D.R.: Some systematic supersaturated designs. Technometrics 4(4), 489–495 (1962)
Box, G.E.P.: Discussion of the papers of Satterthwaite and Budne. Technometrics 1(1), 174–180 (1959)
Candès, E.J., Tao, T.: Decoding by linear programming. IEEE Trans. Inf. Theory 51(12), 4203–4215 (2005)
Candès, E.J., Tao, T.: The Dantzig selector: statistical estimation when p is much larger than n. Ann. Stat. 35(6), 2313–2351 (2007)
Cao, Y., Smucker, B.J., Robinson, T.J.: A hybrid elitist pareto-based coordinate exchange algorithm for constructing multi-criteria optimal experimental designs. Stat. Comput. 27(2), 423–437 (2017)
Draguljić, D., Woods, D.C., Dean, A.M., Lewis, S.M., Vine, A.J.E.: Screening strategies in the presence of interactions. Technometrics 56(1), 1–15 (2014)
Eldar, Y.C., Mishali, M.: Robust recovery of signals from a structured union of subspaces. IEEE Trans. Inf. Theory 55(11), 5302–5316 (2009)
Fan, J., Li, R.: Variable selection via non-concave penalized likelihood and its oracle properties. J. Am. Stat. Assoc. 96(456), 1348–1360 (2001)
Fang, K.-T., Lin, D.K., Ma, C.-X.: On the construction of multi-level supersaturated designs. J. Stat. Plan. Inference 86(1), 239–252 (2000)
Fang, K.T., Ge, G., Liu, M.Q.: Uniform supersaturated design and its construction. Sci. China Ser. A Math. 45, 1080–1088 (2002)
Fang, K.T., Lin, D.K.J., Liu, M.Q.: Optimal mixed-level supersaturated design. Metrika 58, 279–291 (2003)
Fang, K.T., Ge, G., Liu, M.Q., Qin, H.: Combinatorial constructions for optimal supersaturated designs. Discret. Math. 279(1–3), 191–202 (2004)
Fang, K.T., Ge, G., Liu, M.Q.: Construction of optimal supersaturated designs by the packing method. Sci. China Ser. A Math. 47, 128–143 (2004)
Georgiou, S., Koukouvinos, C.: Multi-level \(k\)-circulant supersaturated designs. Metrika 64, 209–220 (2006)
Hedayat, A.S., Sloane, N.J.A., Stufken, J.: Orthogonal Arrays: Theory and Applications. Springer Series in Statistics, Springer, New York (1999)
Kauffman, K.J., Dorkin, J.R., Yang, J.H., Heartlein, M.W., DeRosa, F., Mir, F.F., Fenton, O.S., Anderson, D.G.: Optimization of lipid nanoparticle formulations for mRNA delivery in vivo with fractional factorial and definitive screening designs. Nano Lett. 15(11), 7300–7306 (2015)
Lu, X., Hu, W., Zheng, Y.: A systematical procedure in the construction of multi-level supersaturated design. J. Stat. Plan. Inference 115(1), 287–310 (2003)
Phoa, F.K., Pan, Y.H., Xu, H.: Analysis of supersaturated designs via the Dantzig selector. J. Stat. Plan. Inference 139(7), 2362–2372 (2009)
Satterthwaite, F.E.: Random balance experimentation. Technometrics 1(2), 111–137 (1959)
Schoen, E.D., Eendebak, P.T., Nguyen, M.V.: Complete enumeration of pure-level and mixed-level orthogonal arrays. J. Comb. Des. 18(2), 123–140 (2010)
Schoen, E.D., Eendebak, P.T., Vazquez, A.R., Goos, P.: Systematic enumeration of definitive screening designs. Stat. Comput. 32(6), 109 (2022)
Simon, N., Tibshirani, R.: Standardization and the group LASSO penalty. Stat. Sin. 22(3), 983 (2012)
Singh, R.: Best practices for multi- and mixed-level supersaturated designs. J. Qual. Technol. (2023). https://doi.org/10.1080/00224065.2023.2259022
Singh, R., Stufken, J.: Selection of two-level supersaturated designs for main effects models. Technometrics 65(1), 96–104 (2023). https://doi.org/10.1080/00401706.2022.2102080
Stallrich, J., Young, K., Weese, M., Smucker, B., Edwards, D.: Optimal supersaturated designs for lasso sign recovery (2023)
Sun, F., Lin, D.K.J., Liu, M.Q.: On construction of optimal mixed-level supersaturated designs. Ann. Stat. 39(2), 1310–1333 (2011)
Tai, M., Ly, A., Leung, I., Nayar, G.: Efficient high-throughput biological process characterization: definitive screening design with the Ambr250 bioreactor system. Biotechnol. Prog. 31(5), 1388–1395 (2015)
Tibshirani, R.: Regression shrinkage and selection via the lasso. J. R. Stat. Soc. Ser. B (Methodol.) 58(1), 267–288 (1996)
Wainwright, M.J.: High-Dimensional Statistics: A Non-asymptotic Viewpoint, vol. 48. Cambridge university Press, Cambridge (2019)
Weese, M.L., Stallrich, J.W., Smucker, B.J., Edwards, D.J.: Strategies for supersaturated screening: group orthogonal and constrained Var(\(s\)) designs. Technometrics 63(4), 443–455 (2022)
Xu, H., Wu, C.F.J.: Construction of optimal multi-level supersaturated designs. Ann. Stat. 33(6), 2811–2836 (2005)
Yamada, S., Lin, D.K.J.: Three-level supersaturated designs. Stat. Probab. Lett. 45(1), 31–39 (1999)
Yamada, S., Ikebe, Y.T., Hashiguchi, H., Niki, N.: Construction of three-level supersaturated design. J. Stat. Plan. Inference 81(1), 183–193 (1999)
Yang, Y., Zou, H., Bhatnagar, S.: Gglasso: Group Lasso penalized learning using a unified BMD algorithm (2020). R package version 1.5. https://CRAN.R-project.org/package=gglasso
Yuan, M., Lin, Y.: Model selection and estimation in regression with grouped variables. J. R. Stat. Soc. Ser. B (Stat. Methodol.) 68(1), 49–67 (2006)
Zhao, P., Yu, B.: On model selection consistency of Lasso. J. Mach. Learn. Res. 7, 2541–2563 (2006)
Acknowledgements
The authors were partially supported by the AMS–Simons Travel Grant 2023.
Author information
Authors and Affiliations
Contributions
All authors contributed to coneptualization, analysis, and writing of this manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Appendix A The EPCEA algorithm
Appendix A The EPCEA algorithm
The modified EPCEA algorithm is provided in Algorithm 2. But first, we explain some notations and describe an algorithm that will be used in Algorithm 2. Also, note that every time the coordinate exchange operator is invoked, it makes all possible coordinate exchanges. In particular it adds +1 (mod \(v_j\)), +2 (mod \(v_j\)),..., + \((v_j-1)\) (mod \(v_j\)) to the original level in the jth factor. Recall that the values of our criteria in Sect. 3 are dependent on the chosen sets from all possible \({m \atopwithdelims ()k}\) sets. Let u be the maximum number of sets that are computationally feasible. We define \(S_k = \min (u, {m \atopwithdelims ()k})\) to denote the sets to be used for computing criteria values for k. Define \(S^u = (S_2, \ldots ,S_{\lceil n/3 \rceil })\) for the collection of sets used.
For the results in this paper, we set \(u=1000\), \(ndes = 2\), \(ntry = 500\), and the two starting designs are randomly created.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Singh, R. Pareto-efficient designs for multi- and mixed-level supersaturated designs. Stat Comput 34, 38 (2024). https://doi.org/10.1007/s11222-023-10354-9
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11222-023-10354-9