Log in

Data-driven construction of Convex Region Surrogate models

  • Published:
Optimization and Engineering Aims and scope Submit manuscript

Abstract

With the increasing trend of solving more complex and integrated optimization problems, there is a need for develo** process models that are sufficiently accurate as well as computationally efficient. In this work, we develop an algorithm for the data-driven construction of a type of surrogate model that can be formulated as a set of mixed-integer linear constraints, yet still provide good approximations of nonlinearities and nonconvexities. In such a surrogate model, which we refer to as Convex Region Surrogate (CRS), the feasible region is given by the union of convex regions in the form of polytopes, and for each region, the corresponding cost function can be approximated by a linear function. The general problem is as follows: given a set of data points in the parameter space and a scalar cost value associated with each data point, find a CRS model that approximates the feasible region and cost function indicated by the given data points. We present a two-phase algorithm to solve this problem and demonstrate its effectiveness with an extensive computational study as well as a real-world case study.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19
Fig. 20
Fig. 21
Fig. 22
Fig. 23
Fig. 24
Fig. 25
Fig. 26
Fig. 27

Similar content being viewed by others

Notes

  1. MATLAB version R2012a (7.14.0.739), The Mathworks Inc.

  2. GAMS version 24.2.1, GAMS Development Corporation.

References

  • Barber CB, Dobkin DP, Huhdanpaa H (1996) The quickhull algorithm for convex hulls. ACM Trans Math Softw 22(4):469–483

    Article  MathSciNet  MATH  Google Scholar 

  • Biegler LT, Lang Yd, Lin W (2014) Multi-scale optimization for process systems engineering. Comput Chem Eng 60:17–30

    Article  Google Scholar 

  • Chung PS, Jhon MS, Biegler LT (2011) Chapter 2—the holistic strategy in multi-scale modeling. Adv Chem Eng 48:59–118

    Article  Google Scholar 

  • Cozad A, Sahinidis NV, Miller DC (2014) Learning surrogate models for simulation-based optimization. AIChE J 60(6):2211–2227

    Article  Google Scholar 

  • Crowder H, Johnson EL, Padberg M (1983) Solving large-scale zero-one linear programming problems. Oper Res 31(5):803–834

    Article  MATH  Google Scholar 

  • Goyal V, Ierapetritou MG (2002) Determination of operability limits using simplicial approximation. AIChE J 48(12):2902–2909

    Article  Google Scholar 

  • Goyal V, Ierapetritou MG (2003) Framework for evaluating the feasibility/operability of nonconvex processes. AIChE J 49(5):1233–1240

    Article  Google Scholar 

  • Grossmann IE, Trespalacios F (2013) Systematic modeling of discrete-continuous optimization models through generalized disjunctive programming. AIChE J 59(9):3276–3295

    Article  Google Scholar 

  • Hastie T, Tibshirani R, Friedman J (2009) The elements of statistical learning. Springer, New York

    Book  MATH  Google Scholar 

  • Ierapetritou MG (2001) New approach for quantifying process feasibility: convex and 1-D quasi-convex regions. AIChE J 47(6):1407–1417

    Article  Google Scholar 

  • Jain AK (2010) Data clustering: 50 years beyond K-means. Pattern Recogn Lett 31(8):651–666

    Article  Google Scholar 

  • Jain AK, Duin RPW, Mao J (2000) Statistical pattern recognition: a review. IEEE Trans Pattern Anal Mach Intell 22(1):4–37

    Article  Google Scholar 

  • Karwan MH, Keblis MF (2007) Operations planning with real time pricing of a primary input. Comput Oper Res 34(3):848–867

    Article  MATH  Google Scholar 

  • Kone ER, Karwan MH (2011) Combining a new data classification technique and regression analysis to predict the Cost-To-Serve new customers. Comput Ind Eng 61(1):184–197

    Article  Google Scholar 

  • Mitchell TM (1997) Machine learning. McGraw-Hill, New York

    MATH  Google Scholar 

  • Mitra S, Grossmann IE, Pinto JM, Arora N (2012) Optimal production planning under time-sensitive electricity prices for continuous power-intensive processes. Comput Chem Eng 38:171–184

    Article  Google Scholar 

  • Queipo NV, Haftka RT, Shyy W, Goel T, Vaidyanathan R, Kevin Tucker P (2005) Surrogate-based analysis and optimization. Prog Aerosp Sci 41(1):1–28

    Article  MATH  Google Scholar 

  • Simpson TW, Peplinski JD, Koch PN, Allen JK (2001) Metamodels for computer-based engineering design: survey and recommendations. Eng Comput 17:129–150

    Article  MATH  Google Scholar 

  • Sung C, Maravelias CT (2007) An attainable region approach for production planning of multiproduct processes. AIChE J 53(5):1298–1315

    Article  Google Scholar 

  • Sung C, Maravelias CT (2009) A projection-based method for production planning of multiproduct facilities. AIChE J 55(10):2614–2630

    Article  Google Scholar 

  • Swaney RE, Grossmann IE (1985) An index for operational flexibility in chemical process design–part I: formulation and theory. AIChE J 31(4):621–630

    Article  Google Scholar 

  • Üney F, Türkay M (2006) A mixed-integer programming approach to multi-class data classification problem. Eur J Oper Res 173(3):910–920

    Article  MathSciNet  MATH  Google Scholar 

  • Wang GG, Shan S (2007) Review of metamodeling techniques in support of engineering design optimization. J Mech Des 129(4):370

    Article  MathSciNet  Google Scholar 

  • Xu G, Papageorgiou LG (2009) A mixed integer optimisation model for data classification. Comput Ind Eng 56(4):1205–1215

    Article  Google Scholar 

  • Zhang Q, Grossmann IE, Heuberger CF, Sundaramoorthy A, Pinto JM (2015a) Air separation with cryogenic energy storage: optimal scheduling considering electric energy and reserve markets. AIChE J 61(5):1547–1558

    Article  Google Scholar 

  • Zhang Q, Sundaramoorthy A, Grossmann IE, Pinto JM (2015b) A discrete-time scheduling model for continuous power-intensive process networks with various power contracts. Comput Chem Eng

Download references

Acknowledgments

The authors gratefully acknowledge the financial support from the National Science Foundation under Grant no. 1159443 and from Praxair.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ignacio E. Grossmann.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (zip 455 kb)

Appendices

Appendix 1: Alternative formulation for CRS model

Instead of expressing the feasible region of a CRS model as the union of feasible convex regions, we can also formulate it as the difference of the convex hull and the union of infeasible convex regions. In our example, as shown in Fig. 28, the infeasible convex regions are the two empty polytopes.

Fig. 28
figure 28

The CRS can also be expressed as the difference of the convex hull around all data points and the union of infeasible (empty) convex regions

A polytope can be seen as an intersection of a set of half-spaces of which each is bounded by the hyperplane containing the corresponding facet of the polytope. A point in the polytope, x, is then a solution of the set of constraints \(a_h^T x \le b_h \quad \forall h \in H\) where H is the set of half-spaces. This is commonly referred to as an H-representation of a polyhedron. By applying the H-representation to the infeasible regions, we can express a feasible point in the CRS model, x, as a solution to the following set of constraints:

$$x = \sum\limits_j \lambda_j \, v_j \quad \forall \, j \in V $$
(20a)
$$\sum\limits_{j \in V} \lambda_j = 1$$
(20b)
$$ 0 \le \lambda_j \le 1 \quad \forall \, j \in V$$
(20c)
$$a_{rh}^{{\rm T}} \, x \ge b_{rh} + {\bar{\epsilon}} - M(1-z_{rh}) \quad \forall \; r \in {\bar{R}}, \; h \in H_r$$
(20d)
$$\sum\limits_{h \in H_r} z_{rh} \ge 1 \quad \forall \; r \in {\bar{R}}$$
(20e)
$$z_{rh} \in \{0,1\} \quad \forall \; r \in {\bar{R}}, \, h \in H_r$$
(20f)

where V is the set of vertices of the convex hull, \({\bar{R}}\) is the set of infeasible convex regions, \(H_r\) is the set of half-spaces associated with region r, and \({\bar{\epsilon}}\) is a small margin parameter. Equations (20a)–(20c) state that x is a point inside the convex hull, while Eqs. (20d)–(20f) enforce that x is not a point in the interior of any of the infeasible regions. The binary variable \(z_{rh}\) is 1 if the constraint \(a_{rh}^{\text{T}} \, x \le b_{rh}\) is violated. Note that this approach could also be used to consider “holes” in the feasible region.

Formulation (20) is not entirely equivalent to Eqs. (3b)–(3g) since points on facets shared by feasible and infeasible regions will not be feasible in (20). Moreover, the big-M parameter M in (20d) typically leads to weak LP relaxations which will likely make the alternative formulation less efficient if the numbers of feasible and infeasible convex regions are similar.

Appendix 2: Data for illustrative example

For the illustrative example, we have constructed the set of 100 data points shown in Table 7. Each data point consists of a 2-dimensional parameter vector a and a cost value g. The cost values are calculated from the parameter values using two different linear correlations, for which the corresponding constants and coefficients are listed in Table 8. The cost values for the first 52 points are generated from the first linear correlation, while the cost values for the remaining points are calculated by applying the second linear correlation.

Table 7 Complete set of data for the illustrative example
Table 8 Cost constants and coefficients used in the illustrative example

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhang, Q., Grossmann, I.E., Sundaramoorthy, A. et al. Data-driven construction of Convex Region Surrogate models. Optim Eng 17, 289–332 (2016). https://doi.org/10.1007/s11081-015-9288-8

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11081-015-9288-8

Keywords

Navigation