Introduction

DEA is a nonparametric mathematical method that is used to evaluate the performance of a set of decision-making units. Congestion occurs when one or more input(s) is/are increased to reduce one or more output(s) without making changes in the other inputs and outputs. Grosskopf and Fare [8] proposed an executable form and studied the related DEA models (Model: FGL) to evaluate the performance of the product. Later, Cooper, Tomston, and Trall (CTT) introduced another model for the study of congestion. Cooper [6] compared these two models using numerical examples. Cooper [4] proposed another theory in which the two-step calculation is combined in CTT. Brackets [2] suggested CTT models to examine the relationship between employment and output in order to increase employment in the manufacturing industries of China [7]. Cooper [3] studied the congestion management in Chinese industry and showed how evaluating managerial inefficiency caused an increase in the output size without reducing employment in the textile and automobile industry. The congestion in the Chinese industry was re-examined by Jahanshahloo and Khodabakhshi [9]; the proposed models create the evaluation principles for the DMUs .Tons and Saho [12] proposed a nonparametric process in order to measure the scale of production–congestion [5]. Cooper examined congestion under a stochastic DEA using chance programming. Odeck [11] discussed the impact of congestion on inputs, such as fuel consumption, along with the impact of the number of workers in the bus industry in Norway. In his article [13] such congestion is investigated with constant returns to scale output-oriented models. Nora et al. [10] presented a new method for calculating the amount of congestion.

So far, conceptual congestion has been localized so that each unit should be examined separately, which is very time-consuming and requires solving programming models. But the introduction of the congestion hyperplane is a linear criterion that calculates the congestion of each unit without solving any model. Since efficient units are the basis for introducing this hyperplane, it can always be introduced without using any loops. So the process of finding the hyperplane will be a linear and always convergent approach. There was no way to determine the congestion border until now. This congestion border makes it possible to recognize congestion from technical inefficiency. While congestion border helps to comprehensively identify congestion units and their amounts, previous methods could recognize congestion in evaluating congestion units. Sometimes, the amount of congestion in such units has been found to be zero because of nonconsideration of the convexity condition. For solving all these problems and reducing computation, the congestion hyperplane is introduced and hence congestion is not recognized as a technical inefficiency anymore. The congestion amount of congestion units is determined by the distance between points on the right-hand side of the hyperplane (right points) and the hyperplane, while threshold congestion is defined by the distance between left points and the hyperplane.

In this paper, we first introduce the methods of Cooper et al. [3] and Nora et al. [10] in “Background” section. The existence of the congestion hyperplane is ascertained by some theorems in “Proposed method” section. In “Numerical example” section, we propose a congestion hyperplane based on a linear method by using efficient units, which can separate congestion units from other DMUs. This hyperplane improves the model developed by Nora et al. [10], and the results are similar to those derived from the model of Cooper et al. [3]. Different numerical examples are selected out of two articles [3, 10] and then solved with the proposed method. Then, these values are compared with the previous methods. In the end, the results of this study are presented in “Conclusions” section.

Background

Basic concepts

BCC model

If \(T_{v} = \{ \left( {x,y} \right)| x \ge \sum\nolimits_{i = 0}^{n} {\lambda_{j} x_{j} } , y \le \sum\nolimits_{j = 1}^{n} {\lambda_{j} y_{j} } , \sum\nolimits_{j = 1}^{n} {\lambda_{j} = 1} ,\lambda_{j} \ge 0 , j = 1, \ldots ,n\}\), then to measure the relative efficiency of each DMU in the output oriented, the following model should be solved:

$$\begin{array}{*{20}l} {\text{Max}} \hfill & {\quad \varphi } \hfill \\ {{\text{s}} . {\text{t}} .} \hfill & {\quad \left( {x_{o} ,\varphi y_{o} } \right) \in T_{V} } \hfill \\ \end{array}$$

BCC model (1) is:

$$\begin{aligned} & \varphi^{*} = {\text{Max}}\;\varphi + \varepsilon \left( {\mathop \sum \limits_{r = 1}^{s} s_{r}^{ + } + \mathop \sum \limits_{i = 1}^{m} s_{i}^{ - } } \right) \\ & {\text{s}} . {\text{t}}\; \mathop \sum \limits_{j = 1}^{n} x_{ij} \lambda_{j} + s_{io}^{ - } = x_{io} , \quad i = 1,2, \ldots m \\ & \mathop \sum \limits_{j = 1}^{n} y_{rj} \lambda_{j} - s_{ro}^{ + } = \varphi_{o} y_{ro} ,\quad r = 1,2, \ldots ,s \\ & \mathop \sum \limits_{j = 1}^{n} \lambda_{j} = 1 \\ & \left( {\lambda_{j} , s_{io}^{ - } ,s_{ro}^{ + } } \right) \ge 0,\quad j = 1,2, \ldots ,n,\quad i = 1,2, \ldots ,m , \quad r = 1,2, \ldots ,s \\ \end{aligned}$$
(1)

Hyperplane

The result of solving the multiplicative models is to find the weights such as \(\left( {U^{*} ,V^{*} } \right)\) for the inputs and outputs of the unit. Given the constraints, we have a multiplicative form \(U^{t} Y_{j} - V^{t} X_{j} \le 0 ; j = 1,2, \ldots ,n\). In other words, the n hyperplanes are available with 1 gradient vector (U, V).

Congestion

There is a congestion if the decrease in one or more inputs is accompanied by an increase in one or more outputs (congestion in the input), without the other inputs and outputs getting worse or vice versa, if the increase in one or more inputs along with a decrease in one or multiple outputs without getting worse in other inputs and outputs.

Introducing the Cooper’s method

In this section, the method of Cooper et al. [3] and their definition of congestion will be explained. Consider that there are n DMUs that they have m inputs and s outputs. The \(x_{j} = \left( {x_{1j} ,x_{2j} , \ldots ,x_{mj} } \right)^{T}\) and \(y_{j} = \left( {y_{1j} ,y_{2j} , \ldots ,y_{sj} } \right)^{T}\) are the input and output vectors of \({\text{DMU}}_{j} \; j = 1,2, \ldots ,n\), respectively.

First, they solved the output-oriented BCC (Banker, Charnes, Cooper) [1] model and then the efficiency of each DMU was obtained by solving BCC model (1).

In the above method, inefficiency is a necessary condition for the presence of congestion, and they first determined inefficient units with Model (1) and finally presented the following model to calculate the amount of congestion:

$$\begin{aligned} & \varphi^{*} = \hbox{max} \;\varphi + \varepsilon \left( {\mathop \sum \limits_{r = 1}^{s} s_{r}^{ + } - \varepsilon \mathop \sum \limits_{i = 1}^{m} s_{i}^{ - c} } \right) \\ & {\text{s}} . {\text{t}}\; \mathop \sum \limits_{j = 1}^{n} x_{ij} \lambda_{j} + s_{io}^{ - c} = x_{io} ,\quad i = 1,2, \ldots m \\ & \mathop \sum \limits_{j = 1}^{n} y_{rj} \lambda_{j} - s_{ro}^{ + } = \varphi_{o} y_{ro} ,\quad r = 1,2, \ldots ,s \\ & \mathop \sum \limits_{j = 1}^{n} \lambda_{j} = 1 \\ & \left( {\lambda_{j} , s_{io}^{ - c} ,s_{ro}^{ + } } \right) \ge 0,\quad j = 1,2, \ldots ,n, \quad i = 1,2, \ldots ,m , \quad r = 1,2, \ldots ,s \\ \end{aligned}$$
(2)

Model (1) is solved for all units to identify inefficient DMUs, but Model (2) is used to determine the congestion of inefficient units. Pay attention to the input slack. In practice, the calculations should be conducted in three stages and three models must be solved to get the optimal solution as well as to determine the amount of congestion.

Cooper et al. [3] expressed the following definition to determine congestion in a DMU.

Definition 1

A DMU has congestion if and only if in an optimal solution \(\left( {\varphi^{*} ,\lambda^{*} ,S^{ + *} ,S^{ - c*} } \right)\) of Model (2), at least one of the following two conditions is occurred:

  1. 1.

    \(\varphi^{*} > 1\), and there exists at least one \(s_{i}^{ - c*} > 0\left( {1 \le i \le m} \right)\).

  2. 2.

    There should be at least one \(s_{r}^{ + *} > 0 \;\left( {1 \le r \le s} \right)\) and at least one \(s_{i}^{ - c*} > 0\;\left( {1 \le i \le m} \right)\).

Introducing Nora’s method

First, they solved Model (1) for all DMUs and denoted the optimal solution \(\left( {\varphi^{*} ,\lambda^{*} ,S^{ + *} ,S^{ - *} } \right)\) each DMU; then, they defined Set EF as follows:

$${\text{EF}} = \left\{ {j|\varphi^{*} = 1} \right\}\;{\text{or}}\;{\text{EF}} = \left\{ {{\text{BCC }}\;{\text{efficient}}\;{\text{units}}} \right\}$$

There exists at least one DMU, says DMUl in EF set, that has the highest value in its first input component compared to the first input component of the remaining DMUs of Set EF. That is to say:

$$\exists \left( {K \in {\text{EF}}} \right){\text{s}} . {\text{t}} : \forall j \left( {j \in {\text{EF}}} \right) \to x_{1K} > x_{1j}$$

They delineated \(x_{1K}\) by \(x_{1}^{*}\). For all input components (i = 1, …, m), they introduced a DMU in EF that ith input is higher than those of all other DMUs in the set and shown by \(x_{i}^{*} , i = 1,2, \ldots ,m\) similarly. Then, the defined congestion is as follows:

Definition 2

ADMU has congestion if and only if, in an optimal solution \(\left( {\varphi^{*} ,\lambda^{*} ,S^{ + *} ,S^{ - *} } \right)\) of Model (1) for this DMU, at least one of the following two conditions is occurred:

  1. 1.

    \(\varphi^{*} > 1\), and there exists at least one \(x_{io} > x_{i}^{*} \left( {1 \le i \le m} \right)\).

  2. 2.

    There is at least one \(s_{r}^{ + *} > 0 \left( {1 \le r \le s} \right)\) and at least one \(x_{io} > x_{i}^{*} \left( {1 \le i \le m} \right)\).□

where \(x_{io} > x_{i}^{*}\); they show the amount of congestion in the ith input of DMUo by \(s_{i}^{{c^{{\prime }} }}\) and it is defined as follows:

$$s_{i}^{{c^{{\prime }} }} = x_{io} - x_{i}^{*}$$

A DMU has no congestion when \(x_{io} \le x_{i}^{*}\) and \(s_{i}^{{c^{{\prime }} }} = 0\).

Proposed method

In this section, we prove the existence of the congestion hyperplane. In other words, there is a boundary that divides the possible production set (PPS) units into two sections: congestion units and units without congestion (noncongestion units). These two sets have no intersection. Assume that T is the set of congestion units defined by Cooper et al. [3] and \(T^{{\prime }}\) is the set of noncongestion units, according to this definition.

Theorem 1

T and \(T^{{\prime }}\) sets are convex, T is the set of congestion units, and \(T^{{\prime }}\) is the set of noncongestion units defined by Cooper et al. [3].

Proof

We consider two arbitrary units to prove that Set T is convex:

  1. 1.

    If both these DMUs are efficient.

Now, we show that any convex combination of these two members of Set T will belong to T. It means:

$$\forall \left( {x_{1 } ,y_{1} } \right), \left( {x_{2 } ,y_{2} } \right) \in T, \left( {\lambda x_{1} + \left( {1 - \lambda } \right)x_{2} ,\lambda y_{1} + \left( {1 - \lambda } \right)y_{2} } \right) \in T,$$

suppose that:

$$\begin{aligned} & {\text{DMU}}_{1} = \left( {x_{1 } ,y_{1} } \right) = \left( {x_{11 } ,x_{12} , \ldots ,x_{1m} ,y_{11} ,y_{12} , \ldots ,y_{1s} } \right) \\ & {\text{DMU}}_{2} = \left( {x_{2 } ,y_{2} } \right) = \left( {x_{21 } ,x_{22} , \ldots ,x_{2m} ,y_{21} ,y_{22} , \ldots ,y_{2s} } \right) \\ \end{aligned}$$

According to Definition 1 of congestion, if a DMU is efficient, then there is a component greater than zero in congestion slack and excess slack vectors. Thus, we consider that these components for \({\text{DMU}}_{1}\) are \(s_{{1i_{1} }}^{ - c*} > 0, s_{{1r_{1} }}^{ + *} > 0\), while the components for \({\text{DMU}}_{2}\) are \(s_{{2i_{2} }}^{ - c*} > 0,s_{{2r_{2} }}^{ + *} > 0\). Now, we show any convex combination of these two DMUs has congestion. In other words, there is a component greater than zero in congestion slack and the excess vectors corresponding to the convex combination of these two DMUs. The excess vectors \(s_{1}^{ + *}\), \(s_{2}^{ + *}\) are introduced as follows:

$$s_{1}^{ + *} = \left( {s_{11}^{ + *} , \ldots ,s_{{1r_{1} }}^{ + *} , \ldots , s_{1s}^{ + *} } \right)\;{\text{and}}\;s_{2}^{ + *} = \left( {s_{21}^{ + *} , \ldots ,s_{{2r_{2} }}^{ + *} , \ldots , s_{2s}^{ + *} } \right)$$

Then, one of these two cases will occur:

  1. (a)

    If the same components of excess vectors in these DMUs are greater than zero, then \(r_{1} = r_{2} = r\). Since \(s_{2r}^{ + *} > 0\), \(s_{1r}^{ + *} > 0\) and \(\lambda > 0\) so we have \(\lambda s_{1r}^{ + *} + \left( {1 - \lambda } \right)s_{1r}^{ + *} > 0\) and \(\lambda s_{1}^{ + *} + \left( {1 - \lambda } \right)s_{2}^{ + *} > 0\).

  2. (b)

    If the nonzero components of excess vectors in these DMUs are different, then \(r_{1} \ne r_{2}\):

    $$\begin{aligned} \lambda s_{1}^{ + *} + \left( {1 - \lambda } \right)s_{2}^{ + *} & = \lambda \left( {0, \ldots .,s_{{1r_{1} }}^{ + *} ,0, \ldots ..0} \right) \\ & \quad + \left( {1 - \lambda } \right)\left( {0, \ldots .,s_{{2r_{2} }}^{ + *} ,0, \ldots ..0} \right) \\ & = \left( {0, \ldots ,\lambda s_{{1r_{1} }}^{ + *} ,0, \ldots , \left( {1 - \lambda } \right)s_{{2r_{2} }}^{ + *} ,0, \ldots .,0} \right) > 0. \\ \end{aligned}$$

So if \(s_{{2r_{2} }}^{ + *} > 0\), \(s_{{1r_{2} }}^{ + *} > 0\), then \(\lambda s_{1}^{ + *} + \left( {1 - \lambda } \right)s_{2}^{ + *}\) should be greater than zero. As the same way, we can prove that if \(s_{{1i_{1} }}^{ - c*} , s_{{2i_{2} }}^{ - c*} > 0 \;\left( {1 \le i_{1} ,i_{2} \le m} \right)\), then \(\lambda s_{1}^{ - *} + \left( {1 - \lambda } \right)s_{2}^{ - *}\) should be greater than zero. Therefore, if both of these DMUs will be efficient, then T is convex. So:

  1. 2.

    If both these DMUs are inefficient, then they have a nonzero component in their congestion slack vectors. In this case (according to the previous section), the congestion slack of a convex combination of these DMUs has at least a nonzero component and hence this combination will be in Set T.

  2. 2.

    If \({\text{DMU}}_{1}\) is efficient and \({\text{DMU}}_{2}\) is inefficient, then their convex combination will be inefficient, while the congestion slack and excess slack vectors of \({\text{DMU}}_{2}\) will have at least a nonzero component like \(s_{{2i_{2} }}^{ - *} > 0,s_{{2r_{2} }}^{ + *} > 0\) and the congestion slack vector of \({\text{DMU}}_{1}\) has \(s_{{1i_{1} }}^{ - *} > 0\). Thus, based on the above proof \(\lambda s_{1}^{ - *} + \left( {1 - \lambda } \right)s_{2}^{ - *}\) will be greater than zero.

So in per case, \(\left( {\lambda x_{1} + \left( {1 - \lambda } \right)x_{2} ,\lambda y_{1} + \left( {1 - \lambda } \right)y_{2} } \right)\) has congestion and T will be convex. Similarly, it can be shown that \(T^{{\prime }}\) is convex.□

Theorem 2

T and \(T^{{\prime }}\) sets have no intersection where T is the set of congestion units and \(T^{{\prime }}\) is the set of noncongestion units defined by Cooper et al. [3].

Proof

Suppose \(T \cap T^{{\prime }} \ne \emptyset\). Thus, there is \({\text{DMU}}_{o}\) as a member of T and \(T^{\prime }\). According to the assumption, if \({\text{DMU}}_{o}\) is an efficient congestion unit, then the congestion slack and excess slack vectors have components greater than zero and this is in contradiction with \({\text{DMU}}_{o} \in T^{{\prime }}\). And also if \({\text{DMU}}_{o}\) is inefficient, we will reach a contradiction and hence this assumption is false.□

Theorem 3

If T and \(T^{{\prime }}\) are convex and \(T \cap T^{{\prime }} = \emptyset\), then there is a hyperplane that divides PPS into these sets.

Proof

Assume Set S is the difference between T and \(T^{{\prime }}\) as follows:

We first show the Set S is convex. According to the definition of the S:

$$\left\{ {\begin{array}{*{20}l} {\left( {t_{1} ,t_{2} } \right) \in S \to \left( {t_{1} ,t_{2} } \right) = \left( {X_{1} - X_{2} ,Y_{1} - Y_{2} } \right) \to \left\{ {\begin{array}{*{20}l} {\left( {X_{1} ,Y_{1} } \right) \in T} \hfill \\ {\left( {X_{2} ,Y_{2} } \right) \in T^{{\prime }} } \hfill \\ \end{array} } \right.} \hfill \\ {\left( {z_{1} ,z_{2} } \right) \in S \to \left( {z_{1} ,z_{2} } \right) = \left( {X_{3} - X_{4} ,Y_{3} - Y_{4} } \right) \to \left\{ {\begin{array}{*{20}l} {\left( {X_{3} ,Y_{3} } \right) \in T} \hfill \\ {\left( {X_{4} ,Y_{4} } \right) \in T^{{\prime }} } \hfill \\ \end{array} } \right.} \hfill \\ \end{array} } \right.$$

So:

$$\left( {\lambda t_{1} + \left( {1 - \lambda } \right)z_{1} ,\lambda t_{2} + \left( {1 - \lambda } \right)z_{2} } \right) = \left( { \lambda \left( {X_{1} - X_{2} } \right) + \left( {1 - \lambda } \right)(X_{3} - X_{4} ),\lambda \left( {Y_{1} - Y_{2} } \right) + \left( {1 - \lambda } \right)\left( {Y_{3} - Y_{4} } \right)} \right),$$

According to Theorem 1, T and \(T^{{\prime }}\) are convex so every combination of their members will be in Set S. Namely,

$$\left\{ {\begin{array}{*{20}l} {\lambda \left( {X_{1} - Y_{1} } \right) + \left( {1 - \lambda } \right)(X_{3} - Y_{3} ) \in T} \hfill \\ {\lambda (X_{2} - Y_{2} ) + \left( {1 - \lambda } \right)\left( {X_{4} - Y_{4} } \right) \in T^{{\prime }} } \hfill \\ \end{array} } \right.$$

\(\left( {\lambda t_{1} + \left( {1 - \lambda } \right)z_{1} ,\lambda t_{2} + \left( {1 - \lambda } \right)z_{2} } \right) \in S\) and S is a convex set and \(\left( {0,0} \right) \notin S\) because of \(T \cap T^{{\prime }} = \emptyset\). Therefore, there is a hyperplane relying on a convex set:

$$\exists 0 \ne P = \left( {P_{1}^{t} ,P_{2}^{t} } \right) : \left( {P_{1}^{t} ,P_{2}^{t} } \right) \left( {X,Y} \right) = 0$$

Therefore, the existence of the congestion hyperplane was proven.

The presentation of congestion hyperplane

In this section, we introduce the congestion hyperplane. For this purpose, we determine the BCC-efficient DMUs and then define the maximum value of the input components among these DMUs. The normal vector of the hyperplane is denoted by these values. This hyperplane can pass from any of the BCC-efficient DMUs.

First we solve Model (1) for \({\text{DMU}}_{j} \; \left( {j = 1,2, \ldots ,n} \right)\) and determine the optimal solution \(\left( {\varphi^{*} ,\lambda^{*} ,S^{ + *} ,S^{ - *} } \right)\). Then, Set EF is defined as follows:

$${\text{EF}} = \left\{ {j|\varphi^{*} = 1} \right\}\;{\text{or}}\;{\text{EF}} = \left\{ {{\text{BCC }}\;{\text{efficient}}\;{\text{units}}} \right\}$$

There is a DMU in Set EF (that is called DMUI), and this DMU in the first component has the highest amount than other efficient DMUs in Set EF. It is shown as follows:

$$\exists \left( {l \in EF} \right){\text{s}} . {\text{t}}\quad \forall j \left( {j \in EF} \right) \to x_{1l} > x_{1j}$$
(3)

Then, we determine \(x_{1l}\) with \(x_{1}^{ *}\). Now we solve Model (3) to find all \(x_{i}^{*} , i = 1,2, \ldots ,m\). We introduce \(E^{ *}\) as follows:

$$\forall 1 \le i \le m, \quad \exists \left( {l \in {\text{EF}}} \right) {\text{s}} . {\text{t}}: \forall j \; \left( {j \in {\text{EF}}} \right) \to x_{il} > x_{ij} \to l \in {\text{EF}}^{*}$$
(4)

In other words, a BCC-efficient DMU is the ith component of \({\text{EF}}^{ *}\), and its ith input is higher than other efficient DMUs. We show this DMU as \({\text{DMU}}_{i}^{*}\). Now, suppose \({\text{DMU}}k \in {\text{EF}}^{*}\) has the highest amount in the kth input component in Set EF and its input and output vectors are \(X_{k} = \left( {x_{1k} ,x_{2k} , \ldots ,x_{k}^{*} , \ldots ,x_{mk} } \right) \;{\text{and}} \;Y_{k} = \left( {y_{1k} ,y_{2k} , \ldots ,y_{sk} } \right)\), respectively. There is a hyperplane parallel to axis outputs that passes through DMUk. The normal vector of this hyperplane will be obtained by solving the following system of m + 2 equations and m unknowns. Consider that the input vector of DMU with the highest value of a first input component \(({\text{DMU}}_{1}^{*} )\) is \(X_{1}^{*} = \left( {x_{1}^{*} ,x_{12} , \ldots ,x_{1m} } \right),\), while \(X_{2}^{*} = \left( {x_{21} ,x_{2}^{ *} \ldots ,x_{2m} } \right)\) is input vector of a DMU with the highest value in the second input component \({\text{DMU}}_{2}^{*}\). Similarly, \(X_{m}^{*} = \left( {x_{m1} ,x_{m2} , \ldots ,x_{m}^{*} } \right)\) is the input vector of a DMU with the highest value of the mth input’s component, that is called \({\text{DMU}}_{m}^{*}\).

$$\left\{ {\begin{array}{*{20}c} {N\left( {X_{1}^{*} - X_{k}^{*} } \right) = 0} \\ {N\left( {X_{2}^{*} - X_{k}^{*} } \right) = 0} \\ \vdots \\ {N\left( {X_{k - 1}^{*} - X_{k}^{*} } \right) = 0 } \\ \vdots \\ {N\left( {X_{m}^{*} - X_{k}^{*} } \right) = 0} \\ {N.1 = 1} \\ {N > 0} \\ \end{array} } \right.$$
(5)

Suppose that the hyperplane passes through \({\text{DMU}}_{k}^{*} \in {\text{EF}}^{*}\). By using the values obtained from Model (5), this congestion hyperplane is:

$$H = \left\{ {\left( {\begin{array}{*{20}c} x \\ y \\ \end{array} } \right)|n_{1} \left( {x_{1j} - x_{1k} } \right) + \cdots + n_{m} \left( {x_{mj} - x_{mk} } \right) + 0\left( {y_{1j} - y_{1k} } \right) + \cdots + 0\left( {y_{sj} - y_{sk} } \right) = 0} \right\}$$
(6)

In other words, hyperplane equation will be as follows:

$$H = \left\{ {\left( {\begin{array}{*{20}c} x \\ y \\ \end{array} } \right)|\mathop \sum \limits_{{\begin{array}{*{20}c} {i = 1} \\ {i \ne k} \\ \end{array} }}^{m} n_{i} \left( {x_{ij} - x_{ik} } \right) + n_{k} \left( {x_{kj} - x_{k}^{*} } \right) + \mathop \sum \limits_{r = 1}^{s} 0\left( {y_{rj} - y_{rk} } \right) = 0} \right\}$$
(7)

Now, with a proposition we show a congestion unit based on Definition 1 (Cooper et al. [3]) will satisfy Constraint (8). In fact, a congestion unit at the method of Cooper et al. [3] will have congestion with the proposed model.

Theorem 4

If the following condition is satisfied, then \({\text{DMU}}_{j} \left( {j = 1,2, \ldots ,n} \right)\) has congestion:

$$\mathop \sum \limits_{{\begin{array}{*{20}c} {i = 1} \\ {i \ne k} \\ \end{array} }}^{m} n_{i} \left( {x_{ij} - x_{ik} } \right) + n_{k} \left( {x_{kj} - x_{k}^{*} } \right) > 0$$
(8)

Proof

We consider DMUo as a congestion unit. Suppose that the kth congestion slack component of DMUo is greater than zero \((s_{i}^{ - c} > 0)\). Hence, based on the method of Cooper et al. [3], there is a Constraint (a) as well:

$$\mathop \sum \limits_{j = 1}^{n} \lambda_{j}^{*} x_{kj} < x_{ko}$$
(a)

Nora et al. [10] proved that \(x_{i}^{*} = \sum\nolimits_{j = 1}^{m} {\lambda_{j}^{*} x_{ij} } , i = 1,..,m\). So we have:

$$x_{ko} - x_{k}^{*} > \mathop \sum \limits_{j = 1}^{m} \lambda_{j}^{*} x_{kj} - \mathop \sum \limits_{j = 1}^{m} \lambda_{j}^{*} x_{kj} = 0 \to x_{ko} - x_{k}^{*} > 0 , \quad i = k .$$
(b)
$$x_{io} - x_{i}^{*} \ge \mathop \sum \limits_{j = 1}^{m} \lambda_{j}^{*} x_{ij} - \mathop \sum \limits_{j = 1}^{m} \lambda_{j}^{*} x_{ij} = 0 \to x_{io} - x_{i}^{*} \ge 0 ,\quad i = 1,2, \ldots ,m, \quad i \ne k .$$
(c)

The hyperplane normal vector is greater than zero, so all of its components are positive. As a result, we have:

$$\mathop \sum \limits_{{\begin{array}{*{20}c} {i = 1} \\ {i \ne k} \\ \end{array} }}^{m} n_{i} \left( {x_{io} - x_{ik} } \right) + n_{k} \left( {x_{ko} - x_{k}^{*} } \right) > 0$$

Note that if the hyperplane passes from each efficient units of \({\text{EF}}^{*}\), the distance of DMU from the congestion hyperplane will not be changed.

Definition 3

The congestion amount on each DMU is the distance from the congestion hyperplane to this DMU. In other words, if the congestion’s amount of \({\text{DMU}}_{j}\) in ith input component is shown as \(s_{i}^{\prime \prime }\), then \(x_{ij} - s_{i}^{{\prime \prime }}\) must be on the congestion hyperplane.□

If all \(X_{i}^{*} \; \left( {i = 1,2, \ldots ,m} \right)\) should not be exclusive, then we select among them m, different \({\text{DMU}}s\). If there are not m different \({\text{DMU}}_{i}^{*} \; \left( {i = 1,2, \ldots ,m} \right)\), then there exist more than one congestion hyperplane.

Numerical example

In this section, the proposed model will test with selected numerical examples that had been studied by other papers.

Explaining example

  1. (a)

    Consider eight DMUs with one input and one output as A, B, C, D, E, F, G and H. These DMUs are shown in Fig. 1. The horizontal axis identifies the input and the vertical axis of the output.

    Fig. 1
    figure 1

    Source: Cooper et al. [3]

    Numerical example (a).

  2. (b)

    This example was solved by Cooper et al. [3] using Model (2), the results of which are provided in Table 1.

    Table 1 The results of the Cooper’s method [3]

    We now apply our proposed method to solve the same problem. Considering the efficiency of DMUs, we have Set \({\text{EF}} = \left\{ {A,B,C} \right\}\), where (2) becomes:

    $$X^{*} = X_{c} = 3 > x_{j} \left( {\forall j:\left( {j \in {\text{EF}}} \right)} \right) \to c \in {\text{EF}}^{*}$$

    . According to the proposed model, the congestion hyperplane is \(X_{j} - X^{ *} = 0\). We examine the condition of each DMU to the congestion hyperplane:

    $$\left\{ {\begin{array}{*{20}l} {A:1 - 3 = - 2 < 0} \hfill \\ {B:2 - 3 = - 1 < 0} \hfill \\ {C:3 - 3 = 0 } \hfill \\ {D:5 - 3 = 2 > 0} \hfill \\ {E:4 - 3 = 1 > 0} \hfill \\ {F:4 - 3 = 1 > 0} \hfill \\ { G = 4.5 - 3 = 1.5 > 0} \hfill \\ {H:3 - 3 = 0} \hfill \\ \end{array} } \right.$$

    . According to the results and Definition 2, the D, E, F, G units are congestion DMUs because \(X_{j} - X^{*} > 0\) and \(X_{j} - X^{ *}\) in other DMUs is less than or equal to zero and hence these DMUs have no congestion. In addition, as Table 1 shows, the points D, E, F and G in Cooper et al. [3] are also congestion DMUs. The congestion amounts are shown in Table 2.

    Table 2 The results of the proposed model
  3. (c)

    Consider the six DMUs as A, B, C, D, G and R, in Fig. 2. Each DMU has two inputs and one output. The value of output for R is y = 10, and the output of other DMUs is y = 1. The horizontal axis identifies the first input and the vertical axis of the second input.

    Fig. 2
    figure 2

    Source: Cooper et al. [3]

    Numerical example 2.

The efficiency and congestion of these DMUs in accordance with the model of Cooper et al. [3] are shown in Table 3. According to this table, EF = {A, B, R} and with Model (3):

$$\left\{ {\begin{array}{*{20}l} {\forall j\left( {j \in A} \right) x_{1R} = x_{1A} = 5 > x_{1j} } \hfill \\ {\forall j\left( {j \in A} \right) x_{2R} = x_{2B} = 5 > x_{2j} } \hfill \\ \end{array} } \right.$$

However, all \(x_{i}^{*} , i = 1,2, \ldots ,m\) are not exclusive so \(x_{1}^{*} = x_{1R}\) and \(x_{2}^{*} = x_{2B}\). Therefore, the normal vector is \(n_{1} = \frac{1}{2} , n_{2} = \frac{1}{2}\). According to Model (6), DMUR has the highest amount of output. Therefore, the congestion hyperplane will be passing through R point. The equation of congestion hyperplane will be as follows:

$$\frac{1}{2}\left( {x_{1j} - 5} \right) + \frac{1}{2}\left( {x_{2j} - 5} \right) = 0$$

The situation of all DMUs to the hyperplane is:

$$\left\{ {\begin{array}{*{20}l} {A: - 2 < 0} \hfill \\ {B: - 2 < 0} \hfill \\ {C:\frac{5}{2} > 0} \hfill \\ {D:\frac{5}{2} > 0} \hfill \\ {G:\frac{5}{2} > 0} \hfill \\ {R:0} \hfill \\ \end{array} } \right.$$

As can be seen, the points C, D and G have congestion that corresponds to the results of the model developed by Cooper et al. [3]. This result is presented in Table 3.

Table 3 The result of Cooper’s method [3]

The congestion value of Definition 3 in these DMUs states is given in Table 4:

Table 4 The result of the proposed model

As can be seen, the results are similar to Cooper et al. [3].

Applied example

In Table 5, data set for the Chinese automobile and textile industries during the period 1981–1997 is listed that is used by Cooper et al. [3]. The outputs and inputs are defined as follows:

Table 5 Input and output data from the textile and automobile industries of China (1981–1997) [3]

Y is the output measured in units of one million Renminbi, in 1991 prices; K is capital calculated in units of one million Renminbi, also in 1991 prices; and L is labor measured in units of 1000 persons.

All results obtained by using Model (1) are shown in Table 6. Now we solve this example by using our proposed method. For the textile industry, we have:

$${\text{EF}}_{\text{textile}} = \left\{ {{\text{DMU}}_{1} ,{\text{DMU}}_{3} ,{\text{DMU}}_{4} ,{\text{DMU}}_{5} ,{\text{DMU}}_{11} ,{\text{DMU}}_{14} ,{\text{DMU}}_{16} ,{\text{DMU}}_{17} } \right\}$$

By comparing the efficiency score of DMUs in the textile industry, we find that \({\text{DMU}}_{11}\) and \({\text{DMU}}_{14}\) have the maximum input at the first and second components, respectively. Thus,

$$x_{1}^{*} = x_{1 ,11} = 756 \;{\text{and}}\; x_{2}^{*} = x_{2,14} = 25.45$$

For the automobile industry, we have:

$$x_{1}^{*} = x_{1 ,17} = 197.8 \;{\text{and}}\; x_{2}^{*} = x_{2,17} = 253.96$$

The results obtained from the proposed model and Nora et al. [10] and the congestion amounts obtained based on Definition 3, \(s^{{{\prime \prime }c}}\), are shown in Table 7. As can be seen, the distance of congestion DMUs from the hyperplane is positive, while this amount is less than or equal to zero for efficient or technically inefficient DMUs. The distance between DMU and the hyperplane with P value will be shown in the following table.

Table 6 DEA results using Model (1) for the textile (T) and automobile (A) industries in China
Table 7 Congestion results of Cooper [3] and Noura [10] and the proposed method for textile (T) and automobile (A) industries

Discussions

The comparison between the methods of Cooper et al. [3] and Nora et al. [10] shows the many advantages of the proposed model that should be discussed individually. According to Definition 1, congestion will occur, while increases in one or more input(s) would cause reduction in one or more output(s) without making changes in the other inputs and outputs. In other words, if we have one input and one output, the congestion can be achieved, while increases in this input cannot gain more output. Consider Unit 2 and Unit 13 in the automotive industry (Table 5). In this example, the congestion hyperplane is defined in passing from DMU14. Based on Table 3, it is clear that the input and output vectors of DMU2 are smaller than the input and output vectors of DMU14. In fact, DMU14 has been able to obtain more output by consuming more input than DMU2. Therefore, any decrease in the output of Unit 2 depends on inefficiency rather than congestion. In other words, by comparing the inputs of Unit 2 (21.16, 412.3) to those of Unit 14, we have (21.16, 412.3) < (691, 25.45). In addition, comparing DMU14 with DMU2 shows that \(o_{{{\text{DMU}}_{14} }} = 4949.93 > 866.85 = O_{{{\text{DMU}}_{2} }}\). Similarly, in Unit 13, \(I_{{{\text{DMU}}_{13} }} = \left( {684,25.08} \right) < \left( {691,25.45} \right) = I_{{{\text{DMU}}_{14} }}\) and \(O_{{{\text{DMU}}_{13} }} = 3250.74 < 4949.93 = O_{{{\text{DMU}}_{14} }}\).

The congestion observed in Units 2 and 13, based on the model of Cooper et al. [3], is the type of inefficiency that is wrongly considered as congestion. Owing to Table 7, the results in the textile industry, and the model of Cooper et al. [3] model, \({\text{DMU}}_{15}\) has congestion. And also, based on the model of Nora et al. [10], this unit has no congestion. But it is shown that the input vector of Unit 15 has a larger component rather than the input vector of Unit 17. Notice that the congestion hyperplane is considered passing from Unit 17. It means that the congestion of this unit cannot be considered as technically inefficient. The model of Nora et al. [10] has considered zero congestion amount for a congestion unit (Unit 15) by ignoring the convexity condition. The model of Cooper et al. [3] introduces 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14 units as congestion units. But by comparing the input and output vectors of these units with those of Unit 17, we find that the input and output components of these units are less than those of Unit 17. Thus, based on the model of Cooper et al. [3], the inefficiency of these units is considered wrongly as the congestion. The congestion is known as a type of inefficiency, but the model of Cooper et al. [3] measured the amount of congestion in four efficient units in such a way that the proposed model solved this problem. Therefore, in some cases, the proposed model corrects the models of Nora et al. [10] and Cooper et al. [3].

Conclusions

In this paper, we have shown the existence of the congestion hyperplane by expressing some propositions. This hyperplane can make a complete separation between congestion units and efficient and technically inefficient units. Also, the normal vector has been introduced by using BCC-efficient units and the highest consumption input components. The congestion hyperplane not only confirms the previous models, but in some cases corrects the models incapacity to introduce the congestion border, inability to distinguish among the congestion units of technical inefficient units and disregard of the congestion units due to the lack of consideration of the convexity condition. The advantages of this proposed method are the determination of congestion units, estimation of the congestion amount by using the distance of the congestion border per unit and the specification of congestion units based on the convexity condition and considerable reduction of computation. Future research could study the creation of the hyperplane on fuzzy data and integer data.