1 Introduction

Today’s warehouses face increasing stress imposed by a highly volatile customer demand and increasing customer expectations in terms of ever shorter order response and delivery times. In that respect, warehouses more and more offer same-day and next-day shipment conditions to attract and retain customers and differentiate from their competitors. Customers in the business-to-consumer (B2C) segment, in particular in e-commerce, place orders in the afternoon or evening, and expect the products to be shipped immediately so that they are delivered the next day (Yaman et al. 2012; Boysen et al. 2021; Vanheusden et al. 2021). In dense urban areas, customers even expect those products to be delivered the same day, especially grocery (Klapp et al. 2018; Voccia et al. 2019; Ulmer 2020). Same-day and next-day shipment promises also become increasingly relevant for warehouses operating in the business-to-business (B2B) segment, e.g. warehouses that supply regional warehouses, local warehouses, or retail stores, due to the tough competition among off-line and online retailers and novel services in omni-channel retailing, such as click-and-collect (Boysen et al. 2021). Click-and-collect-services offer customers to order online and pick up the products at a retail store the next day (Kim 2020). So, the supplying warehouse is expected to ship those products the same day. From the operational perspective of the warehouse, however, same-day shipment promises are challenging, especially by the end of the day when the remaining time for order fulfilment becomes tighter. Hence, offering same-day shipment has to be well-coordinated with the order fulfilment capacities available at the warehouse to ensure an efficient order fulfilment process while meeting the shipment promises.

In warehouses, customer orders typically arrive continuously during the day, and the order fulfilment process of these orders, which incorporates order picking, consolidation, packing, and preparing for shipment, also runs continuously. Then, the orders are handed over to a parcel delivery company that is responsible for the actual delivery of the parcels to the customers. The parcels are consolidated into large batches for delivery, and there is a fixed daily delivery schedule with predefined truck departure times, usually late each day (Doerr and Gue 2013; Ceven and Gue 2017). These fixed predefined deadlines impose additional challenges on the order fulfilment process in the warehouse, as missing a truck departure by only a few minutes may result in a delay to the customer of one day. So, promising same-day shipment not only includes that an order is ready for shipment any time the same day, but it has to be ready for shipment by the specific deadline (truck departure) associated with its destination.

In the pursuit of offering increasingly shorter order response and delivery times, warehouses might run the risk of over-promising their shipment services: The amount of orders requiring same-day shipment exceeds the warehouse capacities for fulfilling these orders the same day by the given truck departure, and some orders are not shipped on time. So, the resulting service level, which measures whether orders are ready by their promised due dates (van Gils et al. 2018), is low.

As a natural mitigation strategy, warehouses set a cutoff point and offer cutoff-based shipment promises. Consider an arbitrary day (today). All customers ordering until today’s cutoff point receive same-day shipment, i.e. their orders are promised to be shipped by today’s deadline. In contrast, all customers ordering after today’s cutoff point receive next-day shipment, i.e. their orders are promised to be shipped by tomorrow’s deadline. This cutoff-based pattern of shipment promises applies every day. So, the total amount of orders promised to be shipped by today’s deadline is given by the sum of orders that arrived between yesterday’s and today’s cutoff point. Figure 1 shows the situation.

Fig. 1
figure 1

Cutoff-based shipment promises

Setting an appropriate cutoff point is challenging as it concurrently affects multiple facets of the service quality perceived by the customers, such as the order response and delivery times and the service level. Setting a late cutoff point, such that same-day shipment is offered to customers until shortly before the truck departure, ensures short order response times for the customers, but comes at the expense of a high risk of not meeting the promised shipment dates, reflected in a low service level. In contrast, to ensure high service levels, the cutoff point has to be set early in the day, such that customers experience longer order response and delivery times.

In this paper, we investigate the design of cutoff-based shipment promises for deadline-oriented stochastic order fulfilment processes in order to offer competitive shipment services and meet customer expectations in terms of service quality. To measure service quality, we focus on the order response time, reflected by the cutoff point itself, and the service level, which is an important key performance indicator for service quality of order fulfilment processes in practice (van Gils et al. 2018). In line with service level measurement in inventory management (Tempelmeier 2011), there are multiple types of service level in order fulfilment: probability-based \(\alpha-\)service level (Schleyer and Gue 2012), quantity-based \(\beta-\)service level (Doerr and Gue 2013; Ceven and Gue 2017; MacCarthy et al. 2019; Mohring et al. 2020), and quantity- and time-based \(\gamma-\)service level (Mohring et al. 2020). We propose an \(\alpha-\) and \(\beta-\)cutoff service level for deadline-oriented order fulfilment processes and study how the considered type of cutoff service level affects the selection of the cutoff point. The \(\alpha-\)cutoff service level gives the probability that all orders are ready for shipment by their promised deadline. The \(\beta-\)cutoff service level is the ratio of the number of orders that are ready for shipment by their promised deadline and the total number of orders due by that deadline.

To this end, we build a multi-period stochastic model, where each period ends with a deadline upon which orders that are due for shipment by this deadline are handed over to the parcel delivery company. Throughout each such period, customer orders arrive randomly according to a time-dependent demand pattern. Customers ordering until the cutoff point receive same-day shipment, customers ordering thereafter receive next-day shipment. If the available random processing capacity is insufficient to process all orders due for shipment in the current period, these orders become backorders and are carried over for shipment in the subsequent period. We are interested in studying the interplay between the cutoff point and the cutoff service level in these systems while concurrently considering the effects of the selected type of cutoff service level as well as system parameters, such as utilisation and variability of customer demand and processing capacity.

The main contribution of this paper is three-fold:

  1. 1.

    We are the first to study the design of cutoff-based shipment promises for stochastic deadline-oriented order fulfilment processes in warehouses. We present a discrete-time Markov chain model for exact steady-state performance analysis and propose two performance measures – \(\alpha-\) and \(\beta-\)cutoff service level – for service level measurement of deadline-oriented order fulfilment processes.

  2. 2.

    We numerically show the benefit of cutoff-based shipment promises. Even with a late cutoff point, system performance already improves significantly compared to a benchmark without a cutoff point.

  3. 3.

    We provide insights on how warehouses should set the cutoff point in order to balance customer expectations in terms of service level and order response time, and how they can improve their shipment promises by selecting \(\beta-\) instead of \(\alpha-\)cutoff service level and implementing measures to reduce the utilisation and the variabilities of the order fulfilment process.

The remainder of this paper is structured as follows: Sect. 2 gives an overview of the related literature. Section 3 provides the formal problem description. We introduce our discrete-time Markov chain model in Sect. 4. The numerical analysis is presented in Sect. 5, and the implications derived from its results are given in Sect. 6. Section 7 provides concluding remarks and an outlook.

2 Literature review

The challenges of managing stochastic order fulfilment processes in warehouses have been studied extensively in the literature. There are related surveys concerning warehousing for e-commerce (Boysen et al. 2019) and brick-and-mortar retail chains (Boysen et al. 2021), as well as for the design and control of manual order picking (de Koster et al. 2007) and automated parts-to-picker systems (Boysen et al. 2023). In the following, we review the research streams on stochastic order fulfilment processes in warehouses that incorporate daily order deadlines.

The research interest primarily focuses on strategies for increasing operations efficiency. As many warehouses still operate with manual picker-to-parts systems, and manual labour accounts for a large part of the total operating expense (de Koster et al. 2007; van Gils et al. 2018), increasing order picking efficiency is a natural subject of research interest. Typically, it is assumed that the efficiency of an order picker can be increased if several orders are grouped into a batch and picked simultaneously on one tour. The critical issue is to determine the batch size that minimises the average flow time of a random order (Le-Duc and de Koster 2007). An early attempt to solve this Order Batching Problem (OBP) is presented by Chew and Tang (1999) using queueing models to compute the lower bound, upper bound, and an approximate value for the average throughput time of the first order in a batch. However, using the first order in a batch as means for the throughput time of the entire batch is a limitation. Therefore, Le-Duc and de Koster (2007) consider a two-block warehouse and compute the mean flow time for a random order in the batch. Van Nieuwenhuyse and de Koster (2009) extend this approach to multi-server picking and sorting operations, and general setup and service time distributions. Schleyer and Gue (2012) use discrete-time queueing models to compute the throughput time distribution in a one-block warehouse which allows for more detailed evaluations based on its percentiles. The aforementioned papers assume fixed time windows for batching, which means that the time interval for collecting incoming orders for batch building is fixed. This assumption is relaxed by Xu et al. (2014) who investigate the effects of variable time windows for batching on the expected throughput time.

However, focusing on reducing the average flow time may induce some unintended behaviour (e.g. invoked overtime on Friday nights to clear orders for shipment, even though no shipments were scheduled over the weekend), that provides no benefit, but results in additional costs (Doerr and Gue 2013). Hence, it is common to assign each order a due date (i.e. the respective truck departure) for batch building (Boysen et al. 2019). The Order Batching and Sequencing Problem (OBSP) aims to minimise the tardiness between the order’s due date and the actual completion time. The composition of the batches, their processing times, and the release sequence have a significant impact on whether and to which extent due dates are violated (Henn and Schmid 2013). Henn and Schmid (2013) and Menéndez et al. (2017) present metaheuristics for the OBSP to minimise total tardiness for a given set of customer orders. Chen et al. (2015) and Scholz et al. (2017) study the simultaneous optimisation of order batching, batch sequencing, and picker routing in order to minimise total tardiness of customer orders. In addition to these decisions, Zhang et al. (2016) differentiate between urgent orders and orders that can be processed later to complete the maximum number of orders in the shortest service time.

Recent research interest focuses on the mitigation of workload peaks in order fulfilment to guarantee that daily deadlines are met. Workload peaks are time periods throughout the day when the required order throughput exceeds the available picker capacity (Vanheusden et al. 2020). They are caused by customers who tend to order products in the late afternoon or evening and expect these products to be delivered the next day (Vanheusden et al. 2021), and therefore result in a high risk of missing the truck departure times. Warehouse managers tend to set the workforce levels sufficiently high to cover workload peaks at any time. However, operations efficiency can be significantly improved by flexible workforce planning combined with an accurate prediction of daily workload peaks. Van Gils et al. (2017) propose forecasting methods to predict the future workload of order fulfilment processes. Using these workload forecasts, Vanheusden et al. (2020) and Vanheusden et al. (2021) schedule the workload such that peaks are avoided throughout the day and derive the required workforce levels from these balanced schedules. Beyond flexible workforce planning, levelled order release is an appropriate approach to manage workload peaks (Mohring et al. 2020). The key idea of levelled order release is to convert the volatile workload of the order fulfilment process into a smooth and regular workload per time period that can be managed by a constant workforce level (Mohring et al. 2020). Inspired by flow-job scheduling methods known from manufacturing, Kim (2020) applies flow job-scheduling methods to effectively reduce the order fulfilment time in OEM warehouses.

The papers presented so far focus on improving order fulfilment efficiency in order to meet daily deadlines. While efficient use of resources is undeniably important, introducing a cutoff point can be beneficial as it keeps a time window remaining for fulfilling orders that arrive late in the day. To the best of our knowledge, there are only three publications investigating cutoff-based shipment promises so far. Doerr and Gue (2013) are the first to introduce a cutoff point to manage deadline-oriented order fulfilment processes. The authors propose a novel performance metric, called Next Scheduled Deadline (NSD), to better reflect the deadline-orientation of the order fulfilment process in the system performance of the warehouse. NSD measures the fraction of orders targeted for a particular deadline that are ready by that deadline. An order is targeted for a particular deadline if it arrives before the cutoff point associated with this deadline. NSD is a meaningful performance metric for the service quality perceived by customers, implicitly installs cutoff-based shipment conditions offered to customers, and motivates workers to accelerate the operating speed when it matters most, so just before the deadline. Focusing on the latter dimension of NSD, Doerr and Gue (2013) investigate how to steer the cutoff point and the percentage of NSD in order to improve worker motivation. Case study results show that the cutoff points are set later and the percentage is set lower than an intuitive policy based on the percentage of orders finished without delay would suggest. The dimension of NSD as a performance metric for service quality is discussed by Ceven and Gue (2017) and MacCarthy et al. (2019): Ceven and Gue (2017) study wave picking policies in warehouses operating against daily deadlines and derive the optimal number and timing of wave releases in terms of NSD. By this, the proportion of on-time shipments significantly increases compared to intuitive wave release policies. Applying the introduced model in a case study, the authors show that a higher capacity and an earlier cutoff point increase NSD, respectively. MacCarthy et al. (2019) investigate the order picking operations for same-day buy-online-pickup-in-store services in retail stores, where online orders are fulfilled in conventional retail stores while also serving walk-in customers. The online orders are promised to be ready for pickup in store after a specific time later the same day if they have been placed until a predefined cutoff point. The authors derive best performance frontiers for single-wave and multi-wave order picking and determine the optimal number and timing of picking waves as well as the minimum picking rate needed to guarantee a target value of NSD.

In summary, academic literature mainly focuses on efficiency improvements for order fulfilment processes (and herein especially in the order picking operations). The only papers including cutoff-based shipment promises focus on the effects of introducing a cutoff point on worker motivation (Doerr and Gue 2013) and the design of wave picking strategies in order fulfilment settings with a given cutoff point (Ceven and Gue 2017; MacCarthy et al. 2019). Based on case study analyses, these papers include some first results on the effects of the cutoff point on the system performance. However, it is currently unclear how introducing a cutoff point affects service quality and customer satisfaction in terms of service level and order response time, and what competitive cutoff-based shipment promises should look like in the B2C- and B2B-segment. This paper addresses these research gaps by providing a comprehensive analysis of the design of cutoff-based shipment promises in order to offer competitive shipment conditions and meet customer expectations.

3 Problem description

We investigate the stylised stochastic deadline-oriented order fulfilment process of a warehouse with cutoff-based shipment promises. In the following, we provide the assumptions and the formal description of the order fulfilment process and the cutoff-based shipment promises. The problem notation is summarised in Table 1 at the end of this section.

3.1 Order fulfilment

The order fulfilment process of a warehouse starts with receiving orders from the customers. Customer orders arrive continuously throughout the day and are released for order fulfilment after some preparatory administrative steps. By this, the continuous inflow of customer orders transforms into a discrete-time order income for the actual order fulfilment process. Order fulfilment in warehouses incorporates picking, consolidating, and packing products and preparing parcels for shipment (de Koster et al. 2007). These steps are organised in small batches, and discrete-time modelling of the order fulfilment process is appropriate. Hence, we consider the order fulfilment process at discrete times \(t \in {\mathbb {N}}_0\) that are integer multiples of a constant discretisation interval \(t_{\text {inc}}\), e.g. one hour. We normalise time such that the discretisation interval equals \(t_{\text {inc}}:= 1\).

Fig. 2
figure 2

Representation of the timeline and corresponding notation of our model

The order fulfilment process ends with handing over the parcels to a parcel delivery company, which consolidates the parcels into large transportation batches for delivery depending on their destinations. Parcel delivery companies have fixed delivery schedules with predefined truck departure times. Hence, order fulfilment in warehouses operates against fixed predefined deadlines (Doerr and Gue 2013; Ceven and Gue 2017). These recurring deadlines subdivide the discrete time axis \(t \in {\mathbb {N}}_0\) into operating cycles \(k\in {\mathbb {N}}_0\), e.g. days, which consists of T time periods. Operating cycle k corresponds to the time periods \(\{k T, kT+1,\ldots ,k T +T-1 \big \}\). It starts at deadline kT and ends immediately before reaching the subsequent deadline \((k+1) T\) (cf. Fig. 2). Note that order fulfilment in operating cycle k operates against deadline \((k+1)T\). For time period t, the corresponding operating cycle is

$$\begin{aligned} k = \left\lfloor \frac{t}{T} \right\rfloor , \end{aligned}$$
(1)

and the age \(\tau\) of operating cycle k at time t is

$$\begin{aligned} \tau = t{{\,\mathrm{\,mod\,}\,}}T. \end{aligned}$$
(2)

The random customer demand \(D_t\) arriving at time t gives the number of orders that are released for order fulfilment at time t. Note that the orders have a uniform size. The random variables \(D_t\), \(t \in {\mathbb {N}}_0\), are specified by a periodic demand pattern depending on the age \(\tau\) of the operating cycle:

$$\begin{aligned} D_t \sim D_{\tau }. \end{aligned}$$
(3)

\(D_\tau\), \(\tau \in \{0, 1,\dots , T-1\}\), are discrete generally-distributed random variables that specify the customer demand at age \(\tau\) of the operating cycle.

The order fulfilment process is modelled as a single-stage system covering all processing steps, e.g. picking and packing, in the warehouse. The random processing capacity \(B_t\) provided at time t determines the system’s capability of fulfilling the customer demand. It gives the number of orders that can be completely processed at time t. The random variables \(B_t\), \(t \in {\mathbb {N}}_0\), are i.i.d., specified by the discrete generally-distributed processing capacity per time period B:

$$\begin{aligned} B_t \sim B. \end{aligned}$$
(4)

We obtain the total processing capacity \(B^k\) in operating cycle k by the T-fold convolution of B as follows:

$$\begin{aligned} B^k \sim B^{\text {total}} = \otimes ^{T} B. \end{aligned}$$
(5)

Processing capacity \(B_t\) and customer demand \(D_t\) are independent of each other, and the system utilisation is smaller than one. System utilisation U is calculated as the ratio of the expected total customer demand per operating cycle and the expected total processing capacity per operating cycle as follows:

$$\begin{aligned} U=\frac{\sum _{\tau =0}^{T-1}{\mathbb {E}}[D_\tau ]}{{\mathbb {E}}[B^{\text {total}}]}. \end{aligned}$$
(6)

3.2 Shipment promises

The shipment conditions offered to a customer are time-dependent in the sense that they depend on the age of the operating cycle at the customer’s time of arrival. Consider a customer arriving at time t, and let \(k=\lfloor t/T\rfloor\) and \(\tau =t{{\,\mathrm{\,mod\,}\,}}T\). If the age \(\tau\) of operating cycle k at time t is smaller than or equal to the cutoff point \(\tau _C\), the customer receives same-cycle shipment, meaning that the order is promised to be shipped by the next deadline after t, i.e. by time \((k+1)T\). Otherwise, if the age \(\tau\) of operating cycle k exceeds the cutoff point \(\tau _C\), next-cycle shipment is offered to the customer, meaning that the order is promised to be shipped by the second deadline after t, i.e. by time \((k+2)T\). So, assuming that the operating cycle equals one day (\(T=24\) hours), same-cycle shipment corresponds to same-day shipment and delivery to the customer the next day, and next-cycle shipment is equivalent to next-day shipment and delivery to the customer the day after the next.

Based on these cutoff-based shipment promises, the customer demand arriving within operating cycle k subdivides into an order income for same-cycle shipment \(S^k\) and an order income for next-cycle shipment \(N^k\). We follow the same pattern of shipment promises every operating cycle. So, by convoluting the customer demand \(D_\tau\) arriving until/after the cutoff point \(\tau _C\), we have:

$$\begin{aligned} S^k&\sim S = \otimes _{\tau =0}^{\tau _C} D_{\tau } \end{aligned}$$
(7)
$$\begin{aligned} N^k&\sim N = \otimes _{\tau =\tau _C+1}^{T-1} D_{\tau } . \end{aligned}$$
(8)
Table 1 Problem notation

4 Markov chain model

In this section, we introduce a discrete-time Markov chain model for steady-state performance analysis of deadline-oriented order fulfilment processes with cutoff-based shipment promises. We provide the discrete-time Markov chain formulation (cf. Sect. 4.1) and derive exact formulas for multiple performance measures (cf. Sect. 4.2). Input parameters, variables, and output parameters of the Markov chain model are summarised in Table 1.

4.1 Discrete-time Markov chain

For \(k \in {\mathbb {N}}_0\), let \(X^k\) denote the state of the Markov chain at the beginning of operating cycle k, i.e. at time kT. \(X^k\) gives the number of unprocessed orders at the beginning of operating cycle k, covering all unprocessed orders that are due by the next deadline, i.e. by time \((k+1)T\), and all orders that are already too late.

The processing capacity \(B^k \sim B^\text {total}\) provided for order fulfilment in operating cycle k is initially used to process the orders \(X^k\) and the order income for same-cycle shipment \(S^k \sim S\) released in operating cycle k. Note that orders that are already too late are prioritised over the ones that are due by the next deadline. Any remaining processing capacity is used to process the order income for next-cycle shipment \(N^k \sim N\) or remains unused in case the system runs idle. Otherwise, the orders remaining unprocessed in operating cycle k are carried over to the next operating cycle \((k+1)\). Accordingly, the transitions of the Markov chain are defined as follows:

$$\begin{aligned} X^{k+1} = \left( X^k + S^k -B^k + N^k \right) ^{+}, \end{aligned}$$
(9)

where \((x)^+\) denotes \(\max (x,0)\).

Note that the Markov chain that arises from given shipment promises, i.e. a given cutoff point \(\tau _C\), has a limiting distribution since \(U<1\). Therefore, defining performance measures in terms of long-run averages is appropriate.

4.2 Performance measures

We derive several performance measures from the limiting distribution of the Markov chain. In general, any performance measure including its entire probability distribution can be derived.

In order fulfilment, the service level is the key performance indicator for measuring the service quality perceived by customers. It quantifies whether orders are ready on time by their promised due dates or not. Despite its great practical relevance, the service level has been barely investigated in the order fulfilment literature (van Gils et al. 2018). In line with service level measurement in inventory management (Tempelmeier 2011), there are also multiple types of service level in order fulfilment: probability-based \(\alpha-\)service level (Schleyer and Gue 2012), quantity-based \(\beta-\)service level (Doerr and Gue 2013; Ceven and Gue 2017; MacCarthy et al. 2019; Mohring et al. 2020), and quantity- and time-based \(\gamma-\)service level (Mohring et al. 2020).

In the following, we introduce two types of service level for deadline-oriented order fulfilment processes with cutoff-based shipment promises: \(\alpha-\) and \(\beta-\)cutoff service level. Beyond, we consider the expected number of backorders and the expected number preprocessed orders.

4.2.1 Backorders

Some orders that remain unprocessed in operating cycle k potentially become backorders by the end of operating cycle k. This includes all unprocessed orders that are due by deadline \((k+1)T\) immediately after operating cycle k or that are already too late in operating cycle k. In general, there is a random number of backorders \(M^k\) at the end of operating cycle k if the sum of the orders \(X^k\) and the order income for same-cycle shipment \(S^k\) exceeds the total processing capacity \(B^k\) in operating cycle k:

$$\begin{aligned} M^k=\left( X^k + S^k - B^k \right) ^+. \end{aligned}$$
(10)

The long-run average number of backorders per operating cycle equals

$$\begin{aligned} {\mathbb {E}}[M]=\lim _{K\rightarrow \infty }\frac{1}{K}\sum _{k=0}^{K-1}{\mathbb {E}}[M^k]. \end{aligned}$$
(11)

4.2.2 Preprocessed orders

Some orders of the order income for next-cycle shipment \(N^k\) are potentially preprocessed in operating cycle k. Hence, these orders are ready for shipment by the next deadline \((k+1)T\), although they are only due by the subsequent deadline \((k+2)T\). In general, there is a random number of preprocessed orders \(P^k\) in operating cycle k, if the total processing performance \(B^k\) exceeds the number of unprocessed orders that are due by deadline \((k+1)T\), which is given by the sum of the orders \(X^k\) and the order income for same-cycle shipment \(S^k\):

$$\begin{aligned} P^k=\min \left\{ N^k, \left( B^k - (X^k + S^k)\right) ^+ \right\} . \end{aligned}$$
(12)

The long-run average number of preprocessed orders per operating cycle equals

$$\begin{aligned} {\mathbb {E}}[P]=\lim _{K\rightarrow \infty }\frac{1}{K}\sum _{k=0}^{K-1}{\mathbb {E}}[P^k]. \end{aligned}$$
(13)

4.2.3 \(\alpha-\)cutoff service level

\(\alpha-\)service levels are probability-based service level definitions that measure the probability that all orders are completed on time (Schleyer and Gue 2012). We define the \(\alpha-\)cutoff service level in the context of deadline-oriented order fulfilment processes with cutoff-based shipment promises as the probability that all orders are ready for shipment by their promised deadline. Recall that orders for same-cycle shipment \(S^k\) arriving in operating cycle k are promised to be ready for shipment by the deadline immediately after operating cycle k, i.e. by time \((k+1)T\), and all orders for next-cycle shipment \(N^k\) are promised to be ready for shipment by the next deadline thereafter, i.e. by time \((k+2)T\).

First, consider operating cycle k in isolation: Some orders of the order backlog \(X^k\) and all orders of the order income for same-cycle shipment \(S^k\) are due by the deadline immediately after operating cycle k. If all these orders are processed in operating cycle k, no backorder occurs (\(M^k = 0\)), and the \(\alpha-\)cutoff service level \(SL_\alpha ^k\) of operating cycle k equals one. Otherwise, if at least one of these orders remains unprocessed in operating cycle k, there are some backorders \(M^k>0\), and the \(\alpha-\)cutoff service level \(SL_\alpha ^k\) of operating cycle k equals zero:

$$\begin{aligned} SL_\alpha ^k = {\left\{ \begin{array}{ll} 1 &{} \text {if } M^k = 0, \\ 0 &{} \text {if } M^k > 0. \end{array}\right. } \end{aligned}$$
(14)

Following these considerations, we obtain the \(\alpha-\)cutoff service level \(SL_\alpha\) as the long-run average of the \(\alpha-\)cutoff service level per operating cycle:

$$\begin{aligned} SL_\alpha =\lim _{K\rightarrow \infty }\frac{1}{K}\sum _{k=0}^{K-1} SL_\alpha ^k. \end{aligned}$$
(15)

4.2.4 \(\beta-\)cutoff service level

\(\beta-\)service levels are quantity-oriented service level definitions as they consider the proportion of on-time completed orders (Mohring et al. 2020). We define the \(\beta-\)cutoff service level in the context of deadline-oriented order fulfilment processes with cutoff-based shipment promises as the ratio of the number of orders that are ready for shipment by their promised deadline and the total number of orders due by that deadline.

Consider two subsequent operating cycles \((k-1)\) and k. The number of orders due by the deadline immediately after operating period k, i.e. by time \((k+1)T\), consists of the order income for next-cycle shipment \(N^{k-1}\) in operating cycle \((k-1)\) and the order income for same-cycle shipment \(S^{k}\) in operating cycle k. If the processing capacity \(B^{k-1}\) in operating cycle \((k-1)\) is insufficient to process all orders due by the deadline immediately after operating cycle \((k-1)\), there are \(M^{k-1}\) backorders (cf. (10)) at the end of operating cycle \((k-1)\), and no order for next-cycle shipment is preprocessed in operating cycle \((k-1)\). Otherwise, \(P^{k-1}\) orders (cf. (12)) for next-cycle shipment of \(N^{k-1}\) are preprocessed in operating cycle \((k-1)\), and the remaining unprocessed orders for next-cycle shipment, denoted by \(R^{k-1}\),

$$\begin{aligned} R^{k-1} = \left( N^{k-1} - \left( B^{k-1} - \left( X^{k-1} + S^{k-1} \right) \right) ^+ \right) ^{+} \end{aligned}$$

are postponed to operating cycle k. Those orders and the orders for same-cycle shipment \(S^k\) are processed in operating cycle k depending on the remaining processing capacity \((B^k -M^{k-1})^+\). Note that only this remaining processing capacity is available to process the orders due by deadline \((k+1)T\) as the potential backorders \(M^{k-1}\) are also postponed to operating cycle k, and they are prioritised over other orders. Hence, after operating cycle k, there are

$$\begin{aligned} P^{k-1}+\min \left\{ \left( B^k-M^{k-1}\right) ^+, R^{k-1} + S^k\right\} \end{aligned}$$

orders ready for shipment by their promised deadline \((k+1)T\), and the \(\beta-\)cutoff service level \(SL_\beta ^k\) of operating cycle k is given as follows:

$$\begin{aligned} SL_\beta ^k = \frac{P^{k-1}+\min \left\{ \left( B^k-M^{k-1}\right) ^+, R^{k-1} + S^k\right\} }{N^{k-1}+S^k}. \end{aligned}$$
(16)

The \(\beta-\)cutoff service level \(SL_\beta\) is calculated as the long-run average of the \(\beta-\)cutoff service level per operating cycle:

$$\begin{aligned} SL_\beta =\lim _{K\rightarrow \infty }\frac{1}{K}\sum _{k=1}^{K} SL_\beta ^k. \end{aligned}$$
(17)

In general, the \(\beta-\)cutoff service level is greater than or equal to the \(\alpha-\)cutoff service level, i.e. \(SL_\beta \ge SL_\alpha\), as \(SL_\beta ^k \ge SL_\alpha ^k\) holds in any operating cycle k. This follows immediately when comparing the values of \(SL_\beta ^k\) and \(SL_\alpha ^k\) in the different scenarios of the order fulfilment process, i.e. when no/some/all orders for next-cycle shipment are preprocessed in operating cycle \((k-1)\) and no/some backorders occur in operating cycle k. Note that \(SL_\alpha \le SL_\beta\) is intuitively clear as the \(\alpha-\)cutoff service level measures the probability that all orders are completed on time, whereas the quantity-based \(\beta-\)cutoff service level gives the proportion of on-time completed orders.

5 Numerical analysis

In this section, we conduct an extensive numerical analysis of deadline-oriented order fulfilment processes with cutoff-based shipment promises. We investigate the effects of cutoff-based shipment on the system performance (cf. Sect. 5.2), the benefit of cutoff-based shipment compared to a benchmark policy (cf. Sect. 5.3), and the decision problem of setting an appropriate cutoff point (cf. Sect. 5.4). The design of experiments and the model implementation are given in Sect. 5.1.

5.1 Design of experiments and model implementation

The numerical study covers a vast variety of warehouses operating in the B2C- (e.g. e-commerce) and B2B-segment (e.g. warehouses supplying regional warehouses, local warehouses, or retail stores). The assumptions and input parameters are summarised in Table 2.

Table 2 Input parameters of the numerical study

We consider warehouses with an operating cycle of constant length \(T = 8\) and the cutoff point \(\tau _C\) varies between 0 and 7 to investigate the full range of cutoff-based shipment promises.

The time-dependent customer demand \(D_\tau\), \(\tau \in \{0,1,\dots , 7\}\), is modelled by discretised Beta-distributed random variables with finite support \(\{0,1,\hdots , 20\}\), whose parameters derive from the expected values \({\mathbb {E}}[D_\tau ]\) and the squared coefficient of variation scv[D]. The expected customer demand \({\mathbb {E}}[D_\tau ]\), \(\tau \in \{0,1,\dots , 7\}\), is characterised by a series of monotonously increasing values reaching its peak at the cutoff point \(\tau _C\):

$$\begin{aligned} \max \left\{ {\mathbb {E}}[D_\tau ] \mid \tau \in \{0,1,\hdots ,7\} \right\}&= {\mathbb {E}}[D_{\tau _C}] \end{aligned}$$
(18)
$$\begin{aligned} \min \left\{ {\mathbb {E}}[D_\tau ] \mid \tau \in \{0,1,\hdots ,7\} \right\}&= {\left\{ \begin{array}{ll} {\mathbb {E}}[D_0] &{} \text {if } \tau _C = 7, \\ {\mathbb {E}}[D_{\tau _C+1}] &{} \text {if } \tau _C \ne 7. \end{array}\right. } \end{aligned}$$
(19)

It is common sense that the peak customer demand volume occurs at the cutoff point (Kim 2020), i.e. in the time period when same-cycle shipment is offered for the last time.Footnote 1 Note that due to the varying cutoff point \(\tau _C\), the corresponding series of expected customer demand values \({\mathbb {E}}[D_\tau ]\), \(\tau \in \{0,1,\hdots , 7\}\), varies accordingly. However, the expected total customer demand per operating cycle \(\sum _{\tau =0}^{7} {\mathbb {E}}[D_\tau ]\) is constant. scv[D] varies between 0.1 and 1.0 to model different types of stochastic customer demand. A high variability of customer demand is common for warehouses operating in the B2C-segment, whereas warehouses in the B2B-segment face a lower variability of customer demand (Boysen et al. 2021).

The processing capacity per time period B is a discretised Beta-distributed random variable with finite support \(\{0,1,\hdots , 20\}\), expected value \({\mathbb {E}}[B]\) and squared coefficient of variation scv[B]. The expected processing capacity \({\mathbb {E}}[B]\) derives from the system utilisation U and the expected total customer demand per operating cycle as follows (cf. (6)):

$$\begin{aligned} {\mathbb {E}}[B] = \frac{\sum _{\tau =0}^{T-1}{\mathbb {E}}[D_\tau ] }{T U}. \end{aligned}$$
(20)

System utilisation U varies between 0.5 and 0.95 to model different types medium- and high-utilised warehouses. Hence, the expected processing capacity \({\mathbb {E}}[B]\) varies accordingly. The squared coefficient of variation scv[B] varies between 0.1 and 1.0 to cover different types of order fulfilment processes (e.g. fully-automated, partially automated and manual warehousing systems; picker-to-parts and parts-to-picker systems) (Boysen et al. 2019, 2021).

To implement and compute our Markov model, we derive a finite upper bound \({\overline{X}}\) for its state space. Due to this finite upper bound, some orders are potentially rejected without being fulfilled. An incoming order is rejected in operating cycle k only if, by accepting this order, the current order backlog \(X^k\) would exceed the upper bound \({\overline{X}}\). We determine \({\overline{X}}\) such that the probability of rejecting an order is negligibly small. In this study, the threshold value is 0.003. The formula of the rejection probability and the required adaptions of the state transition are given in Appendix A.

In total, the study incorporates 6,930 data points, and the average computation time per data point is nine seconds.Footnote 2

5.2 Performance analysis

We first analyse the effects of cutoff-based shipment promises on the performance of deadline-oriented order fulfilment processes. Figure 3 shows the expected number of backorders \({\mathbb {E}}[M]\) and preprocessed orders \({\mathbb {E}}[P]\) as well as \(\alpha-\) and \(\beta-\)cutoff service level \(SL_\alpha\), \(SL_\beta\) depending on the cutoff point \(\tau _C\) for an exemplary order fulfilment process (\(U = 0.8\), \(scv[D] = 0.5\), \(scv[B]=0.5\)).

Fig. 3
figure 3

Performance measures \(SL_\alpha\), \(SL_\beta\), \({\mathbb {E}}[M]\), \({\mathbb {E}}[P]\) depending on cutoff point \(\tau _C\) for an exemplary order fulfilment process (\(U=0.8\), \(scv[D] = 0.5\), \(scv[B] = 0.5\))

If the cutoff point is set as early as possible, i.e. \(\tau _C = 0\), only 20% of the total customer demand per operating cycle is due by the next deadline, which occurs immediately after the current operating cycle. Hence, it is very likely that all orders are ready for shipment by their promised due date, reflected in a low number of backorders, \({\mathbb {E}}[M] = 0.4\), and high values of \(\alpha-\) and \(\beta-\)cutoff service level, \(SL_\alpha = 97\%\) and \(SL_\beta = 100\%\). Concurrently, a high proportion of the total processing capacity per operating cycle is used to already process orders for next-cycle shipment, reflected in a high number of preprocessed orders, \({\mathbb {E}}[P] = 26.5\).

By postponing the cutoff point towards the deadline, the proportion of orders for same-cycle shipment on total customer demand per operating cycle increases, and there is a growing risk of missing the promised due date for some orders. Consider a cutoff point in the middle of the operating cycle, e.g. \(\tau _C = 4\), the number of backorders increases to \({\mathbb {E}}[M] = 2.3\) and the cutoff service levels reduce to \(SL_\alpha = 83\%\) and \(SL_\beta = 98\%\).

When setting the cutoff point just one time period ahead the deadline, i.e. \(\tau _C = 7\), same-cycle shipment is offered to all customers arriving throughout the operating cycle. So, the total customer demand per operating cycle is due by the next deadline, and there are no orders for next-cycle shipment. Ensuring on-time shipment becomes very challenging in this setting, in particular when facing the peak of the customer demand at the cutoff point, as orders that arrive at the cutoff point are promised to be ready for shipment in the next time period thereafter. This is reflected in a high number of backorders, \({\mathbb {E}}[M] = 5.6\), and low cutoff service levels, \(SL_\alpha = 62\%\) and \(SL_\beta = 95\%\). Note that the number of preprocessed orders \({\mathbb {E}}[P]\) equals zero as there are no orders for next-cycle shipment.

5.3 Benefit of cutoff-based shipment promises

We evaluate the benefit of cutoff-based shipment compared to a benchmark policy, called SameCycle, which offers same-cycle shipment to all customer orders irrespective of their arrival times. SameCycle can be seen as cutoff-based shipment with a cutoff point of \(\tau _C = T-1 = 7\). For ease of simplicity, the following analysis focuses on two selected cutoff-based shipment policies: LateCutoff sets the cutoff point two time periods ahead the deadline, i.e. \(\tau _C = 6\), so that same-cycle shipment is offered only one time period less than under SameCycle. The other policy, called Medium-Cutoff, sets the cutoff point in the middle of the operating cycle, i.e. \(\tau _C = 4\).

Table 3 Benefit of cutoff-based shipment policies LateCutoff and MediumCutoff compared to benchmark policy SameCycle

Table 3 gives the cutoff service levels \(SL_\alpha\), \(SL_\beta\) and the expected number of backorders \({\mathbb {E}}[M]\) and preprocessed orders \({\mathbb {E}}[P]\) of the policies LateCutoff, MediumCutoff, and SameCycle, as well as the benefits of LateCutoff and MediumCutoff compared to SameCycle. LateCutoff achieves an average benefit of 14% in terms of \(\alpha-\)cutoff service level and 1% in terms of \(\beta-\)cutoff service level compared to SameCycle, and the number of backorders reduces on average by 19%. Applying MediumCutoff results in even higher average improvements: 38% increase of \(\alpha-\)cutoff service level, 2% increase of \(\beta-\)cutoff service level, and 46% reduction in the number of backorders. The benefit of cutoff-based shipment in terms of \(\alpha-\)cutoff service level highly depends on the utilisation U of the order fulfilment process. Consider LateCutoff, the benefit is 3% at a medium utilisation of \(U=0.6\) and 29% at a high utilisation of \(U=0.95\). Similarly, the benefit of MediumCutoff increases from 7% at \(U=0.6\) to 83% at \(U=0.95\).

These results indicate that introducing a cutoff point significantly improves system performance compared to the same-cycle shipment benchmark policy. Even with a late cutoff point, such that same-cycle shipment is offered only one time period less than under the benchmark policy, higher service levels are achieved. By shifting the cutoff point to earlier time periods of the operating cycle, system performance further improves. However, this happens at the expense of longer order response times perceived by the customers as same-cycle shipment is offered to a decreasing proportion of customers ordering early in the operating cycle, whereas an increasing proportion of customers receives next-cycle shipment.

5.4 Design of cutoff-based shipment promises

Although we showed that offering cutoff-based shipment conditions is beneficial, the question of how to actually set the cutoff point in order to offer competitive shipment services is still open. This decision problem depends on multiple aspects, such as customer expectations in terms of service quality, characteristics of the order fulfilment process, and the considered performance measure. The following analysis focuses on the performance measures \(\alpha-\) and \(\beta-\)cutoff service level as service levels are common key performance indicators of order fulfilment processes in practice (van Gils et al. 2018).

5.4.1 Effects of customer expectations

The cutoff point is negatively related to \(\alpha-\) and \(\beta-\)cutoff service level, respectively (cf. Figure 3). By postponing the cutoff point towards the deadline of the operating cycle, the proportion of orders for same-cycle shipment increases, the risk of not meeting the promised order due dates grows, and the cutoff service levels reduce. However, late cutoff points have the merit of short order response times, and both service level and order response time are important facets of the service quality perceived by customers.

These results are consistent with previous findings by Doerr and Gue (2013) and Ceven and Gue (2017) and confirm the general trade-off between high service levels and short order response times, reflected by late cutoff points, in stochastic order fulfilment settings. Given this trade-off, the cutoff point should be set such that it balances customer expectations in terms of service level and order response time.

5.4.2 Effects of type of cutoff service level

\(\alpha-\)cutoff service level is more sensitive to a variation of the cutoff point than \(\beta-\)-cutoff service level. In other words, the trade-off between service level and order response time is more substantial regarding \(\alpha-\)cutoff service level than \(\beta-\)cutoff service level. Consider e.g. the exemplary order fulfilment process in Fig. 3: For the given range of cutoff points \(\tau _C \in \{0, 1, \hdots , 7\}\), \(\alpha-\)cutoff service level is between 97% and 62% (reduction by 35 percentage points), whereas \(\beta-\)cutoff service level varies between 100% and 95% (reduction by only 5 percentage points). To ensure a service level of 95% in this setting, the cutoff point is set early in the operating cycle, i.e. \(\tau _C=1\), when referring to \(\alpha-\)cutoff service level. In contrast, regarding \(\beta-\)cutoff service level, it is possible to offer same-day shipment to all customers arriving throughout the operating cycle, i.e. \(\tau _C=7\), and still guarantee a service level of 95%. Furthermore, it is impossible to ensure a service level of 99% with cutoff-based shipment when referring to \(\alpha-\)cutoff service level, whereas this service level target is achieved when considering \(\beta-\)cutoff service level, e.g. by setting a cutoff point of \(\tau _C=3\).

These examples illustrate that depending on the underlying type of cutoff service level, cutoff points are set differently, and even the range of achievable service level targets may vary. This is due to the fact that the \(\beta-\)cutoff service level is always greater than or equal to the \(\alpha-\)cutoff service level in a given setting (cf. Sect. 4.2).

5.4.3 Effects of system characteristics

Fig. 4
figure 4

Effects of simultaneous variation of utilisation U, variability of processing capacity scv[B], and variability of customer demand scv[D] on the relationship between \(\alpha-\)cutoff service level \(SL_\alpha\) and cutoff point \(\tau _C\)

Fig. 5
figure 5

Effects of simultaneous variation of utilisation U, variability of processing capacity scv[B], and variability of customer demand scv[D] on the relationship between \(\beta-\)cutoff service level \(SL_\beta\) and cutoff point \(\tau _C\)

The characteristics of the order fulfilment process are specified by the system parameters utilisation U, variability of processing capacity scv[B], and variability of customer demand scv[D]. Figures 4 and 5 illustrate the effects of simultaneous variation of these system parameters on the trade-off between service level and order response time, reflected by the cutoff point. They show \(\alpha-\)cutoff service level \(SL_\alpha\) respective \(\beta-\)cutoff service level \(SL_\beta\) depending on the cutoff point \(\tau _C\) for selected values of utilisation (\(U=0.6, 0.7, 0.8, 0.9, 0.95\)) in order fulfilment processes with low (\(scv[\cdot ]=0.1\)), medium (\(scv[\cdot ]=0.5\)), and high (\(scv[\cdot ]=1.0\)) variability of customer demand scv[D] and processing capacity scv[B], respectively. Pairwise comparisons of selected graphs and curves illustrate the effects of any isolated or simultaneous variation of the system parameters on \(\alpha-\) and \(\beta-\)cutoff service level.

We find that all system parameters are negatively related to \(\alpha-\) and \(\beta-\)cutoff service level, respectively. Hence, an increase in each system parameter negatively affects the trade-off between service level and order response time. For example, consider an order fulfilment process with \(scv[B]=scv[D]=0.5\) (cf. 2nd graph in 2nd row of Fig. 4), given a utilisation of \(U=0.8\), the \(\alpha-\)cutoff service level varies between 96.8% and 61.8% depending on the selected cutoff point. At a higher utilisation of \(U=0.9\), \(\alpha-\)cutoff service level is between 80.8% and 35.6%. Hence, to ensure a service level target of \(SL_\alpha = 80\%\), it is sufficient to set a cutoff point of \(\tau _C=4\) in the setting with \(U = 0.8\), but given the higher utilisation of \(U=0.9\), an early cutoff point of \(\tau _C =0\) is required. Furthermore, in the setting with \(U=0.9\), it is impossible to ensure any service level target greater than 80% with cutoff-based shipment. Similar effects occur when increasing the variabilities of processing capacity or customer demand, respectively.

However, the system parameters differ in terms of the magnitude of their effects: Utilisation U is the major impact factor, the variabilities of processing capacity scv[B] and customer demand scv[D] are minor impact factors. For example, consider an order fulfilment process with \(U=0.8\) and \(scv[B]=scv[D]=1.0\) (cf. 3rd graph in 3rd row of Fig. 4) that ensures a service level target of \(SL_\alpha = 80\%\) by setting an early cutoff point of \(\tau _C = 1\). When halving the variability of customer demand (\(scv[D]=0.5\)) or processing capacity (\(scv[B]=0.5\)), the service level target is achieved with later cutoff points of \(\tau _C=2\) respective \(\tau _C=3\). In contrast, when reducing system utilisation by only 25% to \(U=0.6\), it is possible to guarantee the service level target with a late cutoff point of \(\tau _C=7\), i.e. same-cycle shipment is offered to all customers arriving throughout the operating cycle.

These results indicate that the characteristics of the order fulfilment process, especially its utilisation, affect the purposive selection of the cutoff point and limit the range of achievable service level targets. Conversely, measures that reduce system utilisation and variabilities of processing capacity and customer demand provide potentials to offer a later cutoff point at a given service level target or ensure a higher service level at a given cutoff point.

6 Implications

Fig. 6
figure 6

Cutoff-based shipment promises for B2B- and B2C-warehouses

Offering cutoff-based shipment conditions is an effective measure to improve system performance in deadline-oriented order fulfilment processes. Cutoff service levels and related performance measures of a warehouse improve significantly by introducing a cutoff point compared to a benchmark policy that offers same-cycle shipment to all customers. However, effective management of stochastic order fulfilment processes with deadlines is still challenging for warehouse managers. First, setting an appropriate cutoff point is non-trivial due to the trade-off between high service levels and short order response times. Second, this decision problem is affected by the considered type of cutoff service level. Third, there are several control levers to improve the offered shipment conditions. We discuss these implications for warehouse managers in the following.

When setting the cutoff point, warehouse managers must balance customer expectations in terms of service level and order response time. The ideal shipment conditions would be that warehouses offer same-cycle shipment to customers that order late during the operating cycle (i.e. setting a late cutoff point), while at the same time achieving a high cutoff service level (cf. Fig. 6). Since these shipment conditions are usually not feasible, warehouse managers should concentrate on the primary influence on customers’ perceived service quality. For time-sensitive customers (e.g. customers in competitive e-commerce), the perceived service quality is mainly driven by short order response times and service level is of secondary importance (Yaman et al. 2012; Boysen et al. 2019). Hence, to attract these customers, warehouses operating in the B2C-segment should focus on setting the cutoff point as late as possible while still guaranteeing an acceptable cutoff service level (cf. Fig. 6). In contrast, the time pressure for warehouses operating in the B2B-segment is lower (Boysen et al. 2021). The customers are reliability-sensitive, meaning that the perceived service quality is mainly driven by the service level and they do not mind early cutoff points as long as a cutoff service level close to 100% is guaranteed. Hence, warehouses in the B2B-segment should set the cutoff point only so late such that they can still guarantee a high cutoff service level (cf. Fig. 6).

Warehouse managers find it easier to ensure later cutoff points at a given service level target or higher cutoff service levels at a given cutoff point when referring to \(\beta-\) instead of \(\alpha-\)cutoff service level. Therefore, from the warehouse perspective, the quantity-based \(\beta\)-cutoff service level is preferable to the probability-based \(\alpha-\)cutoff service level. For warehouses operating in the B2C-segment, it is straightforward to use \(\beta-\)cutoff service level to communicate their shipment promises (e.g. on the order website). Furthermore, their customers are time-sensitive (as mentioned above), so the service level is only secondary for the perceived service quality. In the B2B-segment, warehouse managers should carefully select and transparently communicate their service level measurement to the customers as they usually enter long-term service contracts that are legally binding to guarantee a certain service level and include penalties in case of not meeting the service level target. Compared to the B2C-segment, warehouse managers find it more difficult to negotiate \(\beta-\)cutoff service level into service contracts, as long-term commercial customers tend to prefer \(\alpha-\)cutoff service level. This is since commercial customers place multiple orders during an operating cycle (e.g. a company running multiple retail stores that are supplied from the same warehouse) and therefore expect that all orders are shipped on time.

Warehouse managers should exploit internal control levers to improve the offered shipment conditions. First, efforts can be made to increase the process stability of the order fulfilment process (e.g. by using tools of lean management and standardisation) to decrease the variability of the processing capacity. Accordingly, reducing process variability from a high to a medium (low) value in a B2B-warehouse operating with an early cutoff point (\(\tau _C = 2\)) increases \(\alpha-\)cutoff service level by 2.8% (4.8%). Second, investments in additional resource capacity (e.g. manpower, machines, tools) reduce system utilisation which has enormous potential for improving the offered shipment conditions. By reducing system utilisation from 95% to 90% (80%), B2C-warehouses operating with a target \(\beta-\)cutoff service level of 95% may postpone the cutoff point by 3 h (6 h) towards the deadline. In B2B-warehouses that operate with an early cutoff point (\(\tau _C = 2\)), reducing utilisation from 95% to 90% (80%) increases \(\alpha-\)cutoff service level by 6.7% (14.8%). Third, warehouses operating in the B2B-segment may have the opportunity to get access to customers’ demand forecast data (Boysen et al. 2021), which reduces the uncertainties in terms of customers’ purchasing behaviour and decreases the variability of customer demand. Our numerical results suggest that reducing the variability of customer demand from a medium to a low value increases \(\alpha-\)cutoff service level by 3.3% and \(\beta-\)service level by 0.6%.

7 Concluding remarks and outlook

In this paper, we studied the design of cutoff-based shipment promises for deadline-oriented stochastic order fulfilment processes in warehouses. We introduced a discrete-time Markov chain model for deadline-oriented order fulfilment processes with cutoff-based shipment promises, time-dependent generally-distributed customer demand, and generally-distributed processing capacity. Customers ordering until the cutoff point receive same-cycle shipment, i.e. their orders are promised to be ready for shipment by the next truck departure, whereas customers ordering after the cutoff point receive next-cycle shipment, i.e. their orders are promised to ready for shipment by the truck departure after the next. Orders that are not completed on time become backorders and are carried over to the next period. The model enables an exact steady-state performance analysis for these systems based on the performance measures backorders, preprocessed orders, \(\alpha-\) and \(\beta-\)cutoff service level. By this, we are the first to study how cutoff-based shipment promises affect the system performance of stochastic deadline-oriented order fulfilment processes in warehouses.

Based on a comprehensive numerical study, we investigated the benefit of cutoff-based shipment promises as well as the decision problem of setting an appropriate cutoff point. Our results show that introducing cutoff-based shipment conditions significantly improves the cutoff service level. Compared with a benchmark policy, \(\alpha-\) and \(\beta-\)cutoff service level improve on average by 14% and 1%, respectively. However, due to the trade-off between high service levels and short order response times, setting an appropriate cutoff point is a managerial challenge. We conclude that B2C-warehouses serving time-sensitive customers should focus on setting the cutoff point as late as possible while still ensuring an acceptable service level. B2B-warehouses serve reliability-sensitive customers, so the cutoff point should only be set so late such that they can still ensure a high service level. In general, warehouses should prefer \(\beta-\) over \(\alpha-\)cutoff service level as they are able to ensure later cutoff points at a given service level target or higher service levels at a given cutoff point. To improve the offered shipment conditions, warehouse managers may implement measures to reduce the utilisation, increase process stability of the order fulfilment process, or reduce the uncertainty of the customer demand.

The approach of cutoff-based shipment promises studied in this paper offers a single shipment option to every customer. Interesting future research directions are settings which offer multiple shipment options concurrently, e.g. same-day and next-day shipment, such that the customers can select their preferred option. In these settings, it is furthermore common to charge different fees for the shipment options to incentive the customers to choose the shipment option that is favourable from the operational perspective of the warehouse. Beyond, future research can be conducted on studying the effects of cutoff-based shipment promises on general operational decisions in a warehouse, such as capacity planning.