1 Introduction

Transient convection diffusion optimal control problems are widely used to model some engineering problems, for example, air pollution problem [1], [2] and waste water treatment [3]. In recent years the numerical approximations of this kind of problems form a hot topic, and many works are contributed to develo** effective numerical methods and algorithms. For stabilization methods, we refer to [4]–[7] and for discontinuous Galerkin methods, we refer to [8], [9]. For more literature, one can refer to the references cited therein.

It is well known that the solutions to convection diffusion problems may have boundary layers with small widths where their gradients change rapidly. Therefore, only using the stable methods to solve convection diffusion optimal control problems is generally not enough. One approach to improve the quality of a numerical solution is to exploit special mesh which is locally refined near the boundary layers, for example, Shishkin-type mesh or adaptive mesh. Note that a priori knowledge of the locations of the boundary layers is necessary to construct Shishkin-type mesh. Using adaptive mesh to resolve the boundary layers seems to be more natural. As we know the key problem of the adaptive finite element method is the a posteriori error estimate. Compared with a posteriori error estimates for stationary convection diffusion optimal control problems (see, [7], [10]–[12]), the works devoted to a posteriori error estimates for transient convection diffusion optimal control problems are much fewer. In [13] the authors discuss adaptive characteristic finite element approximation of transient convection diffusion optimal control problems with a general diffusion coefficient, where a posteriori error estimates in L 2 (0,T; L 2 (Ω)) norm are derived by dual argument skill for the state and adjoint state variables.

The primary interest of this paper is to derive a posteriori error estimates for the following transient convection diffusion optimal control problem with dominance convection:

min u U a d J(y,u)= 1 2 Ω T ( y ( x , t ) y d ( x , t ) ) 2 dxdt+ γ 2 Ω T u 2 (x,t)dxdt
(1.1)

subject to

{ y t + β y + α y ε y = f + u , ( x , t ) Ω T = Ω × ( 0 , T ) , y ( x , t ) = 0 , ( x , t ) Γ T = Ω × ( 0 , T ) , y ( x , 0 ) = y 0 ( x ) , ( x , t ) Ω .
(1.2)

The details will be specified in the next section.

In order to improve the quality of the numerical solutions, the continuous interior penalty Galerkin method (CIP Galerkin method) is used to solve the state equation (1.2). This method was firstly proposed in [14]. In [7], [15] the CIP Galerkin method was used to approximate stationary convection diffusion optimal control problems, where a posteriori error estimates in L 2 (Ω) and energy norm were derived. In [16] the CIP Galerkin method combined with Crank-Nicolson scheme was used to solve transient convection diffusion optimal control problems without constraints and a priori error estimates were deduced.

In the present paper, we apply the CIP Galerkin method combined with the backward Euler method to solve control constrained transient convection diffusion optimal control problems (1.1)-(1.2), where the control is discretized by the implicit discretization method developed in [17], and the state is approximated by piecewise linear finite element space. Due to the existence of boundary layer or interior layer for the state and adjoint state as well as limited regularity of control variable, we derive a posteriori error estimates for the state and adjoint state, which can be utilized to guide the mesh refinements in the adaptive algorithm. In contrast to [13], here we use the elliptic reconstruction technique developed in [18] for parabolic problems instead of dual argument skill to deduce the a posterior error estimates for the state and adjoint state. By use of this technique we can take full advantage of the well-established a posteriori error estimates for stationary convection diffusion optimal control problems in [7], [15] to derive the a posterior error estimate for transient convection diffusion optimal control problems.

The paper is organized as follows. In Section 2 we describe the continuous interior penalty Galerkin scheme for the constrained optimal control problem. In Section 3 a posteriori error estimates are derived. Finally, we briefly summarize the method used, results obtained and possible future extensions and challenges.

Throughout this paper C>0 denotes a generic constant independent of mesh parameters and may be different at different occurrence. We use the expression ab to stand for aCb.

2 The CIP Galerkin approximation scheme

2.1 Problems formulation

Consider the following transient convection diffusion optimal control problems:

min u U a d J(y,u)
(2.1)

subject to

{ y t + β y + α y ε y = f + u , ( x , t ) Ω T , y ( x , t ) = 0 , ( x , t ) Γ T , y ( x , 0 ) = y 0 ( x ) , x Ω .
(2.2)

Here Ω is a bounded domain in R 2 with boundary Ω. f L 2 ( Ω T ) and y 0 (x) H 0 1 (Ω) is the initial value. U a d ={u L 2 ( Ω T ):au(x,t)b a.e. in  Ω T } is a bounded convex set with two constants satisfying a<b. α>0 is the reaction coefficient, 0<ε1 is a small diffusion coefficient, and β ( W 1 , ( Ω ) ) 2 is a velocity field. We assume that the following coercivity condition holds:

α 1 2 β α 0 >0.

To consider the CIP Galerkin approximation of the above optimal control problem, we first derive a weak formulation for the state equation. Let A(,) be the bilinear form given by

A(y,w)=(εy,w)+(βy,w)+(αy,w),y,w H 0 1 (Ω).
(2.3)

It is easy to check

A(y,y) y 2 ,
(2.4)

where

y = ( ε y 0 , Ω 2 + α 0 y 0 , Ω 2 ) 1 / 2 .

Then the weak formulation of state equation (2.2) reads as

( y t , w ) +A(y,w)=(f+u,w),w H 0 1 (Ω).

The variational formulation of optimal control problem (2.1)-(2.2) then can be written as

min u U a d J(y,u)
(2.5)

subject to

( y t , w ) +A(y,w)=(f+u,w),w H 0 1 (Ω).
(2.6)

The existence and uniqueness of solutions to (2.5)-(2.6) can be guaranteed by the theory in [19]. Moreover, by using the Lagrange functional, the first-order necessary (also sufficient here) optimality condition of (2.5)-(2.6) can be characterized by

{ ( y t , w ) + A ( y , w ) = ( f + u , w ) , w H 0 1 ( Ω ) , ( z t , w ) + A ( w , z ) = ( y y d , w ) , w H 0 1 ( Ω ) , ( γ u + z , v u ) 0 , v U a d .
(2.7)

From the second equation in (2.7), we have that the adjoint state z satisfies transient convection diffusion equations with the strong form

{ z t ε z ( β z ) + α z = y y d , ( x , t ) Ω T , z ( x , t ) = 0 , ( x , t ) Γ T , z ( x , T ) = 0 , x Ω .
(2.8)

In contrast to the state equation, the velocity field of the adjoint equation is −β .

By the pointwise projection on U a d ,

P U a d : L 2 ( Ω T ) U a d , P U a d (v)=max ( a , min ( v ( x , t ) , b ) ) ,
(2.9)

the optimal condition in (2.7) simplifies to

u= P U a d ( 1 γ v ) .
(2.10)

2.2 Semi-discrete discretization

Let T h be a regular triangulation of Ω, so that Ω ¯ = K T h K ¯ . Let h K denote the diameter of the element K. Associated with T h is a finite dimensional subspace W h of C( Ω ¯ ) H 0 1 (Ω), consisting of piecewise linear polynomials.

To control the convective derivative of the discrete solution sufficiently, a symmetric stabilization form S (see, e.g., [14]) on W h × W h was introduced as follows:

S( v h , w h )=σ E E h E h E 2 [ v h n][ w h n]ds,

where σ>0 is the stabilization parameter. E h denotes the collection of interior edges of the elements in T h . h E is the size of the edge E. [q] denotes the jump of q across E for E E h defined by

[ q ( x ) ] = lim s 0 + ( q ( x + s n ) q ( x s n ) ) ,

with n being the outward unit normal.

Using the above stabilization form, a semi-discrete CIP Galerkin approximation of optimal control problem (2.1)-(2.2) is defined by

min u h U a d J( y h , u h )
(2.11)

subject to

{ ( y h t , w h ) + A ( y h , w h ) + S ( y h , w h ) = ( f + u h , w h ) , w h W h , y h ( 0 ) = y 0 h W h .
(2.12)

Here the control variable was approximated by variational discrete concept (see [17]). u h in general is not a finite element function associated with the space mesh T h .

By standard argument it can be shown that the following first-order optimality condition holds:

{ ( y h t , w h ) + A ( y h , w h ) + S ( y h , w h ) = ( f + u h , w h ) , w h W h , ( z h t , q h ) + A ( q h , z h ) + S ( q h , z h ) = ( y h y d , q h ) , q h W h , ( γ u h + z h , v h u h ) 0 , v h U a d , y h ( 0 ) = y 0 h , z h ( T ) = 0 .
(2.13)

By the pointwise projection operator P U a d , we have

u h = P U a d ( 1 γ z h ) .
(2.14)

2.3 Fully discrete scheme

To define a fully discrete scheme, we introduce a time partition. Let 0= t 0 < t 1 << t N 1 < t N =T be a time grid with τ n = t n t n 1 , n=1,2,,N. Set I n =( t n 1 , t n ].

Using variational discretization concept, the fully discrete CIP Galerkin scheme for (2.1)-(2.2) reads as follows:

min u h n U a d J h ( y h n , u h n ) = 1 2 n = 1 N τ n ( y h n y d n 0 , Ω 2 + γ u h n 0 , Ω 2 )
(2.15)

subject to

{ ( y h n y h n 1 τ n , w h ) + A ( y h n , w h ) + S ( y h n , w h ) = ( f n + u h n , w h ) , w h W h , y h 0 = y 0 h W h .
(2.16)

Similar to a semi-discrete scheme, we can derive the discrete first-order optimality condition:

{ ( y h n y h n 1 τ n , w h ) + A ( y h n , w h ) + S ( y h n , w h ) = ( f n + u h n , w h ) , w h W h , ( z h n 1 z h n τ n , q h ) + A ( q h , z h n 1 ) + S ( q h , z h n 1 ) = ( y h n y d n , q h ) , q h W h , ( γ u h n + z h n 1 , v h u h n ) 0 , v h U a d , y h 0 = y 0 h , z h N = 0 , n = 1 , 2 , , N .
(2.17)

Again by the pointwise projection operator P U a d , we obtain

u h n = P U a d ( 1 γ z h n 1 ) .

We can see that u h n is a piecewise constant function in time.

For n=1,2,,N, let

Y h | ( t n 1 , t n ] = l n ( t ) y h n + l n 1 ( t ) y h n 1 , Z h | ( t n 1 , t n ] = l n ( t ) z h n + l n 1 ( t ) z h n 1 , U h | ( t n 1 , t n ] = u h n ,

where

l n (t)= t t n 1 τ n , l n 1 (t)= t n t τ n .

For ψC(0,T; L 2 (Ω)), t( t n 1 , t n ], we set ψ ˆ =ψ(x, t n ), ψ ¯ =ψ(x, t n 1 ). Note that

Y h t = y h n y h n 1 τ n , Z h t = z h n z h n 1 τ n

for t( t n 1 , t n ]. Then the above optimality conditions can be rewritten as

{ ( Y h t , w h ) + A ( Y ˆ h , w h ) + S ( Y ˆ h , w h ) = ( f n + U h , w h ) , w h W h , ( Z h t , q h ) + A ( q h , Z ¯ h ) + S ( q h , Z ¯ h ) = ( Y ˆ h y d n , q h ) , q h W h , ( γ U h + Z ¯ h , v h U h ) 0 , v h U a d .
(2.18)

3 A posteriori error estimates

The objective of this section is to derive a posteriori error estimates for the state, adjoint state and control.

3.1 The estimate for control

To obtain the estimate for control, we introduce an auxiliary problem. For given U h , let (y( U h ),z( U h )) be a solution of the following system:

{ ( y ( U h ) t , w ) + A ( y ( U h ) , w ) = ( f + U h , w ) , w H 0 1 ( Ω ) , y ( U h ) ( x , 0 ) = y 0 ( x ) , x Ω , ( z ( U h ) t , q ) + A ( q , z ( U h ) ) = ( y ( U h ) y d , q ) , q H 0 1 ( Ω ) , z ( U h ) ( x , T ) = 0 , x Ω .
(3.1)

Lemma 3.1

Let(y,z,u)and( Y h , Z h , U h )be the solutions of (2.7) and (2.18), respectively. Then the following estimate

u U h L 2 ( 0 , T ; L 2 ( Ω ) ) C ( z ( U h ) Z h L 2 ( 0 , T ; L 2 ( Ω ) ) + Z h Z ¯ h L 2 ( 0 , T ; L 2 ( Ω ) ) )

holds.

Proof

It follows from (2.7) and (2.18) that

γ u U h L 2 ( 0 , T ; L 2 ( Ω ) ) 2 = 0 T Ω γ u ( u U h ) 0 T Ω γ U h ( u U h ) 0 T Ω z ( U h u ) 0 T Ω γ U h ( u U h ) = 0 T Ω ( z Z ¯ h ) ( U h u ) 0 T Ω ( γ U h + Z ¯ h ) ( u U h ) = 0 T Ω ( z z ( U h ) ) ( U h u ) + 0 T Ω ( z ( U h ) Z ¯ h ) ( U h u ) 0 T Ω ( γ U h + Z ¯ h ) ( u U h ) 0 T Ω ( z z ( U h ) ) ( U h u ) + 0 T Ω ( z ( U h ) Z ¯ h ) ( U h u ) .

Here the last inequality was fulfilled due to the implicit discretization of the control variable.

By (2.7) and (3.1) we have

0 T Ω ( z z ( U h ) ) ( u U h ) = 0 T Ω ( y y ( U h ) ) t ( z z ( U h ) ) + 0 T A ( y y ( U h ) , z z ( U h ) ) = 0 T Ω ( z z ( U h ) ) t ( y y ( U h ) ) + 0 T A ( y y ( U h ) , z z ( U h ) ) = 0 T Ω ( y y ( U h ) ) 2 0 ,

where y( U h )(0)= y 0 (x) and z( U h )(T)=0 was used. Thus we arrive at

γ u U h L 2 ( 0 , T ; L 2 ( Ω ) ) 2 0 T Ω ( z ( U h ) Z ¯ h ) ( U h u ) C ( δ ) z ( U h ) Z h L 2 ( 0 , T ; L 2 ( Ω ) ) 2 + C ( δ ) Z h Z ¯ h L 2 ( 0 , T ; L 2 ( Ω ) ) 2 + C δ u U h L 2 ( 0 , T ; L 2 ( Ω ) ) 2 .

Choosing δ= γ 2 C yields the theorem result. □

3.2 The estimate for the state and adjoint state

In this section we shall adopt the elliptic reconstruction technique proposed in [18], [20] to derive a posteriori error estimates for the state and adjoint state.

To this end we first introduce the following elliptic reconstruction definitions for state and adjoint state.

Definition 3.2

For n=1,2,,N, we define the elliptic reconstruction ν n H 0 1 (Ω) and ω n 1 H 0 1 (Ω) satisfying the following elliptic problems:

A ( ν n , w ) = ( f n + u h n y h n y h n 1 τ n , w ) ,w H 0 1 (Ω)
(3.2)

and

A ( q , ω n 1 ) = ( y h n y d n + z h n z h n 1 τ n , q ) ,q H 0 1 (Ω).
(3.3)

Noticing that the CIP Galerkin approximation of ν n can be defined as

A ( ν h n , w h ) +S ( ν h n , w h ) = ( f n + u h n y h n y h n 1 τ n , w h ) , w h W h n .

Then we have

A ( y h n ν h n , w h ) +S ( y h n ν h n , w h ) =0,

which implies y h n = ν h n . We can observe a similar property for the CIP Galerkin approximation of ω n 1 .

Using the above convention, we define ν(t) and ω(t) as

ν(t)= l n (t) ν n + l n 1 (t) ν n 1

and

ω(t)= l n (t) ω n + l n 1 (t) ω n 1

for t I n and n[1,N]. We decompose the error as follows:

y( U h ) Y h (t)=y( U h )ν(t) ( Y h ( t ) ν ( t ) ) := ρ y ξ y

and

z( U h ) Z h (t)=z( U h )ω(t) ( Z h ( t ) ω ( t ) ) := ρ z ξ z .

Nextly we shall derive the estimates of ρ and ξ. For simplicity, we introduce the following notations:

R K , y n = f n + u h n Y h t β y h n α y h n , R E , y n = [ y h n n ] , R K , z n = y h n y d n + Z h t + ( β z h n 1 ) α z h n 1 , R E , z n = [ z h n 1 n ] , ¯ t φ n = φ n φ n 1 τ n , ¯ t 2 φ n = ¯ t φ n ¯ t φ n 1 τ n .

Let A h s and A h a be the discrete operators associated with the state and adjoint state, which are defined by the following for v W h :

A h s v , w h = A ( v , w h ) + S ( v , w h ) , w h W h , A h a v , w h = A ( w h , v ) + S ( w h , v ) , w h W h .

The time error estimators are characterized by

θ y , n = { ( ¯ t f n + ¯ t u h n ¯ t 2 y h n ) τ n , 2 n N , f 1 + u h 1 ¯ t y h 1 A h s y h 0 , n = 1

and

θ z , n = { ( ¯ t y h n ¯ t y d n + ¯ t 2 z h n 1 ) τ n , 1 n N 1 , y h N y d N + ¯ t z h N A h a z h N , n = N .

Moreover, let α K =min{ α 0 1 2 , ε 1 2 h K }, α E =min{ α 0 1 2 , ε 1 2 h E }.

By the standard techniques used in a posteriori error estimate for stationary convection diffusion optimal control problems [7], [15], we obtain the following results.

Lemma 3.3

Let ν n and y h n be the solutions to (3.2) and (2.16). Then the following a posteriori error estimate holds:

ξ y n η y , n ,

where

η y , n 2 = K T h α K 2 R K , y n 0 , K 2 + E E h ε 1 2 α E ε R E , y n 0 , E 2 + E E h α E 2 h E R E , y n 0 , E 2 .

Lemma 3.4

Let ω n 1 and z h n 1 be the solutions to (3.3) and (2.17). Then the following a posteriori error estimate holds:

ξ z n 1 η z , n ,

where

η z , n 2 = K T h α K 2 R K , z n 0 , K 2 + E E h ε 1 2 α E ε R E , z n 0 , E 2 + E E h α E 2 h E R E , z n 0 , E 2 .

In the following we shall deduce the estimates of ρ y and ρ z . By (3.1) and Definition 3.2 we can derive the following error equations for ρ y and ρ z .

Lemma 3.5

Givent I n , we deduce

( ρ y t , ψ ) + A ( ρ y , ψ ) = ( ξ y t , ψ ) + A ( ν n ν ( t ) , ψ ) + ( f f n , ψ ) , ψ H 0 1 ( Ω )
(3.4)

and

( ρ z t , ψ ) + A ( ρ z , ψ ) = ( ξ z t , ψ ) + A ( ψ , ω n 1 ω ( t ) ) + ( y d y d n , ψ ) + ( y ( U h ) y h n , ψ ) , ψ H 0 1 ( Ω ) .
(3.5)

Proof

Note that

( y ( U h ) t , ψ ) +A ( y ( U h ) , ψ ) =(f+ U h ,ψ),ψ H 0 1 (Ω)

and

A ( ν n , ψ ) = ( f n + U h Y h t , ψ ) ,ψ H 0 1 (Ω).

Then we have

( ρ y t , ψ ) + A ( ρ y , ψ ) = ( y ( U h ) t , ψ ) + A ( y ( U h ) , ψ ) ( ν ( t ) t , ψ ) A ( ν ( t ) , ψ ) = ( f + U h , ψ ) + ( ξ y t , ψ ) ( Y h t , ψ ) A ( ν ( t ) , ψ ) = ( f n + U h Y h t , ψ ) + ( ξ y t , ψ ) + ( f f n , ψ ) A ( ν ( t ) , ψ ) = ( ξ y t , ψ ) + ( f f n , ψ ) + A ( ν n ν ( t ) , ψ ) .

Similarly we can deduce the error equation for ρ z . □

Before deriving the estimates for ρ y and ρ z , we first introduce the following lemma with respect to a Clément-type interpolation operator. The proof can be found in [21], [22].

Lemma 3.6

Let I h be a quasi-interpolation operator of Clément type. The following estimates hold for all elements K, all faces E and all functionsv H 0 1 (Ω):

v I h v 0 , K α K v , N ( K ) , v I h v 0 , E ε 1 4 α E 1 2 v , N ( E ) , I h v , K v , N ( K ) ,

whereN(K)andN(E)denote the union of all elements that share at least one point with K and E.

Then we arrive at the following.

Lemma 3.7

The following estimate holds:

max t [ 0 , T ] ρ y 2 + 0 T ρ y 2 ρ y ( 0 ) 2 + ( 1 2 n = 1 N τ n θ y , n ) 2 + ( n = 1 N I n f ( t ) f n ) 2 + [ n = 1 N I n ( K T h α K ¯ t R K , y n 0 , K + E E h α E 1 2 ε 1 4 ε ¯ t R E , y n 0 , E + E E h α E h E 1 2 ¯ t R E , y n 0 , E ) ] 2 .

Proof

Setting ψ= ρ y in (3.4) leads to

1 2 d d t ρ y ( t ) 2 + A ( ρ y , ρ y ) = ( ξ y t , ρ y ) + A ( ν n ν ( t ) , ρ y ) + ( f f n , ρ y ) .

Integrating in time from 0 to T gives

1 2 ( ρ y ( T ) 2 ρ y ( 0 ) 2 ) + 0 T A ( ρ y , ρ y ) = 0 T ( ξ y t , ρ y ) + 0 T A ( ν n ν ( t ) , ρ y ) + 0 T ( f f n , ρ y ) 0 T | ( ξ y t , ρ y ) | + 0 T | A ( ν n ν ( t ) , ρ y ) | + 0 T | ( f f n , ρ y ) | .

Assume that

ρ y ( t m ) = max t [ 0 , T ] ρ y .

Again integrating in time from 0 to t m results in

1 2 ( ρ y ( t m ) 2 ρ y ( 0 ) 2 ) + 0 t m A ( ρ y , ρ y ) = 0 t m ( ξ y t , ρ y ) + 0 t m A ( ν n ν ( t ) , ρ y ) + 0 t m ( f f n , ρ y ) 0 T | A ( ν n ν ( t ) , ρ y ) | + 0 T | ( f f n , ρ y ) | + 0 T | ( ξ y t , ρ y ) | .

Combining the above two inequalities yields

1 2 ( ρ y ( t m ) 2 ρ y ( 0 ) 2 ) + 0 T A ( ρ y , ρ y ) 2 0 T | A ( ν n ν ( t ) , ρ y ) | + 2 0 T | ( f f n , ρ y ) | + 2 0 T | ( ξ y t , ρ y ) | : = 2 i = 1 3 T i .
(3.6)

In the following we shall derive the estimates of T i . By the definition of the elliptic reconstruction, we can bound T 1 as follows:

T 1 = n = 1 N I n l n 1 ( t ) | ( θ y , n , ρ y ) | n = 1 N I n l n 1 ( t ) θ y , n ρ y 1 2 n = 1 N τ n θ y , n ρ y ( t m ) ,

which implies

T 1 C(δ) ( 1 2 n = 1 N τ n θ y , n ) 2 +δ ρ y ( t m ) 2

with an arbitrarily positive constant δ.

For the second term T 2 , we can bound it as follows:

T 2 n = 1 N I n f ( t ) f n ρ y ρ y ( t m ) n = 1 N I n f ( t ) f n C ( δ ) ( n = 1 N I n f ( t ) f n ) 2 + δ ρ y ( t m ) 2 .

Now it remains to estimate T 3 . Note that

T 3 = n = 1 N I n τ n 1 | ( ν n ν n 1 y h n + y h n 1 , ρ y ) | .

This term can be estimated by the techniques used in a posterior error estimates for the stationary problem. To this end we introduce an auxiliary problem

{ ε ϕ ( β ϕ ) + α ϕ = ρ y , in  Ω , ϕ = 0 , on  Ω .
(3.7)

For the above auxiliary problem, the following stability estimates (see, e.g., [23]) hold:

ε 3 2 ϕ 2 + ε 1 2 ϕ 1 +ϕC ρ y .
(3.8)

Using the above auxiliary problem, we have

( ν n ν n 1 y h n + y h n 1 , ρ y ) =A ( ν n ν n 1 y h n + y h n 1 , ϕ ) .

By the definitions of ν n and y n n , we can deduce

( ν n ν n 1 y h n + y h n 1 , ρ y ) = ( f n + u h n ¯ t y h n f n 1 u h n 1 + ¯ t y h n 1 , ϕ ) A ( y h n y h n 1 , ϕ I h ϕ ) A ( y h n y h n 1 , I h ϕ ) S ( y h n y h n 1 , I h ϕ ) + S ( y h n y h n 1 , I h ϕ ) = ( f n + u h n ¯ t y h n f n 1 u h n 1 + ¯ t y h n 1 , ϕ I h ϕ ) A ( y h n y h n 1 , ϕ I h ϕ ) + S ( y h n y h n 1 , I h ϕ ) ,

where I h ϕ denotes the Clément interpolation of ϕ. Further, we have

( ν n ν n 1 y h n + y h n 1 , ρ y ) = τ n K T h K ( ¯ t f n + ¯ t u h n ¯ t 2 y h n ¯ t ( β y h n + α y h n ) , ϕ I h ϕ ) + τ n E E h E [ ε ¯ t y h n ] ( I h ϕ ϕ ) d s + τ n S ( ¯ t y h , I h ϕ ) τ n K T h ¯ t R K , y n 0 , K ϕ I h ϕ 0 , K + τ n E E h ε ¯ t R E , y n 0 , E I h ϕ ϕ 0 , E + τ n σ E E h h E 2 ¯ t R E , y n 0 , E [ n ( I h ϕ ) ] 0 , E .

Note that

[ ( I h ϕ ) ] 0 , E C h E 1 2 ( I h ϕ ) 0 , N ( E ) ,

and

[ ( I h ϕ ) ] 0 , E C h E 3 2 I h ϕ 0 , N ( E ) .

Then we derive

τ n E E h h E 2 ¯ t R E , y n 0 , E [ n ( I h ϕ ) ] 0 , E C τ n E E h h E 3 2 ¯ t R E , y n 0 , E ( I h ϕ ) 0 , N ( E )

or

τ n E E h h E 2 ¯ t R E , y n 0 , E [ n ( I h ϕ 1 ) ] 0 , E C τ n E E h h E 1 2 ¯ t R E , y n 0 , E I h ϕ 0 , N ( E ) .

This implies

τ n E E h h E 2 ¯ t R E , y n 0 , E [ n ( I h ϕ 1 ) ] 0 , E C τ n E E h α E h E 1 2 ¯ t R E , y n 0 , E I h ϕ , N ( E ) .

It follows from Lemma 3.6 and (3.8) that

( w n w n 1 y h n + y h n 1 , ρ y ) C τ n ( K T h α K ¯ t R K , y n 0 , K + E E h α E 1 2 ε 1 4 ε ¯ t R E , y n 0 , E + E E h α E h E 1 2 ¯ t R E , y n 0 , E ) ρ y 0 , Ω .
(3.9)

Thus we arrive at

T 3 C n = 1 N I n ( K T h α K ¯ t R K , y n 0 , K + E E h α E 1 2 ε 1 4 ε ¯ t R E , y n 0 , E + E E h α E h E 1 2 ¯ t R E , y n 0 , E ) ρ y ( t m ) C ( δ ) [ n = 1 N I n ( K T h α K ¯ t R K , y n 0 , K + E E h α E 1 2 ε 1 4 ε ¯ t R E , y n 0 , E + E E h α E h E 1 2 ¯ t R E , y n 0 , E ) ] 2 + C δ ρ y ( t m ) 2 .

Inserting the estimates of T 1 , T 2 and T 3 into (3.6) and setting δ small enough leads to the theorem results. □

Theorem 3.8

Lety( U h )and Y h be the solutions of (2.7) and (2.13), respectively. Then the following estimate holds:

( 0 T y ( U h ) Y h 2 ) 1 2 ρ y ( 0 ) + 1 2 n = 1 N τ n θ y , n + n = 1 N I n f ( t ) f n + n = 1 N I n ( K T h α K ¯ t R K , y n 0 , K + E E h α E 1 2 ε 1 4 ε ¯ t R E , y n 0 , E + E E h α E h E 1 2 ¯ t R E , y n 0 , E ) + ( 1 3 n = 1 N τ n ( ξ y n 2 + ξ y n 1 2 ) ) 1 2 .
(3.10)

Proof

Note that

0 T y ( U h ) Y h 2 0 T ρ y 2 + 0 T ξ y 2 0 T ρ y 2 + 0 T ξ y 2 0 T ρ y 2 + 1 3 τ n n = 1 N ( ξ y n 2 + ξ y n 1 2 ) .

Then by Lemmas 3.3 and 3.7 we can deduce the estimates of 0 T y ( U h ) Y h 2 . □

Remark 3.9

Note that

y ( U h ) Y h ρ y + ξ y .
(3.11)

The second term is the elliptic reconstruction error, which can bounded as follows for t I n :

ξ y = l n ξ y n + l n 1 ξ y n 1 max ( ξ y n , ξ y n 1 ) .
(3.12)

Then

y ( U h ) Y h L ( 0 , T ; L 2 ( Ω ) ) max t ρ y ( t ) + max t ξ y max t ρ y ( t ) + max n ξ y n .

Combining Lemmas 3.3 and 3.7, we can deduce

y ( U h ) Y h L ( 0 , T ; L 2 ( Ω ) ) ρ y ( 0 ) + 1 2 n = 1 N τ n θ y , n + n = 1 N I n f ( t ) f n + n = 1 N τ n I n ( K T h α K ¯ t R K , y n 0 , K + E E h α E 1 2 ε 1 4 ε ¯ t R E , y n 0 , E + E E h α E h E 1 2 ¯ t R E , y n 0 , E ) + max n [ 0 , N ] η y , n .
(3.13)

Now we turn our attention to estimate z( U h ) Z h . The argument skills are similar to those used in the estimate of y( U h ) Y h . Therefore we just sketch the proof.

Setting ψ= ρ z in (3.5) leads to

1 2 d d t ρ z ( t ) 2 + A ( ρ z , ρ z ) = ( ξ z t , ρ z ) + A ( ρ z , ω n 1 ω ( t ) ) + ( y d y d n , ρ z ) + ( y ( U h ) y h n , ρ z ) .

Let

max t [ 0 , T ] ρ z = ρ z ( t n ) .

Then integrating the above equation from t n to T and 0 to T, respectively, leads to

1 2 ( ρ z ( t n ) 2 ρ z ( T ) 2 ) + 0 T A ( ρ z , ρ z ) 2 0 T | ( ξ z t , ρ z ) | + 2 0 T | A ( ρ z , ω n 1 ω ) | + 2 0 T | ( y d y d n , ρ z ) | + 2 0 T | ( y ( U h ) y h n , ρ z ) | .

In an analogous way to Lemma 3.7, we can derive the estimate for ρ z .

Lemma 3.10

We have

max t [ 0 , T ] ρ z 2 + 0 T ρ z 2 ρ z ( T ) 2 + ( 1 2 n = 1 N τ n θ z , n ) 2 + ( n = 1 N I n y d y d n ) 2 + [ n = 1 N I n ( K T h α K ¯ t R K , z n 0 , K + E E h α E ε 1 4 ε ¯ t R E , z n 0 , E + E E h α E h E 1 2 ¯ t R E , z n 0 , E ) ] 2 + ( n = 1 N I n Y h y h n ) 2 + y ( U h ) Y h L 2 ( 0 , T ; L 2 ( Ω ) ) 2 .

Collecting Lemmas 3.4 and 3.10 and using similar arguments to Theorem 3.8 yields the following.

Theorem 3.11

Letz( U h )and Z h be the solutions of (2.7) and (2.13), respectively. Then the following estimates hold:

( 0 T z ( U h ) Z h 2 ) 1 2 ρ z ( T ) + 1 2 n = 1 N τ n θ z , n + n = 1 N I n y d y d n + n = 1 N I n Y h y h n + [ n = 1 N I n K T h α K ¯ t R K , z n 0 , K + E E h α E ε 1 4 ε ¯ t R E , z n 0 , E + E E h α E h E 1 2 ¯ t R E , z n 0 , E ] + ( 1 3 n = 1 N τ n ( ξ z n 2 + ξ z n 1 2 ) ) 1 2 + y ( U h ) Y h L 2 ( 0 , T ; L 2 ( Ω ) ) .

Remark 3.12

Similar to Remark 3.9, we can also derive the posteriori error estimates of

z ( U h ) Z h L ( 0 , T ; L 2 ( Ω ) ) ρ z ( T ) + 1 2 n = 1 N τ n θ z , n + n = 1 N I n y d y d n + n = 1 N I n Y h y h n + [ n = 1 N I n K T h α K ¯ t R K , z n 0 , K + E E h α E ε 1 4 ε ¯ t R E , z n 0 , E + E E h α E h E 1 2 ¯ t R E , z n 0 , E ] + max n [ 0 , N ] η z , n + y ( U h ) Y h L 2 ( 0 , T ; L 2 ( Ω ) ) .

3.3 The main results

By (2.7) and (3.1) we can also derive

0 T y y ( U h ) 2 u U h L 2 ( 0 , T ; L 2 ( Ω ) ) 2
(3.14)

and

0 T z z ( U h ) 2 u U h L 2 ( 0 , T ; L 2 ( Ω ) ) 2 .
(3.15)

Therefore, combining Lemma 3.1, Theorems 3.8, 3.11, (3.14) and (3.15), we can deduce the following estimates.

Theorem 3.13

Let(y,p,u)and( Y h , Z h , U h )be the solutions of (2.7) and (2.13), respectively. Then the following estimate

u U h L 2 ( 0 , T ; L 2 ( Ω ) ) + ( 0 T y Y h 2 ) 1 2 + ( 0 T z Z h 2 ) 1 2 η ˜ y + η ˜ z

holds, where

η ˜ y = ρ y ( 0 ) + 1 2 n = 1 N τ n θ y , n + n = 1 N I n f ( t ) f n + n = 1 N I n ( K T h α K ¯ t R K , y n 0 , K + E E h α E 1 2 ε 1 4 ε ¯ t R E , y n 0 , E + E E h α E h E 1 2 ¯ t R E , y n 0 , E ) + n = 1 N I n Y h y h n + ( 1 3 n = 1 N τ n ( ξ y n 2 + ξ y n 1 2 ) ) 1 2

and

η ˜ z = ρ z ( T ) + 1 2 n = 1 N τ n θ z , n + n = 1 N I n y d y d n + Z h Z ¯ h L 2 ( 0 , T ; L 2 ( Ω ) ) + [ n = 1 N I n K T h α K ¯ t R K , z n 0 , K + E E h α E ε 1 4 ε ¯ t R E , z n 0 , E + E E h α E h E 1 2 ¯ t R E , z n 0 , E ] + ( 1 3 n = 1 N τ n ( ξ z n 2 + ξ z n 1 2 ) ) 1 2 .

Remark 3.14

It follows from (2.7) and (3.1) that

y y ( U h ) L ( 0 , T ; L 2 ( Ω ) ) u U h L 2 ( 0 , T ; L 2 ( Ω ) )

and

z z ( U h ) L ( 0 , T ; L 2 ( Ω ) ) u U h L 2 ( 0 , T ; L 2 ( Ω ) )

Using the above estimate and Lemma 3.1, we can derive the posteriori error estimates of u U h L 2 ( 0 , T ; L 2 ( Ω ) ) + y Y h L ( 0 , T ; L 2 ( Ω ) ) + z Z h L ( 0 , T ; L 2 ( Ω ) ) .

4 Conclusion

In this paper a posteriori error estimates were established for time-dependent convection diffusion optimal control problems by the elliptic reconstruction technique. By introducing the elliptic reconstruction, we can take full advantage of the well-established a posteriori error estimates for stationary convection diffusion optimal control problems. There are still many issues needed to be addressed, such as optimal control problems with state constraints and pointwisely imposed control problems. The applications of our approach to these settings will be postponed to our future work.