An optimal advertising model with carryover effect and mean field terms

Gozzi, Fausto; Masiero, Federica; Rosestolato, Mauro

doi:10.1007/s11579-024-00361-3

An optimal advertising model with carryover effect and mean field terms

Open access
Published: 21 May 2024

(2024)
Cite this article

Download PDF

You have full access to this open access article

Mathematics and Financial Economics Aims and scope Submit manuscript

An optimal advertising model with carryover effect and mean field terms

Download PDF

230 Accesses
Explore all metrics

Abstract

We consider a class of optimal advertising problems under uncertainty for the introduction of a new product into the market, on the line of the seminal papers of Vidale and Wolfe (Oper Res 5:370–381, 1957) and Nerlove and Arrow (Economica 29:129–142, 1962). The main features of our model are that, on one side, we assume a carryover effect (i.e. the advertisement spending affects the goodwill with some delay); on the other side we introduce, in the state equation and in the objective, some mean field terms that take into account the presence of other agents. We take the point of view of a planner who optimizes the average profit of all agents, hence we fall into the family of the so-called “Mean Field Control” problems. The simultaneous presence of the carryover effect makes the problem infinite dimensional hence belonging to a family of problems which are very difficult in general and whose study started only very recently, see Cosso et al. [Ann Appl Probab 33(4):2863–2918, 2023]. Here we consider, as a first step, a simple version of the problem providing the solutions in a simple case through a suitable auxiliary problem.

Asymmetric Replicator Dynamics on Polish Spaces: Invariance, Stability, and Convergence

Article 20 December 2023

Global dynamics of a quantum Cournot duopoly with quadratic costs and relative profit maximization

Article 27 June 2024

Continuous-Time Mean Field Markov Decision Models

Article Open access 22 June 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Since the seminal papers of [16, 18] on dynamics model in marketing, a considerable amount of work has been devoted to problems of optimal advertising, both in monopolistic and competitive settings, and both in deterministic and stochastic environments (see [6] for a review of the existing work until the 1990’s).

Various extensions of the basic setting of [16, 18] have been studied. For the stochastic case, we recall, among the various papers on the subject, [12, 14, 15, 17].

Our purpose here is to start exploring a family of models that put together two important features that may arise in such problems and that have not yet been satisfactorily treated in the actual theory on optimal control.

On one side we account, as in [7, 8] for the presence of delay effects, in particular the fact that the advertisement spending affects the goodwill with some delay, the so-called carryover effect (see e.g. [6, 8, 13] and the references therein).

On the other side, and more crucially, we take into account the fact that the agents maximizing their profit/utility from advertising are embedded in an environment where other agents act and where the action of such other agents influences their own outcome (see e.g. [15] for a specific case of such a situation). To model such interaction among maximizing agents, one typically resorts to game theory. However, cases like this, where the number of agents can be quite large (in particular if we hink of web advertising), are very difficult to treat in an N-agents game setting. A way to make such a problem tractable but still meaningful is to resort to what is called the mean-field theory. The idea is the following: assume that the agents are homogeneous (i.e. displaying the same state equations and the same objective functionals) and send their numbers to infinity. The resulting limit problem is in general more treatable, and, under certain conditions, its equilibria are a good approximation of the N-agents game (see e.g. the books [2] for an extensive survey on the topic).

For the above reason, we think it is interesting, both from the mathematical and economic side, to consider the optimal advertising investment problem with delay of [7, 8] in the case when, in the state equation and in the objective, one adds a mean field term depending on the law of the state variable (the goodwill), which takes into account the presence of other agents.

There are two main ways of looking at the problem when such mean field terms are present. One (which falls into the class of Mean Field Games (MFG), see e.g. [2, Ch. 1], and which is not our goal here) is to look at the Nash equilibria where each agent takes the distribution of the state variables of the others as given. The other one, which we follow here, is to assume a cooperative game point of view: there is a planner that optimizes the average profit of each agent: this means that we fall into the family of the so-called “Mean Field Control” (MFC) problems (or “control of McKean–Vlasov dynamics”). We believe that both viewpoints are interesting from the economic side and challenging from the mathematical side. In particular, the one we adopt here (the Mean Field Control) can be seen as a benchmark (a first best) to compare, subsequently, with the non-cooperative Mean Field Game case, as is typically done in game theory (see e.g. [1]). It can also be seen as the case of a big selling company (who acts as the central planner), which has many shops in the territory whose local advertising policies interact.

The simultaneous presence of the carryover effect and of the “Mean Field Control” terms makes the problem belong to the family of infinite dimensional control of McKean–Vlasov dynamics: a family of problems that are very difficult in general and whose study started only very recently (see [3]).

Here we consider, as a first step, a simple version of the problem that displays a linear state equation, mean field terms depending only on the first moments, and an objective functional whose integrand (the running objective) is separated in the state and the control. We develop the infinite dimensional setting in this case. Moreover, we show that, in the special subcase when the running objective is linear in the state and quadratic in the control, we can solve the problem. This is done through the study of a suitable auxiliary problem whose HJB equation can be explicitly solved (see Sect. 4 below) and whose optimal feedback control can be found through an infinite dimensional Verification Theorem (see Sect. 4.3 below).

The paper is organized as follows.

In Sect. 2, we formulate the optimal advertising problem as an optimal control problem for stochastic delay differential equations with mean field terms and delay in the control. Moreover, using that the mean field terms depends only on the first moments we introduce an auxiliary problem without mean field terms but with a “mean” constraint on the control (see (2.13)).
In Sect. 3, the above “not mean field” auxiliary non-Markovian optimization problem is “lifted” to an infinite dimensional Markovian control problem, still with a “mean” constraint on the control (see (3.7)).
In Sect. 4, we show how to solve the original problem in the special case when the optimal controls of the original and auxiliary problems are deterministic. We explain the strategy in Sect. 4.1, proving Proposition 4.1. Then we consider a suitable Linear Quadratic (LQ) case. In Sect. 4.2, we solve the appropriated HJB equation, while, in Sect. 4.3, we find, through a verification theorem, the solution of the auxiliary LQ problem. Finally, in Sect. 4.4, we show that we can use Proposition 4.1) to also get the solution of the original LQ problem.

2 Formulation of the problem

We call X(t) the stock of advertising goodwill (at time $t \in [0,T]$) of a given product. We assume that the dynamics of $X(\cdot )$ is given by the following controlled stochastic delay differential equation (SDDE), where u models the intensity of advertising spending:

$$\begin{aligned} {\left\{ \begin{array}{ll} dX(t) = \left[ a_0 X(t) +a_1\mathbb {E}[ X(t)] + b_0 u(t) + \int _{-d}^0b_1(\xi )u(t+\xi ) d\xi \right] dt\\ \quad + \sigma dW(t)&{} \forall t\in [0,T] \\ X(0)=x \\ u(\xi )=\delta (\xi )&{}\forall \xi \in [-d,0] \end{array}\right. } \end{aligned}$$

(2.1)

where the Brownian motion W is defined on a filtered probability space $(\Omega ,\mathcal {F},\mathbb {F}=(\mathcal {F}_t)_{t\ge 0},\mathbb {P})$, with $(\Omega ,\mathcal {F},\mathbb {P})$ being complete, $\mathbb {F}$ being the augmentation of the filtration generated by W, and where, for a given closed interval $U\subset \mathbb {R}$, the control strategy u belongs to $\mathcal {U}:=L^2_\mathcal {P}(\Omega \times [0,T];U)$, the space of U-valued square integrable progressively measurable processes. The last line in (2.1) must read as an extension of u to $[-d,T]$ by means of $\delta $.

Here the control space and the state space are both equal to the set $\mathbb {R}$ of real numbers^{Footnote 1} Regarding the coefficients and the initial data, we assume the following conditions are verified:

Assumption 2.1

(i)
$a_0,a_1\in \mathbb {R}$;
(ii)
$b_0 \ge 0$;
(iii)
$b_1(\cdot ) \in L^2([-d,0];\mathbb {R}^+)$;
(iv)
$\delta (\cdot )\in L^2([-\delta ,0];U)$.

Here $a_0$ and $a_1$ are constant factors reflecting the goodwill changes in absence of advertising, $b_0$ is a constant advertising effectiveness factor, and $b_1(\cdot )$ is the density function of the time lag between the advertising expenditure u and the corresponding effect on the goodwill level. Moreover, x is the level of goodwill at the beginning of the advertising campaign, $\delta (\cdot )$ is the history of the advertising expenditure before time zero (one can assume $\delta (\cdot )=0$, for instance).

Notice that under Assumption 2.1 there exists a unique strong solution to the following SDDE starting at time $t\in [0,T)$:

$$\begin{aligned} {\left\{ \begin{array}{ll} dX(s) = \left[ a_0 X(s) +a_1\mathbb {E}[ X(s)] + b_0 u(s) + \int _{-d}^0b_1(\xi )u(s+\xi ) d\xi \right] ds\\ \quad + \sigma dW(s)&{} \forall s\in [t,T] \\ X(t)=x\\ u({t+\xi })=\delta (\xi )&{}\forall \xi \in [-d,0] \end{array}\right. } \end{aligned}$$

(2.2)

We denote such a solution by $X^{t,x,u}$. It belongs $ L^2_\mathcal {P}(\Omega \times [0,T],\mathbb {R})$. In what follows, without loss of generality, we always assume to deal with a continuous version $X^{t,x,u}$.

The objective functional to be maximized is defined as

$$\begin{aligned} J(t,x;u(\cdot ))&= \mathbb {E}\left[ \int _t^T e^{-r(s-t)} \left( f \left( s, X^{t,x,u}(s),\mathbb {E} \left[ X^{t,x,u}(s) \right] ,u(s),\mathbb {E} \left[ u(s) \right] \right) \right) ds\right. \nonumber \\&\quad +e^{-r(T-t)} \mathbb {E}\left[ g \left( X^{t,x,u}(T), \mathbb {E}\left[ X^{t,x,u}(T) \right] \right) \right] \end{aligned}$$

(2.3)

where for the functions $ f:[0,T]{\times }\mathbb {R}\times \mathbb {R} \rightarrow \mathbb {R}$ and $ g:\mathbb {R}\times \mathbb {R} \rightarrow \mathbb {R}$ we assume the following Assumption 2.2 is verified.

Assumption 2.2

(i)
The functions f, g are measurable.
(ii)
There exist $N>0,{\ell }>0, \theta >1$ such that
$$\begin{aligned} f(t,x,m ,u,z ) + g(x,m ) \le N(1+|x|+|m |+|u|+|z |)-{\ell }(|u|+|z |)^\theta , \end{aligned}$$
for all $t\in [0,T],y\in \mathbb {R},m \in \mathbb {R},z \in \mathbb {R}$.
(iii)
f, g are locally uniformly continuous in x, m, uniformly with respect to (t, u, z), meaning that for every $R>0$ there exists a modulus of continuity $\texttt{w}_R:\mathbb {R}^+\rightarrow \mathbb {R}^+$ such that
$$\begin{aligned}&\sup _{\begin{array}{c} t\in [0,T]\\ u\in \mathbb {R},z \in \mathbb {R} \end{array}}|f(t,x,m ,u,z )-f(t,x',m ',u,z )|+|g(x,m )-g(x',m ')|\\&\quad \le \texttt{w}_R (|x-x'|+|m -m '| ) \end{aligned}$$
for all real numbers $x,m,x',m '$ such that $|x|\vee |m |\vee |x'|\vee |m '|\le R$.

Under Assumptions 2.1 and 2.2, the reward functional J in (2.3) is well-defined for any $(t,x;u(\cdot ))\in [0,T]\times \mathbb {R}^+\times \mathcal {U}$.

We also define the value function $ \overline{V}$ for this problem as follows:

$$\begin{aligned} \overline{V}(t,x) = \sup _{u \in \mathcal {U}} J(t,x;u ), \end{aligned}$$

(2.4)

for $(t,x)\in [0,T]\times \mathbb {R}$. We shall say that $u^* \in \mathcal {U}$ is an optimal control strategy if it is such that

$$\begin{aligned} \overline{V}(t,x)=J(t,x;u^* ). \end{aligned}$$

Our main aim here is to finding such optimal control strategies

We now take into account the controlled ordinary delay differential equation (ODDE)

$$\begin{aligned} {\left\{ \begin{array}{ll} dM(s) = \left( (a_0+a_1) M(s) + b_0 z(s) + \int _{-d}^0b_1(\xi )z(s+\xi ) d\xi \right) ds &{} \forall s\in [t,T] \\ M(t)=m\\ z({t+\xi })=\delta (\xi )&{}\forall \xi \in [-d,0] \end{array}\right. } \end{aligned}$$

(2.5)

where $m\in \mathbb {R}$ and $z\in L^2([0,T],\mathbb {R})$ is extended to $[-d,0]$ by $\delta $ as expressed by the last line in (2.5). We denote by $M^{t,m,z}$ the unique strong solution to (2.5). It is straightforward to notice the relationship

$$\begin{aligned} M^{t,m,z}=\mathbb {E} \left[ X^{t,m,u} \right] \; \textrm{whenever} \; z(s)=\mathbb {E}[u(s)]\; \textrm{for}\; s\in [t,T]. \end{aligned}$$

(2.6)

Property (2.6) suggests that we can couple the two systems (2.2) and (2.5) as follows. We set

$$\begin{aligned} A_0:=\begin{bmatrix} a_0&{}a_1\\ 0&{}a_0+a_1 \end{bmatrix} \end{aligned}$$

(2.7)

and introduce, for $\tilde{x}\in \mathbb {R}^2$ and with

$$\begin{aligned} \tilde{u}=(u,z)\in \tilde{\mathcal {U}}:=L^2_\mathcal {P} \left( \Omega \times [0,T];\mathbb {R} \right) \times L^2 \left( [0,T];\mathbb {R} \right) ,\quad \tilde{\sigma }= (\sigma ,0), \end{aligned}$$

(2.8)

the process $\tilde{X}^{t,\tilde{x},\tilde{u}}$ as the unique strong solution of the controlled SDDE

$$\begin{aligned} {\left\{ \begin{array}{ll} d\tilde{X}(s)= \left( A_0 \tilde{X}(s)+b_0 \tilde{u}(s) + \int _{-d}^0 b_1(\xi ) \tilde{u}(s+\xi ) d\xi \right) ds + \tilde{\sigma } dW(s)&{} \forall s\in (t,T]\\ \tilde{X}(t)=\tilde{x}\\ \tilde{u}({t+\xi })= \left( \delta (\xi ),\delta (\xi ) \right) &{}\forall \xi \in [-d,0] \end{array}\right. } \end{aligned}$$

(2.9)

then by (2.2), (2.5), (2.5), and (2.9), we immediately have

$$\begin{aligned} \left( X^{t,x,u},M^{t,x,z} \right) = \tilde{X}^{t,(x,x),\tilde{u})} \ \underline{\textrm{if} \ z(s)=\mathbb {E}[u(s)]\ \textrm{for}\ s\in [t,T]}. \end{aligned}$$

(2.10)

Property (2.10) states that the process $X^{t,x,u}$ can be seen as the first projection of a bidimensional process driven by a SDDE whose coefficients do not involve any dependence on the law.

Thanks to (2.10), we can rephrase the original control problem as follows. We define, for $t\in [0,T],\tilde{x}\in \mathbb {R}^2$, and for

$$\begin{aligned} \tilde{u}=(u,z)\in \tilde{\mathcal {U}}:=L^2_\mathcal {P} \left( \Omega \times [0,T];\mathbb {R} \right) \times L^2 \left( [0,T];\mathbb {R} \right) , \end{aligned}$$

the functional

$$\begin{aligned} \tilde{J}(t,\tilde{x};\tilde{u}):=\mathbb {E}\left[ \int _t^T e^{-r(s-t)} f \left( s, \tilde{X}^{t,\tilde{x},\tilde{u}}(s) ,\tilde{u}(s) \right) ds+ g \left( \tilde{X}^{t,\tilde{x},\tilde{u}}(T) \right) \right] , \end{aligned}$$

(2.11)

where, with a slight abuse of notation, we identify

$$\begin{aligned} f(t,(x,m),(u,z))=f(t,x,m,u,z)\quad g((x,m))=g(x,m). \end{aligned}$$

(2.12)

Then, by (2.3), (2.4), (2.10), and (2.11), it follows that

$$\begin{aligned} \overline{V}(t,x)=\sup \left\{ \tilde{J}(t,(x,x);\tilde{u}):{ \tilde{u}\in \tilde{\mathcal {U}}, \; \textrm{and}\; } z(s)=\mathbb {E}[u(s)]\; s\in [t,T] \right\} . \end{aligned}$$

(2.13)

3 Carryover effect of advertising: reformulation of the problem in infinite dimension

To recast the SDDE (2.9) as an abstract stochastic differential equation on a suitable Hilbert space we use the approach introduced first by [19] in the deterministic case and then extended in [8] to the stochastic case (see also [5, 6], [paragraph 2.6.8.2], [10] and [11], and [12] where the case of unbounded control operator is considered). We reformulate Eq. (2.9) as an abstract stochastic differential equation in the following Hilbert space H

$$\begin{aligned} H:=\mathbb {R}^2\times L^2([-d,0],\mathbb {R}^2). \end{aligned}$$

If $y\in H$, we denote by $y_0$ the projection of y onto $\mathbb {R}^2$ and by $y_1$ the projection of y onto $L^2([-d,0],\mathbb {R}^2)$. Hence $y=(y_0,y_1)$. The inner product in H is induced by its factors, meaning

$$\begin{aligned} \langle y,y'\rangle :=\langle y_0,y'_0 \rangle _{\mathbb {R}^2} + \int _{-d}^0 \langle y_1(\xi ),y_1'(\xi ) \rangle _{\mathbb {R}^2} d\xi \quad \forall y,y'\in H. \end{aligned}$$

In particular, the induced norm is

$$\begin{aligned} |y| = \left( |y_0|_{\mathbb {R}^2}^2 + \int _{-d}^0 |y_1(\xi )|_{\mathbb {R}^2}^2 d\xi \right) ^{1/2}\quad \forall y\in H. \end{aligned}$$

Recalling (2.7), we define $A:\mathcal {D}(A)\subset H\rightarrow H$ by

$$\begin{aligned} { Ay:=\left( A_0y_0,-\dot{y}_1 \right) } \end{aligned}$$

where the domain $\mathcal {D}(A)$ is

$$\begin{aligned} \mathcal {D}(A)= \left\{ y\in H:y_1\in W^{1,2}([-d,0],\mathbb {R}^2), \; y_1(-d)=0 \right\} . \end{aligned}$$

The adjoint $A^*:\mathcal {D}(A^*)\subset H\rightarrow H$ of A is given by

$$\begin{aligned} { A^*y:=\left( A_0^*y_0,\dot{y}_1 \right) } \end{aligned}$$

with

$$\begin{aligned} \mathcal {D}(A^*)= \left\{ y\in H:y_1\in W^{1,2}([-d,0],\mathbb {R}^2), \; y_1(0)=y_0 \right\} . \end{aligned}$$

The operator A generates a $C_0$-semigroup $\{e^{tA}\}_{t\in \mathbb {R}^+}$ on H, where

$$\begin{aligned} e^{tA}y= \left( e^{tA_0}y_0+\int _{-d}^0 \textbf{1}_{[-t,0]}e^{(t+s)A_0}y_1(s)ds, y_1(\cdot -t)\textbf{1}_{[-d+t,0]}(\cdot ) \right) \quad \forall y\in H, \end{aligned}$$

whereas the $C_0$-semigroup $\{e^{tA^*}\}_{t\in \mathbb {R}^+}$ generated by $A^*$ is given by

$$\begin{aligned} e^{tA^*}y= \left( e^{tA^*_0}y_0, e^{(\cdot +t)A^*_0}y_0\textbf{1}_{[-t,0]}(\cdot ) + y_1(\cdot +t)\textbf{1}_{[-d,-t]}(\cdot ) \right) \quad \forall y\in H, \end{aligned}$$

where $A^*_0$ is the adjoint of $A_0$.

We then introduce the noise operator $G:\mathbb {R}\rightarrow H$ defined by

$$\begin{aligned} Gx:=\left( (\sigma x, 0),0 \right) \quad \forall x\in \mathbb {R}, \end{aligned}$$

and the control operator $B:\mathbb {R}^2\rightarrow H$ defined by

$$\begin{aligned} By_0=(b_0y_0,b_1(\cdot )y_0)\quad \forall y_0\in \mathbb {R}^2. \end{aligned}$$

The adjoint $B^*:H\rightarrow \mathbb {R}^2$ of B is given by

$$\begin{aligned} B^*y= b_0y_0+\int _{-d}^0b_1(\xi )y_1(\xi )d\xi \quad \forall y\in H. \end{aligned}$$

We now introduce the abstract stochastic differential equation on H

$$\begin{aligned} {\left\{ \begin{array}{ll} dY(s)= \left( AY(s)+B \tilde{u}(s) \right) ds+GdW(s)&{} s\in (t,T]\\ &{}Y(t)=y\\ \tilde{u}(t+\xi )= \left( \delta (\xi ),\delta (\xi ) \right) &{} \forall \xi \in [-d,0] \end{array}\right. } \end{aligned}$$

(3.1)

with $t\in [0,T), y\in H, \tilde{u}\in \mathcal {U}\times \mathcal {U}$. Denote by $Y^{t,y,\tilde{u}}$ the mild solution to (3.1), i.e., the pathwise continuous process in $L^2_\mathcal {P}(\Omega \times [0,T];H)$ given by the variation of constants formula:

$$\begin{aligned} Y^{t,y,\tilde{u}}(s) = e^{(s-t)A}y + \int _t^s e^{(s-r)A}B\tilde{u}(r) dr+ \int _t^s e^{(s-t)A}G dW(r),\quad \forall s\in [t,T]. \end{aligned}$$

(3.2)

Similarly as done in [7], if the space of admissible controls is restricted to $ \tilde{\mathcal {U}}$, one can show that (3.1) is equivalent to (2.9), in the sense that

$$\begin{aligned} Y^{t,y,\tilde{u}}_0(s)= \tilde{X}^{t,y_0,\tilde{u}} \end{aligned}$$

(3.3)

for every $t\in [0,T),\tilde{u}\in \tilde{\mathcal {U}}$, and for every $y=(y_0,y_1)\in H$ with

$$\begin{aligned} y_1(\xi )= \left( \int _{-d}^\xi b_1(\zeta )\delta (\zeta -\xi )d\zeta , \int _{-d}^\xi b_1(\zeta )\delta (\zeta -\xi )d\zeta \right) \quad \forall \xi \in [-d,0]. \end{aligned}$$

(3.4)

A further equivalence is given by considering together (2.10) and (3.4), that provide

$$\begin{aligned} Y^{t,y,\tilde{u}}_0(s)= \left( X^{t,x,u},M^{t,x,z} \right) \;\! \textrm{if}\! \; y_0\!=\!(x,x),\; \!y_1\;\! \text {is as in }(3.4),\; z(s)=\mathbb {E}[u(s)]\; \textrm{for}\; s\in [t,T]. \end{aligned}$$

(3.5)

Thanks to equivalence (3.5), we can rephrase the original control problem as follows. For $t\in [0,T],y\in H, \tilde{u}\in \mathcal {U}\times \mathcal {U}$, define the functional (recall (2.12))

$$\begin{aligned} \mathcal {J}(t,y;\tilde{u}):=\mathbb {E}\left[ \int _t^T e^{-r(s-t)} f \left( s, Y^{t,y,\tilde{u}}_0(s) ,\tilde{u}(s) \right) ds+ g \left( Y^{t,y,\tilde{u}}_0(T) \right) \right] \end{aligned}$$

(3.6)

Then, by (2.11), (2.13), (3.3), and (3.4), it follows that

$$\begin{aligned} \overline{V}(t,x)= & {} \sup \big \{ \mathcal {J}(t,y;\tilde{u}):y_0=(x,x), \; y_1\; \text {is as in}\,(3.4),\, \tilde{u}\in \tilde{\mathcal {U}},\, \textrm{and}\,\nonumber \\ z(s)= & {} \mathbb {E}[u(s)]\; s\in [t,T] \big \}. \end{aligned}$$

(3.7)

4 Solution of the original problem in a special Linear Quadratic (LQ) case

4.1 The strategy of solution through a suitable HJB equation

Following (3.7) above we introduce the function

$$\begin{aligned} \mathcal {V}:[0,T]\times H\rightarrow \mathbb {R} \end{aligned}$$

defined by

$$\begin{aligned} \mathcal {V}(t,y):=\sup \big \{ \mathcal {J}(t,y,\tilde{u}):\, \tilde{u}\in \tilde{\mathcal {U}},\; z(s)=\mathbb {E}[u(s)]\; \forall s\in [t,T] \big \}. \end{aligned}$$

Notice that, by (3.7), we have

$$\begin{aligned} \underline{\overline{V}(t,x) = \mathcal {V}(t,y)\;\textrm{if}\; y_0=(x,x), \; \text {and if}\; y_1\;\text {is as in}\,(3.4).} \end{aligned}$$

(4.1)

The problem with the above constraint $z(s)=\mathbb {E}[u(s)]$, for $s\in [t,T]$, is that it does not allow to apply directly the Dynamic Programming Approach to get the HJB equation. For this reason, instead of optimizing on the set $\mathcal {U}$ with the constraints $z(s)=\mathbb {E}[u(s)]\; s\in [t,T]$, we take into consideration a different problem, for which the optimization is performed on the set $\mathcal {U}\times \mathcal {U}$ with the constraint $z(s)=u(s)\ s\in [t,T]$, hence considering the following value function

$$\begin{aligned} V(t,y) :=\sup \big \{ \mathcal {J}(t,y,\tilde{u}):\, \tilde{u}=(u,z)\in \mathcal {U}\times \mathcal {U},\; \textrm{and}\; u=z \big \}. \end{aligned}$$

(4.2)

In general we do not know if and how this function is related to $\mathcal {V}$ (and consequently to our goal $\overline{V}$). However it is clear from the constraints involved that, if for both problems V and $\mathcal {V}$ the supremum is reached on the set of deterministic controls, meaning

$$\begin{aligned} \mathcal {V}(t,y)&=\text {(to prove)}= \sup \big \{ \mathcal {J}(t,y,\tilde{u}):\tilde{u}=(u,z)\in \mathcal {U}\times \mathcal {U},\; \textrm{and}\; u=z\; \textrm{deterministic} \big \}\end{aligned}$$

(4.3a)

$$\begin{aligned} V(t,y)&=\text {(to prove)}= \sup \big \{ \mathcal {J}(t,y,\tilde{u}):\tilde{u}=(u,z)\in \mathcal {U}\times \mathcal {U},\; \textrm{and}\; u=z\; \textrm{deterministic} \big \}, \end{aligned}$$

(4.3b)

then finding the deterministic optimal controls for $\mathcal {V}$ is equivalent to doing that for V. For future reference, we restate this observation in the following proposition.

Proposition 4.1

Let $t\in [0,T]$ and $y\in H$. If (4.3a) and (4.3b) hold true, then a deterministic control $\tilde{u}^*=(u^*,u^*)\in \mathcal {U}\times \mathcal {U}$ is optimal for $\mathcal {V}$ if and only if it is optimal for V.

The HJB equation associated to the optimal control problem related to V is the following.

$$\begin{aligned} {\left\{ \begin{array}{ll} v _t(t,y)+\frac{1}{2}\mathop {\textrm{Tr}}\nolimits Q\nabla ^2 v (t,y)+\langle Ay,\nabla v (t,y) \rangle \\ \quad +H_0(t,y,\nabla v (t,y)) -r v (t,y) =0&{} \forall (t,y)\in (0,T)\times H\\ v (T,y)=g(y_0) &{} \forall y\in H \end{array}\right. } \end{aligned}$$

(4.4)

where $Q=G^*G$, and the Hamiltonian function defined as

$$\begin{aligned} H_0(t,y,p) :=\sup _{\tilde{u}\in \textbf{D}} H_{CV}(t,y,\tilde{u},p)= \sup _{\tilde{u}\in \textbf{D}} \big \{ f(t,y_0,\tilde{u})+\langle B\tilde{u},p\rangle \big \}, \end{aligned}$$

with $H_{CV}$ denoting the current value Hamiltonian function, and $\textbf{D}$ being the diagonal in $U\times U$, meaning $\textbf{D}= \left\{ (u,u):u\in U \right\} $. Notice that $H_0(t,y,p)$ depends on p only by means of $B^*p$. Indeed, if we define

$$\begin{aligned} H(t,y,q) :=\sup _{\tilde{u}\in \textbf{D}} \big \{ f(t,y_0,\tilde{u})+\langle \tilde{u},q\rangle \big \}, \end{aligned}$$

(4.5)

we get $H_0(t,y,p)=H(t,y,B^*p)$. Then (4.4) can be rewritten as

$$\begin{aligned} {\left\{ \begin{array}{ll} v _t(t,y)+\frac{1}{2}\mathop {\textrm{Tr}}\nolimits Q\nabla ^2 v (t,y)+\langle Ay,\nabla v (t,y) \rangle \\ \quad +H(t,y,B^*\nabla v (t,y)) -r v (t,y) =0&{} \forall (t,y)\in (0,T)\times H\\ v (T,y)=g(y_0) &{} \forall y\in H \end{array}\right. } \end{aligned}$$

(4.6)

Notice that, in the above Eqs. (4.4) and (4.6), the gradient inside the Hamiltonian H is indeed a couple of directional derivatives since it acts only through the operator $B^*$ whose image lies in $\mathbb {R}^2$.

In the next subsections we specify f, g and we show that with such a choice (4.3a) and (4.3b) are verified.

4.2 Explicit solution of the HJB equation in the auxiliary LQ case

In this section we specify the general model with

$$\begin{aligned} f(t,x,m ,u,z )&= \alpha _0 x-\alpha _1 m -\beta _0u-\gamma _0 u^2-\beta _1 z -\gamma _1z^2\nonumber \\ g(x,m )&= \lambda _0 x-\lambda _1 m \end{aligned}$$

(4.7)

for $(x,m,u,z)\in \mathbb {R}^4$, where

(i)
$\alpha _0,\alpha _1,\beta _0,\beta _1,\lambda _0, \lambda _1\in \mathbb {R}$;
(ii)
$\gamma _0>0, \gamma _1>0$.

We also set $U=\mathbb {R}$. Notice that Assumption 2.2 is satisfied. Moreover, denoting $\tilde{\alpha }=(\alpha _0,-\alpha _1)$, $\tilde{\beta }=(\beta _0,\beta _1)$, and recalling (2.12), we have, for $q\in \mathbb {R}^2$,

$$\begin{aligned} u^*(q)&:=\mathop {\mathrm {arg\,max}}\limits _{u\in U} \big \{ \langle \tilde{\alpha }, y_0\rangle +\langle q-\tilde{\beta },(1,1)\rangle u -(\gamma _0+\gamma _1)u^2 \big \}\nonumber \\&=\frac{\langle q-\tilde{\beta },(1,1)\rangle }{2(\gamma _0+\gamma _1)}, \end{aligned}$$

(4.8)

which entails, by considering the definition of H given in (4.5),

$$\begin{aligned} H(t,y,q)=\frac{ \left( \langle q-\tilde{\beta },(1,1)\rangle \right) ^2}{4(\gamma _0+\gamma _1)}+\langle \tilde{\alpha },y_0\rangle \end{aligned}$$

and then the HJB equation (4.4) reads as

$$\begin{aligned} {\left\{ \begin{array}{ll} v _t(t,y)+\frac{1}{2}\mathop {\textrm{Tr}}\nolimits Q\nabla ^2 v (t,y)+\langle Ay,\nabla v (t,y) \rangle \\ \quad + \frac{ \left( \langle B^*\nabla v (t,y)-\tilde{\beta },(1,1)\rangle \right) ^2}{4(\gamma _0+\gamma _1)}+\langle \tilde{\alpha },y_0\rangle -r v (t,y) =0&{} \forall (t,y)\in (0,T)\times H\\ v (T,y)=\langle \tilde{\lambda },y_0\rangle &{} \forall y\in H \end{array}\right. } \end{aligned}$$

(4.9)

where $\tilde{\lambda }=(\lambda _0,-\lambda _1)$.

We look for solutions of (4.9) of the following form

$$\begin{aligned} v (t,y)=\langle a(t),y\rangle +b(t) \end{aligned}$$

(4.10)

with $a:[0,T]\rightarrow H$ and $b:[0,T]\rightarrow \mathbb {R}$ to be determined. The final condition in (4.9) holds true for (4.10) only if

$$\begin{aligned} a(T)=(\tilde{\lambda },0),\quad b(T)=0. \end{aligned}$$

(4.11)

Moreover, if v is of the form (4.10), (4.9) reads as

$$\begin{aligned} \langle \dot{a}(t),y\rangle +\dot{b}(t)+\langle y, A^*a(t)\rangle + \frac{ \left( \langle B^*a(t)-\tilde{\beta },(1,1)\rangle \right) ^2}{4(\gamma _0+\gamma _1)}+\langle \tilde{\alpha },y_0\rangle -r\langle a(t),y\rangle -rb(t)=0 \end{aligned}$$

(4.12)

The previous Eq. (4.12) is to be intended in a mild way that we are going to specify in the following, since we cannot guarantee that, for all t, $a(t)\in \mathcal{D}(A^*)$. Indeed, by (4.11), $a(T)\notin \mathcal{D}(A^*)$.

Equation (4.12) can be aligned into two equations by isolating the terms containing y and all the other terms, namely

$$\begin{aligned} \langle \dot{a}(t),y\rangle +\langle y, A^*a(t)\rangle +\langle \tilde{\alpha },y_0\rangle -r\langle a(t),y\rangle =0 \end{aligned}$$

(4.13)

and

$$\begin{aligned} \dot{b}(t) + \frac{ \left( \langle B^*a(t)-\tilde{\beta },(1,1)\rangle \right) ^2}{4(\gamma _0+\gamma _1)} -rb(t)=0. \end{aligned}$$

(4.14)

Taking into account that (4.13) must hold for all $y\in H$, and combining (4.13) and (4.14) with the final conditions (4.11), we obtain two separated equations, one for a and one for b, namely

$$\begin{aligned} {\left\{ \begin{array}{ll} \dot{a}(t) + A^*a(t) +(\tilde{\alpha },0) -r a(t)=0&{} t\in [0,T)\\ a(T)=(\tilde{\lambda },0) \end{array}\right. } \end{aligned}$$

(4.15)

and

$$\begin{aligned} {\left\{ \begin{array}{ll} \dot{b}(t) + \frac{ \left( \langle B^*a(t)-\tilde{\beta },(1,1)\rangle \right) ^2}{4(\gamma _0+\gamma _1)} -rb(t)=0&{} t\in [0,T)\\ b(T)=0&{} \end{array}\right. } \end{aligned}$$

(4.16)

We solve (4.15), which turns out to be an abstract evolution equation in H, in mild sense, getting

$$\begin{aligned} a(t) =e^{(T-t)(A^*-r)}(\tilde{\lambda },0) +\int _t^T e^{(s-t)(A^*-r)} (\tilde{\alpha },0) ds. \end{aligned}$$

(4.17)

Consequently we can write the solution to (4.16)

$$\begin{aligned} b(t) =\int _t^T \frac{1}{2}e^{-r(s-t)} \frac{ \left( \langle B^*a(s)-\tilde{\beta },(1,1)\rangle \right) ^2}{4(\gamma _0+\gamma _1)} ds, \end{aligned}$$

(4.18)

where a is given by (4.17).

So far we have found a solution v to the HJB equation (4.9) whose candidate optimal feedback is deterministic. In the next section we will prove that it is indeed the optimal control and that $v=V$. We will also prove that the optimal feedback control associated to the optimal control problem associated to $\mathcal {V}$ is deterministic. This will allow us to apply Proposition 4.1, so finding the optimal strategies for the initial problem in the linear quadratic case.

4.3 Fundamental identity and verification theorem in the auxiliary LQ case

The aim of this subsection is to provide a verification theorem and the existence of optimal feedback controls for the linear quadratic problem for V introduced in the previous section. This, in particular, will imply that the solution in (4.10), with a and b given respectively by (4.17) and (4.18), coincides with the value function of our optimal control problem V defined in (4.2).

The main tool needed to get the wanted results is an identity [often called “fundamental identity”, see Eq. (4.19)] satisfied by the solutions of the HJB equation. Since the solution (4.10) is not smooth enough (it is not differentiable with respect to t due to the presence of $A^*$ in a, given by (4.17)), we need to perform an approximation procedure thanks to which Ito’s formula can be applied. Finally we pass to the limit and obtain the needed “fundamental identity”.

Proposition 4.2

Let Assumption 2.1 hold. Let v be as in (4.10), with a and b given respectively by (4.17) and (4.18), solution of the HJB equation(4.9). Then for every $t\in [ 0,T],\, y\in H$, and $\tilde{u}=(u,z)\in \mathcal {U}\times \mathcal {U}$, with $u=z$, we have the fundamental identity

$$\begin{aligned} v(t,y)&= \mathcal {J}(t,y;\tilde{u}) +\mathbb {E} \left[ \int _t^T e^{-r(s-t)} \left( \frac{ \left( \langle B^*\nabla v (t,Y ^{t,y,\tilde{u}}(s))-\tilde{\beta },(1,1)\rangle \right) ^2}{4(\gamma _0+\gamma _1)} \right. \right. \nonumber \\&\quad +\langle \tilde{\alpha },(Y ^{t,y,\tilde{u}})_0(s)\rangle - H_{CV}(s,B^*\nabla v(s,Y^{t,y,\tilde{u}},\tilde{u}(s)) \bigg ) ds \bigg ]. \end{aligned}$$

(4.19)

Proof

Let $t\in [0,T), y\in H, \tilde{u}=(u,z)\in \mathcal {U}\times \mathcal {U}$, $u=z$. We should apply Ito’s formula to the process $ \left\{ e^{-rs}v(s,Y^{t,y,\tilde{u}}(s)) \right\} _{s\in [t,T]}$, but we cannot, because $Y^{t,y,\tilde{u}}$ is a mild solution (the integrals in (3.2) are convolutions with a $C_0$-semigroup) and not a strong solution of (3.1), moreover v is not differentiable in t, since $(\tilde{\lambda },0)\not \in D(A^*)$. Then we approximate $Y^{t,y,\tilde{u}}$ by means of the Yosida approximation (see also [10, Proposition 5.1]). For $k_0\in \mathbb {N}$ large enough, the operator $k-A$, $k\ge k_0$, is full-range and invertible, with continuous inverse, and $k(k-A)^{-1}A$ can be extended to a continuous operator on H. Define, for $k\ge k_0$, the operator on H

$$\begin{aligned} A_k:=k(k-A)^{-1}A. \end{aligned}$$

It is well known that, as $k\rightarrow \infty $, $e^{tA_k}y'\rightarrow e^{tA}y'$ in H, uniformly for $t\in [0,T]$ and for $y'$ on compact sets of H. Since $A_k$ is continuous, there exists a unique strong solution $Y^{t,y,\tilde{u}}_k$ to the SDE on H

$$\begin{aligned} {\left\{ \begin{array}{ll} dY_k(s)= \left( A_kY_k(s)+B \tilde{u}(s) \right) ds+GdW(s)&{} s\in (t,T]\\ Y_k(t)=y\\ \tilde{u}(s+\xi )= \left( \delta (\xi ),\delta (\xi ) \right) &{}\forall \xi \in [-d,0] \end{array}\right. } \end{aligned}$$

(4.20)

By taking into account (3.2) together with the same formula with $A_k$ in place of A, and by recalling the convergence $e^{\cdot A_k}\rightarrow e^{\cdot A}$ mentioned above, one can easily show that

$$\begin{aligned} Y_k^{t,y,\tilde{u}}\rightarrow Y^{t,y,\tilde{u}} \; \textrm{in} \; L_\mathcal {P}^{2}(\Omega \times [0,T];H) \; \textrm{as}\; k\rightarrow \infty . \end{aligned}$$

(4.21)

We now take into consideration the HJB

$$\begin{aligned} {\left\{ \begin{array}{ll} v _t(t,y)+\frac{1}{2}\mathop {\textrm{Tr}}\nolimits Q\nabla ^2 v (t,y)+\langle A_ky,\nabla v (t,y) \rangle \\ \quad + \frac{ \left( \langle B^*\nabla v (t,y)-\tilde{\beta },(1,1)\rangle \right) ^2}{4(\gamma _0+\gamma _1)}+\langle \tilde{\alpha },y_0\rangle -r v (t,y) =0&{} \forall (t,y)\in (0,T)\times H \\ v (T,y)=\langle \tilde{\lambda },y_0\rangle &{}\forall y\in H. \end{array}\right. } \end{aligned}$$

(4.22)

As argued for (4.9), a solution for (4.22) is given by

$$\begin{aligned} v^{(k)} (t,y)=\langle a_k(t),y\rangle +b_k(t) \end{aligned}$$

(4.23)

where

$$\begin{aligned} a_k(t) =e^{(T-t)(A_k^*-r)}(\tilde{\lambda },0) +\int _t^T e^{(s-t)(A_k^*-r)} (\tilde{\alpha },0) ds \end{aligned}$$

(4.24)

and

$$\begin{aligned} b_k(t) =\int _t^T \frac{1}{2}e^{-r(s-t)} \frac{ \left( \langle B^*a_k(s)-\tilde{\beta },(1,1)\rangle \right) ^2}{4(\gamma _0+\gamma _1)} ds. \end{aligned}$$

(4.25)

Since $A_k^*\in L(H)$, both $a_k$ and $b_k$ belong to $C^{1}([0,T];\mathbb {R})$. So we can apply Ito’s formula to $ \left\{ e^{-r(s-t)}v^{(k)}(s, Y_k^{t,y,\tilde{u}}(s)) \right\} _{s\in [t,T]}$ getting:

$$\begin{aligned}&e^{-r(T-t)}\mathbb {E}\left[ v^{(k)}(T,Y_k^{t,y,\tilde{u}}) \right] -\mathbb {E}\left[ v^{(k)}(t,y) \right] \\&\quad =\mathbb {E}\left[ \int _t^Te^{-r(s-t)}\left( v^{(k)}_t(s,Y_k^{t,y,\tilde{u}}(s))-rv^{(k)}(s,Y_k^{t,y,\tilde{u}}(s))\right. \right. \\&\qquad + {1\over 2}\mathop {\textrm{Tr}}\nolimits \left[ Q \nabla ^2v^{(k)}(t,Y_k^{t,y,\tilde{u}}(s)) \right] \\&\qquad \left. + \langle A_kY_k^{t,y,\tilde{u}}(s), \nabla v^{(k)} (s,Y_k^{t,y,\tilde{u}}(s)) \rangle +\langle B\tilde{u}(s),\nabla v^{(k)}(s,Y_k^{t,y,\tilde{u}}(s))\rangle \right] ds. \end{aligned}$$

Since $v^{(k)}$ is a solution to Eq. (4.22), we get

$$\begin{aligned}&e^{-r(T-t)} \mathbb {E}\left[ \langle \tilde{\lambda }, \left( Y_k^{t,y,\tilde{u}}\right) _0(T)\rangle \right] - v^{(k)}(t,y)\nonumber \\&\quad =\mathbb {E}\int _t^T \left[ e^{-r(s-t)}\left( -\frac{ \left( \langle B^*\nabla v^{(k)} (t,Y_k^{t,y,\tilde{u}}(s))-\tilde{\beta },(1,1)\rangle \right) ^2}{4(\gamma _0+\gamma _1)}- \langle \tilde{\alpha },(Y_k^{t,y,\tilde{u}})_0(s)\rangle \right. \right. \nonumber \\&\qquad + \left\langle B\tilde{u}(s),\nabla v^{(k)}(s,Y_k^{t,y,\tilde{u}}(s))\right\rangle \bigg )ds \bigg ]. \end{aligned}$$

(4.26)

We then let $k\rightarrow \infty $ in (4.26). Recalling the convergence $e^{\cdot A_k}\rightarrow e^{\cdot A}$ mentioned above, we first notice that

$$\begin{aligned} a_k\rightarrow a \; \textrm{in}\; H \; \textrm{and}\; b_k\rightarrow b \; \textrm{in}\; \mathbb {R},\; \text {uniformly on}\,[0,T],\; as\; k\rightarrow \infty . \end{aligned}$$

(4.27)

Then (4.26), (4.27), and (4.21) entail

$$\begin{aligned}&e^{-r(T-t)} \mathbb {E}\left[ \langle \tilde{\lambda }, \left( Y ^{t,y,\tilde{u}}\right) _0(T)\rangle \right] - v(t,y)\nonumber \\&\quad =\mathbb {E}\int _t^T \left[ e^{-r(s-t)}\left( -\frac{ \left( \langle B^*\nabla v (t,Y ^{t,y,\tilde{u}}(s))-\tilde{\beta },(1,1)\rangle \right) ^2}{4(\gamma _0+\gamma _1)}- \langle \tilde{\alpha },(Y ^{t,y,\tilde{u}})_0(s)\rangle \right. \right. \nonumber \\&\qquad + \left\langle B\tilde{u}(s),\nabla v(s,Y ^{t,y,\tilde{u}}(s))\right\rangle \bigg )ds \bigg ], \end{aligned}$$

(4.28)

or

$$\begin{aligned} v(t,y)&= e^{-r(T-t)} \mathbb {E}\left[ \langle \tilde{\lambda }, \left( Y ^{t,y,\tilde{u}}\right) _0(T)\rangle \right] \\&\quad +\mathbb {E} \left[ \int _t^T e^{-r(s-t)} \left( \frac{ \left( \langle B^*\nabla v (t,Y ^{t,y,\tilde{u}}(s))-\tilde{\beta },(1,1)\rangle \right) ^2}{4(\gamma _0+\gamma _1)}+ \langle \tilde{\alpha },(Y ^{t,y,\tilde{u}})_0(s)\rangle \right. \right. \nonumber \\&\quad - \left\langle B\tilde{u}(s),\nabla v(s,Y ^{t,y,\tilde{u}}(s))\right\rangle \bigg ) ds \bigg ]. \end{aligned}$$

Finally, adding and subtracting

$$\begin{aligned} \mathbb {E} \left[ \int _t^T e^{-r(s-t)} \left( \langle \tilde{\beta },\tilde{u}(s)\rangle + \left\langle \begin{bmatrix} \gamma _0&{}0\\ 0&{}\gamma _1 \end{bmatrix}\tilde{u}(s),\tilde{u}(s)\right\rangle \right) ds \right] \end{aligned}$$

we get

$$\begin{aligned} v(t,y)&= \mathcal {J}(t,y;\tilde{u}) +\mathbb {E} \left[ \int _t^T e^{-r(s-t)} \left( \frac{ \left( \langle B^*\nabla v (t,Y ^{t,y,\tilde{u}}(s))-\tilde{\beta },(1,1)\rangle \right) ^2}{4(\gamma _0+\gamma _1)}\right. \right. \\&\quad + \langle \tilde{\alpha },(Y ^{t,y,\tilde{u}})_0(s)\rangle - H_{CV}(s,B^*\nabla v(s,Y^{t,y,\tilde{u}},\tilde{u}(s)) \bigg ) ds \bigg ]. \end{aligned}$$

$\square $

We can now pass to prove a verification theorem i.e. a sufficient condition of optimality given in term of the solution v of the HJB equation.

Theorem 4.3

Let Assumption 2.1 hold true. Let v be in (4.10), with a and b given respectively by (4.17) and (4.18), solution to the HJB equation (4.9). Then the following holds.

(i)
For all $(t,y)\in [0,T]\times H$ we have $v(t,y) \ge V(t,y)$, where V is the value function defined in (4.2).
(ii)
Let $t\in [0,T],y\in H$. If $u^*$ is as in (4.8), and if $\tilde{u}^*(s):=(u^*(B^*a(s)),u^*(B^*a(s)))$, $s\in [t,T]$, then the pair $(\tilde{u}^*,Y^{t,y,\tilde{u}^*})$ is optimal for the control problem (4.2), and $V(t,y)=v(t,y)=\mathcal {J}(t,y;\tilde{u}^*)$.

Proof

The first statement follows directly by (4.19) due to the positivity of the integrand. Concerning the second statement, we immediately see that, when $\tilde{u}=\tilde{u}^*$, (4.19) becomes $v(t,y)=\mathcal {J}(t,y;\tilde{u}^*)$. Since we know that, for any admissible control $\tilde{u}=(u,z)\in \mathcal {U}\times \mathcal {U}$ with $u=z$,

$$\begin{aligned} \mathcal {J}(t,y;\tilde{u})\le V(t,y) \le v(t,x), \end{aligned}$$

the claim immediately follows. $\square $

4.4 Equivalence with the original problem in the LQ case

To find the solution of the original problem in the LQ case we need to apply Proposition 4.1, i.e. to prove that the optimal control in the original LQ case is deterministic. This is the subject of next proposition.

Proposition 4.4

Condition (4.3a) is verified.

Proof

Let $t\in [0,T],y\in H$. Let $\tilde{u}=(u,z)\in \mathcal {U}$, with $z(s)=\mathbb {E}[u(s)]$ for $s\in [t,T]$. Let $\tilde{u}_\mathbb {E}= (\mathbb {E}[u],z)$. Then

$$\begin{aligned} \tilde{u}_\mathbb {E}\in \big \{ \tilde{u}=(u,z)\in \mathcal {U}\times \mathcal {U},\; \textrm{and}\; u=z\; \textrm{deterministic} \big \}. \end{aligned}$$

Notice, by (3.2), that

$$\begin{aligned} \mathbb {E} \left[ Y^{t,y,\tilde{u}} \right] = \mathbb {E} \left[ Y^{t,y,\tilde{u}_\mathbb {E}} \right] . \end{aligned}$$

(4.29)

Then

$$\begin{aligned} \mathcal {J}(t,y;\tilde{u})&= \mathbb {E}\bigg [ \int _t^T e^{-r(s-t)} \left( \langle \tilde{\alpha },Y^{t,y,\tilde{u}}_0(s) \rangle -\langle \tilde{\beta },\tilde{u}(s)\rangle - \left\langle \begin{bmatrix} \gamma _0&{}0\\ 0&{}\gamma _1 \end{bmatrix}\tilde{u}(s),\tilde{u}(s)\right\rangle \right) ds\\&\quad +\langle \tilde{\lambda },(Y^{t,y,\tilde{u}})_0(T)\rangle \bigg ]\\&= \mathbb {E}\bigg [ \int _t^T e^{-r(s-t)} \left( \langle \tilde{\alpha },Y^{t,y,\tilde{u}_\mathbb {E}}_0(s) \rangle -\langle \tilde{\beta },\tilde{u}_\mathbb {E}(s)\rangle - \left\langle \begin{bmatrix} \gamma _0&{}0\\ 0&{}\gamma _1 \end{bmatrix}\tilde{u}(s),\tilde{u}(s)\right\rangle \right) ds\\&\quad +\langle \tilde{\lambda },(Y^{t,y,\tilde{u}_\mathbb {E}})_0(T)\rangle \bigg ]\\&\le \text {(by Jensen's inequality)}\\&\le \mathbb {E}\bigg [ \int _t^T e^{-r(s-t)} \left( \langle \tilde{\alpha },Y^{t,y,\tilde{u}_\mathbb {E}}_0(s) \rangle -\langle \tilde{\beta },\tilde{u}_\mathbb {E}(s)\rangle - \left\langle \begin{bmatrix} \gamma _0&{}0\\ 0&{}\gamma _1 \end{bmatrix}\tilde{u}_\mathbb {E}(s),\tilde{u}_\mathbb {E}(s)\right\rangle \right) ds\\&\quad +\langle \tilde{\lambda },(Y^{t,y,\tilde{u}_\mathbb {E}})_0(T)\rangle \bigg ], \end{aligned}$$

which implies (4.3a). $\square $

Corollary 4.5

Let f, g be as in (4.7). Let $t\in [0,T],x\in \mathbb {R}$. If $u^*$ is as in (4.8), with (x, x) in place of $y_0$, then $u^*(B^*a(s))$ is optimal for $\overline{V}(t,x)$.

Proof

The statement is a straightforward consequence of (4.1), Proposition 4.1, Theorem 4.3. $\square $

Data availibility

Data sharing not applicable to this article as no datasets were generated or analysed during the current study

Notes

This means that, due to the difficulty of the problem, we do not consider ex ante state or control constraints. They could be checked ex post or could be the subject of a subsequent research work.

References

Boucekkine, R., Fabbri, G., Federico, S., Gozzi, F.: A dynamic theory of spatial externalities. In: Games and Economic Behavior, vol. 132(C), pp. 133–165. Elsevier, Amsterdam (2022)
Carmona, R., Delarue, F.: Probabilistic Theory of Mean Field Games with Applications. I Probab. Theory Stoch. Model., vol. 83. Springer, Cham, xxv+713 pp (2018)
Cosso, A., Gozzi, F., Kharroubi, I., Pham, H., Rosestolato, M.: Optimal control of path-dependent McKean–Vlasov SDEs in infinite-dimension. Ann. Appl. Probab. 33(4), 2863–2918 (2023)
Article MathSciNet Google Scholar
de Feo, F.: Stochastic optimal control problems with delays in the state and in the control via viscosity solutions and an economical application, ar**v:2308.14506
Fabbri, G., Gozzi, F., Swiech, A.: Stochastic Optimal Control in Infinite Dimensions: Dynamic Programming and HJB Equations. Springer, Berlin (2017)
Book Google Scholar
Feichtinger, G., Hartl, R., Sethi, S.: Dynamical optimal control models in advertising: recent developments. Manag. Sci. 40, 195–226 (1994)
Gozzi, F., Marinelli, C.: Stochastic optimal control of delay equations arising in advertising models. In: Stochastic Partial Differential Equations and Applications— VII. Lect. Notes Pure Appl. Math., vol. 245, pp. 133–148. Chapman & Hall/CRC, Boca Raton (2006)
Gozzi, F., Marinelli, C., Savin, S.: On controlled linear diffusions with delay in a model of optimal advertising under uncertainty with memory effects. J. Optim. Theory Appl. 142(2), 29–321 (2009)
Article MathSciNet Google Scholar
Gozzi, F., Masiero, F.: Stochastic optimal control with delay in the control, I: solving the HJB equation through partial smoothing. SIAM J. Control Optim. 55(5), 2981–3012 (2017)
Gozzi, F., Masiero, F.: Stochastic optimal control with delay in the control, II: Verification theorem and optimal feedback controls. SIAM J. Control Optim. 55(5), 3013–3038 (2017)
Gozzi, F., Masiero, F.: Stochastic control problems with unbounded control operators: solutions through generalized derivatives. SIAM J. Control Optim. 61(2), 586–619 (2023)
Article MathSciNet Google Scholar
Grosset, L., Viscolani, B.: Advertising for a new product introduction: a stochastic approach. Top 12(1), 149–167 (2004)
Article MathSciNet Google Scholar
Hartl, R.F.: Optimal dynamic advertising policies for hereditary processes. J. Optim. Theory Appl. 43(1), 51–72 (1984)
Article MathSciNet Google Scholar
Marinelli, C.: The stochastic goodwill problem. Eur. J. Oper. Res. 176(1), 389–404 (2007)
Article MathSciNet Google Scholar
Motte, M., Pham, H.: Optimal bidding strategies for digital advertising. ar**v:2111.08311
Nerlove, M., Arrow, J.K.: Optimal advertising policy under dynamic conditions. Economica 29, 129–142 (1962)
Article Google Scholar
Prasad, A., Sethi, S.P.: Competitive advertising under uncertainty: a stochastic differential game approach. J. Optim. Theory Appl. 123(1), 163–185 (2004)
Article MathSciNet Google Scholar
Vidale, M.L., Wolfe, H.B.: An operations-research study of sales response toadvertising. Oper. Res. 5, 370–381 (1957)
Article Google Scholar
Vinter, R.B., Kwong, R.H.: The infinite time quadratic control problem for linear systems with state and control delays: an evolution equation approach. SIAM J. Control Optim. 19(1), 139–153 (1981)
Article MathSciNet Google Scholar

Download references

Acknowledgements

Federica Masiero is a member of INDAM-GNAMPA.

Funding

Open access funding provided by Università degli Studi di Milano - Bicocca within the CRUI-CARE Agreement.

Author information

Authors and Affiliations

Dipartimento di Economia e Finanza, Università LUISS - Guido Carli, Viale Romania 32, 00197, Roma, Italy
Fausto Gozzi
Dipartimento di Matematica e Applicazioni, Università di Milano Bicocca, via Cozzi 55, 20125, Milano, Italy
Federica Masiero
Dipartimento di Economia, Università di Genova, via Vivaldi 5, 12126, Genova, Italy
Mauro Rosestolato

Authors

Fausto Gozzi
View author publications
You can also search for this author in PubMed Google Scholar
Federica Masiero
View author publications
You can also search for this author in PubMed Google Scholar
Mauro Rosestolato
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Federica Masiero.

Ethics declarations

Conflict of interest

There are no Conflict of interest.

Ethical approval

We do not work with any empirical data. For this reason, we are not aware of any ethical issues that could arise within this article.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Gozzi, F., Masiero, F. & Rosestolato, M. An optimal advertising model with carryover effect and mean field terms. Math Finan Econ (2024). https://doi.org/10.1007/s11579-024-00361-3

Download citation

Received: 12 September 2023
Accepted: 12 April 2024
Published: 21 May 2024
DOI: https://doi.org/10.1007/s11579-024-00361-3

Keywords

JEL Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

An optimal advertising model with carryover effect and mean field terms

Abstract

Similar content being viewed by others

Asymmetric Replicator Dynamics on Polish Spaces: Invariance, Stability, and Convergence

Global dynamics of a quantum Cournot duopoly with quadratic costs and relative profit maximization

Continuous-Time Mean Field Markov Decision Models

1 Introduction

2 Formulation of the problem

Assumption 2.1

Assumption 2.2

3 Carryover effect of advertising: reformulation of the problem in infinite dimension

4 Solution of the original problem in a special Linear Quadratic (LQ) case

4.1 The strategy of solution through a suitable HJB equation

Proposition 4.1

4.2 Explicit solution of the HJB equation in the auxiliary LQ case

4.3 Fundamental identity and verification theorem in the auxiliary LQ case

Proposition 4.2

Proof

Theorem 4.3

Proof

4.4 Equivalence with the original problem in the LQ case

Proposition 4.4

Proof

Corollary 4.5

Proof

Data availibility

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

JEL Classification

Search

Navigation