1 Introduction

Understanding price dynamics in the housing market is crucial both for private entities, such as real estate developers, but also for policy-makers, who can thus shape effective housing policies. The need for a proper understanding of the mechanisms of the housing market also stems from its huge impact on a country's economy, as was recognised in the recent global financial crisis.

One aspect of housing market dynamics is the phenomenon of price convergence or divergence. In the literature, a number of arguments can be found for the possibility of the convergence of sales/rental prices between spatially separated residential markets. In particular, as Apergis et al. (2015) point out, price convergence may be a consequence of the so-called ripple effect, which postulates the easy transfer of price shocks between property markets, due to existing population, labour market and financial capital mobility. It should be stressed that the occurrence of the ripple effect in the housing market has been empirically confirmed by studies conducted by Tomal (2020a) and Brzezicka et al. (2019), among others.

Conversely, however, assets in the housing market are highly heterogeneous and immobile (Małkowska et al. 2019), which may encourage price divergence. It is, therefore, difficult to expect that all local/regional housing markets in a given area will converge to a single equilibrium. A much more likely phenomenon is the occurrence of convergence clubs, i.e. the existence of several steady states to which prices are heading in the long term. The identification of price convergence clubs in the housing market is possible using a method developed by Phillips and Sul (2007, 2009). The empirical research to date in this area is not very extensive, and to the best of the author's knowledge only several such analyses have been developed. In particular, studies on the presence of convergence clubs in the residential market have been conducted in the USA (Kim and Rous 2012; Apergis and Payne 2012, 2019, 2020; Montañés and Olmos 2013), China (Meng et al. 2010), who analysed the stochastic convergence of housing rents between cities in Turkey.

Such a significant omission of the rental market in the analyses of the convergence phenomenon seems unjustified. This is due to the fact that it is the rental market, compared to home-ownership, that is currently develo** very fast all over the world, in response to increasing property prices and the problem of affordability. In addition, as Czerniak and Rubaszek (2018) and Rubaszek and Rubio (2020) state, a well-developed rental market contributes to the reduction of fluctuations in the entire housing segment, as well as to macroeconomic stability. Therefore, the first objective of this article is to examine the processes of convergence of residential rents across Polish provincial capitals. This goal will be pursued in two stages. The first will assess the occurrence of overall convergence across the studied cities. In this case, four types of convergence will be examined, i.e. β-, σ-, stochastic and relative convergence. For the latter, the research method will be the Phillips and Sul (2007, 2009) approach, which will also be used in the second stage of the research aimed at testing for cluster convergence. The results of the study in the field of cluster convergence will enable the identification of clubs, in which rents tend to move towards a club-specific steady state in the long term.

Given that long-term price behaviour is the subject of the study, it should be noted that the calculations using the methodology outlined by Phillips and Sul (2007, 2009) should not be based on raw time series data but on an extracted trend component. Surveys to date have analysed convergence in the housing market based on only one selected filtration technique. The vast majority of papers are dominated by the Hodrick and Prescott (1997) filter, with the exception of a study conducted by Holmes et al. (2019), who used the Hamilton (2018) filter. The second objective of this study was formulated on this basis and is to compare the club convergence analysis estimates obtained based on various time series filtering techniques. In particular, the comparison will include an analysis of the structure of the estimated convergence clubs, as well as the dynamics of their establishment over subsequent 3-year periods. In addition, using a logit model, the drivers for the formation of the convergence clubs will be checked.

As mentioned above, currently, the rental market is becoming an extremely important part of the real estate sector and complementary to home-ownership in Poland as well (Tomal 2020b). It is, therefore, important to analyse the relationship between sales and the rental housing market. Consequently, the third objective of this analysis is to compare the results of the identified convergence clubs in the rental market with those found in the sales market. The implementation of this goal is also important from the point of view of formulating potential recommendations for policy-makers regarding the development of the residential rental market in Poland.

In summary, this article aims to answer the following research questions:

  1. Q1.

    Do residential rents across the surveyed rental markets converge to one steady state in the long term? If not, can convergence clubs be identified among the cities studied?

  2. Q2.

    In the case of absence of overall convergence in the Polish rental market, do club convergence analysis estimates differ depending on the time series filtering technique? Is the number of convergence clubs stable over time? Is the estimated structure of convergence clubs the same as in the case of the residential sales market?

  3. Q3.

    In the case of absence of overall convergence in the Polish rental market, which factors drive the formation of the identified convergence clubs?

This study contributes to the existing literature in several ways. Firstly, this is the first paper that identifies convergence clubs in the residential rental market. Secondly, this work is unique because of the use of several different filtering methods to extract the trend component from the data. In particular, the Hodrick and Prescott (1997), Butterworth (1930), Christiano and Fitzgerald (2003) filters, as well as the very recently developed boosted Hodrick–Prescott filter developed by Phillips and Shi (2020) will be used. Finally, a completely new approach outlined by Kong et al. (2019) will be employed to assess the occurrence of σ-convergence.

The rest of the article is organised as follows. Section 2 describes the data and provides a detailed overview of the convergence test methods. Section 3 contains the results of the analysis and their discussion. The final Sect. 4 presents the main conclusions of the study, as well as recommendations for local and national policy-makers concerning the development of the rental market, particularly in the areas of urban land-use planning policy, housing legislation and public–private partnerships.

2 Methodology and data

2.1 Data and study area

The data used in this study were obtained from the National Bank of Poland (https://www.nbp.pl/). At the moment, this is the most detailed source of information on housing rents in Poland. It should also be emphasised that the analysed data concern only the secondary market, as the primary residential rental market in Poland practically does not exist.

The time series used in this study covers the period 2013q1–2019q1 and represents the average monthly nominal housing rent per square meter in each Polish provincial capital.Footnote 1 Two cities, however, i.e. Opole and Bydgoszcz, were omitted from the study due to lack of data for the entire analysed time frame. Figure 1 presents the dynamics of rental prices in the surveyed cities, on the basis of which it can be concluded that, by far, the highest rents are in the Polish capital, i.e. Warsaw. Other cities, which differ significantly from the rest in terms of rental price levels, are Cracow, Wroclaw and Gdansk.

Fig. 1
figure 1

Temporal dynamics of average monthly housing rents (PLN/m2) in the cities

It should be noted that Fig. 1 shows raw data, which have not been processed in any way. During convergence tests, a seasonally adjusted time series containing information on housing rentals in natural log form will be used. In addition, in order to perform the convergence test using the log t regression method, a trend component will be extracted from the time series described above, which is detailed in the next subsection.

2.2 Testing for overall convergence

In order to examine the overall convergence of housing rents in Polish provincial capitals, four research methods will be used to assess the occurrence of β-, σ-, stochastic, and relative convergence. In the context of this article, β-convergence assumes that cities with low residential rents at the beginning of the analysed period are, at the same time, characterised by a high average rental price growth rate. This type of convergence can be examined using the following equation:

$$\frac{{\ln \left( {\frac{{p_{iT} }}{{p_{i0} }}} \right)}}{T} = \alpha + \beta \ln p_{i0} + \varepsilon_{i}$$
(1)

where \(p_{iT}\) is the average rental price in city \(i \left( {i = 1, \ldots ,N} \right)\) at time \(T \left( {t = 1, \ldots ,T} \right)\), \(p_{i0}\) denotes the initial average housing rent in city \(i\), \(\alpha\) represents the constant term, \(\varepsilon_{i}\) is the error term, and \(\beta\) is the coefficient. If, following the estimation of the above model, the fitted slope coefficient (\(\hat{\beta }\)) is statistically significant and has a negative value, then the phenomenon of unconditional convergence between cities can be said to exist. Analysing convergence by means of Eq. (1) examines its absolute version. If the above equation is supplemented by a set of covariates, conditional convergence is tested. It should be noted, however, that β-convergence has several disadvantages. First of all, the fitted slope coefficient is biased due to the endogeneity problem (Bai et al. 2019). Secondly, Eq. (1) is cross-sectional and does not take into account price changes over time. Finally, β-convergence does not provide information on whether or not there is a process of price equalisation between locations. In order to assess the occurrence of the latter phenomenon, the concept of σ-convergence should be used. It should also be stressed that there is a relationship between β- and σ-convergence in the sense that the former is necessary for the latter to appear (Young et al. 2008).

Until now, there has been no robust econometric framework to study σ-convergence. Recently, however, Kong et al. (2019) developed a very powerful tool for studying this type of convergence (hereinafter referred to as the KPS approach). This new methodology uses the following trend regression to verify the occurrence of σ-convergence:

$$S_{t} = \alpha + \phi t + \varepsilon_{t}$$
(2)

where \(S_{t} = \frac{1}{N}\mathop \sum \nolimits_{i = 1}^{N} \left( {\ln p_{it} - \overline{\ln p}_{t} } \right)^{2}\) and denotes the cross-sectional variance of log level rental prices at time \(t\), \(\overline{{{\text{ln}}p}}_{t}\) is the cross-sectional mean of log level rental prices at time \(t\), \(\varepsilon_{t}\) is the error term, and \(\phi\) represents the slope coefficient. In the KPS method, σ-convergence is present if the fitted slope coefficient \(\hat{\phi }\) is negative and statistically significant. The assessment of the significance of \(\hat{\phi }\) is based on a comparison of the estimated robust t-statistic \(\left( {t_{{\hat{\phi }}} } \right)\) to a critical value of ± 1.65. In particular, if \(t_{{\hat{\phi }}} < - 1.65\) there is convergence; however, if \(t_{{\hat{\phi }}} > 1.65\), there is divergence. The advantage of the KPS approach is that it allows for: (1) misspecification of Eq. (2); and (2) serially correlated and heteroskedastic residuals. It should also be noted that for the empirical application of the method described above, the lag truncation parameter (\(L\)) should be determined.Footnote 2 Following Kong et al. (2019), the calculations in this study will be made for \(L = \left\{ {1,3,5,7} \right\}\).

Another type of convergence that will be examined in this article is the so-called stochastic convergence, which in a panel setting can be defined as (Próchniak and Witkowski 2016):

$$\mathop {\lim }\limits_{k \to \infty } \left( {{\text{ln}}p_{i,t + k} - \overline{{{\text{ln}}p}}_{t + k} |I_{t} } \right) = 0$$
(3)

where \(I_{t}\) denotes the set of information at time \(t\), while the rest of the markings are as above. In order to examine empirically whether there is stochastic convergence, the stationarity of the following time series should be checked:

$$y_{it} = {\text{ln}}p_{it} - \overline{{{\text{ln}}p}}_{t} = {\text{ln}}\left( {\frac{{p_{it} }}{{\overline{p}_{t} }}} \right)$$
(4)

It should be stressed that in the data used in this article cross-sectional dependence was identified using the Pesaran (2004) and Breusch and Pagan (1980) tests; therefore, the selected unit root test must be resistant to this type of phenomenon. On this basis, this study will apply the cross-sectionally augmented DF (CADF) test outlined by Pesaran (2007a). The idea of the above method is to extend the DF test with a cross-sectional average of: (1) the variable defined above with a time lag of one period, as well as (2) its first difference. Taking this into account, the CADF regressionFootnote 3 in the context of this article is as follows:

$$\Delta y_{it} = \alpha_{i} + \rho_{i} y_{i,t - 1} + \varphi_{i} \overline{y}_{t - 1} + \vartheta_{i} \Delta \overline{y}_{t} + \varepsilon_{it}$$
(5)

where \(\rho_{i}\), \(\varphi_{i}\), \(\vartheta_{i}\) are parameters to be estimated for each studied city separately. Testing for the unit root consists in checking the statistical significance of the fitted parameter for a lagged variable. In particular, if the estimated t-statistic for \(\hat{\rho }_{i}\) (\(t_{{\hat{p}_{i} }}\)—known also as CADFi) is less than the critical value, the unit root hypothesis is rejected, which means that stochastic convergence takes place. Using the calculated values of CADFi, it is also possible to check the panel stationarity as a whole. For this purpose, the CIPS (cross-sectionally augmented Im, Pesaran and Shin) statistic, as an arithmetic mean of CADFi, must be calculated. Critical values for the CADF and CIPS tests are presented for the different significance levels in Pesaran (2007a). It should also be noted that Eq. (5) is able to indicate whether there is a process of equalisation of rental prices between a given city and the Polish average to a constant value or to zero. For the latter to occur, both the estimated t-statistic for \(\hat{\rho }_{i}\) must be less than the critical value and the fitted parameter \(\hat{\alpha }_{i}\) must be statistically insignificant.

Finally, the concept of relative convergence introduced by Phillips and Sul (2007, 2009) will also be used to assess overall convergence in this study. Generally speaking, the two time series converge relatively over time if their ratio is heading towards unity in the long term. The approach outlined by Phillips and Sul (2007, 2009) (hereinafter referred to as the PS approach) is widely used in practice due to its advantages. First of all, the PS approach takes into account the heterogeneity of the examined units and its change over time. Secondly, the method does not depend on assumptions regarding trend stability or stochastic non-stationarity in the panel common component. Thirdly, this method overcomes the problem of the biased convergence parameter appearing at β-convergence. Fourthly, the PS approach is treated as an asymptotic cointegration test without suffering from the small sample problems.Footnote 4 Fifthly, the analysis is based on a general form of a nonlinear time-varying factor model. Finally, in the absence of overall convergence between the units analysed, the PS approach allows the identification of convergence clubs (Apergis and Cooray 2014).

The starting point of the relative convergence examination using the PS approach in the context of this study is the following decomposition of the panel of average rental prices:

$${\text{ln}}p_{it} = \delta_{it} \mu_{t}$$
(6)

where \(\delta_{it}\) is the changing over time idiosyncratic component measuring the relative share of city \(i\) at time \(t\) in \(\mu_{t}\), i.e. in a common trend component. Further, \(\delta_{it}\) can be modelled in a semiparametric form as:

$$\delta_{it} = \delta_{i} + \sigma_{i} \xi_{it} \log \left( t \right)^{ - 1} t^{ - \alpha }$$
(7)

where \(\xi_{it} \sim iid\left( {0,1} \right)\) across \(i\), \(\alpha\) is a parameter indicating the speed of convergence, \(\sigma_{i}\) is a scale parameter, \(\log \left( t \right)\) denotes a slowly varying function, and \(\delta_{i}\) is fixed. On this basis, the null hypothesis of convergence can be presented as follows:

$$\begin{array}{*{20}l} {H_{0} : \delta_{i} = \delta \;{\text{and}}\; \alpha \ge 0} \hfill & {\quad {\text{for convergence in rates}}} \hfill \\ {H_{0} : \delta_{i} = \delta \;{\text{and}}\;\alpha \ge 1} \hfill & {\quad {\text{for convergence in levels}}} \hfill \\ \end{array}$$
(8)

whereas the alternative hypothesis of divergence is:

$$H_{A} : \delta_{i} \ne \delta \;{\text{for}}\;{\text{all}}\; i\; {\text{or}}\; \alpha < 0$$
(9)

The first stage for the empirical verification of the hypotheses set out above is to estimate the cross-sectional variance according to the following formula:

$$H_{t} = \frac{1}{N}\mathop \sum \limits_{i = 1}^{N} \left( {h_{it} - 1} \right)^{2} ,\;h_{it} = \frac{{\ln p_{it} }}{{N^{ - 1} \mathop \sum \nolimits_{i = 1}^{N} \ln p_{it} }}$$
(10)

where \(h_{it}\) is the transition path of city i at time t, which represents the ratio of log level rental price in city i and panel average at time t. In case \(h_{it} \to 1\) for all i, as \(t \to \infty\), or when \(H_{t} \to 0\), as \(t \to \infty\), there is a convergence process. The next stage of the convergence study according to the PS approach is the estimation of the so-called log t regression, which is given by formulaFootnote 5:

$$\log \left( {\frac{{H_{1} }}{{H_{t} }}} \right) - 2\log \left( {\log \left( t \right)} \right) = a + b\log \left( t \right) + \varepsilon_{t}$$
(11)

where \(t = \left[ {rT} \right],\left[ {rT} \right] + 1, \ldots ,T\) for \(r \in \left[ {0.2,0.3} \right]\) and \({b}\) is the regression parameter. In particular, if \(\hat{b}\) is significantly smaller than zero, there is a process of divergence, i.e. all cities do not share a common trend in the long run. Conversely, in the case of \(\hat{b} \ge 2\), there is a convergence in levels, which means that the rental prices from the surveyed cities aim at one common value. Finally, if \(2 > \hat{b} \ge 0\), there is growth convergence, which indicates that housing rent growth rates across cities tend to converge. The significance of parameter \(\hat{b}\) in the PS approach is usually assessed at a level of 0.05, for which the critical value is − 1.65. An important element of the log t regression is also the selection of the sample fraction (r) to be excluded from the estimation. For a short time series, i.e. when \(T \le 50\), the setting \(r = 0.3\) is suggested (Phillips and Sul 2007).Footnote 6

In the context of the analysis of rental price convergence before applying the PS approach, the cycle component should be removed from each time series because it improves the finite sample power and size of the test (Phillips and Sul 2007). When the subject of the analysis is a raw rental price time series, Eq. (6) concerning panel data decomposition is as follows:

$${\text{ln}}p_{it} = \delta_{it} \mu_{t} + \tau_{it}$$
(12)

where \(\tau_{it}\) is a business cycle component for city i at time t.

In this article, four filtering techniques were used to extract the trend component from the data. The first is the well-known Hodrick and Prescott (HP) (1997) filter, for which, due to the quarterly nature of the data, a tuning parameter (\(\lambda\)) equal to 1600 was set. The above filter, however, has been criticised in recent years, among other things because of the problem of selecting the optimal value of the \(\lambda\) parameter, which has a significant impact on the result of the filtration. As noted by Phillips and Shi (2020), if \(\lambda\) is too large, the HP fitted trend creates a residual trend, which pollutes the cyclical component. If \(\lambda\) is too small, however, the fitted trend is too flexible and imitates short-term fluctuations. Therefore, Phillips and Shi (2020) developed a modification of the standard HP filter, which they called boosted HP filter. This method is based on checking whether the cyclical component contains elements of the trend after using the standard HP filter. If so, the HP filter is reapplied to it to remove leftover trend residuals. When a new cyclic component is also characterised by trending behaviour, the whole process is repeated and continues until the trend elements are completely removed. As a criterion for stop** the iteration process, the ADF test can be used, which examines after which time the cyclic component is stationary (assumed significance level 0.05). In addition to the HP and boosted HP methods, the Butterworth (1930) and Christiano and Fitzgerald (CF) (2003) filters were used in this study.

2.3 Testing for cluster convergence

It should be noted that in the absence of overall convergence across the studied cities, it is possible to identify so-called convergence clubs, in which the rental prices of members of a given cluster head towards a club-specific steady state. The PS approach described above will be used to assess the occurrence of convergence clubs.Footnote 7 Under this method, the identification of city clusters takes place in the following steps (Du 2017):

  1. 1.

    arranging the cities in the panel in decreasing order according to housing rents in the last period;

  2. 2.

    forming a core group of cities \(k^{*}\) on the basis of the log t test maximising \(t_{k}\), subject to \(t_{k} > - 1.65\), starting with the k highest individuals in the panel, where \(N > k \ge 2\);

  3. 3.

    adding one city to the core group formation at a time and running the log t regression. If the obtained t-statistic is above the critical value (c) equal to 0 (for small T), include the city in the core group. Repeat this procedure for the rest of the cities. At the end, run the log t test for the extended core group of cities and make sure that the calculated t-statistic is greater than − 1.65. If so, the first convergence club is created. If not, repeat step 3 but firstly raise the critical value c;

  4. 4.

    for the remaining cities, forming a potential new convergence club and running the log t regression. If the obtained t-statistic is above − 1.65, end the whole clustering procedure. Otherwise, repeat steps 1–3 to check if the remaining subgroup of cities can be further subdivided into convergence clubs;

  5. 5.

    running a club merging procedure to check if the convergence clubs created in steps 1–4 can be combined. In this paper, the club merging will be conducted subsequently. According to this approach, the log t test is run for all subsequent pairs of the initial clubs. If the calculated t-statistic for a pair indicates that there is a convergence, merge the clubs from this pair into one club.

3 Results

3.1 Overall convergence estimates

The first stage of the empirical study was the assessment of overall convergence based on the β-convergence concept. The results of Eq. (1) revealed a negative slope coefficient value (\(\hat{\beta } = {-}\,0.0059\)). The convergence parameter, however, did not turn out to be significant even at the 0.10 level. Therefore, taking into account the whole panel, it is not possible to identify the occurrence of a phenomenon in which cities with low initial rental prices are simultaneously characterised by high average growth rate over the analysed period. Detailed estimates of the β-convergence test are presented in Fig. 2, from which it can be concluded that lack of β-convergence is mainly caused by cities such as Katowice and Wroclaw, which are characterised by a very similar initial price but a diametrically different growth rate. Figure 2 also provides the information that, for the surveyed cities, the average rental price growth rate ranges from approx. 0.2% to almost 2% in the case of the city of Lodz, which at the same time has almost the lowest initial rental price level.

Fig. 2
figure 2

Visualisation of β-convergence test estimates

Then, the occurrence of σ-convergence was checked using the KPS approach. Estimates obtained using the above-mentioned method, similarly to β-convergence, indicated the lack of σ-convergence in the examined period (Table 1). This means that rental prices across the studied cities do not show a tendency to equalise. When analysing the cross-sectional variance of housing rents in Fig. 3, however, it can be seen that from the second quarter of 2016, the variance of rental prices is decreasing. Therefore, the σ-convergence study was also conducted in two sub-periods: (1) 2013q1–2016q1; and (2) 2016q2–2019q1. The results showed that for the first 3 years there is a process of quite strong divergence, while for the second sub-period a convergence of rental prices can be observed (independently of the lag truncation parameter). Such a result may be related to the housing sales prices, which by 2016 were stagnant or slightly increasing, but after that time began to grow dramatically. This may have resulted in an increased demand for renting in relation to home-ownership, which could have accelerated the convergence process, as rent is a much more flexible form of housing and, therefore, prices can be "transferred" much quicker between locations.

Table 1 The KPS test results for σ-convergence
Fig. 3
figure 3

Visualisation of σ-convergence test estimates

Another surveyed type of convergence in the Polish residential rental market was stochastic convergence. The results of this analysis are presented in Table 2. In particular, the CADF unit root test estimates indicate that only four out of the 14 examined cities stochastically converge to the average rental price in Poland. Moreover, the verification of the statistical significance of parameter \(\hat{\alpha }_{i}\) shows that rental prices in Katowice and Lodz are heading towards exactly the same level as the panel average. Conversely, between rents in Cracow and Bialystok and the Polish average rental price, there is a convergence to nonzero constant (positive for Cracow and negative for Bialystok).

Table 2 Estimates of stochastic convergence using the CIPS and CADF unit root tests

The final method to study the overall convergence of housing rents across Polish provincial capitals was the log t regression. As with previous tests, the results obtained with the PS approach also indicated that the residential rental markets surveyed do not share the common trend in the long term, as shown by the \(t_{{\hat{b}}}\) values, which regardless of the data filtering method, are well below the critical value of − 1.65 (Table 3). Moreover, on the basis of transition paths analysis (Fig. 4a–d), a very strong heterogeneity can be observed among the analysed cities both in the cross-sectional and temporal dimensions. In particular, in the case of data filtering with the help of the HP, boosted HP and Butterworth filters, until 2016, it is clear that rental prices tend to diverge. In contrast, in the subsequent period, transition paths are beginning to strive for unity. Similarly, the cross-sectional variance marked in Fig. 4a–d also decreases from 2016 onwards, which confirms the convergence phenomenon. It should be noted that the above conclusions are consistent with those drawn from the KPS test. Furthermore, it can be noted that filtering time series of rental prices using the CF method largely fails to get rid of the cyclical component, which may be due to the fact that this filter is optimal only for data that are generated by pure random walk, while for other time series describing, for example, inflation, interest rates or unemployment this method is nearly optimalFootnote 8 (Christiano and Fitzgerald 2003).

Table 3 Log t regression test estimates for the whole sample depending on the filtering technique used to extract the trend component of time series
Fig. 4
figure 4

Transition paths and cross-sectional variance for the cities calculated based on the trend component of time series extracted using a the Hodrick–Prescott filter, b the boosted Hodrick–Prescott filter, c the Butterworth filter and d the Christiano–Fitzgerald filter

It should be stressed that according to the log t test, overall convergence of rental prices does not exist for different test periods as well. In particular, Fig. 5 presents t-statistics of the log t regression for subsequent 3-year periods. It can be seen that all lines in Fig. 5 are significantly below the critical value. It can only be pointed out that the t-statistic estimates dynamics is very similar for data filtration according to the HP and Butterworth filters.Footnote 9

Fig. 5
figure 5

The value of the t-statistic from the log t regression estimates in subsequent 3-year periods depending on the data filtering method

Summarising the overall convergence analysis, it should be stressed that all research methods used in this study rejected the possibility of its occurrence across the Polish residential rental markets, which is in line with other analyses studying convergence or differentiation in the Polish real estate market; see, for example, Żelazowski (2019), Gnat (2016), Dittmann (2014), Wiśniewski and Brzezicka (2020), Tomal (2019a), Tomal and Gumieniak (2020).

It seems that lack of overall convergence in the rental market is to be expected. Firstly, as mentioned earlier, housing goods are heterogeneous and immobile. Moreover, lack of convergence of rental prices among Polish cities results also from the exceptionally uneven and poorly developed rental market in Poland. In particular, according to Eurostat (2020) and OECD (2020), the share of households renting on market terms hovers around 5%, and most of the supply of housing on the rental market is offered in Cracow, Warsaw, Wroclaw and Gdansk. In addition, prices on the long-term rental market in Poland are very much distorted by short-term rentals, which until now have not been regulated to any extent. Moreover, the rental market in Polish cities is influenced by the number of students, which causes the number of units for long-term rent to decrease. In addition, the so-called institutional housing lease market, which is very well developed in other European countries, is hardly functioning in Poland. The specificity of the Polish residential rental market described above fully justifies the lack of overall convergence and, also, gives rise to the assumption that individual cities may form convergence clubs where rental prices are heading towards a club-specific steady state.

3.2 Cluster convergence estimates

Table 4 presents preliminary results of the allocation of the examined cities to convergence clubs. In the case of data filtering with the HP and Butterworth filters, the divergent cases are also visible where rental prices tend to move towards a city-specific steady state. When analysing in detail the division of the surveyed cities into clusters, it can be observed that in the case of cities such as Gdansk, Wroclaw, Warsaw, Szczecin and Poznan there are no differences between club clustering procedure estimates. For the remaining cities, depending on the filter used, we can see quite a significant variation in terms of belonging to a convergence club.

Table 4 Club convergence analysis estimates: initial classification

After the initial assignment of cities to convergence clubs, it was checked whether some city clusters could be merged. In particular, as shown in Table 4, this is only possible for the initial clubs 2 and 3 (\(t_{{\hat{b}}} > - \,1.65\)) when data are filtered with the HP and CF filters. After the merging procedure, the final classification of cities to convergence clubs is presented in Table 5. The final results of the cluster convergence analysis reveal that for eight out of the 14 examined cities, i.e. Wroclaw, Warsaw, Gdansk, Szczecin, Rzeszow, Poznan, Lublin and Katowice, the obtained allocation does not change depending on the data filtering method. To the contrary, small differences in assignment are observed for the city of Zielona Gora. For the rest of the cities, there are quite large discrepancies in terms of club membership. Furthermore, from Table 5 it can be concluded that the most similar club convergence analysis estimates are observed when applying the HP, Butterworth and CF filters (9–12 compatible matches). The results of the cluster procedure based on the boosted HP filter, however, are only in line with those obtained from other data filtering methods in the case of eight cities. Moreover, this filter makes by far for the largest fragmentation of the Polish rental market by creating as many as four convergence clubs. Cluster convergence estimates obtained from data extraction with the boosted HP filter, however, seem by far to be the most reliable. In particular, only in this case is the city of Cracow assigned to club 1, which would be expected, as Cracow has the second largest residential rental market in Poland after Warsaw. Moreover, created clubs 3 and 4 seem to be exceptionally accurate, which is confirmed in Fig. 2, showing that rental prices in these cities are very similar in terms of initial rent and its growth rate. It should also be stressed that all identified convergence clubs are characterised only by convergence in rates, i.e. the cities in individual clubs do not tend to reach a certain price level, but share a common growth path in the long term.

Table 5 Club convergence analysis estimates: final classification

Finally, Table 5 provides the information that the convergence processes in the rental market, compared to those in the sales secondary housing market, differ quite significantly. This may be due, as already mentioned above, to the very underdeveloped nature of the long-term residential rental market in Poland, which is additionally very much destabilised by short-term housing rents.

The geographical diversity of convergence clubs and divergent cases on the Polish residential rental market is very interesting. In particular, as can be seen in Fig. 6a–d, the allocation of individual cities to convergence clubs does not depend on their location. For example, cities included in the first clubs are located in opposite parts of Poland. On this basis, it can be concluded that spatial factors may not play a major role in the formation of convergence clubs.

Fig. 6
figure 6

Spatial distribution of convergence clubs: in the Polish rental housing market identified based on the trend component of time series extracted using a the Hodrick–Prescott filter, b the boosted Hodrick–Prescott filter, c the Butterworth filter and d the Christiano–Fitzgerald filter; e in the Polish sales secondary housing market according to estimates by Tomal (2019a, b)

Another aspect worth examining during the cluster convergence analysis is transition paths for the clubs. As can be seen in Fig. 7, the identified clubs are unlikely to have a trend towards convergence. The only exceptions are the transition paths for clubs 2 and 3 in the case of data filtering with the boosted HP filter, where it can be assumed that in the following years these clubs may be merged and create a new city cluster.

Fig. 7
figure 7

Transition paths for the clubs calculated based on the trend component of time series extracted using a the Hodrick–Prescott filter, b the boosted Hodrick–Prescott filter, c the Butterworth filter and d the Christiano–Fitzgerald filter

Finally, a dynamic club convergence analysis revealed that there was a slight decrease in the number of steady states in the surveyed lease market over the subsequent 3-year periods (Fig. 8). Furthermore, it can be seen that the time series obtained from the HP and Butterworth filters produce a very consistent number of steady states in the examined periods, which fluctuates around ten. The situation is also similar for the boosted HP and CF filters, but the number of long-term equilibria hovers around six.Footnote 10 It can also be concluded from Fig. 8 that using the boosted HP filter, the changes in the number of steady states in the subsequent 3-year periods are the highest compared to other filters. The latter result may indicate that the boosted HP filter best reflects the dynamics of change in the real estate market in question, which is undoubtedly a great advantage when analysing rental or sales prices.

Fig. 8
figure 8

Number of steady states (convergence clubs plus divergent cases) in subsequent 3-year periods depending on the data filtering method

Summarising the results of cluster convergence testing on the Polish residential rental market, one can state that this market is highly fragmented, and the estimates obtained vary depending on the data filtering method. In particular, the club convergence analyses based on the trend components extracted using the HP, Butterworth and CF filters produce quite similar results (between 9 and 12 matches out of 14 cities in the whole analysed period). On the other hand, substantially different from the others, the estimates obtained from data filtered using the boosted HP method seem to be the most reliable.

3.3 Robustness of cluster convergence estimates

In order to test the robustness of the results obtained in club convergence analysis, the approach outlined by Blanco et al. (2016) was applied. Specifically, it involves testing for the presence of stochastic convergence in the estimated convergence clubs. In other words, for each city in a given club, it is checked whether rental prices stochastically converge to the club average. Given that the speed of convergence in the defined clubs using the PS approach is small (for all cases \(\hat{b} < 2\)) and indicates convergence in growth rates, this robustness test will be performed in two specifications. The first one examines the stationarity of the variable \(y_{itc} = {\text{ln}}p_{itc} - \overline{{{\text{ln}}p}}_{tc}\) where \({\text{ln}}p_{itc}\) denotes the log level rental price in city \(i\) from club \(c\) at time \(t\) while \(\overline{{{\text{ln}}p}}_{tc}\) refers to the average log level rental price at time \(t\) in club \(c\). The stationarity analysis of \(y_{itc}\) allows to check whether rental price differentials between a given city and the club average tend towards zero or a nonzero constant. In the second robustness specification, a time series of the form \(\Delta y_{itc} \)Footnote 11 is analysed to test whether growth rates are converging over time among members of a given convergence club. In the framework of Eq. (5) convergence in growth rates occurs when the estimated t-statistic for \(\hat{\rho }_{i}\) is less than the critical value and simultaneously the fitted parameter \(\hat{\alpha }_{i}\) is statistically insignificant.

Table 6 presents the results of the robustness tests conducted. Based on them, it can be concluded that there is no absolute price convergence in the identified convergence clubs, which is in line with the generated values of the parameter \(b\) under the PS approach. Moreover, the test of stationarity of the variable \(\Delta y_{itc}\) mostly confirmed the results obtained in the PS analysis showing convergence in the rates of change of rental prices. However, some differences can also be identified between the estimates generated by the PS approach and those by this robustness test. However, this applies only to the clustering based on the trend components obtained through the HP and Butterworth filters. For example, when analysing the estimates in Table 5 it can be seen that the cities of Kielce and Olsztyn are divergent units and are not part of any convergence club. However, the clusterings based on data filtered using the boosted HP and CF methods indicate that rental prices in the above-mentioned urban agglomerations form their own two-element convergence club. This fact has been confirmed by the robustness test carried out, which revealed that the growth rates of rental prices are equalising between these cities (see Table 6).

Table 6 Robustness tests results

The robustness tests carried out, however, did not indicate which clustering is definitely the most reliable, i.e. whether the one obtained on the basis of data filtered with the boosted HP method or the one where the CF filter was applied. Therefore, in order to finally determine the most accurate division, the following formula was usedFootnote 12:

$$CP = \frac{{\mathop \sum \nolimits_{c = 1}^{C} \mathop \sum \nolimits_{i = 1}^{{N^{c} }} \mathop \sum \nolimits_{t = 1}^{T} \left| {\Delta y_{itc} } \right| }}{{\left( {T - 1} \right)\mathop \sum \nolimits_{c = 1}^{C} N^{c} }}$$
(13)

where \(N^{c}\) denotes the number of cities in a convergence club \(c\), while \(\left| {\Delta y_{itc} } \right|\) should be interpreted as the absolute value of the difference between the growth rate in the city \(i\) from club \(c\) at time \(t\) and the average growth rate in club \(c\) at time \(t\). The lower the CP value, the smaller, on average, the difference between the rental price growth rates in the cities and the average values in the clubs to which these cities belong, and consequently the estimated clubs are characterised by greater homogeneity.

The results indicated that the most accurate clustering was obtained by the PS approach, in which the boosted HP filter was used to extract the trend components of the time series. In particular, for this partitioning, the CP value was 0.0324, while for the clustering obtained when filtering the data with the CF method it was already 0.0339.Footnote 13 Taking into account the robustness tests carried out and the results presented in Sect. 3.2, it should be concluded that the most reliable assignment of the examined cities to convergence clubs was proposed by the PS analysis, which used the trend components extracted using the boosted HP filter.

3.4 What drives club convergence formation in the Polish rental housing market?

In order to check which factors, drive the formation of convergence clubs in the Polish residential rental market, a logit model was used, in which the dependent variable takes the value of 1 if two cities from a pair are in the same club, and 0 otherwise. It should be noted that in the case of the dependent variable defined in this way, the number of observations is 91 in the model, as the number of unique city pairs created is equal to \(\frac{{N\left( {N - 1} \right)}}{2} = \frac{{14\left( {14 - 1} \right)}}{2} = 91.\)

When examining aggregated prices in the housing market, as in this article, the explanatory variables are usually selected in the context of their impact on housing supply and demand. In terms of demand, the price determinants analysed are usually of demographic, economic and social nature. Conversely, supply variables describe, among others, the size of the housing stock. In this study, in order to select the covariates that may influence the formation of housing rental convergence clubs, the current scientific literature has been analysed, in which meso- and macro-determinants of sales housing prices on the Polish real estate market have been examined. Unfortunately, there are no empirical analyses explaining housing rental prices in Poland. Some studies, however, can be found at the micro-level, which concern tenants' preferences on local rental markets (see, for example, Gawlik et al. 2017).

The literature review indicated that the key predictors on the demand and supply side may be variables describing the level of unemployment (Leszczyński and Olszewski 2017; Tomal 2019a, b, 2021; Cellmer et al. 2020), wages (Cellmer et al. 2020), housing stock (Kokot 2018; Matysiak and Olszewski 2019; Tomal 2019b), population size and density (Tomal 2019a), city area (Tomal 2019a) and sense of security (Źróbek et al. 2015). The list of explanatory variables was supplemented with additional covariates characterising the specificity of the residential rental market in Poland, i.e. relating to tourist traffic (measured by total overnight stays) and the number of students in a given city. Moreover, the model contains regressors determining the distance between the surveyed cities and housing sales prices, in order to take into account spatial dependencies and links with the residential sales market, respectively. It should be stressed that since the dependent variable examines whether any two cities belong to the same convergence club, each regressor apart from the distance variable is defined as the absolute difference of the values for the cities in the pair.

The estimation results are presented in Table 7. In particular, models 1, 2, 3 and 4 were based on club convergence analysis estimates obtained from data filtered with the HP, boosted HP, Butterworth and CF methods, respectively. On the basis of the results obtained, it can be concluded that by far the best model in terms of goodness of fit is model 2 where the value of pseudo-R2 is 0.27 and is substantially higher than in the other models. Also, regarding the significance and direction of the impact of the average marginal effects, this model seems to be most in line with expectations. In particular, taking into account the factors affecting housing supply, it can be noted that the smaller the absolute difference in the number of dwellings and the area of a city in two cities the more likely it is that these cities will belong to the same convergence club. On the other hand, in terms of demand determinants, three variables are significant in model 2: population size, number of students and unemployment rate. For the latter two, the estimates of the average marginal effects are as expected, i.e. cities with similar unemployment rates and the number of students are more likely to belong to one convergence club. However, in the context of the population size variable, the situation is reversed. This may be due to the fact that in Poland the high level of development of the housing market is often not derived from the high population in a given city. For example, Cracow and Lodz, which are among the largest cities in Poland in terms of population, at the same time have opposite level of housing market development (see Fig. 1). The last variable that significantly shapes the formation of convergence clubs describes the distance between cities. The estimated parameter in model 2 indicates that the greater the distance, the greater the chance that any two cities will be in the same cluster. Paradoxically, taking into account the specificity of the Polish housing market, this is a desirable result. In particular, in Poland, the main urban agglomerations driving the development of the market under study are Cracow, Warsaw, Gdansk and Wroclaw. Analysing the location in space of these cities, one may notice that these local governments are located on opposite sides of Poland (see Fig. 6), and therefore, the estimated parameter for the distance variable seems plausible.

Table 7 Logit model estimates: drivers of convergence club formation

Examining the other estimated models (1, 3 and 4), it can be concluded that they have a rather low pseudo-R2 value, and moreover, the generated parameters for them are mostly insignificant as well as inconsistent with expectations; this applies, for example, to the parameter for the covariate describing the unemployment rate in model 4. Summarising the estimation results, one can notice that they fully confirmed the conclusions presented in Sects. 3.23.3 indicating that the most optimal tool for filtering the data under the PS approach is the boosted HP filter.

4 Conclusions

In this article, an attempt was made to examine comprehensively the convergence of housing rental prices across Polish provincial capitals. In particular, this study aimed at answering the following research questions:

  1. Q1.

    Do residential rents across the surveyed rental markets converge to one steady state in the long term? If not, can convergence clubs be identified among the cities studied?

  2. Q2.

    In the case of absence of overall convergence in the Polish rental market, do club convergence analysis estimates differ depending on the time series filtering technique? Is the number of convergence clubs stable over time? Is the estimated structure of convergence clubs the same as in the case of the residential sales market?

  3. Q3.

    In the case of absence of overall convergence in the Polish rental market, which factors drive the formation of identified convergence clubs?

Referring to question Q1, it should be stated that there is no overall convergence in the Polish residential rental market. It is possible, however, to identify convergence clubs where rental prices tend to move towards a club-specific steady state. In the scope of question Q2, club convergence analysis estimates indicate that, depending on the data filtering method, the division of cities into clusters differs significantly. Also, the dynamic analysis of steady states indicates that the number of long-term equilibria changes not only over time but also with the time series filtering technique. Moreover, on the basis of the revealed fragmentation of the residential rental market in Poland into convergence clubs, it can be concluded that it is not the same as that of the residential sales market. In the context of the final research question (Q3), a study using the logit model shows that cities with very similar unemployment rates, housing stocks, city areas and student numbers are very likely to be in the same convergence club.

This study obviously has some limitations. The first is the short time frame of the study, which is due to data availability. Another limitation is the lack of information for a larger number of localities, but this does not affect the results of the analysis to a large extent, as residential rents in Poland are concentrated mainly in major cities.

It should also be noted that this survey has revealed that the residential rental market in Poland is very fragmented. This may be a consequence of the underdeveloped nature of rental housing. Long-term residential leasing in Poland is also quite strongly destabilised by short-term renting, mainly for tourists, as well as by student tenancy. Therefore, state and local policy-makers should focus on the development of long-term renting as a supplement to home-ownership. This is necessary because of the huge housing shortages in Poland reaching up to three million units, as well as because of the increasing sales prices of housing, which exacerbates the residential affordability problem. One of the possible actions towards the development of the residential rental market in Poland is the promotion by policy-makers of institutional investments, for example, through land-use planning policy actions. In this context, as in the UK (see National Planning Policy Framework 2019), local politicians may consider introducing a mechanism for allocating certain locations to build-to-rent investment projects. A second possibility for politicians to promote tenancy is joint ventures of a public–private nature, in which a public entity could be a donor of land for future investments. Unfortunately, in Poland, the level of such activities is extremely low, and in the area of housing almost non-existent (Węgrzyn 2014; Tomal and Nalepka 2020). It should also be noted that renting in relation to purchasing is very often seen as an unprofitable form of providing a good such as housing (Ullah and Sepasgozar 2020). Therefore, policy-makers in Poland, through legislative actions, should solve the problem of artificially rising rents in the long-term market, which is a result of the growing stock of dwellings allocated for short-term tenancy.

Future research analysing the convergence processes in the Polish residential rental market should examine how the COVID-19 pandemic affected the behaviour of rental prices between the examined locations. It may be assumed that due to the lockdown of the entire economy in Poland, which was introduced several times, the processes responsible for the formation of the ripple effect in the residential market have weakened considerably. In particular, this may affect the rental market to a much greater extent than the sales market, which is more dependent on population, labour market and financial capital mobility.