Abstract
The Random Forest algorithm (RFA) is used to predict the approximate final box-office revenue of a movie in the Taiwanese film market. The results show that the RFA has stable capabilities to predict the final box-office revenue of a movie during its theatrical period with an 80% overall accuracy. Two other machine learning algorithms, i.e., the Support Vector Machine and the Logistic Regression algorithms, are applied for comparison with the RFA. We find that the RFA still achieves the highest overall accuracy of prediction in our experiment. Additionally, we applied an unsupervised machine learning method to distinguish each group in the box office revenue categories in the classification problem. Also, the feature importance analysis indicates that word-of-mouth plays a vital role in theatrical revenue determination. Our findings imply several crucial suggestions for film distributors.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Ainslie, A., Drèze, X., Zufryden, F.: Modeling movie life cycles and market share. Mark. Sci. 24(3), 508–517 (2005)
Asur, S., Huberman, B.A.: Predicting the future with social media. In: Proceedings of the 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology-Volume 01 (2010)
Baek, H., Oh, S., Yang, H.-D., Ahn, J.: Electronic word-of-mouth, box office revenue and social media. Electron. Commer. Res. Appl. 22, 13–23 (2017)
Basuroy, S., Chatterjee, S., Ravid, S.A.: How critical are critical reviews? The box office effects of film critics, star power, and budgets. J. Mark. 67(4), 103–117 (2003)
Basuroy, S., Desai, K.K., Talukdar, D.: An empirical investigation of signaling in the motion picture industry. J. Mark. Res. 43(2), 287–295 (2006)
Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001)
Brewer, S.M., Kelley, J.M., Jozefowicz, J.J.: A blueprint for success in the US film industry. Appl. Econ. 41(5), 589–606 (2009)
Chintagunta, P.K., Gopinath, S., Venkataraman, S.: The effects of online user reviews on movie box office performance: accounting for sequential rollout and aggregation across local markets. Mark. Sci. 29(5), 944–957 (2010)
De Vany, A., Walls, W.D.: Uncertainty in the movie industry: does star power reduce the terror of the box office? J. Cult. Econ. 23(4), 285–318 (1999)
Delen, D., Sharda, R.: Predicting the financial success of Hollywood movies using an information fusion approach. Indus. Eng. J. 21(1), 30–37 (2010)
Delen, D., Sharda, R., Kumar, P.: Movie forecast Guru: a web-based DSS for Hollywood managers. Decis. Support Syst. 43(4), 1151–1170 (2007)
Dellarocas, C., Zhang, X.M., Awad, N.F.: Exploring the value of online product reviews in forecasting sales: the case of motion pictures. J. Interact. Mark. 21(4), 23–45 (2007)
Dhar, T., Sun, G., Weinberg, C.B.: The long-term box office performance of sequel movies. Mark. Lett. 23(1), 13–29 (2012)
Du, J., Xu, H., Huang, X.: Box office prediction based on microblog. Expert Syst. Appl. 41(4), 1680–1689 (2014)
Duan, W., Gu, B., Whinston, A.B.: Do online reviews matter?—an empirical investigation of panel data. Decis. Support Syst. 45(4), 1007–1016 (2008)
Duan, W., Gu, B., Whinston, A.B.: The dynamics of online word-of-mouth and product sales—an empirical investigation of the movie industry. J. Retail. 84(2), 233–242 (2008)
Elberse, A., Eliashberg, J.: Demand and supply dynamics for sequentially released products in international markets: the case of motion pictures. Mark. Sci. 22(3), 329–354 (2003)
Eliashberg, J., Shugan, S.M.: Film critics: influencers or predictors? J. Mark. 61(2), 68–78 (1997)
Ghiassi, M., Lio, D., Moon, B.: Pre-production forecasting of movie revenues with a dynamic artificial neural network. Expert Syst. Appl. 42(6), 3176–3193 (2015)
Gong, J.J., Van der Stede, W.A., Mark Young, S.: Real options in the motion picture industry: evidence from film marketing and sequels. Contemp. Account. Res. 28(5), 1438–1466 (2011)
Hennig-Thurau, T., Houston, M.B., Walsh, G.: Determinants of motion picture box office and profitability: an interrelationship approach. RMS 1(1), 65–92 (2007)
Hur, M., Kang, P., Cho, S.: Box-office forecasting based on sentiments of movie reviews and Independent subspace method. Inf. Sci. 372, 608–624 (2016)
Kim, T., Hong, J., Kang, P.: Box office forecasting using machine learning algorithms based on SNS data. Int. J. Forecast. 31(2), 364–390 (2015)
Kohonen, T.: Self-organized formation of topologically correct feature maps. Biol. Cybern. 43(1), 59–69 (1982)
Lee, K., Park, J., Kim, I., Choi, Y.: Predicting movie success with machine learning techniques: ways to improve accuracy. Inf. Syst. Front. 20(3), 577–588 (2018)
Lee, K.J., Chang, W.: Bayesian belief network for box-office performance: a case study on Korean movies. Expert Syst. Appl. 36(1), 280–291 (2009)
Liao, Y., Peng, Y., Shi, S., Shi, V., Yu, X.: Early box office prediction in China’s film market based on a stacking fusion model. Ann. Oper. Res. 308, 1–18 (2020)
Lin, W., Wu, Z., Lin, L., Wen, A., Li, J.: An ensemble random forest algorithm for insurance big data analysis. IEEE Access 5, 16568–16575 (2017)
Litman, B.R.: Predicting success of theatrical movies: an empirical study. J. Pop. Cult. 16(4), 159–175 (1983)
Litman, B.R., Kohl, L.S.: Predicting financial success of motion pictures: the’80s experience. J. Media Econ. 2(2), 35–50 (1989)
Liu, Y.: Word of mouth for movies: its dynamics and impact on box office revenue. J. Mark. 70(3), 74–89 (2006)
Markonis, Y., Strnad, F.: Representation of European hydroclimatic patterns with self-organizing maps. Holocene 30(8), 1155–1162 (2020)
Marshall, P., Dockendorff, M., Ibáñez, S.: A forecasting system for movie attendance. J. Bus. Res. 66(10), 1800–1806 (2013)
Moon, S., Bergey, P.K., Iacobucci, D.: Dynamic effects among movie ratings, movie revenues, and viewer satisfaction. J. Mark. 74(1), 108–121 (2010)
MPAA. A comprehensive analysis and survey of the theatrical and home entertainment market environment (Theme) for 2017 - THEME REPORT (2017). https://www.mpaa.org/wp-content/uploads/2018/04/MPAA-THEME-Report-2017_Final.pdf
Nanda, T., Sahoo, B., Chatterjee, C.: Enhancing the applicability of Kohonen self-organizing map (KSOM) estimator for gap-filling in hydrometeorological timeseries data. J. Hydrol. 549, 133–147 (2017)
Neelamegham, R., Chintagunta, P.: A Bayesian model to forecast new product performance in domestic and international markets. Mark. Sci. 18(2), 115–136 (1999)
Panaligan, R., Chen, A.: Quantifying movie magic with google search. Google Whitepaper—Industry Perspectives+ User Insights (2013)
Prag, J., Casavant, J.: An empirical study of the determinants of revenues and marketing expenditures in the motion picture industry. J. Cult. Econ. 18(3), 217–235 (1994)
Ravid, S.A.: Information, blockbusters, and stars: a study of the film industry. J. Bus. 72(4), 463–492 (1999)
Sawhney, M.S., Eliashberg, J.: A parsimonious model for forecasting gross box-office revenues of motion pictures. Mark. Sci. 15(2), 113–131 (1996)
Schmidt, C.R., Rey, S.J., Skupin, A.: Effects of irregular topology in spherical self-organizing maps. Int. Reg. Sci. Rev. 34(2), 215–229 (2011)
Sharda, R., Delen, D.: Predicting box-office success of motion pictures with neural networks. Expert Syst. Appl. 30(2), 243–254 (2006)
Sun, J., Zhong, G., Dong, J., Saeeda, H., Zhang, Q.: Cooperative profit random forests with application in ocean front recognition. IEEE Access 5, 1398–1408 (2017)
Wang, F., Zhang, Y., Li, X., Zhu, H.: Why do moviegoers go to the theater? The role of prerelease media publicity and online word of mouth in driving movie going behavior. J. Interact. Advert. 11(1), 50–62 (2010)
Wang, W., **u, J., Yang, Z., Liu, C.: A deep learning model for predicting movie box office based on deep belief network. In: Tan, Y., Shi, Y., Tang, Q. (eds.) ICSI 2018. LNCS, vol. 10942, pp. 530–541. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-93818-9_51
Wu, Q., Ye, Y., Liu, Y., Ng, M.K.: SNP selection and classification of genome-wide SNP data using stratified sampling random forests. IEEE Trans. Nanobiosci. 11(3), 216–227 (2012)
Ye, Y., Wu, Q., Huang, J.Z., Ng, M.K., Li, X.: Stratified sampling for feature subspace selection in random forests for high dimensional data. Pattern Recogn. 46(3), 769–787 (2013)
Zhang, L., Luo, J., Yang, S.: Forecasting box office revenue of movies with BP neural network. Expert Syst. Appl. 36(3), 6580–6587 (2009)
Zhang, W., Skiena, S.: Improving movie gross prediction through news analysis. In: 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (2009)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Appendices
Appendix
Self-Organizing Maps Algorithm Pseudocode for Box Office Revenue in Taiwan
![figure a](http://media.springernature.com/lw685/springer-static/image/chp%3A10.1007%2F978-3-031-35763-3_49/MediaObjects/605573_1_En_49_Figa_HTML.png)
![figure b](http://media.springernature.com/lw685/springer-static/image/chp%3A10.1007%2F978-3-031-35763-3_49/MediaObjects/605573_1_En_49_Figb_HTML.png)
Self-Organizing Maps Clustering Results and Illustration of Training Error
Case 1: Topology: 10*10, neighborhood radius: 10 \(\sqrt{2}\), shrink learning rate: 0.9; iteration: 100.
![figure c](http://media.springernature.com/lw685/springer-static/image/chp%3A10.1007%2F978-3-031-35763-3_49/MediaObjects/605573_1_En_49_Figc_HTML.png)
Case 2: Topology: 10*10, neighborhood radius: 10 \(\sqrt{2}\), shrink learning rate: 0.01; iteration: 100.
![figure d](http://media.springernature.com/lw685/springer-static/image/chp%3A10.1007%2F978-3-031-35763-3_49/MediaObjects/605573_1_En_49_Figd_HTML.png)
Case 3: Topology: 8*8, neighborhood radius: 8 \(\sqrt{2}\), shrink learning rate: 0.9; iteration: 100.
![figure e](http://media.springernature.com/lw685/springer-static/image/chp%3A10.1007%2F978-3-031-35763-3_49/MediaObjects/605573_1_En_49_Fige_HTML.png)
Case 4: Topology: 8*8, neighborhood radius: 8 \(\sqrt{2}\), shrink learning rate: 0.01; iteration: 100.
![figure f](http://media.springernature.com/lw685/springer-static/image/chp%3A10.1007%2F978-3-031-35763-3_49/MediaObjects/605573_1_En_49_Figf_HTML.png)
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this chapter
Cite this chapter
Lu, SH., Wang, HJ., Nguyen, A.T. (2024). Machine Learning Applications on Box-Office Revenue Forecasting: The Taiwanese Film Market Case Study. In: Ngoc Thach, N., Kreinovich, V., Ha, D.T., Trung, N.D. (eds) Optimal Transport Statistics for Economics and Related Topics. Studies in Systems, Decision and Control, vol 483. Springer, Cham. https://doi.org/10.1007/978-3-031-35763-3_49
Download citation
DOI: https://doi.org/10.1007/978-3-031-35763-3_49
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-35762-6
Online ISBN: 978-3-031-35763-3
eBook Packages: EngineeringEngineering (R0)