A Multimodal Machine Learning Framework for Box Office Prediction Using CNN-Based Trailer Analysis and Random Forest Models

Authors

  • Xinyuan Li
  • Zeyu Yang
  • Lei Yue
  • Tianyu Zhao

DOI:

https://doi.org/10.54691/18pwh024

Keywords:

Machine learning, convolutional neural network, random forest, data mining, computer vision, sentiment analysis, ensemble learning, multimodal analysis, prediction models, feature extraction.

Abstract

This study presents a comprehensive multimodal machine learning framework for box office prediction in the Chinese film market. We develop an integrated approach combining multiple computational techniques: time series analysis for market trend visualization, matrix heatmap analysis for actor-genre correlation mapping, convolutional neural network (CNN) for quantitative sentiment analysis of movie trailers through image processing, and random forest ensemble learning for final prediction. Our methodology employs web crawling technology to collect comprehensive datasets from major film platforms. The CNN model achieves 98.21% accuracy in classifying emotional tones from trailer frames, while the random forest model demonstrates robust performance with MAPE of 10.7% on validation data. We validate our framework by predicting box office performance for three films scheduled for 2025 release, achieving relative errors below 20%. This research contributes to the intersection of computer vision, machine learning, and entertainment industry analytics, providing a scalable solution for revenue forecasting in digital media markets.

Downloads

Download data is not yet available.

References

[1] X. Hu, Y. Guo, M. Feng, et al., "Time Series Model Prediction and Analysis of Malaria Cases in Jiangsu Province from 1980 to 2023," Chinese Journal of Frontier Health and Quarantine, vol. 47, no. 6, pp. 582-587, 2024.

[2] C. Deng, J. Song, R. Sun, et al., "Visual Analysis System for Cigarette Market Data Based on Heatmap," Tobacco Science & Technology, vol. 49, no. 12, pp. 91-97, 2016.

[3] C. Yan and C. Wang, "Development and Application of Convolutional Neural Network Models," Journal of Frontiers of Computer Science and Technology, vol. 15, no. 1, pp. 27-46, 2021.

[4] X. Ding, "Research on the Application of BP Neural Networks and Convolutional Neural Networks in Text Recognition," Ph.D. dissertation, Huazhong University of Science and Technology, Wuhan, China, 2017.

[5] K. Fang, J. Wu, J. Zhu, et al., "A Review of Random Forest Methods," Statistics & Information Forum, vol. 26, no. 3, pp. 32-38, 2011.

Downloads

Published

2025-10-29

Issue

Section

Articles

How to Cite

Li, Xinyuan, Zeyu Yang, Lei Yue, and Tianyu Zhao. 2025. “A Multimodal Machine Learning Framework for Box Office Prediction Using CNN-Based Trailer Analysis and Random Forest Models”. Scientific Journal of Intelligent Systems Research 7 (10): 127-39. https://doi.org/10.54691/18pwh024.