A Multimodal Machine Learning Framework for Box Office Prediction Using CNN-Based Trailer Analysis and Random Forest Models
DOI:
https://doi.org/10.54691/18pwh024Keywords:
Machine learning, convolutional neural network, random forest, data mining, computer vision, sentiment analysis, ensemble learning, multimodal analysis, prediction models, feature extraction.Abstract
This study presents a comprehensive multimodal machine learning framework for box office prediction in the Chinese film market. We develop an integrated approach combining multiple computational techniques: time series analysis for market trend visualization, matrix heatmap analysis for actor-genre correlation mapping, convolutional neural network (CNN) for quantitative sentiment analysis of movie trailers through image processing, and random forest ensemble learning for final prediction. Our methodology employs web crawling technology to collect comprehensive datasets from major film platforms. The CNN model achieves 98.21% accuracy in classifying emotional tones from trailer frames, while the random forest model demonstrates robust performance with MAPE of 10.7% on validation data. We validate our framework by predicting box office performance for three films scheduled for 2025 release, achieving relative errors below 20%. This research contributes to the intersection of computer vision, machine learning, and entertainment industry analytics, providing a scalable solution for revenue forecasting in digital media markets.
Downloads
References
[1] X. Hu, Y. Guo, M. Feng, et al., "Time Series Model Prediction and Analysis of Malaria Cases in Jiangsu Province from 1980 to 2023," Chinese Journal of Frontier Health and Quarantine, vol. 47, no. 6, pp. 582-587, 2024.
[2] C. Deng, J. Song, R. Sun, et al., "Visual Analysis System for Cigarette Market Data Based on Heatmap," Tobacco Science & Technology, vol. 49, no. 12, pp. 91-97, 2016.
[3] C. Yan and C. Wang, "Development and Application of Convolutional Neural Network Models," Journal of Frontiers of Computer Science and Technology, vol. 15, no. 1, pp. 27-46, 2021.
[4] X. Ding, "Research on the Application of BP Neural Networks and Convolutional Neural Networks in Text Recognition," Ph.D. dissertation, Huazhong University of Science and Technology, Wuhan, China, 2017.
[5] K. Fang, J. Wu, J. Zhu, et al., "A Review of Random Forest Methods," Statistics & Information Forum, vol. 26, no. 3, pp. 32-38, 2011.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Scientific Journal of Intelligent Systems Research

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.




