Predicting Defensive Formation Effectiveness in Football Using Random Forest and LSTM Models
DOI:
https://doi.org/10.56127/ijst.v4i3.2271Keywords:
Football Analytics , Defensive Strategy, Random Forest, LSTMAbstract
Defensive strategies are fundamental to football success, yet the evaluation of formation effectiveness often remains subjective. This study proposes a data-driven approach to predict the most effective defensive formations by integrating machine learning models. Using tracking-derived features from 150 professional European matches (2018–2023), Random Forest (RF) and Long Short-Term Memory (LSTM) models were applied to assess defensive outcomes. The results indicate that the 5-3-2 formation consistently achieved the highest predicted defensive success across direct, wing, and central attacks, followed by 4-4-2, while the 4-3-3 formation exhibited the weakest defensive stability. RF identified key static features such as line height, block width, and compactness, while LSTM captured temporal dynamics of coordinated player movements, yielding superior predictive performance. This study concludes that combining interpretable ensemble models with sequence-based neural networks offers a robust framework for tactical analysis. The findings provide actionable insights for coaches and analysts, supporting evidence-based decision-making in optimizing defensive strategies in modern football.
References
Atta Mills, E. F. E., Deng, Z., Zhong, Z., et al. (2024). Data driven prediction of soccer outcomes using enhanced machine and deep learning techniques. Journal of Big Data, 11, Article 170. https://doi.org/10.1186/s40537-024-01008-2 Journal of Big Data
Bunker, R., & Susnjak, T. (2019). The application of machine learning techniques for predicting results in team sport: A review [Preprint]. arXiv. https://doi.org/10.48550/arXiv.1912.11762 arXiv+1
Elstak, I. (2024). A case study on player selection and team formation in football with machine learning. Turkish Journal of Electrical Engineering & Computer Sciences, 29(3). https://doi.org/10.3906/elk-2308-49 Tandfonline
Forcher, L., Beckmann, T., Wohak, O., Romeike, C., Graf, F., & Altmann, S. (2023). Prediction of defensive success in elite soccer using machine learning: Tactical analysis of defensive play using tracking data and explainable AI. Science and Medicine in Football, 0(0), 1–16. https://doi.org/10.1080/24733938.2023.2239766 PMC
Moya, D., Tipantuña, C., Villa, G., Calderón-Hinojosa, X., Rivadeneira, B., & Álvarez, R. (2025). Machine learning applied to professional football: Performance improvement and results prediction. Machine Learning and Knowledge Extraction, 7(3), 85. https://doi.org/10.3390/make7030085 MDPI
Narizuka, T., & Yamazaki, Y. (2019). Clustering algorithm for formations in football games. Scientific Reports, 9, Article 13172. https://doi.org/10.1038/s41598-019-48623-5 PMC+1
Teixeira, J. E., et al. (2025). Mapping football tactical behavior and collective dynamics using artificial intelligence: A systematic review. Sports and Active Living. https://doi.org/10.3389/fspor.2025.1569155 Frontiers+2PMC+2
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Nurdiyanto Yusuf

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.













