Football Analytics is a comprehensive project designed to collect, analyze, and visualize performance data for football teams and players during the Serie A 2017/18 season. The project uses a sophisticated database structure and machine learning models to provide insights into match events, player actions, and team performance.
The goal of this project is to build a system that allows for detailed analysis of football events, focusing on Serie A 2017/18 matches. By utilizing Kaggle's Soccer Match Event Dataset and additional data from FBref via web scraping, this project offers:
- Data Preprocessing: Cleaning and transforming raw football data.
- Database Design: Creating a robust database to store match events, player stats, and more.
- Analysis: Detailed analysis of events like passes, shots, and actions during the match.
- Visualization: Interactive visualizations of player movements, actions, and performance metrics.
- Languages: Python
- Libraries: Pandas, NumPy, Scikit-learn, mplsoccer, Electron
- Database: MongoDB (NoSQL)
- Data Sources: Kaggle, FBref
- Machine Learning Models: Not specified in the current document
- Visualizations: mplsoccer for football-specific data visualizations
Football_Analytics/
βββ code/ β Source code and implementation files
β
βββ dataset.r β Compressed dataset (ZIP file, includes all relevant datasets)
β
βββ docs/ β Documentation
β βββ Football_Analytics.pptx β PowerPoint presentation about the project
β βββ Football_Analytics_RAD.pdf β Requirement Analysis Document
β βββ Project_Documentation_Football_Analytics.pdf β Final project documentation
β
βββ README.md β Project documentation (this file)
-
Clone the repository:
git clone https://github.com/Marco210210/Football-Analytics.git
-
Download the dataset from Kaggle or via the provided link in the
dataset/directory. The dataset is not included in this repository due to size limitations. -
Explore the code in the
code/directory for the data preprocessing and analysis scripts. -
The
Football_Analytics.pptxfile provides an overview of the project, while the PDF documents indocs/offer detailed documentation and requirement analysis.
- Arcangeli Giovanni
- Ciancio Vittorio
- Di Maio Marco
This project is licensed under the CC BY-NC-SA 4.0 License
You may share and adapt this work for non-commercial purposes only, as long as you give appropriate credit and distribute your contributions under the same license.
For commercial use, explicit permission from the authors is required.
