This repository contains a data analysis and statistical modeling project focused on social, economic, and environmental datasets.
The project aims to explore multiple variables and build statistical models (mainly linear regressions) to understand correlations, predict behaviors, and identify significant patterns across different territorial indicators.
dataset/β Folder containing the 10.csvdatasets used in the analysis.docs/- Project Description (PDF) β Project guidelines and specifications.
- Final Report (PDF) β Detailed report including statistical models, interpretations, and results.
codice_analisi_gruppo9.Rβ Complete R script for data loading, statistical analysis, modeling, and visualization.presentazione_gruppo_9.pptxβ Final presentation summarizing key findings and conclusions.
- R (statistical programming language)
- RStudio (recommended IDE)
- Main R libraries:
ggplot2,corrplot,stats,car,psych, among others.
- Clone or download this repository.
- Open
codice_analisi_gruppo9.Rwith RStudio. - Ensure the
dataset/folder is present in the working directory. - Run the script to reproduce the data analysis, statistical models, and visualizations.
The datasets are already cleaned and ready for analysis.
- Multivariate data analysis from real-world social and environmental datasets.
- Construction and comparison of linear regression models.
- Assessment of model performance using statistical metrics.
- Data visualization to support interpretation and insights.
- Alessandro Ambrosone
- Ciancio Vittorio
- Marco Di Maio
- Antonio Giorgio
This project is licensed under the CC BY-NC-SA 4.0 License
You may share and adapt this work for non-commercial purposes only, as long as you give appropriate credit and distribute your contributions under the same license.
For commercial use, explicit permission from the authors is required.
