MSc Supervised Learning - Course Repository

Master's Degree in Data Science - University of Milano-Bicocca

A comprehensive collection of assignments, lecture notes, and projects from the Supervised Learning course

📚 Repository Overview

This repository contains all coursework, implementations, and research from the Supervised Learning course, including:

Weekly Assignments: Hands-on exercises covering fundamental ML concepts
Lecture Notes: Detailed notes and code examples from class sessions
Final Project: TinyNet - A custom CNN for food classification with <1M parameters

🎯 Final Project: TinyNet

Food Classification with Constrained CNN Architecture

A complete deep learning project tackling a challenging 251-class food image classification task with strict architectural constraints.

Key Achievements:

✅ Custom CNN architecture with exactly 999,675 parameters (< 1M constraint)
✅ 45.33% validation accuracy on 251 food categories
✅ Self-supervised pre-training for improved convergence
✅ Automated hyperparameter optimization with Optuna

Techniques Implemented:

Convolutional Neural Networks (CNNs) with GELU activations
Self-Supervised Learning (SSL) via image reconstruction
Hyperparameter tuning with pruning strategies
Advanced data augmentation preserving food characteristics
Transfer learning from pre-trained encoder

📂 Project Location: Final_Project/

📖 Documentation:

Comprehensive README - Setup, usage, and results
Architecture Details - In-depth technical breakdown
Project Report (PDF) - Full academic paper

📝 Course Assignments

The Assignments/ directory contains weekly exercises covering:

Topics Covered

Linear Regression - Least squares, regularization (Ridge, Lasso)
Logistic Regression - Binary and multi-class classification
Support Vector Machines - Kernel methods, margin optimization
Decision Trees - CART, pruning, ensemble methods
Neural Networks - Backpropagation, activation functions
Deep Learning - CNNs, batch normalization, dropout
Model Selection - Cross-validation, hyperparameter tuning
Ensemble Methods - Bagging, boosting, random forests
Dimensionality Reduction - PCA, feature selection
Evaluation Metrics - Confusion matrix, ROC curves, F1-score

Each assignment includes:

Problem statements
Implementation in Python/PyTorch
Analysis and results
Visualizations

📖 Lecture Notes

The Lessons_notes/ directory contains organized notes from each lecture:

Lessons_notes/
├── L01/ - Introduction to Supervised Learning
├── L02/ - Linear Models
├── L03/ - Regularization Techniques
├── L04/ - Classification Fundamentals
├── L05/ - Support Vector Machines
├── L06/ - Kernel Methods
├── L07/ - Decision Trees
├── L08/ - Ensemble Methods
├── L09/ - Neural Networks Basics
├── L10/ - Deep Learning
├── L11/ - Convolutional Networks
├── L12/ - Advanced CNN Architectures
└── L13/ - Self-Supervised Learning

Notes include:

Theoretical concepts with mathematical derivations
Code implementations and examples
Visualizations and diagrams
References to key papers

🛠️ Technologies Used

Core Libraries

PyTorch: Deep learning framework for neural network implementation
scikit-learn: Classical ML algorithms and utilities
NumPy: Numerical computing
Pandas: Data manipulation and analysis
Matplotlib/Seaborn: Data visualization

Specialized Tools

Optuna: Hyperparameter optimization framework
TorchMetrics: Evaluation metrics for PyTorch
OpenCV: Image processing for computer vision
TensorBoard: Training visualization and monitoring

📊 Repository Structure

MSc_Supervised_Learning/
│
├── Final_Project/                    # Main project - TinyNet
│   ├── README.md                     # Comprehensive documentation
│   ├── ARCHITECTURE.md               # Technical architecture details
│   ├── main.py                       # Main training script
│   ├── htuning.py                    # Hyperparameter tuning
│   ├── pickles/                      # Training metrics and results
│   └── Supervised_Learning__Final_project_.pdf
│
├── Assignments/                      # Weekly coursework
│   ├── Assignment_01/
│   ├── Assignment_02/
│   └── ...
│
├── Lessons_notes/                    # Lecture materials
│   ├── L01/ through L13/
│   └── Additional resources
│
├── .gitignore
└── README.md                         # This file

🚀 Getting Started

Prerequisites

Python 3.8+
CUDA-capable GPU (recommended for Final Project)

Installation

Clone the repository

git clone <repository-url>
cd MSc_Supervised_Learning

Set up virtual environment

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install dependencies

For the Final Project:

cd Final_Project
pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118
pip install pandas numpy matplotlib seaborn pillow scikit-learn
pip install tqdm optuna torchsummary torchmetrics tensorboard opencv-python

For assignments:

pip install numpy pandas scikit-learn matplotlib seaborn jupyter

Running the Final Project

cd Final_Project

# Train TinyNet from scratch
python main.py

# Run hyperparameter optimization
python htuning.py

# View results in TensorBoard
tensorboard --logdir=runs/

Detailed instructions available in Final_Project/README.md

📈 Key Results

Final Project Performance

Metric	Value	Context
Validation Accuracy	45.33%	251 food categories
F1-Score (micro)	0.4533	Balanced performance
Model Parameters	999,675	< 1M constraint ✓
Training Time	~3 hours	RTX 3080, 150 epochs

Model Variants

Configuration	Accuracy	Notes
TinyNet + SSL	45.33%	Best overall
TinyNet Baseline	45.31%	Strong baseline
Tuned + SSL	43.93%	Faster convergence
Tuned Only	43.83%	Different optimum

🎓 Learning Outcomes

Through this course and project, I developed expertise in:

Classical ML: Strong foundation in traditional supervised learning algorithms
Deep Learning: Hands-on experience with CNN architectures and training
Model Optimization: Hyperparameter tuning, regularization, and convergence strategies
Research Skills: Literature review, experimentation, and technical writing
Software Engineering: Clean code, version control, and reproducible research
Problem Solving: Working within constraints, debugging, and iterative improvement

📚 References

Course Materials

Lecture slides and notes (included in Lessons_notes/)
Recommended textbooks:
- Pattern Recognition and Machine Learning - Bishop
- Deep Learning - Goodfellow, Bengio, Courville
- Hands-On Machine Learning - Géron

Final Project References

Krizhevsky et al. (2012) - AlexNet
Simonyan & Zisserman (2015) - VGG
Ronneberger et al. (2015) - U-Net
Hendrycks & Gimpel (2023) - GELU
Akiba et al. (2019) - Optuna

See Final_Project/README.md for complete bibliography.

👥 Authors

Student: Mirko Morello (920601), Andrea Borghesi (916202) Institution: University of Milano-Bicocca Program: MSc in Data Science Course: Supervised Learning Academic Year: 2024-2025

📜 License

This repository contains academic coursework and is intended for educational purposes. Please respect academic integrity policies if referencing this work.

🙏 Acknowledgments

Instructors: For comprehensive course materials and guidance
Teaching Assistants: For support during assignments
PyTorch Community: For excellent documentation and examples
Optuna Team: For powerful hyperparameter optimization tools

⭐ If you found this repository helpful, please consider giving it a star! ⭐

For questions about the Final Project, see the project README

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MSc Supervised Learning - Course Repository

📚 Repository Overview

🎯 Final Project: TinyNet

Food Classification with Constrained CNN Architecture

📝 Course Assignments

Topics Covered

📖 Lecture Notes

🛠️ Technologies Used

Core Libraries

Specialized Tools

📊 Repository Structure

🚀 Getting Started

Prerequisites

Installation

Running the Final Project

📈 Key Results

Final Project Performance

Model Variants

🎓 Learning Outcomes

📚 References

Course Materials

Final Project References

👥 Authors

📜 License

🙏 Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 64 Commits
Assignments		Assignments
Final_Project		Final_Project
Lessons_notes		Lessons_notes
.gitignore		.gitignore
README.md		README.md

MirkoMorello/MSc_Supervised_Learning

Folders and files

Latest commit

History

Repository files navigation

MSc Supervised Learning - Course Repository

📚 Repository Overview

🎯 Final Project: TinyNet

Food Classification with Constrained CNN Architecture

📝 Course Assignments

Topics Covered

📖 Lecture Notes

🛠️ Technologies Used

Core Libraries

Specialized Tools

📊 Repository Structure

🚀 Getting Started

Prerequisites

Installation

Running the Final Project

📈 Key Results

Final Project Performance

Model Variants

🎓 Learning Outcomes

📚 References

Course Materials

Final Project References

👥 Authors

📜 License

🙏 Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages