[test_loss computation L459](https://github.com/aidecentralized/sonar/blob/main/src/utils/model_utils.py#L459) is not scaled correctly. Ideally, it should be multiplied by the batch_size in this line, and then divided by the size of the dataset at the end.