We missed a requirement: the metrics files should contain data sample counts for their relative operations. We can use any key names we like, so long as they're known at configuration time.
"Training samples", "Evaluation samples", "Validation samples", etc...