-
Notifications
You must be signed in to change notification settings - Fork 4
Open
Description
I can't post in https://groups.google.com/g/perkeep/c/8hNJ1_tmcmE/m/OHB0v46CCAAJ?pli=1, but there is an interesting discussion to have on the slicing parameters related to the wished chunk size.
Everybody understands that if you stop when the n most significant bits of the checksum of your rolling window are all the same, you will cut on average every 2^n bytes.
What is however rarely discussed is: is there a point where you should slice even though you have not met such a window? Imagine somebody creating input data that is specially designed to break your checksum, for example.
If anyone is interested, I discussed this in https://github.com/chmduquesne/opi/blob/master/doc/maxchunksize.md. Feel free to have a look!
Metadata
Metadata
Assignees
Labels
No labels