Skip to content

Requirements on window size and block size #27

@chmduquesne

Description

@chmduquesne

I can't post in https://groups.google.com/g/perkeep/c/8hNJ1_tmcmE/m/OHB0v46CCAAJ?pli=1, but there is an interesting discussion to have on the slicing parameters related to the wished chunk size.

Everybody understands that if you stop when the n most significant bits of the checksum of your rolling window are all the same, you will cut on average every 2^n bytes.

What is however rarely discussed is: is there a point where you should slice even though you have not met such a window? Imagine somebody creating input data that is specially designed to break your checksum, for example.

If anyone is interested, I discussed this in https://github.com/chmduquesne/opi/blob/master/doc/maxchunksize.md. Feel free to have a look!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions