You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It would be nice to use markov chains or similar to produce data of different patterns / "similarity" for use with benchmarking compression and deduplication.
Candidates include:
file/directory names
data blocks / "segments"
directory depth / structure
The text was updated successfully, but these errors were encountered:
Speaking of which, there should be test cases for weird path elements like long names, funny charset encodings, broken charset encodings, etc. I suspect some of the rsync-based tools might have a hard time with files and folders containing special characters.
It would be nice to use markov chains or similar to produce data of different patterns / "similarity" for use with benchmarking compression and deduplication.
Candidates include:
The text was updated successfully, but these errors were encountered: