Feature request: Generate data using markov chains #1

cfcs · 2016-03-07T21:16:03Z

It would be nice to use markov chains or similar to produce data of different patterns / "similarity" for use with benchmarking compression and deduplication.

Candidates include:

file/directory names
data blocks / "segments"
directory depth / structure

cfcs · 2016-03-07T21:22:30Z

Speaking of which, there should be test cases for weird path elements like long names, funny charset encodings, broken charset encodings, etc. I suspect some of the rsync-based tools might have a hard time with files and folders containing special characters.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature request: Generate data using markov chains #1

Feature request: Generate data using markov chains #1

cfcs commented Mar 7, 2016

cfcs commented Mar 7, 2016

Feature request: Generate data using markov chains #1

Feature request: Generate data using markov chains #1

Comments

cfcs commented Mar 7, 2016

cfcs commented Mar 7, 2016