feat(tools): Add Zipfian cache testing tool #640

devangjhabakh · 2023-01-04T05:11:01Z

Adds a cache testing tool that produces values according to a Zipfian distribution to test cache, as per issue #253.

Signed-off-by: Ubuntu <[email protected]>

devangjhabakh · 2023-01-04T06:37:26Z

tools/cache_testing.py

+    parser.add_argument('-c', '--count', type=int, default=100000, help='total number of incrby operations')
+    parser.add_argument('-u', '--uri', type=str, default='redis://localhost:6379', help='Redis server URI')
+    parser.add_argument('-a', '--alpha', type=int, default=1.0, help='alpha value being used for the Zipf distribution')
+    parser.add_argument('-n', '--number', type=int, default=30, help='the number of values to be used in the distribution')


it's worth noting here that it we really want to see the number of misses go up, we'll have to fill our cache, which would mean that we'll need a far higher number.

romange · 2023-01-04T06:46:16Z

tools/cache_testing.py

+    distMap = [x / zeta[-1] for x in zeta]
+
+    # Generate an array of uniform 0-1 pseudo-random values:
+    u = np.random.random(numSamples)


I understand that you took the implementation from here:
https://stackoverflow.com/questions/31027739/python-custom-zipf-number-generator-performing-poorly

it's fine, but I am worried about the memory usage of the this program - we will need to generate traces of hundreds of millions/billions of records to cause evictions. To solve this, we should convert rand_zipf to a generator, yielding
batches of np.random.random(numSamples) each time and then we can generate a batch each time repeating the preprocessor step every time I call rand_zipf. if you know what I am talking about please do the adjustments otherwise leave as is.

I have fixed the numSamples bottleneck by rewriting the code to look something like this:

# Calculate Zeta values from 1 to n: tmp = np.power( np.arange(1, n+1), -alpha ) zeta = np.r_[0.0, np.cumsum(tmp)] # Store the translation map: distMap = [x / zeta[-1] for x in zeta] # Generate values form the distribution based on 0-1 pseudo-random values for i in range(numSamples): yield np.searchsorted(distMap, np.random.random())

However, it's worth nothing that there exists another significant bottleneck -- the distMap. It's likely going to also be a huge data structure (if we really want to start seeing cache misses, that is). I think it might be a bit more challenging to remove that bottleneck, but it may be possible (I still have to spend some more time thinking about how, if at all, it can be done). Do you think I should spend time on it? Because another workaround to forcing misses out of the cache is to simply limit the amount of memory it is allowed, too.

I actually thought to leave numSamples batch as is but instead to have:

while True: u = np.random.random(numSamples) v = np.searchsorted(distMap, u) samples = [t-1 for t in v] yield samples

and pass pipeline length as numSamples.

I would not worry about distMap right now. yes it's big but I think it's smaller than total number of writes we gonna send to the server.

Ah, I understand. Shall fix accordingly!

romange · 2023-01-04T06:55:06Z

tools/cache_testing.py

+
+    distribution_values = rand_zipf(args.number, args.alpha, args.count)
+    for idx, val in enumerate(distribution_values):
+        r.incrby(str(val), 1)


following my previous comment, we should use pipelining capabilities of Redis/DF and send data in batches.
Otherwise, sending a billion items will take lots of time. Can you please, add a pipeline length parameter p ?

Finally, I see you read statistics from info() which is fine, but when I think of it, the idea of using incr does not bring additional value because you do not read its response to know if it was hit or miss. In that case, we should change it to set <key> <val> NX which will allow us to increase the value length and put pressure on the server to evict items. val can be a constant string of specified length (-d argument),

Can add pipelining!

I was also thinking about how incr doesn't tell us whether the item was a hit or a miss. There is no response regarding hit/miss from redis-py, at least. I think the value length increases sound like a great idea, I'll change it to that!

just a small correction - value length should be constant. I did not mean to append values - but just use a constant
val = 'x' * args.d for the "set key val" command. The thought is that with larger values it will be easier to add memory pressure on the server and we will need less keys, less samples.

Do we also want to support multiprocessing/multiple workers? Shall I add a -w for the same?

I think we can avoid multi-processing for now.

romange · 2023-01-04T06:57:23Z

Thanks! Overall looks good! There are some comments that should allow us to make the tool more usable.

romange · 2023-01-04T17:29:24Z

No need for multiprocessing but if you want, you can look how other python scripts use asyncio and you can use multiple connections - this will increase their effective throughput.

…

On Wed, Jan 4, 2023 at 6:44 PM Devang Jhabakh Jai ***@***.***> wrote: ***@***.**** commented on this pull request. ------------------------------ In tools/cache_testing.py <#640 (comment)> : > + parser = argparse.ArgumentParser(description='Cache Benchmark', formatter_class=argparse.ArgumentDefaultsHelpFormatter) + parser.add_argument('-c', '--count', type=int, default=100000, help='total number of incrby operations') + parser.add_argument('-u', '--uri', type=str, default='redis://localhost:6379', help='Redis server URI') + parser.add_argument('-a', '--alpha', type=int, default=1.0, help='alpha value being used for the Zipf distribution') + parser.add_argument('-n', '--number', type=int, default=30, help='the number of values to be used in the distribution') + args = parser.parse_args() + uri = urlparse(args.uri) + + r = redis.Redis(host=uri.hostname, port=uri.port) + + initial_hits = r.info()['keyspace_hits'] + initial_misses = r.info()['keyspace_misses'] + + distribution_values = rand_zipf(args.number, args.alpha, args.count) + for idx, val in enumerate(distribution_values): + r.incrby(str(val), 1) Do we also want to support multiprocessing/multiple workers? Shall I add a -w for the same? — Reply to this email directly, view it on GitHub <#640 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AA4BFCCZ7AWKZSNHJ2Y26QTWQWSHTANCNFSM6AAAAAATQMMWJY> . You are receiving this because you commented.Message ID: ***@***.***>

Signed-off-by: Ubuntu <[email protected]>

devangjhabakh · 2023-01-05T13:00:54Z

Hey @romange I seem to have implemented pipelining and it works on my end (albeit for some reason, on my machine, the pipelined version seems to be running slower than the non-pipelined)
Can you take a look and see if the changes seem okay? This is my first time working with redis pipeline so I may have made a mistake.

romange · 2023-01-05T13:07:13Z

tools/cache_testing.py

+        total_count = 0
+        for idx, values in enumerate(distribution_values_generator):
+            total_count += len(values)
+            p = r.pipeline()


add transaction=False

Oh wow, that sped things up significantly. Thanks!

tools/cache_testing.py

Signed-off-by: Ubuntu <[email protected]>

romange

🙏

adding cache testing tool

e4a9acb

Signed-off-by: Ubuntu <[email protected]>

devangjhabakh mentioned this pull request Jan 4, 2023

implement cache testing tool #253

Open

devangjhabakh commented Jan 4, 2023

View reviewed changes

romange reviewed Jan 4, 2023

View reviewed changes

Merge branch 'main' into djj/cache_testing_tool

84028e3

adding pipelines

3b50043

Signed-off-by: Ubuntu <[email protected]>

devangjhabakh force-pushed the djj/cache_testing_tool branch from fcf852e to 3b50043 Compare January 5, 2023 12:57

romange reviewed Jan 5, 2023

View reviewed changes

tools/cache_testing.py Outdated Show resolved Hide resolved

Ubuntu and others added 3 commits January 5, 2023 13:14

fixing pipeline and renaming vals to keys

66b4077

Signed-off-by: Ubuntu <[email protected]>

Merge branch 'main' into djj/cache_testing_tool

380ccc1

Merge branch 'main' into djj/cache_testing_tool

ca2e874

romange approved these changes Jan 6, 2023

View reviewed changes

romange merged commit f4457be into dragonflydb:main Jan 7, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(tools): Add Zipfian cache testing tool #640

feat(tools): Add Zipfian cache testing tool #640

devangjhabakh commented Jan 4, 2023

devangjhabakh Jan 4, 2023

romange Jan 4, 2023

devangjhabakh Jan 4, 2023

romange Jan 4, 2023 •

edited

Loading

romange Jan 4, 2023

devangjhabakh Jan 4, 2023

romange Jan 4, 2023

devangjhabakh Jan 4, 2023

romange Jan 4, 2023 •

edited

Loading

devangjhabakh Jan 4, 2023

devangjhabakh Jan 4, 2023

romange Jan 5, 2023

romange commented Jan 4, 2023

romange commented Jan 4, 2023 via email

devangjhabakh commented Jan 5, 2023

romange Jan 5, 2023

devangjhabakh Jan 5, 2023

romange left a comment

feat(tools): Add Zipfian cache testing tool #640

feat(tools): Add Zipfian cache testing tool #640

Conversation

devangjhabakh commented Jan 4, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

romange Jan 4, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

romange Jan 4, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

romange commented Jan 4, 2023

romange commented Jan 4, 2023 via email

devangjhabakh commented Jan 5, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

romange left a comment

Choose a reason for hiding this comment

romange Jan 4, 2023 •

edited

Loading

romange Jan 4, 2023 •

edited

Loading