Skip to content
This repository has been archived by the owner on Sep 18, 2024. It is now read-only.

Proposal: a more typing-friendly experiment id pattern #3102

Open
ultmaster opened this issue Nov 19, 2020 · 0 comments
Open

Proposal: a more typing-friendly experiment id pattern #3102

ultmaster opened this issue Nov 19, 2020 · 0 comments
Labels
Milestone

Comments

@ultmaster
Copy link
Contributor

ultmaster commented Nov 19, 2020

As proposed in the last meeting, I'm hoping to have a more typing-friendly experiment id pattern instead of the current "random-uppercase-lowercase-digits". We can use the concatenation of two short English words. Here are a few examples.

joint-drake        2020-11-18 01:17  (a day ago)
worthy-burro       2020-11-18 02:07  (a day ago)
safe-kitten        2020-11-18 02:15  (a day ago)
mint-lamb          2020-11-18 02:25  (a day ago)
fresh-mullet       2020-11-18 02:28  (a day ago)
amused-louse       2020-11-18 02:32  (a day ago)
cool-gannet        2020-11-18 02:34  (a day ago)
funny-tomcat       2020-11-18 09:42  (a day ago)
prime-rhino        2020-11-18 09:52  (a day ago)
eager-newt         2020-11-18 10:00  (a day ago)
crack-dragon       2020-11-18 10:47  (a day ago)
nearby-jennet      2020-11-18 11:08  (a day ago)

How to generate random English words

We can use faker.

Pros

  • It's much easier for human to read and type an English words rather than something random.
  • It saves the effort to use monospace font to display experiment IDs.

Cons

  • The collision likelyhood might be higher than before. Might not work on trials (because we can easily have thousands of trials). We can combine with some other patterns, such as digits or collision detection and retrying.

Collision likelyhood analysis

According to the reddit post, there are over 20,000 English words with length shorter than 6:

image

So basically we have more than 400,000,000 combinations. According to birthday paradox, we need ~2836 experiments for a user to be 1% likely to have a collision. ~895 to be a 0.1% likely. It is almost impossible to have thousands of experiments on one single machine. Besides, we can just crash the experiments and retry if that actually happens.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Projects
None yet
Development

No branches or pull requests

2 participants