[Core] Simplify duplicate feature detection #1602

mpkorstanje · 2019-04-07T10:15:28Z

Summary

Between #165 and #259 cucumber would start to ignore duplicate features
by taking their MD5 sum and comparing newly parsed features. Replacing
the set of MD5 sums with a map of source to features simplifies the code
and allows duplicates to be logged as warnings.

I couldn't discover any good reason to use an MD5 hash over javas
hashCode and equals.

Memory consumption doesn't seem to be a problem. CucumberFeature
already keeps a reference the original source.
Collision doesn't appear to be a problem. hashCode produces a 32
bit hash. So by the birth-day paradox math we'd need approximately 9000
feature files for a 1% chance of collision.

Types of changes

Bug fix (non-breaking change which fixes an issue).
New feature (non-breaking change which adds functionality).
Breaking change (fix or feature that would cause existing functionality to not work as expected).

Checklist:

I've added tests for my code.
My change requires a change to the documentation.
I have updated the documentation accordingly.

Between #165 and #259 cucumber would start to ignore duplicate features by taking their MD5 sum and comparing newly parsed features. Replacing the set of MD5 sums with a map of source to features simplifies the code and allows duplicates to be logged as warnings. I couldn't discover any good reason to use an MD5 hash over javas `hashCode` and `equals`. 1) Memory consumption doesn't to be a problem. `CucumberFeature` already keeps a reference the original source. 2) Collision doesn't appear to be a problem. `hashCode` produces a 32 bit hash. So by the birth-day paradox math we'd need approximately 9000 feature files for a 1% chance of collision.

coveralls · 2019-04-07T10:28:53Z

Coverage increased (+0.02%) to 86.207% when pulling 350d99d on simplify-duplicate-feature-detection into 0caf37c on master.

mpkorstanje merged commit b6a58ea into master Apr 8, 2019

mpkorstanje deleted the simplify-duplicate-feature-detection branch August 2, 2019 13:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Core] Simplify duplicate feature detection #1602

[Core] Simplify duplicate feature detection #1602

mpkorstanje commented Apr 7, 2019 •

edited

Loading

coveralls commented Apr 7, 2019

[Core] Simplify duplicate feature detection #1602

[Core] Simplify duplicate feature detection #1602

Conversation

mpkorstanje commented Apr 7, 2019 • edited Loading

Summary

Types of changes

Checklist:

coveralls commented Apr 7, 2019

mpkorstanje commented Apr 7, 2019 •

edited

Loading