Best fit matching for commodity names #15

Lknechtli · 2015-01-11T21:41:51Z

A best fit match to commodity names when OCRing the markets would make it much more accurate - for instance, I've seen numbers show up in commodity names. Restricting it to letters for the names, and numbers for prices etc would reduce the number of errors.

stringandstickytape · 2015-01-11T22:05:01Z

It does do Levenshtein similarity tests between commodity names. Maybe that list needs expanding or the algorithm needs to be more tolerant of differences on longer names. Do you have any specific examples?

Lknechtli · 2015-01-11T22:11:20Z

stringandstickytape · 2015-01-11T22:56:48Z

Weird. These are all in the hard-coded list of commodity names, and I've just run them through MRmP's tweaked levenshtein algorithm - they all got autocorrected properly.

You can either:
a) try deleting your autosave.csv and retrying the screenshots - i wonder if it is related to existing commodity names in your autosave.csv, or
b) post a link to a zip file containing your autosave.csv, calibration.txt and (say) the HYDROGEN FUEL Curie Gateway screenshot, so I can try on this side.

It seems to be working fine for me - but I regularly delete my autosave.csv so that could be relevant :/

Lknechtli · 2015-01-11T23:02:02Z

uploaded them to onedrive:
http://1drv.ms/1FILRi9

stringandstickytape · 2015-01-11T23:59:33Z

Awesome. This has pointed me to a HUGE HOLE in the Levenshtein algorithm, namely: it doesn't work. I have swapped in one that does and the problem is fixed. V1.73 includes the fix.

Note that for two of the commodity names, "FRUIT AND VEGETABLES" and one other, it will still ask you to correct it. This is because you have (something like) "FRUTT AND VEGETABLES" and it can't decide if the correct answer is that or "FRUIT AND VEGETABLES" because they both look like good candidates.

The fix is to remove the bad commodity names from your AutoSave.csv - then this problem will go away. A commodity rename feature, like the station rename feature, is issue #17...

Lknechtli · 2015-01-12T01:17:56Z

Awesome. Seems to be working much better now.

stringandstickytape closed this as completed Jan 11, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Best fit matching for commodity names #15

Best fit matching for commodity names #15

Lknechtli commented Jan 11, 2015

stringandstickytape commented Jan 11, 2015

Lknechtli commented Jan 11, 2015

stringandstickytape commented Jan 11, 2015

Lknechtli commented Jan 11, 2015

stringandstickytape commented Jan 11, 2015

Lknechtli commented Jan 12, 2015

Best fit matching for commodity names #15

Best fit matching for commodity names #15

Comments

Lknechtli commented Jan 11, 2015

stringandstickytape commented Jan 11, 2015

Lknechtli commented Jan 11, 2015

stringandstickytape commented Jan 11, 2015

Lknechtli commented Jan 11, 2015

stringandstickytape commented Jan 11, 2015

Lknechtli commented Jan 12, 2015