Unfair to skip questions in test dataset #56

xhuang31 · 2018-07-11T16:52:23Z

In augment_process_dataset.py, some questions in the test dataset are skipped. It is unfair to use such results to compare to the state-of-the-art. Nothing should be changed in the test data.

Impavidity · 2018-07-12T01:21:27Z

@xhuang31 Thank you so much for your comments! I rechecked the code and found that we indeed skipped some questions for whole dataset, not considering their sources. I recalculated and found that we ignore 54(0.2%) questions in test data. We will change our denominator when calculating the final results.
Thank you again for spotting this !

fwzlaughing · 2018-07-13T14:19:42Z

Do you know any methods that could prevent this problem (reserve these dropped questions)? Does it requires a larger KB? I am a beginner of this area and have few related knowledge. Looking forward to your reply, thanks!

Impavidity · 2018-07-13T14:48:46Z

@fwzlaughing Hi just have a quick fix here 73b5a42.

fwzlaughing · 2018-07-13T15:14:49Z

Why some entities' names can't be found in FB5M.name.txt? Is there a complete dataset that contains all the entities' names of FB2M?

Impavidity · 2018-07-13T16:45:23Z

@fwzlaughing FB5M is a subset of freebase. If you are interested in the full freebase, you could download the dump from https://developers.google.com/freebase/

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unfair to skip questions in test dataset #56

Unfair to skip questions in test dataset #56

xhuang31 commented Jul 11, 2018

Impavidity commented Jul 12, 2018

fwzlaughing commented Jul 13, 2018

Impavidity commented Jul 13, 2018

fwzlaughing commented Jul 13, 2018

Impavidity commented Jul 13, 2018

Unfair to skip questions in test dataset #56

Unfair to skip questions in test dataset #56

Comments

xhuang31 commented Jul 11, 2018

Impavidity commented Jul 12, 2018

fwzlaughing commented Jul 13, 2018

Impavidity commented Jul 13, 2018

fwzlaughing commented Jul 13, 2018

Impavidity commented Jul 13, 2018