LM Similarity returns all zero scores #15397

keikha · 2015-12-11T16:26:29Z

When I use the LMDirichlet similarity I get back are all zero scores. Here is the steps to reproduce the problem:

Created the index:

{ "settings": { "similarity": { "LMSimilarity": { "type": "LMDirichlet", "mu": 2500 } } }, "mappings": { "item": { "properties": { "title": { "type": "string", "similarity": "LMSimilarity" } } } } }

Indexed two documents:

{"title":"This is a test for search similarity when we search by other search options."}
{"title”:”Search looks weird when use other search possibilities. Numbers are not clear. Just adding new stuff to make the document longer. Document norm looks weird."}

Run a simple query:

{ "explain": "true", "query": { "match": { "title": "search" } } }

If you look at the returned scores, there are multiple weird numbers:

The score for all documents is zero
The collection probability, a term property that is independent from individual documents, is different for each document. I expect this number to be the same for all documents for a given term.
Document norm has a negative value, probably it's the log of another number, but I can't match these numbers to the LM formula.

clintongormley · 2016-02-14T19:12:23Z

Closing in favour of #15345

clintongormley added discuss :Search/Search Search-related issues that do not fall into other categories labels Dec 14, 2015

clintongormley closed this as completed Feb 14, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LM Similarity returns all zero scores #15397

LM Similarity returns all zero scores #15397

keikha commented Dec 11, 2015

clintongormley commented Feb 14, 2016

LM Similarity returns all zero scores #15397

LM Similarity returns all zero scores #15397

Comments

keikha commented Dec 11, 2015

clintongormley commented Feb 14, 2016