[BUG] Hybrid query returns error when `request_cache` flag is set in search request #601

martin-gaievski · 2024-02-09T19:16:53Z

What is the bug?

hybrid search query returns below error message for any search request with flag request_cache

"type": "illegal_state_exception",
"reason": "Score normalization processor cannot produce final query result"

How can one reproduce the bug?

Detailed steps added at the end of this issue, most important pieces are:

set exactly 1 shard in the index mapping
add request_cache flag to the search query and run exact same query at least twice:

POST example-index/_search?search_pipeline=nlp-search-pipeline&request_cache=true&preference=_local

What is the expected behavior?

Query should return actual result instead of error

What is your host/environment?

2.11 and latest main both have this issue

Do you have any additional context?

Issue can be avoided by setting 2+ shards for index. It's specific to 1 shard scenario

As per analysis query phase searcher is skipped in case of a second + query when request_cache is set, results are taken from local shard cache and passed to normalization processor. Probably logic for merging fetch and query results should be changed to deal with that cached data properly.

detailed steps to repro:

DELETE example-index

PUT example-index
{
  "settings": {
    "index": {
      "knn": true,
      "number_of_shards": 1,
      "number_of_replicas": 0
    }
  },
  "mappings": {
    "properties": {
      "text": {
        "type": "text"
      },
      "my_vector": {
        "type": "knn_vector",
        "dimension": 1,
        "method": {
          "name": "hnsw",
          "space_type": "innerproduct",
          "engine": "faiss"
        }
      },
      "integer": {
        "type": "integer"
      }
    }
  }
}


PUT example-index/_bulk?refresh
{"index":{"_id":"1"}}
{"text": "neural","my_vector": [1], "integer": 1 }
{"index":{"_id":"2"}}
{"text": "neural neural","my_vector": [2], "integer": 2 }
{"index":{"_id":"3"}}
{"text": "neural neural neural","my_vector": [3], "integer": 3 }
{"index":{"_id":"4"}}
{"text": "neural neural neural neural", "integer": 4 }
{"index":{"_id":"5"}}
{"my_vector": [0], "integer": 5 }


PUT /_search/pipeline/nlp-search-pipeline
{
  "description": "Post processor for hybrid search",
  "phase_results_processors": [
    {
      "normalization-processor": {
        "normalization": {
          "technique": "min_max"
        }
      }
    }
  ]
}



POST example-index/_search?search_pipeline=nlp-search-pipeline&request_cache=true&preference=_local
{
  "query": {
    "hybrid": {
      "queries": [
        {
          "term": {
            "text": "neural"
          }
        },
        {
          "knn": {
            "my_vector": {
              "vector": [
                3
              ],
              "k": 3
            }
          }
        }
      ]
    }
  },
  "size": 3
}

actual response:

{
  "error": {
    "root_cause": [],
    "type": "search_phase_execution_exception",
    "reason": "The phase has failed",
    "phase": "query",
    "grouped": true,
    "failed_shards": [],
    "caused_by": {
      "type": "illegal_state_exception",
      "reason": "Score normalization processor cannot produce final query result"
    }
  },
  "status": 500
}

The text was updated successfully, but these errors were encountered:

jmazanec15 · 2024-04-30T14:48:42Z

@martin-gaievski Im moving this one to 2.15 because I dont think a PR is up yet

martin-gaievski · 2024-08-08T18:12:55Z

it's fixed in #663, closing this issue

martin-gaievski added bug Something isn't working untriaged and removed untriaged labels Feb 9, 2024

github-actions bot added the untriaged label Feb 9, 2024

martin-gaievski removed the untriaged label Feb 9, 2024

vamshin added the v2.14.0 label Apr 4, 2024

vamshin assigned martin-gaievski Apr 4, 2024

jmazanec15 added v2.15.0 and removed v2.14.0 labels Apr 30, 2024

martin-gaievski closed this as completed Aug 8, 2024

martin-gaievski added v2.14.0 and removed v2.15.0 labels Aug 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Hybrid query returns error when `request_cache` flag is set in search request #601

[BUG] Hybrid query returns error when `request_cache` flag is set in search request #601

martin-gaievski commented Feb 9, 2024

jmazanec15 commented Apr 30, 2024

martin-gaievski commented Aug 8, 2024

[BUG] Hybrid query returns error when request_cache flag is set in search request #601

[BUG] Hybrid query returns error when request_cache flag is set in search request #601

Comments

martin-gaievski commented Feb 9, 2024

What is the bug?

How can one reproduce the bug?

What is the expected behavior?

What is your host/environment?

Do you have any additional context?

jmazanec15 commented Apr 30, 2024

martin-gaievski commented Aug 8, 2024

[BUG] Hybrid query returns error when `request_cache` flag is set in search request #601

[BUG] Hybrid query returns error when `request_cache` flag is set in search request #601