Support one_to_one in ML Inference Search Response Processor #2801

mingshl · 2024-08-02T23:57:42Z

Description

Many document to one prediction is the default config in ML Inference Search Response Processor. This PR adds one document to one prediction support following up with #2688

in 2.16, ml inference search response processor support collect documents into a list of model input and make one prediction call.

for example,

the search response returns as following, and we would like to use the text field in the document to send to model input.

{
  "took": 100,
  "timed_out": false,
  "_shards": {
    "total": 22,
    "successful": 22,
    "skipped": 0,
    "failed": 0
  },
  "hits": {
    "total": {
      "value": 2,
      "relation": "gte"
    },
    "max_score": 1,
    "hits": [
      {
        "_index": "test_index",
        "_id": "1",
        "_score": 1,
        "_source": {
          "text": " this is document 1"
      },
     {
            "_index": "test_index",
            "_id": "2",
            "_score": 1,
            "_source": {
              "text": " this is document 2"
          } 
      ] 
}

one_to_one inference is false setting will combine the two document containing the field text into ["this is document 1", "this is document 2"] and this is one round of prediction.

in 2.17, we support one_to_one inference is true setting, that will make two rounds of prediction, the first prediction, we send "this is document 1" and the second prediction we send "this is document 2".

when users want to use a model that accept a list of model input, for example, using openai embedding model, we can set one_to_one is false,

when users want to use a model that accept a single string model input, for example, using bedrock embedding model, we can set one_to_one is true. The most common use case for one-to-one inference is rerank use case using xgboost, which usually take a single document and compare with the search string and return a single score.

Related Issues

#2173
#2444
#2839

#2879

Check List

New functionality includes testing.
New functionality has been documented.
API changes companion pull request created.
Commits are signed per the DCO using --signoff.
Public documentation issue/PR created.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

mingshl · 2024-08-03T00:02:41Z

We need to bump the version in the build.gradle file to let the CI runs.

> Could not resolve all dependencies for configuration ':opensearch-ml-plugin:runtimeClasspath'.
   > Conflict found for the following module:
       - org.apache.httpcomponents.core5:httpcore5 between versions 5.2.5 and 5.2.2
       -

zane-neo · 2024-08-05T05:35:09Z

plugin/src/main/java/org/opensearch/ml/processor/MLInferenceSearchResponseProcessor.java

        } catch (Exception e) {
            if (ignoreFailure) {
                responseListener.onResponse(response);
            } else {
-                responseListener.onFailure(e);
+                responseListener.onFailure(new RuntimeException(e.getMessage()));


Why encapsulate the original exception to RuntimeException?

to avoid 500x error

Why are we getting 5x error in the first place?

Suggest use OpenSearchStatusException to replace RuntimeException.

I don't see 5xx exception, it's for precautions. changed to OpenSearchStatusException similar to

ml-commons/plugin/src/main/java/org/opensearch/ml/utils/IndexUtils.java

Line 134 in 9663053

throw new OpenSearchStatusException("Failed to search index " + indexName, RestStatus.BAD_REQUEST);

added in commit use OpenSearchStatusException in error handling

plugin/src/main/java/org/opensearch/ml/processor/MLInferenceSearchResponseProcessor.java

dhrubo-os · 2024-08-12T22:35:45Z

Thanks for the explanation. IMO, one_to_one doesn't not seem intuitive. May be single_document_inference? Or document_wise_inference?

in 2.16, we support one_to_one inference is true setting,

So we already released this one_to_one inference in 2.16?

Where do we save this setting?

plugin/src/main/java/org/opensearch/ml/processor/MLInferenceSearchResponseProcessor.java

plugin/src/test/java/org/opensearch/ml/processor/MLInferenceSearchResponseProcessorTests.java

mingshl · 2024-08-12T22:40:49Z

Thanks for the explanation. IMO, one_to_one doesn't not seem intuitive. May be single_document_inference? Or document_wise_inference?

in 2.16, we support one_to_one inference is true setting,

So we already released this one_to_one inference in 2.16?

Where do we save this setting?

aw ha, it should be 2.17, yeah this change is going to add to 2.17. just modified the description.

mingshl · 2024-08-13T20:34:22Z

Thanks for the explanation. IMO, one_to_one doesn't not seem intuitive. May be single_document_inference? Or document_wise_inference?

in 2.16, we support one_to_one inference is true setting,

So we already released this one_to_one inference in 2.16?

Where do we save this setting?

in the ml search response processor, we already had it in 2.16 but it's not supported to turn on to true yet.

dhrubo-os · 2024-08-13T20:50:40Z

in the ml search response processor, we already had it in 2.16 but it's not supported to turn on to true yet.

can we change the settings name? or we need to stick with one_to_one?

mingshl · 2024-08-13T20:57:01Z

in the ml search response processor, we already had it in 2.16 but it's not supported to turn on to true yet.

can we change the settings name? or we need to stick with one_to_one?

the name is added since 2.16, if we change the name in 2.17, it might case confusion.

Signed-off-by: Mingshi Liu <[email protected]>

plugin/src/main/java/org/opensearch/ml/processor/MLInferenceSearchResponseProcessor.java

mingshl · 2024-08-20T16:11:50Z

All tests passed

Signed-off-by: Mingshi Liu <[email protected]>

* add one document to one prediction support Signed-off-by: Mingshi Liu <[email protected]> * rephrase javadoc Signed-off-by: Mingshi Liu <[email protected]> * use OpenSearchStatusException in error handling Signed-off-by: Mingshi Liu <[email protected]> * fix message Signed-off-by: Mingshi Liu <[email protected]> * add more tests Signed-off-by: Mingshi Liu <[email protected]> * handle different exceptions properly Signed-off-by: Mingshi Liu <[email protected]> --------- Signed-off-by: Mingshi Liu <[email protected]> (cherry picked from commit 2a33c65)

…2843) * add one document to one prediction support Signed-off-by: Mingshi Liu <[email protected]> * rephrase javadoc Signed-off-by: Mingshi Liu <[email protected]> * use OpenSearchStatusException in error handling Signed-off-by: Mingshi Liu <[email protected]> * fix message Signed-off-by: Mingshi Liu <[email protected]> * add more tests Signed-off-by: Mingshi Liu <[email protected]> * handle different exceptions properly Signed-off-by: Mingshi Liu <[email protected]> --------- Signed-off-by: Mingshi Liu <[email protected]> (cherry picked from commit 2a33c65) Co-authored-by: Mingshi Liu <[email protected]>

mingshl requested review from b4sjoo, dhrubo-os, jngz-es, model-collapse, rbhavna, ylwu-amzn, zane-neo, Zhangxunmt, austintlee, HenryL27, sam-herman and xinyual as code owners August 2, 2024 23:57

mingshl had a problem deploying to ml-commons-cicd-env August 2, 2024 23:57 — with GitHub Actions Failure

mingshl had a problem deploying to ml-commons-cicd-env August 2, 2024 23:58 — with GitHub Actions Failure

zane-neo reviewed Aug 5, 2024

View reviewed changes

dhrubo-os reviewed Aug 12, 2024

View reviewed changes

plugin/src/main/java/org/opensearch/ml/processor/MLInferenceSearchResponseProcessor.java Outdated Show resolved Hide resolved

plugin/src/test/java/org/opensearch/ml/processor/MLInferenceSearchResponseProcessorTests.java Outdated Show resolved Hide resolved

Zhangxunmt previously approved these changes Aug 13, 2024

View reviewed changes

mingshl dismissed Zhangxunmt’s stale review via eca844b August 13, 2024 21:04

mingshl had a problem deploying to ml-commons-cicd-env August 13, 2024 21:04 — with GitHub Actions Failure

mingshl had a problem deploying to ml-commons-cicd-env August 13, 2024 21:38 — with GitHub Actions Failure

mingshl force-pushed the main-one-to-one-inference branch from 0d5c6bf to 4926805 Compare August 13, 2024 21:44

mingshl had a problem deploying to ml-commons-cicd-env August 13, 2024 21:44 — with GitHub Actions Failure

mingshl had a problem deploying to ml-commons-cicd-env August 18, 2024 23:29 — with GitHub Actions Failure

add more tests

f1aa03f

Signed-off-by: Mingshi Liu <[email protected]>

mingshl force-pushed the main-one-to-one-inference branch from 5b5d79f to f1aa03f Compare August 19, 2024 18:28

mingshl had a problem deploying to ml-commons-cicd-env August 19, 2024 18:28 — with GitHub Actions Failure

mingshl had a problem deploying to ml-commons-cicd-env August 19, 2024 18:29 — with GitHub Actions Failure

mingshl temporarily deployed to ml-commons-cicd-env August 19, 2024 20:40 — with GitHub Actions Inactive

mingshl added 2.17 v2.17.0 labels Aug 19, 2024

mingshl had a problem deploying to ml-commons-cicd-env August 19, 2024 22:29 — with GitHub Actions Failure

mingshl had a problem deploying to ml-commons-cicd-env August 20, 2024 00:27 — with GitHub Actions Failure

mingshl temporarily deployed to ml-commons-cicd-env August 20, 2024 02:33 — with GitHub Actions Inactive

zane-neo reviewed Aug 20, 2024

View reviewed changes

plugin/src/main/java/org/opensearch/ml/processor/MLInferenceSearchResponseProcessor.java Show resolved Hide resolved

plugin/src/main/java/org/opensearch/ml/processor/MLInferenceSearchResponseProcessor.java Show resolved Hide resolved

mingshl temporarily deployed to ml-commons-cicd-env August 20, 2024 03:44 — with GitHub Actions Inactive

zane-neo reviewed Aug 20, 2024

View reviewed changes

plugin/src/main/java/org/opensearch/ml/processor/MLInferenceSearchResponseProcessor.java Outdated Show resolved Hide resolved

handle different exceptions properly

6e16254

Signed-off-by: Mingshi Liu <[email protected]>

mingshl temporarily deployed to ml-commons-cicd-env August 21, 2024 04:32 — with GitHub Actions Inactive

mingshl temporarily deployed to ml-commons-cicd-env August 21, 2024 05:42 — with GitHub Actions Inactive

Zhangxunmt approved these changes Aug 21, 2024

View reviewed changes

mingshl added the backport 2.x label Aug 21, 2024

b4sjoo approved these changes Aug 21, 2024

View reviewed changes

mingshl merged commit 2a33c65 into opensearch-project:main Aug 21, 2024
7 checks passed

opensearch-trigger-bot bot mentioned this pull request Aug 21, 2024

[Backport 2.x] Support one_to_one in ML Inference Search Response Processor #2843

Merged

This was referenced Sep 3, 2024

[META] ML Inference Processor Enhancements I #2839

Closed

[RFC] Support One to One Inference in ML Inference Search Response Processor #2879

Closed

ohltyler mentioned this pull request Sep 4, 2024

Onboard one-to-one config for ML search resp processors opensearch-project/dashboards-flow-framework#343

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support one_to_one in ML Inference Search Response Processor #2801

Support one_to_one in ML Inference Search Response Processor #2801

mingshl commented Aug 2, 2024 •

edited

Loading

mingshl commented Aug 3, 2024

zane-neo Aug 5, 2024

mingshl Aug 8, 2024

dhrubo-os Aug 8, 2024

Zhangxunmt Aug 13, 2024

mingshl Aug 13, 2024

dhrubo-os commented Aug 12, 2024 •

edited

Loading

mingshl commented Aug 12, 2024 •

edited

Loading

mingshl commented Aug 13, 2024

dhrubo-os commented Aug 13, 2024

mingshl commented Aug 13, 2024

mingshl commented Aug 20, 2024

Support one_to_one in ML Inference Search Response Processor #2801

Support one_to_one in ML Inference Search Response Processor #2801

Conversation

mingshl commented Aug 2, 2024 • edited Loading

Description

Related Issues

Check List

mingshl commented Aug 3, 2024

zane-neo Aug 5, 2024

Choose a reason for hiding this comment

mingshl Aug 8, 2024

Choose a reason for hiding this comment

dhrubo-os Aug 8, 2024

Choose a reason for hiding this comment

Zhangxunmt Aug 13, 2024

Choose a reason for hiding this comment

mingshl Aug 13, 2024

Choose a reason for hiding this comment

dhrubo-os commented Aug 12, 2024 • edited Loading

mingshl commented Aug 12, 2024 • edited Loading

mingshl commented Aug 13, 2024

dhrubo-os commented Aug 13, 2024

mingshl commented Aug 13, 2024

mingshl commented Aug 20, 2024

mingshl commented Aug 2, 2024 •

edited

Loading

dhrubo-os commented Aug 12, 2024 •

edited

Loading

mingshl commented Aug 12, 2024 •

edited

Loading