-
Notifications
You must be signed in to change notification settings - Fork 39
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[META] [FEATURE] Add a WorkflowStep for calling external REST APIs #522
Comments
This looks great 👍 |
One more question - can we parameterize the workflow so that it could be invoked with different parameters (similar to the saved search API) ? For example a step can accept parameters such as the following
|
Each step uses an ActionFuture which it completes when it's complete. If there are cases where we may need to retry, we can use a conditional to define whether that happens. Would be helpful if you gave a specific example of an API call I can consider for that. |
This looks like something @amitgalitz proposed in #213. If so I'll address that request first. |
Adding arbitrary external RESTful calls into Search Pipeline sounds quite risky. It provides better flexibility but may cause performance, security, and stability uncertainty. Is this really a good tradeoff for a core feature like Search Pipeline? or similar capability could be provided through other mechanism, like Agent Framework? |
As I've been looking into this, I have been trying to envision how to address the security concerns. One of the main use cases to use this was to support opensearch-project/observability#1805 but I think we can still address that with individual workflow steps (update index, reindex, and a rollup step). I'll work together with @YANG-DB to try to implement the specific requirements. |
@xinlamzn the general idea is to allow assembly of core API calls that will allow workflow use case to emerge. |
Yes, @YANG-DB that was what I implied with this comment: #522 (comment) We have a create index step added in 2.13 and plan to have a reindex step in 2.14. What other core APIs do you need? Can you contribute them (we can help, there's plenty of examples and even a tutorial to follow)? |
Seems something overlaps with connector framework. A connector is something owns the connections to any external APIs. It has predict action. We have plan to enhance it to support more action types. |
Flow Framework allows sequencing of API calls, but these are presently limited to a fixed set of implemented steps. We can empower customers to perform automation and configuration using any OpenSearch or plugin API, or even calling external REST APIs.
Is your feature request related to a problem?
A workflow structure nearly identical to this plugin was proposed in this RFC for the Observability plugin: opensearch-project/observability#1805
An example workflow step for an API call is included in the linked RFC:
What solution would you like?
Add Workflow Steps which enable calling REST APIs.
The code already exists in our integration test classes, which could be refactored into appropriate steps in the main source tree:
flow-framework/src/test/java/org/opensearch/flowframework/TestHelpers.java
Lines 81 to 90 in 13f672e
This is called with parameters like this:
flow-framework/src/test/java/org/opensearch/flowframework/FlowFrameworkRestTestCase.java
Lines 135 to 143 in 13f672e
I think the best approach here is to write new steps:
InitHttpClientStep
which creates or configures an appropriateclient()
to make the call. This would be set up very similar to how theToolStep
is created, using the workflow parameters to configure aClient
object. Add an HttpHost Workflow Step #530RestApiStep
which would reference the client in its previous node inputs, and take other params for the actual REST call.One other consideration is the status API. We keep track of provisioned resources using step name, step id, resource type, and resource id. The linked RFC suggests intermediate status output with "complete" where we'd really want to identify what was done (hard to do generically so maybe we just put "complete") and for in-process steps, running tasks. We could consider that updated provisioning detail in a separate feature request.
We also need supporting code:
What alternatives have you considered?
We could write workflow steps for every OpenSearch API (or at least the most common ones) and use the associated Transport Actions.
Do you have any additional context?
While making REST calls is less efficient than the transport calls, they enable calls outside of OpenSearch as well.
REST client calls may also be needed for future migration to serverless.
While relatively easy to do this with an HTTP Client, supporting HTTPS and preserving appropriate headers may introduce some security complexities.
The text was updated successfully, but these errors were encountered: