-
Notifications
You must be signed in to change notification settings - Fork 3.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[meta] Pipeline Java Execution important issues #11175
Comments
I have looked into the "Java execution startup is potentially slower for some configurations #11105" issue and as I commented in that issue, it is not potentially slower, the slowdown is in fact a factor of the number of workers, see #11105 (comment). I will continue investigating this. |
I have created 2 PRs with alternate implementations to solve the class compilation caching problem leading to recompiling everything per worker; the first #11479 uses the existing global cache but fixes the cache key to correctly reuse compiled classes. There are some potential problems with pipeline reloading and cache invalidation with this implementation and another PR #11482 uses a different strategy by moving the class caching at the pipeline level which solves both the potential invalidation and pipeline reloading problems. |
Good news on the inputs starting before the workers are initialized: it is easier than I thought to fix, now just need to see if we make that configurable. Will push PR soon. |
#11482 was merged and should considerably improve java execution pipeline compilation time as it will not be multiplied by the number of workers anymore. Fix will be included in 7.6.0 and 7.5.2. |
PR to fix worker initialization sequence in #11492. Should we make that behaviour optional? |
New solution proposal for the event ordering #11524 |
About
I am wondering where in the docs we should introduce/document this subject. It does not feel right IMO to simply add a snipet un the settings file. /cc @karenzone @jsvd |
We're building out two general sections in Troubleshooting and Tips and Best Practices. It seems like this new info is similar in nature, even though it doesn't fit neatly under either of those current categories. If we renamed Tips and Best Practices or created a new section describing how things work, I'll bet we'd find other stuff to add. |
Yeah, I like the idea of having a more general "concepts" section? we could then reference that in the settings docs for example. |
I like the idea of keeping the more general info (Troubleshooting, Tips and Best Practices, and a new concepts section) together. On the other hand, it seems like a new conceptual section makes more sense in "How Logstash Works." If I expanded that topic, I'd expect to see information such as what you want to add. Under "How Logstash Works" (Pros and Cons): A new section grouped with Troubleshooting and Tips and Best Practices (Pros and Cons): WDYT? UPDATED: |
The rebatching after filters #11710 which improves the |
Closing this, all important work was done, one remaining testing issue has been created #12476 |
This is a meta issue about some important Pipeline Java Execution issues.
Priority Issues
(https://gist.github.com/yaauie/82e199687c490a2e04fe1f0c9d9a7fee))
Followup Issues
pipeline.ordered
setting to rebatch before outputs improvepipeline.ordered
setting to rebatch before outputs #11550 - Fixed in Rebatch after filters when using pipeline.ordered #11710Other issues
The text was updated successfully, but these errors were encountered: