You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I used the GitHub search to find a similar issue and didn't find it.
I searched the Prefect documentation for this issue.
I checked that this issue is related to Prefect and not one of its dependencies.
Bug summary
If during a flow run the k8s pod gets evicted, the flow run will stay in Running mode forever. We have had similar issues a few months ago but they seemed resolved (the flow run would get marked as Crashed). Now, since about 2-3 weeks we're seeing stuck flow runs again and we have to manually cancel then to get them out of the running state.
This is a major blocker for us since our flows need to run on a tight scheduled basis and no 2 flows can run simultaneously. So stuck flow runs will block our entire process.
Reproduction
# Start a flow run# Ensure the pod gets evicted# Flow will stay in Running state
First check
Bug summary
If during a flow run the k8s pod gets evicted, the flow run will stay in Running mode forever. We have had similar issues a few months ago but they seemed resolved (the flow run would get marked as Crashed). Now, since about 2-3 weeks we're seeing stuck flow runs again and we have to manually cancel then to get them out of the running state.
This is a major blocker for us since our flows need to run on a tight scheduled basis and no 2 flows can run simultaneously. So stuck flow runs will block our entire process.
Reproduction
Error
No response
Versions
Additional context
No response
The text was updated successfully, but these errors were encountered: