Skip to content
This repository has been archived by the owner on Jun 6, 2024. It is now read-only.

Job doesn't stop when all tasks is over #1015

Closed
chy-crypto opened this issue Jul 31, 2018 · 5 comments
Closed

Job doesn't stop when all tasks is over #1015

chy-crypto opened this issue Jul 31, 2018 · 5 comments

Comments

@chy-crypto
Copy link

I submit a job with some tasks and when all tasks' command are executed over, the job are still running
How about it?

@yqwang-ms
Copy link
Member

After all tasks completed, the whole job should complete soon.

Please provide below whole logs for us to debug:
image

@chy-crypto
Copy link
Author

@yqwang-ms
Copy link
Member

Thanks. But I see all tasks' container are still running.
So, I guess the task should be hang when exit.
@Gerhut to take a look.

http://10.151.40.179/yarn/10.151.40.169:8042/node/containerlogs/container_e9881_1532509944573_0029_01_000003/yuqian/stdout/?start=-4096

image

@chy-crypto
Copy link
Author

When the job is running when all task is over, all resource it held are about 100M memories.

@Gerhut
Copy link
Member

Gerhut commented Jul 31, 2018

Caused by same $PAI_TASK_INDEX in all containers, fixed in #879

@Gerhut Gerhut closed this as completed Aug 6, 2018
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

No branches or pull requests

4 participants