Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bulk Executor and HTCondor Bulk Operations #224
Bulk Executor and HTCondor Bulk Operations #224
Changes from 44 commits
3491a86
0db8645
973bdbb
6e47c3b
9f8e5cc
78404b0
8f3a07b
85cdab2
8fb9287
414b60d
676cfa0
2fd2923
e7468b5
9157397
76b5ea5
d95e0ed
1c4014e
071b1db
a0b5dd8
ecc71e4
d4f4713
d33a880
c21507f
f6185e7
03d3b2e
65b9893
f3e18a2
7993bba
2c55fcb
e7449a6
64a0be5
724e751
d11872d
5dbe599
4c274b6
5bd66e1
340e541
86f105f
b2dbc4a
31de4b7
ef537af
4561e52
b10e04e
d909bd1
be36258
f7c286f
74ffd75
ce859f7
ba5ff71
612fb7d
42f2aab
4eeea62
d07f59c
0079046
94d0613
cea7d68
f673f2d
f8b9c70
File filter
Filter by extension
Conversations
Jump to
There are no files selected for viewing
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Does a
not found
needs to be treated here as well? At least I would like to havelogger.debug
output here.There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sure, I can add a debug log. Would you be okay if it's next to the
raise TardisResourceStatusUpdateFailed
in the adapter methods? It seems we don't get any sensible debug output from condor anyway.There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That would be okay!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can this potentially have any impact?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Good question – not as far as I can tell, but @giffels better take a look at this as well. ;)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This already attracted my attention. ;)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It seems to have a negative effect on already present drones in the system started with an older version of tardis.
tardis reports the following:
while at the same time the job is still running in HTCondor
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think the problem is in
tardis/tardis/adapters/sites/htcondor.py
Lines 247 to 248 in ce859f7
_job_id
onremote_resource_id
it should be fine.There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the heads up. Changed as suggested.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Potentially it could be possible to get several
job_id
s after executing this. Can this be handled already or is this on purpose for one job only at the moment?There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We cannot actually get more than one
job_id
here, since the bulk execution framework forbids it. But indeed the JDL template could queue more than one job or even none.@giffels would you object to deprecating the use of
queue
statements in the JDL template? It is simple enough to inject aqueue 1
automatically – I don't really see any advantage in addingqueue
manually but many ways it can go wrong.There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I am just afraid that a removal will break backwards compatibilty. Officially, we should deprecate it first and remove it later on in
0.8.0
. So for example checking it is in there and print a warning like we did intardis/tardis/adapters/sites/slurm.py
Lines 58 to 61 in 9d45bbc
queue 1
.There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I've added a helper that checks for
queue
statements and inserts one otherwise.There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Great! Thanks!