👌 IMPROVE: Notebook Execution #236

chrisjsewell · 2020-08-20T02:45:44Z

Standardise auto/cache execution

Both now call the same underlying function (from jupyter-cache) and act the same.
This improves auto, by making it output error reports and not raising an exception on an error.

Additional config has also been added: execution_allow_errors and execution_in_temp.

Like for timeout, allow_errors can also be set in the notebook metadata.execution.allow_errors

This presents one breaking change, in that auto will now by default execute in a temporary folder as the cwd. (we could set temp to False by default, but I think this is safer?)

For both methods, executions data is captured into:

env.nb_execution_data[env.docname] = {
        "mtime": datetime.datetime.utcnow().isoformat(),
        "runtime": runtime,
        "method": execution_method,
        "succeeded": succeeded,
    }

This (almost) closes executablebooks/jupyter-cache#56

Both now call the same underlying function (from jupyter-cache) and act the same. This improves auto, by making it output error reports and not raising an exception on an error. Additional config has also been added: `execution_allow_errors` and `execution_in_temp`. Like for timeout, `allow_errors` can also be set in the notebook metadata.execution.allow_errors This presents one breaking change, in that `auto` will now by default execute in a temporary folder as the cwd.

codecov · 2020-08-20T02:50:47Z

Codecov Report

Merging #236 into master will increase coverage by 0.61%.
The diff coverage is 93.00%.

@@            Coverage Diff             @@
##           master     #236      +/-   ##
==========================================
+ Coverage   85.37%   85.99%   +0.61%     
==========================================
  Files           9       10       +1     
  Lines         841      921      +80     
==========================================
+ Hits          718      792      +74     
- Misses        123      129       +6

Flag	Coverage Δ
#pytests	`85.99% <93.00%> (+0.61%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
myst_nb/execution.py	`79.25% <84.44%> (ø)`
myst_nb/__init__.py	`90.21% <100.00%> (+0.68%)`	⬆️
myst_nb/exec_table.py	`100.00% <100.00%> (ø)`
myst_nb/parser.py	`95.97% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update f98fa54...d186389. Read the comment docs.

This import is to mitigate errors on CI VMs, where you can get the message: "Matplotlib is building the font cache"

chrisjsewell · 2020-08-20T03:49:14Z

Just got to document it also...

mmcky

@chrisjsewell some initial general point for discussion.

Option Names:

With option names do you think we should adopt a convention such as {ext}_{option} where {ext} is an indication of which extension the option belongs to. My only concern is option names could get long with this approach but it may help to avoid name conflicts in the option namespace in the future?

Report:

Looks like execution stats include mtime, runtime, method, and succeeded added to env.nb_execution_data[env.docname]. If succeeded is False could we also save a path/reference to the generated error log file? This might be useful for opening traceback information based on context.

chrisjsewell · 2020-08-20T04:29:54Z

With option names do you think we should adopt a convention such as {ext}_{option}

Yeh in myst-parser all the config start with myst_: https://myst-parser.readthedocs.io/en/latest/using/intro.html#myst-configuration-options, which is literally programmed in as you suggest: https://github.com/executablebooks/MyST-Parser/blob/8206f2a6eb891737b0b7916b761d6a5311a08de4/myst_parser/__init__.py#L41-L43, and all configs are actually available in the one MdParserConfig object, which is stored on app.env.myst_config

Could do something similar here, but I was not sure if it was a good idea to break back compatibility.

If we did want to change, probably we should keep both name versions for a time, and somehow log warnings about one being deprecated

My only concern is option names could get long

I'd say better to be long and specific, than have conflicts.

If succeeded is False could we also save a path/reference to the generated error log file?

Sounds good to me 👍

chrisjsewell · 2020-08-20T04:53:27Z

If succeeded is False could we also save a path/reference to the generated error log file?

Added in 8534c2c

AakashGfude · 2020-08-20T08:05:03Z

Awesome @chrisjsewell . A few pointers from my side.

Some of the statistics mentioned it seems are not captured like num_of_errors, language, extension ? Also, it seems like a particular notebooks execution stops after it encounters the first cell with error. Then it jumps to the next notebook. We will need to have all the errors in a notebook captured for statistics?
The functionality of auto which differs it from cache now is generating coverage reports? should we have a different name for it then specifying its purpose?

chrisjsewell · 2020-08-20T08:26:39Z

Ok @mmcky @AakashGfude check this out! https://myst-nb--236.org.readthedocs.build/en/236/use/execute.html#execution-statistics

Some of the statistics mentioned it seems are not captured like num_of_errors, language, extension

I think maybe it would be better to have a customizable function, for extracting additional data from the notebook and adding it to nb_execution_data, to cater for different use cases

We will need to have all the errors in a notebook captured for statistics?

and that is what the execution_allow_errors configuration is for 😄

The functionality of auto which differs it from cache now is generating coverage reports?

Not sure what you mean by this? coverage reporting how?

should we have a different name for it then specifying its purpose?

as with changing the names of the current configuration values (see above), I'm hesitant to change them, since it means having to implement some kind of deprecation process, updating the documentation both here and in jupyter-book, and dealing with lots of people raising issues about why their books no longer build 😬

chrisjsewell · 2020-08-20T08:33:49Z

The difference between auto and cache, is that cache does all its execution "at once" before any files have been parsed, whereas auto executes each notebook during the parsing phase.
Also auto will re-execute on any changes to the document (not just code changes)

Personally I was never really on-board with having two methods. But I guess it was done since jupyter-cache was/is less developed. This PR though converges them a bit more, and maybe eventually we can just have the one execution method.

chrisjsewell · 2020-08-20T09:29:39Z

Note I'm now going to set execution_in_temp to False by default, as I think thats more intuitive.

chrisjsewell · 2020-08-20T10:04:20Z

Ok, I've finished updating the documentation.
I think I'm happy with the basic functionality so am going to merge.
Then I have moved @mmcky's issue here to #237 and we can discuss/iterate further there, about some of the points you have made here

chrisjsewell changed the title ~~👌 IMPROVE: Standardise auto/cache execution~~ 👌 IMPROVE: Notebook Execution Aug 20, 2020

🧪 TEST: Add auto matplotlib install

e8dad54

This import is to mitigate errors on CI VMs, where you can get the message: "Matplotlib is building the font cache"

chrisjsewell force-pushed the execution branch from 31485f8 to e8dad54 Compare August 20, 2020 03:08

✨ NEW: Capture execution data in sphinx env

f7260f4

chrisjsewell marked this pull request as ready for review August 20, 2020 03:43

chrisjsewell requested review from AakashGfude and mmcky August 20, 2020 03:43

mmcky reviewed Aug 20, 2020

View reviewed changes

👌 IMPROVE: Store error log path in env.nb_execution_data

8534c2c

chrisjsewell force-pushed the execution branch from 9288695 to 8534c2c Compare August 20, 2020 04:57

✨ NEW: Add nb-exec-table directive

126909d

🧪 TEST: add test for nb-exec-table

983d23f

📚 DOCS: Document new execution features

d186389

mmcky mentioned this pull request Aug 20, 2020

Improve stored execution statistics #237

Open

chrisjsewell merged commit 2bc0c11 into master Aug 20, 2020

chrisjsewell deleted the execution branch August 20, 2020 10:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

👌 IMPROVE: Notebook Execution #236

👌 IMPROVE: Notebook Execution #236

chrisjsewell commented Aug 20, 2020 •

edited

Loading

codecov bot commented Aug 20, 2020 •

edited

Loading

chrisjsewell commented Aug 20, 2020

mmcky left a comment •

edited

Loading

chrisjsewell commented Aug 20, 2020

chrisjsewell commented Aug 20, 2020 •

edited

Loading

AakashGfude commented Aug 20, 2020

chrisjsewell commented Aug 20, 2020 •

edited

Loading

chrisjsewell commented Aug 20, 2020

chrisjsewell commented Aug 20, 2020

chrisjsewell commented Aug 20, 2020

👌 IMPROVE: Notebook Execution #236

👌 IMPROVE: Notebook Execution #236

Conversation

chrisjsewell commented Aug 20, 2020 • edited Loading

codecov bot commented Aug 20, 2020 • edited Loading

Codecov Report

chrisjsewell commented Aug 20, 2020

mmcky left a comment • edited Loading

Choose a reason for hiding this comment

chrisjsewell commented Aug 20, 2020

chrisjsewell commented Aug 20, 2020 • edited Loading

AakashGfude commented Aug 20, 2020

chrisjsewell commented Aug 20, 2020 • edited Loading

chrisjsewell commented Aug 20, 2020

chrisjsewell commented Aug 20, 2020

chrisjsewell commented Aug 20, 2020

chrisjsewell commented Aug 20, 2020 •

edited

Loading

codecov bot commented Aug 20, 2020 •

edited

Loading

mmcky left a comment •

edited

Loading

chrisjsewell commented Aug 20, 2020 •

edited

Loading

chrisjsewell commented Aug 20, 2020 •

edited

Loading