TF model cards #14720

Rocketknight1 · 2021-12-10T20:19:29Z

Creating model cards for models trained with Keras!
Model cards will automatically be created by the PushToHubCallback - the callback will peek at the metrics and accumulate information during the training run to enable this.
Calling model.push_to_hub() will not, by default, create a model card. This method is cross-platform (it comes from the PushToHubMixin), and changing its behaviour would negatively affect Trainer users.
Instead, Keras models now have a create_model_card() method if users want to create a model card outside of the PushToHubCallback. Because this method can't peek at the training history like the callback can, you need to pass the History object returned by model.fit() to create the card.

sgugger

Thanks for adding this! Let's avoid storing a token that users may then share by mistake if we can :-)

src/transformers/keras_callbacks.py

src/transformers/modelcard.py

sgugger · 2021-12-13T01:43:58Z

src/transformers/keras_callbacks.py

+        # TODO Is it okay to store the hub token as a callback attribute like this?
+        self.hub_token = hub_token


If you store it, make sure it doesn't appear in the repr (we had that surprise in the TrainingArguments) or something saved automatically.

Is there any way to make this work without storing the hub_token? I could move the repo creation to __init__, which would avoid the need to store the token as a property, but that might surprise users when the repo is made when the callback is made, rather than when training begins.

I think it's better to do it at init and have a potential error sooner rather than later.

I set it to clear the hub_token attribute as soon as the repo is created, and made sure it wasn't being printed in the __repr__, so I think this is fairly safe.

But why not create the repo at init? Is there some infor we only get later?

The main reason is that it's surprising for Keras users - Callbacks usually don't do anything visible at init, because they exist to trigger actions at certain points during training. If you think the problems with moving initialization to on_train_begin are too annoying, though, I can create the repo there, I'm sure it's fine!

In this case I find it weird to not do it at init (the Trainer does it there FYI): this method clones the repo where we will work, which is something you do before you execute your script in normal life. If there is an error because the repo already exists and doesn't match the clone_from it will be more obvious if the error is raised at the callback creation rather when launching the training.

That makes sense, yes. I'll move it!

Rocketknight1 · 2021-12-13T19:19:13Z

@sgugger all comments should be addressed and testing looks good. My main remaining concern is that model.push_to_hub does not generate a model card by default, because that method comes from the cross-platform PushToHubMixin. If you want, I can edit that method so that it creates a model card if you pass it a Keras model history? This would leave the behaviour for Trainer unchanged.

sgugger · 2021-12-13T19:22:40Z

I'm fine with both options, so it's really up to what you think is best @Rocketknight1
Also, if you could rebase on master to fix the CI that would be great :-)

Rocketknight1 · 2021-12-13T19:29:15Z

Will do both!

sgugger

Perfect, thanks a lot for all your work on this!

LysandreJik

This is great! Do you have an example of what a training would look like, that we could link in the release and in the docs?

LysandreJik · 2021-12-14T13:02:36Z

src/transformers/modelcard.py

@@ -377,6 +383,7 @@ class TrainingSummary:
    eval_results: Optional[Dict[str, float]] = None
    eval_lines: Optional[List[str]] = None
    hyperparameters: Optional[Dict[str, Any]] = None
+    source: Optional[str] = "trainer"


Cool addition!

LysandreJik · 2021-12-14T13:02:59Z

src/transformers/modelcard.py

+        if tags is None:
+            tags = ["generated_from_keras_callback"]
+        elif isinstance(tags, str) and tags != "generated_from_keras_callback":
+            tags = [tags, "generated_from_keras_callback"]
+        elif "generated_from_trainer" not in tags:
+            tags.append("generated_from_keras_callback")


Nice! This will be super useful to track!

Rocketknight1 · 2021-12-14T15:14:53Z

@LysandreJik I'm hoping to post an example with it when it's ready, but right now I'm having some issues with the method generating malformed YAML and they're a pain to track down!

sgugger

Nice new test!

…l cards

Good job, Matt.

Rocketknight1 · 2021-12-15T13:26:07Z

@sgugger @LysandreJik This should be ready to go now - some tests are failing even after rebasing but they have nothing to do with this PR. Okay if I merge?

sgugger · 2021-12-15T14:06:58Z

Yes you can!

Rocketknight1 requested review from LysandreJik, sgugger and merveenoyan December 10, 2021 20:19

sgugger reviewed Dec 13, 2021

View reviewed changes

src/transformers/keras_callbacks.py Outdated Show resolved Hide resolved

src/transformers/keras_callbacks.py Outdated Show resolved Hide resolved

src/transformers/modelcard.py Outdated Show resolved Hide resolved

sgugger reviewed Dec 13, 2021

View reviewed changes

Rocketknight1 force-pushed the tf_model_card branch from 9f34944 to 4b5fce4 Compare December 13, 2021 19:32

sgugger approved these changes Dec 13, 2021

View reviewed changes

LysandreJik approved these changes Dec 14, 2021

View reviewed changes

Rocketknight1 force-pushed the tf_model_card branch from 4b5fce4 to e80a299 Compare December 14, 2021 15:09

sgugger approved these changes Dec 15, 2021

View reviewed changes

Rocketknight1 added 16 commits December 15, 2021 13:01

Initial commit for Keras model cards

009f25a

Revert accidental change

0ac28c9

make style

463bd85

make style

a8aaaa6

make style

736c3c3

Fix PR comments

d1ef319

Move repo creation to __init__

3c602e8

Fixes to README.md creation

ec24179

Partial progress for proper card creation on push_to_hub

de60584

Proper card creation from push_to_hub plus fixes for malformed mode…

92e5497

…l cards

Fixes for model card creation outside the callback

f66b04c

Adding a model card creation test

9af413f

Putting the model card creation test in the right file.

e14417a

Good job, Matt.

make style

d83701c

Fix model card test temp dir usage

933673a

Fix model card creation when no optimizer present

18020d1

Rocketknight1 added 2 commits December 15, 2021 13:01

Fixes for when training history not present

e8ceeaa

Fix accidental edit to test_modeling_common

355b74a

Rocketknight1 force-pushed the tf_model_card branch from 4c99cc7 to 355b74a Compare December 15, 2021 13:01

Rocketknight1 merged commit 48d4827 into master Dec 15, 2021

Rocketknight1 deleted the tf_model_card branch December 15, 2021 14:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TF model cards #14720

TF model cards #14720

Rocketknight1 commented Dec 10, 2021

sgugger left a comment

sgugger Dec 13, 2021

Rocketknight1 Dec 13, 2021 •

edited

Loading

sgugger Dec 13, 2021

Rocketknight1 Dec 13, 2021

sgugger Dec 13, 2021

Rocketknight1 Dec 13, 2021

sgugger Dec 13, 2021

Rocketknight1 Dec 13, 2021

Rocketknight1 Dec 13, 2021

Rocketknight1 commented Dec 13, 2021

sgugger commented Dec 13, 2021

Rocketknight1 commented Dec 13, 2021

sgugger left a comment

LysandreJik left a comment

LysandreJik Dec 14, 2021

LysandreJik Dec 14, 2021

Rocketknight1 commented Dec 14, 2021

sgugger left a comment

Rocketknight1 commented Dec 15, 2021

sgugger commented Dec 15, 2021

		# TODO Is it okay to store the hub token as a callback attribute like this?
		self.hub_token = hub_token

TF model cards #14720

TF model cards #14720

Conversation

Rocketknight1 commented Dec 10, 2021

sgugger left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Rocketknight1 Dec 13, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Rocketknight1 commented Dec 13, 2021

sgugger commented Dec 13, 2021

Rocketknight1 commented Dec 13, 2021

sgugger left a comment

Choose a reason for hiding this comment

LysandreJik left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Rocketknight1 commented Dec 14, 2021

sgugger left a comment

Choose a reason for hiding this comment

Rocketknight1 commented Dec 15, 2021

sgugger commented Dec 15, 2021

Rocketknight1 Dec 13, 2021 •

edited

Loading