DEV: Initial AVA implementation review #699

marisbasha · 2023-09-17T20:05:32Z

@NickleDave Here's the pull request. I've changed some things to the network to ensure it's the same as AVA. Also, I've made it such that you can use any input shape to train it, although we'd need to use 128x128 inputs to use the AVA weights.

NickleDave · 2023-09-18T13:54:00Z

🙌 awesome, thank you @marisbasha! This looks great. I had a quick look and can see you made the changes we discussed.

I will review in the next couple of days.

I sent you an invite for a meeting two weeks from today. Before we meet, I will add a recipe in a notebook using your implementation as we discussed.

You got this most of the way there, I'm happy to take a first pass at stuff like docstrings + higher-level functions for training.

I will make edits to this branch directly. In theory you should be able to git pull --rebase in your local clone of your fork so that you get the history after I rewrite it. You can also clone this repo and check out the PR using the github cli if you don't want to deal with git weirdness: gh pr checkout 699

It will be good to get your feedback on the docstrings, train/eval/predict functions, and tests too so we have more than one set of eyes on them.

Thanks again! I'm excited about getting this first implementation added ASAP so we can test it out on some real data.

marisbasha · 2023-09-18T14:03:20Z

@NickleDave I have github desktop, so its automatically synced, I can see it from the app.

Let's meet on Monday 2 October then!

Let me know when I will have something to test (docstring / training loop)

NickleDave · 2023-09-18T16:24:35Z

@NickleDave I have github desktop, so its automatically synced, I can see it from the app.

Excellent 🤔 maybe I should be using GitHub desktop

Let me know when I will have something to test (docstring / training loop)

Will do, thank you @marisbasha

NickleDave

Hi @marisbasha, thanks again for all you work here and sorry I haven't gotten to this sooner, I've been working on stuff for Yarden and dealing with other life things.

There's two quick changes I need you to make before I continue

use torchmetrics.KLDivergence as one of the metrics in the vak.models.AVA definition (see my comment)
(this is very minor, just for consistency, but) rename vak.nets.Ava -> vak.nets.AVA

Please do that and test that you can make an instance of the model in a notebook without getting any crashes, and let me know how it goes, thanks!

Also please notice I rewrote the commit history. I promise I'm not trying to take credit for your work, I just needed to remove some changes, and also break up some of the commits so I could better follow the changes. I can give you more detail when we meet but for now please know you will want to do git pull --rebase in your local clone of vak so that you can get the rewritten history. If you do git pull (without the rebase) you'll get a bunch of weird conflicts. Just let me know if that's not clear.

Let's keep working async like this for now, I think we are close to getting a toy example in a notebook. I'll check in early next week but I feel like we can keep going this way for a bit before we need to meet (since I'm sure you're busy too)

NickleDave · 2023-09-29T13:16:47Z

src/vak/nets/ava.py

+    def forward(self, x):
+        return self.layer(x)
+
+class Ava(nn.Module):


Let's please rename this to AVA since it's an acronym and to be consistent with the model name. We'll rely on the namespace to differentiate them (nets.AVA vs models.AVA) as you have done already in the model class

NickleDave · 2023-09-29T13:29:40Z

src/vak/models/ava.py

+    optimizer = torch.optim.Adam
+    metrics = {
+        "loss": VaeElboLoss,
+        "kl": torch.nn.functional.kl_div


When I try to instantiate the model in a notebook I get an error that I'll paste below.
It's not your fault though, it's because of something we haven't made precise in the code yet.
I'll raise an issue about that (so I don't dump a bunch of detail here).
Can you please try changing to torchmetrics.KLDivergence and we'll see if that fixes the bug?
https://torchmetrics.readthedocs.io/en/stable/regression/kl_divergence.html

Below is the traceback from the error I'm getting, basically because vak.models.base.Model expects every metric to implement a __call__ method (I think we might want to require that every metric be a subclass of the torchmetrics.Metric class instead, since that's more explicit and helps us ensure consistent behavior)

TypeError Traceback (most recent call last) File ~/Documents/repos/coding/vocalpy/vak-vocalpy/src/vak/models/decorator.py:73, in model.<locals>._model(definition) 72 try: ---> 73 validate_definition(definition) 74 except ValueError as err: File ~/Documents/repos/coding/vocalpy/vak-vocalpy/src/vak/models/definition.py:262, in validate(definition) 259 if not ( 260 inspect.isclass(metrics_dict_val) and callable(metrics_dict_val) 261 ): --> 262 raise TypeError( 263 "A model definition's 'metrics' variable must be a dict mapping " 264 "string names to classes that define __call__ methods, " 265 f"but the key '{metrics_dict_key}' maps to a value with type {type(metrics_dict_val)}, " 266 f"not recognized as callable." 267 ) 269 # ---- validate default config TypeError: A model definition's 'metrics' variable must be a dict mapping string names to classes that define __call__ methods, but the key 'kl' maps to a value with type <class 'function'>, not recognized as callable. The above exception was the direct cause of the following exception: ModelDefinitionValidationError Traceback (most recent call last) Cell In[1], line 3 1 import torch ----> 3 from vak.nets.ava import Ava File ~/Documents/repos/coding/vocalpy/vak-vocalpy/src/vak/__init__.py:1 ----> 1 from . import ( 2 __main__, 3 cli, 4 common, 5 config, 6 datasets, 7 eval, 8 learncurve, 9 metrics, 10 models, 11 nets, 12 nn, 13 plot, 14 predict, 15 prep, 16 train, 17 transforms, 18 ) 19 from .__about__ import ( 20 __author__, 21 __commit__, (...) 28 __version__, 29 ) 31 __all__ = [ 32 "__main__", 33 "__author__", (...) 56 "transforms", 57 ] File ~/Documents/repos/coding/vocalpy/vak-vocalpy/src/vak/__main__.py:8 5 import argparse 6 from pathlib import Path ----> 8 from .cli import cli 11 def get_parser(): 12 """returns ArgumentParser instance used by main()""" File ~/Documents/repos/coding/vocalpy/vak-vocalpy/src/vak/cli/__init__.py:4 1 """command-line interface functions for training, 2 creating learning curves, etc.""" ----> 4 from . import cli, eval, learncurve, predict, prep, train 6 __all__ = [ 7 "cli", 8 "eval", (...) 12 "train", 13 ] File ~/Documents/repos/coding/vocalpy/vak-vocalpy/src/vak/cli/eval.py:4 1 import logging 2 from pathlib import Path ----> 4 from .. import config 5 from .. import eval as eval_module 6 from ..common.logging import config_logging_for_cli, log_version File ~/Documents/repos/coding/vocalpy/vak-vocalpy/src/vak/config/__init__.py:2 1 """sub-package that parses config.toml files and returns config object""" ----> 2 from . import ( 3 config, 4 eval, 5 learncurve, 6 model, 7 parse, 8 predict, 9 prep, 10 spect_params, 11 train, 12 validators, 13 ) 16 __all__ = [ 17 "config", 18 "eval", (...) 26 "validators", 27 ] File ~/Documents/repos/coding/vocalpy/vak-vocalpy/src/vak/config/config.py:4 1 import attr 2 from attr.validators import instance_of, optional ----> 4 from .eval import EvalConfig 5 from .learncurve import LearncurveConfig 6 from .predict import PredictConfig File ~/Documents/repos/coding/vocalpy/vak-vocalpy/src/vak/config/eval.py:8 6 from ..common import device 7 from ..common.converters import expanded_user_path ----> 8 from .validators import is_valid_model_name 11 def convert_post_tfm_kwargs(post_tfm_kwargs: dict) -> dict: 12 post_tfm_kwargs = dict(post_tfm_kwargs) File ~/Documents/repos/coding/vocalpy/vak-vocalpy/src/vak/config/validators.py:6 2 from pathlib import Path 4 import toml ----> 6 from .. import models 7 from ..common import constants 10 def is_a_directory(instance, attribute, value): File ~/Documents/repos/coding/vocalpy/vak-vocalpy/src/vak/models/__init__.py:12 10 from .tweetynet import TweetyNet 11 from .vae_model import VAEModel ---> 12 from .ava import AVA 14 __all__ = [ 15 "base", 16 "ConvEncoderUMAP", (...) 30 31 ] File ~/Documents/repos/coding/vocalpy/vak-vocalpy/src/vak/models/ava.py:11 7 from .vae_model import VAEModel 8 from ..nn.loss import VaeElboLoss 10 @model(family=VAEModel) ---> 11 class AVA: 12 """ 13 """ 14 network = nets.Ava File ~/Documents/repos/coding/vocalpy/vak-vocalpy/src/vak/models/decorator.py:79, in model.<locals>._model(definition) 75 raise ModelDefinitionValidationError( 76 f"Validation failed for the following model definition:\n{definition}" 77 ) from err 78 except TypeError as err: ---> 79 raise ModelDefinitionValidationError( 80 f"Validation failed for the following model definition:\n{definition}" 81 ) from err 83 attributes = dict(family.__dict__) 84 attributes.update({"definition": definition}) ModelDefinitionValidationError: Validation failed for the following model definition: <class 'vak.models.ava.AVA'>

NickleDave · 2023-10-03T13:23:28Z

@all-contributors add @marisbasha for code

allcontributors · 2023-10-03T13:23:36Z

@NickleDave

I've put up a pull request to add @marisbasha! 🎉

NickleDave · 2023-10-03T14:38:31Z

@marisbasha following up on our discussion today: let's use the TorchMetrics.KLDivergence class.

But just for future reference, as far as I can tell it would be equivalent use torch.nn.KLDivergence (the class) and automatically get the validation loss we want--aggregated over batches and then averaged.

I'm basing that on this discussion: https://lightning.ai/forums/t/understanding-logging-and-validation-step-validation-epoch-end/291/2

marisbasha · 2023-10-08T15:18:56Z

@NickleDave added the requested changes. Also as we talked on the last call I made the commit very expressive on everything that changed in that commit.

NickleDave · 2023-10-09T15:59:27Z

🚀 great, thank you @marisbasha -- I will get back to this this week. (I need to get this branch merged in so I can re-start experiments and deal with some life stuff)

NickleDave · 2023-10-19T00:10:35Z

Hi @marisbasha I made some progress on this but I get an error when I start training -- if you have a chance before I do to test with the notebook I added and you have a guess about what's going on, please let me know.

Looks like for some reason we end up with 5D input going into the batchnorm?

File ~/Documents/repos/coding/vocalpy/vak-vocalpy/.venv/lib/python3.10/site-packages/torch/nn/modules/batchnorm.py:138, in _BatchNorm.forward(self, input)
    137 def forward(self, input: Tensor) -> Tensor:
--> 138     self._check_input_dim(input)
    140     # exponential_average_factor is set to self.momentum
    141     # (when it is available) only so that it gets updated
    142     # in ONNX graph when this node is exported to ONNX.
    143     if self.momentum is None:

File ~/Documents/repos/coding/vocalpy/vak-vocalpy/.venv/lib/python3.10/site-packages/torch/nn/modules/batchnorm.py:416, in BatchNorm2d._check_input_dim(self, input)
    414 def _check_input_dim(self, input):
    415     if input.dim() != 4:
--> 416         raise ValueError(f"expected 4D input (got {input.dim()}D input)")

ValueError: expected 4D input (got 5D input)

marisbasha · 2023-10-20T12:08:11Z

@NickleDave sure, I'll have a look during the weekend and I'll let you know!

marisbasha · 2023-10-24T18:34:39Z

Hello @NickleDave, as far as I was able to infer, the problem is related to batching.

In line 98 of nets/ava.py we have:
x = self.encoder(x.unsqueeze(self.in_channels)).view(-1, self.in_fc)

The error is caused by unsqueeze, that transforms the already 4d input (Batch, Channel, Width, Height) into a 5d tensor.
We could split this operation in multiple lines and then check for the shape and based on that decide to either unsqueeze or not.

Another error that emerges, which we need to talk, is the view operation after the encoder in the same line .view(-1, self.in_fc). It causes problems because of the shape of the input (1, 257, 133) in the test case. While we in theory could calculate these on-the-fly, I'd advise to set these parameters on input_shape. So for the moment for the network to work you could first set input_shape=(1, 257, 133) when initializing the model and then:
x = self.encoder(x).view(-1, self.in_fc) instead of x = self.encoder(x.unsqueeze(self.in_channels)).view(-1, self.in_fc) in line 98 until we discuss on how we want to proceed with the batching.

Let me know if you want me to proceed somehow with any change.

NickleDave · 2023-11-02T13:51:44Z

Thank you @marisbasha for figuring out where the error is coming from.
Sorry for not replying sooner; I was under a deadline, and in the middle of changing jobs.
I need to think more about what the expected input and output shapes should be.
I'll send you an invite for a meeting--let's talk through it as you suggest.

NickleDave · 2023-11-21T01:02:09Z

Hi @marisbasha, sorry I wasn't able to look at this sooner.

Now that I have, I think you are right -- we should give the network an input_shape parameter and use that throughout. This is what other networks in vak do right now. I prefer it, since we make the expected input shape a little more explicit, instead of implicitly adding a channel. I'm guessing those kind of reasons are why you suggested it.

I added a commit making that change.

We should also make those assumptions explicit with a default dataloader we use for the model, but we don't need to figure that out now.

Before we meet tomorrow I will

read through one more time to make sure I follow the logic of building the network, and
test one more time with a notebook

But I think we are closer and I can start doing stuff like adding docstrings, functions to train/test the model, etc.

NickleDave · 2023-11-22T20:37:55Z

Hi @marisbasha, getting closer on this.
I'm going to leave two comments.
The first summarizes changes I made; please let me know what you think, especially if you spot any mistakes I made.
The second will explain what I think we need to do next. I'm putting that in a separate comment just so it doesn't get lost in a wall of text.

Changes I made:

So far I've mainly changed the network itself, in the vak.nets.ava module. The one exception that's very minor was changing the default learning rate for the module to match what the original implementation uses, 1e-3 instead of .003
The changes I made to the network were mainly just extending the improvements in readability that @marisbasha made, minimizing the number of things that someone new to the code would need to hold in their head while reading, and in some cases rephrasing to look more like "idiomatic" pytorch / the original implementation / related projects like pytorch-vae
- I changed AVA.__init__ to determine shapes dynamically like fully-connected input, etc, using a dummy tensor
- I changed encoder_channels default to end with 32 instead of always matching z_dim -- I agree that the intent of the original model was to match z_dim but AFAICT there's no reason to match z_dim since the usual thing to do is take whatever the number of channels is out of the encoder, flatten it, and then project down to z_dim with the fully connected layers; so I don't think we should hard-code this and instead should let someone specify whatever series of channels they want for the encoder
- I renamed BottleneckLayer -> FullyConnectedLayer to avoid confusion with e.g. a ResNet Bottleneck, and likewise renamed AVA attributes to use fc to match what the original implementation does and what pytorch-vae does. Of course these are in fact acting like bottlenecks but referring to them as fc/fully-connected seemed a little clearer
- There was one case where the we applied the same view to the decoder twice, I removed that--easy mistake to make since pytorch-vae does things one way and ava does it another, since it's trying to provide a lightweight "model" class

NickleDave · 2023-11-22T20:44:10Z

Okay, comment two: what do we need to change

I have a branch here with a couple of notebooks in the project root:
https://github.com/vocalpy/vak/tree/test-ava-in-notebook

In test-vae I just run a torch.rand tensor with the same fixed shape used in the original implementation, 128 x 128, through an instance of AVA. This works with no problem
In train-ava I have the guts of what would become a train_vae function. It crashes, and this is because we can't reshape the output of the fully-connected part of the decoder to feed it into the convolutional part of the decoder. The reason we can't reshape is because our input shape is not 128 x 128, it is (1, 257, 133): a "channel", the number of frequency bins in the spectrogram, and the number of time bins for each spectrogram after padding so they are all the same shape.

So what we need to do is have a series of transforms that further pads the spectrograms to give us an input shape that we'll be able to reshape to when we decode. Basically we'll add a torchvision.transforms.Pad at the end of the transforms.Compose pipeline.

I'm not actually sure what valid shapes are. I think we have to work backwards using the equations that determine output sizes of convolutional layers. I'm pretty sure any power of 2 will work but this would involve adding a lot of padding in some cases, e.g. if we go to the next power of 2 from 256 (for 257 frequency bins) we'd be padding to (512 x 512), which is a lot of wasted computation.

NickleDave · 2023-11-29T21:01:08Z

Hi @marisbasha I thought about this some more and I feel like it's best to replicate the AVA pre-processing pipeline.
I'm happy to do that.
It doesn't have to be a complete perfect replication but I think we'll at least need to re-implement this function to get the 128x128 shape expected by the model.
https://github.com/pearsonlab/autoencoded-vocal-analysis/blob/11c95990065ec6bfd3b860b9a9232db3a2f9b89c/ava/preprocessing/utils.py#L18
Our dataset prep step should produce spectrograms of the correct size using a similar function.
We should keep the model written as it is, to be more general, but document that the default dataset prep will give us the same window size as the original

I'm working on something else with a deadline but I expect to be able to get back to this after Dec 6 next week

…om PREP

…i/prep.py

…ataset.py -- AVA default caused entire spectrogram of 1 value

…/make_splits.py, remove unused parameter 'purpose' from (renamed) make_splits function

…etric_umap.py

…vae.py

…k/prep/parametric_umap/parametric_umap.py

NickleDave · 2024-03-01T13:54:45Z

Hey @marisbasha thanks for reaching back out and picking this up again.

Just writing notes on next steps from our call today:

run train and understand why loss is high
- something about data prep? How much do we need to replicate how they prep? Could it be normalization / scaling?
- I just added a commit with the config files in tests/data_for_tests/configs -- please start with those and confirm you can run them with vak prep --> vak train
write "minimum viable" version of eval
- for now, we just want to log loss on test set
write initial version of predict
- for now, we just save each embedding as an .npy file in output_dir; we will want some schema for how we name those files, something like f{audio_file_name}-segment-{segment_number}-AVA-embedding.npy

Just let me know if you have any questions! Like I said, happy to help however--we can discuss here or jump on a video call, whatever works best for you. Thanks so much for working on this

NickleDave force-pushed the feature branch 4 times, most recently from 6009b0d to 4fc5d80 Compare September 29, 2023 12:32

NickleDave requested changes Sep 29, 2023

View reviewed changes

NickleDave mentioned this pull request Sep 29, 2023

ENH: Require that metrics for a model definition be a subclass of torchmetrics.Metric #707

Open

allcontributors bot mentioned this pull request Oct 3, 2023

docs: add marisbasha as a contributor for code #713

Merged

marisbasha and others added 7 commits December 27, 2023 21:15

DEV: just to push on branch

0d6f66e

DEV: Initial notebook implementation VAE + AVA.

3aeb576

DEV: Added training and validation inside model family definition.

9885e6b

dev: Added suggestions and tested network forward result.

4de1b09

Add src/vak/nets/ava.py

d39f14a

Add src/vak/nn/loss/vae.py

aee76bf

Import vae_loss and VaeLoss in src/vak/nn/loss/__init__.py

f132cb3

NickleDave added 15 commits January 10, 2024 22:18

Add options to SPECT_PARAMS in valid_toml, remove normalize option fr…

32a0d47

…om PREP

Remove normalize paramter from src/vak/prep/vae/vae.py

dd9fcf5

Remove normalize parameter from src/vak/prep/vae/segment_vae.py

23e8244

Fix up remove normalize from src/vak/prep/vae/vae.py

89ec944

Remove normalize parameter from src/vak/prep/prep_.py

694a30b

Remove normalize arg (no longer exists) in call to prep in src/vak/cl…

18a5029

…i/prep.py

Fix use of finfo in src/vak/prep/spectrogram_dataset/spect.py

22c71cc

Fix how we get/use fill/pad value in src/vak/prep/unit_dataset/unit_d…

0696e56

…ataset.py -- AVA default caused entire spectrogram of 1 value

Rename 'unit_dataset' -> 'segment_dataset' in prep/

de4e58f

Fix up rename 'unit_dataset' -> 'segment_dataset' in prep/

a546eeb

Rename prep/parametric_umap/dataset_arrays.py -> prep/segment_dataset…

cfb23c4

…/make_splits.py, remove unused parameter 'purpose' from (renamed) make_splits function

Import make_splits function in src/vak/prep/segment_dataset/__init__.py

7f70781

Use segment_dataset.make_splits in src/vak/prep/parametric_umap/param…

e5361ce

…etric_umap.py

Use segment_dataset.make_splits function in src/vak/prep/vae/segment_…

dfbcba7

…vae.py

Fixup use make_splits in src/vak/prep/parametric_umap/parametric_umap.py

c001b24

NickleDave mentioned this pull request Jan 18, 2024

BUG/DOC: Tutorial eval and predict configs missing sections with model names #734

Closed

NickleDave added 11 commits January 19, 2024 13:09

Add src/vak/prep/segment_dataset/learncurve.py

c61a881

Use segment_dataset.learncurve.make_subsets_from_dataset_df in src/va…

b2e166a

…k/prep/parametric_umap/parametric_umap.py

Remove unused imports in src/vak/prep/segment_dataset/learncurve.py

73108a9

Import learncurve in src/vak/prep/segment_dataset/__init__.py

575ff6e

Add newline in src/vak/prep/segment_dataset/learncurve.py

00cabc1

WIP: Add tests/test_prep/test_segment_dataset/

8f3a609

Fix imports in src/vak/prep/parametric_umap/__init__.py

e640ef4

Add ref to AVA code in src/vak/models/ava.py

a4272aa

WIP: Revise docstrings in src/vak/nets/ava.py

9e608e9

WIP: Revise docstrings in src/vak/nn/loss/vae.py

5550cb8

Add AVA configs in tests/data_for_tests/configs

3b8f98a

NickleDave added 2 commits April 12, 2024 20:53

WIP: Add tests/test_models/test_vae.py

36a0a0d

WIP: Add tests/test_datasets/test_vae/

d09d577

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DEV: Initial AVA implementation review #699

DEV: Initial AVA implementation review #699

marisbasha commented Sep 17, 2023

NickleDave commented Sep 18, 2023

marisbasha commented Sep 18, 2023 •

edited

Loading

NickleDave commented Sep 18, 2023

NickleDave left a comment •

edited

Loading

NickleDave Sep 29, 2023

NickleDave Sep 29, 2023 •

edited

Loading

NickleDave commented Oct 3, 2023

allcontributors bot commented Oct 3, 2023

NickleDave commented Oct 3, 2023

marisbasha commented Oct 8, 2023

NickleDave commented Oct 9, 2023

NickleDave commented Oct 19, 2023

marisbasha commented Oct 20, 2023

marisbasha commented Oct 24, 2023

NickleDave commented Nov 2, 2023

NickleDave commented Nov 21, 2023

NickleDave commented Nov 22, 2023 •

edited

Loading

NickleDave commented Nov 22, 2023 •

edited

Loading

NickleDave commented Nov 29, 2023 •

edited

Loading

NickleDave commented Mar 1, 2024 •

edited

Loading

DEV: Initial AVA implementation review #699

Are you sure you want to change the base?

DEV: Initial AVA implementation review #699

Conversation

marisbasha commented Sep 17, 2023

NickleDave commented Sep 18, 2023

marisbasha commented Sep 18, 2023 • edited Loading

NickleDave commented Sep 18, 2023

NickleDave left a comment • edited Loading

Choose a reason for hiding this comment

NickleDave Sep 29, 2023

Choose a reason for hiding this comment

NickleDave Sep 29, 2023 • edited Loading

Choose a reason for hiding this comment

NickleDave commented Oct 3, 2023

allcontributors bot commented Oct 3, 2023

NickleDave commented Oct 3, 2023

marisbasha commented Oct 8, 2023

NickleDave commented Oct 9, 2023

NickleDave commented Oct 19, 2023

marisbasha commented Oct 20, 2023

marisbasha commented Oct 24, 2023

NickleDave commented Nov 2, 2023

NickleDave commented Nov 21, 2023

NickleDave commented Nov 22, 2023 • edited Loading

NickleDave commented Nov 22, 2023 • edited Loading

NickleDave commented Nov 29, 2023 • edited Loading

NickleDave commented Mar 1, 2024 • edited Loading

marisbasha commented Sep 18, 2023 •

edited

Loading

NickleDave left a comment •

edited

Loading

NickleDave Sep 29, 2023 •

edited

Loading

NickleDave commented Nov 22, 2023 •

edited

Loading

NickleDave commented Nov 22, 2023 •

edited

Loading

NickleDave commented Nov 29, 2023 •

edited

Loading

NickleDave commented Mar 1, 2024 •

edited

Loading