Fix quality issues #33

tibuch · 2022-10-04T13:53:50Z

I might have found two issues related to the quality drop described in #30.

It looks like the training data was sampled from a much smaller region than the validation data.
I added the rotation augmentation back.

thorstenwagner

I've some doubts regarding this change. See my comment :-)

thorstenwagner · 2022-10-05T08:37:22Z

cryocare/internals/CryoCAREDataModule.py

@@ -211,7 +211,7 @@ def __compute_extraction_shapes__(self, even_path, odd_path, tilt_axis_index, sa
        assert even.data.shape[1] > 2 * sample_shape[1]
        assert even.data.shape[2] > 2 * sample_shape[2]

-        val_cut_off = int(even.data.shape[tilt_axis_index] * validation_fraction)
+        val_cut_off = int(even.data.shape[tilt_axis_index] * (1 - validation_fraction))


I don't think that this is correct.

In cryoCARE_extract_train the validation fraction is already defined as:

validation_fraction=(1.0 - config['split'])

See: https://github.com/juglab/cryoCARE_pip/blob/master/cryocare/scripts/cryoCARE_extract_train_data.py#L28

Given a split of 0.9: With your change 90% of the data would be validation data?

Here

cryoCARE_pip/cryocare/internals/CryoCAREDataModule.py

Line 239 in 847d9bf

extraction_shape_train[tilt_axis_index] = [0, val_cut_off]

it takes then train_data to go from 0 up to the val_cut_off. Maybe I am confusing it :/

You are totally right :-)

Nice catch! Please merge it.

I didn't manage to run it yet. Secretly hoping that @rdrighetto finds the time 😇

OK, I tested it and now I get this error:

2022-10-05 15:28:10.372969: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.11.0 0%| | 0/500 [00:04<?, ?it/s] Traceback (most recent call last): File "/scicore/home/engel0006/GROUP/pool-engel/soft/cryo-care/cryoCARE_pip/cryocare_11/bin/cryoCARE_extract_train_data.py", line 45, in <module> main() File "/scicore/home/engel0006/GROUP/pool-engel/soft/cryo-care/cryoCARE_pip/cryocare_11/bin/cryoCARE_extract_train_data.py", line 27, in main dm.setup(config['odd'], config['even'], n_samples_per_tomo=config['num_slices'], File "/scicore/home/engel0006/GROUP/pool-engel/soft/cryo-care/cryoCARE_pip/cryocare_11/lib/python3.8/site-packages/cryocare/internals/CryoCAREDataModule.py", line 194, in setup self.train_dataset = CryoCARE_Dataset(tomo_paths_odd=tomo_paths_odd, File "/scicore/home/engel0006/GROUP/pool-engel/soft/cryo-care/cryoCARE_pip/cryocare_11/lib/python3.8/site-packages/cryocare/internals/CryoCAREDataModule.py", line 46, in __init__ self.compute_mean_std(n_samples=n_normalization_samples) File "/scicore/home/engel0006/GROUP/pool-engel/soft/cryo-care/cryoCARE_pip/cryocare_11/lib/python3.8/site-packages/cryocare/internals/CryoCAREDataModule.py", line 91, in compute_mean_std x, _ = self.__getitem__(i) File "/scicore/home/engel0006/GROUP/pool-engel/soft/cryo-care/cryoCARE_pip/cryocare_11/lib/python3.8/site-packages/cryocare/internals/CryoCAREDataModule.py", line 161, in __getitem__ return self.augment(np.array(even_subvolume)[..., np.newaxis], np.array(odd_subvolume)[..., np.newaxis]) File "/scicore/home/engel0006/GROUP/pool-engel/soft/cryo-care/cryoCARE_pip/cryocare_11/lib/python3.8/site-packages/cryocare/internals/CryoCAREDataModule.py", line 137, in augment x[i] = np.rot90(x[i], k=rot_k[i], axes=self.rot_axes) ValueError: could not broadcast input array from shape (1,72,72) into shape (72,72,1)

I tried to fix it by myself without success ☹️. What I did notice is the following:

The problem occurs because x and y have shape (72, 72, 72, 1) when augment() is called

Therefore, when k=rot_k[i] is 1 or 3 in np.rot90 (i.e. a rotation of 90 or 270 degrees) the resulting array will be of shape (1,72,72) which it will try to put in an array whose original shape is (72,72,1) (i.e. x[i])

As I said, I tried to fix this by getting rid of this 4th dimension within augment() and placing it back when returning from that function, but then the code would break somewhere else, so I decided it's better to stop and call the experts 😅

Thanks again!

Thanks! I pushed a hot-fix.

I promise, if it does not work this time I will setup everything on my end and stop coding in the github IDE!

rdrighetto · 2022-10-05T14:54:30Z

Thanks @tibuch! With some additional tweaks to the code cryoCARE_extract_train_data.py now runs apparently flawless (see #34).
I'm doing a full run right now to check the quality of results, will report back when done.

tibuch · 2022-10-05T15:46:05Z

Okay, as promised I checked it out and took it for a functional test-run on my side. Now it should work.

@rdrighetto could you restart with the current version? I would be interested to see if the results look now better.

Thanks!

tibuch added 2 commits October 4, 2022 15:31

Fix extraction_shape_train.

c757dc8

Add rotation augmentation.

243f8bd

tibuch requested a review from thorstenwagner October 4, 2022 13:53

tibuch added 2 commits October 5, 2022 09:13

Save and load tilt-axis.

92b011d

Fix rot_axes computation.

847d9bf

thorstenwagner reviewed Oct 5, 2022

View reviewed changes

thorstenwagner mentioned this pull request Oct 5, 2022

Update sample jsons and README #28

Merged

Fix rotation.

92d2c5f

Fix augmentation.

414bd5d

tibuch merged commit 30a0b4b into master Oct 6, 2022

tibuch deleted the fix-quality-issues branch October 6, 2022 06:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix quality issues #33

Fix quality issues #33

tibuch commented Oct 4, 2022

thorstenwagner left a comment

thorstenwagner Oct 5, 2022

tibuch Oct 5, 2022

thorstenwagner Oct 5, 2022

thorstenwagner Oct 5, 2022

tibuch Oct 5, 2022

rdrighetto Oct 5, 2022 •

edited

Loading

tibuch Oct 5, 2022

rdrighetto commented Oct 5, 2022

tibuch commented Oct 5, 2022

Fix quality issues #33

Fix quality issues #33

Conversation

tibuch commented Oct 4, 2022

thorstenwagner left a comment

Choose a reason for hiding this comment

thorstenwagner Oct 5, 2022

Choose a reason for hiding this comment

tibuch Oct 5, 2022

Choose a reason for hiding this comment

thorstenwagner Oct 5, 2022

Choose a reason for hiding this comment

thorstenwagner Oct 5, 2022

Choose a reason for hiding this comment

tibuch Oct 5, 2022

Choose a reason for hiding this comment

rdrighetto Oct 5, 2022 • edited Loading

Choose a reason for hiding this comment

tibuch Oct 5, 2022

Choose a reason for hiding this comment

rdrighetto commented Oct 5, 2022

tibuch commented Oct 5, 2022

rdrighetto Oct 5, 2022 •

edited

Loading