[RFC] Feature facilitate customization of net #593

BenjaminBossan · 2020-02-16T12:21:48Z

The motivation for this PR is to make skorch easier to customize so that things like DCGAN can be implemented with less effort. See the attached notebook for an illustration of this.

To achieve this, I introduced 2 major changes:

`PassthroughScoring`

Currently, when adding new losses on a batch level, it's very fiddly to log them on an epoch level. You have to define a custom function that picks the right value from history and passes it through without modification, then add a BatchScoring callback that uses this mock scoring function, defines the name of the score again, and pass noop.

Here is the code before and after:

# somewhere in the net code
    ...
    self.history.record_batch('my_loss', my_loss)
    ...

# before
from skorch.callbacks import BatchScoring
from skorch.utils import noop

def my_loss_score(net, X=None, y=None):
    return net.history[-1, 'batches', -1, 'my_loss']

callbacks = [
    BatchScoring(
        my_loss_score,
        name='my_loss',
        target_extractor=noop,
    ),
]

# after
from skorch.callbacks import PassthroughScoring

callbacks = [
    PassthroughScoring(name='my_loss'),
]

I replaced BatchScoring with PassthroughScoring in the existing code for train and valid loss.

Facilitate adding custom torch modules and optimizers

Right now, if a user wants to add their own modules or optimizers to NeuralNet, there are a few non-obvious steps they need to take for everything to work well:

Make sure to extend prefixes_ attribute to make set_params work (but don't mutate or else woe!)
Make sure to extend cuda_dependent_attributes_ to avoid subtle bugs (but again, don't mutate!)
Use "private" methods like self._get_params_for or self._get_params_for_optimizer.

The changes introduces in this PR automatically take care of 1. and 2. by using some __setattr__ magic. This is of course a dangerous thing to do, so please think about cases where this could go wrong.

For 3., I have not found a better solution right now then to make the mentioned methods public. I was experimenting with automating this step completely with some magic things happening behind the door but this was a) a bit hacky and b) we cannot know which module parameters should be passed to an optimizer, i.e. we cannot fully automate this.

If anyone has a suggestion to make this easier for the user, please come ahead.

Conclusion

To illustrate how all these changes help, please have a look at the attached notebook. This replaces the notebook shown in #587 and sticks much closer to the original pytorch implementation of DCGAN.

If we find this solution okay, I would add more documentation, docstrings, and tests.

ping @zachbellay @cgarciae @YannDubs

A scorer that just passes through a score that is already calculated and logged during the batches (as, e.g., train_loss). Replaces use of BatchScoring for most use cases. This makes it a lot easier to add a custom loss during a batch. Just log it in the history at each batch and add PassthroughScoring with the name of the key of that loss. No need to add a fake scoring function that just returns the value.

This is achieved by: * automatically registering modules/optimizers in the prefixes and cuda_dependent_attributes when they are set * promoting get_params_for{,_optimizer} to public methods, since they need to be used for set_params to work

* fix wrong formatting * add missing information

In this notebook, a DCGAN is trained. It makes use of 2 separate optimizers and adds custom batch-level losses. This is now easier with the changes introduced in this PR.

review-notebook-app · 2020-02-16T12:21:54Z

Check out this pull request on

You'll be able to see Jupyter notebook diff and discuss changes. Powered by ReviewNB.

BenjaminBossan · 2020-02-16T18:51:52Z

After discussion with @ottonemo, it might be better to split the introduction of PassthroughScoring from the changes relating to __setattr__. I'll leave this here for now, but keep this option in mind.

ottonemo · 2020-02-18T16:45:40Z

The changes introduces in this PR automatically take care of 1. and 2. by using some setattr magic. This is of course a dangerous thing to do, so please think about cases where this could go wrong.

I think using __setattr__ is fine and actually in line with how PyTorch modules register their subordinate parameters. We probably should handle __delattr__ as well, even if there are no references (yet) added, there might be in the future and then this will bite us in unexpected ways.

For 3., I have not found a better solution right now then to make the mentioned methods public. I was experimenting with automating this step completely with some magic things happening behind the door but this was a) a bit hacky and b) we cannot know which module parameters should be passed to an optimizer, i.e. we cannot fully automate this.

I'm for making this step explicit and to just expose the methods publicly. They are side-effect free and therefore there is no harm done in doing so.

BenjaminBossan · 2020-02-20T23:25:05Z

We probably should handle __delattr__ as well

Good point, I'll look into it. As you said, for now we'd probably be fine.

BenjaminBossan added 4 commits February 16, 2020 12:17

Improve docstring of NeuralNet.notify method

a49a121

* fix wrong formatting * add missing information

Add DCGAN notebook which makes use of new features

fffa561

In this notebook, a DCGAN is trained. It makes use of 2 separate optimizers and adds custom batch-level losses. This is now easier with the changes introduced in this PR.

BenjaminBossan self-assigned this Feb 16, 2020

BenjaminBossan requested review from ottonemo and thomasjpfan February 16, 2020 12:21

BenjaminBossan added the enhancement label Feb 16, 2020

BenjaminBossan mentioned this pull request Feb 16, 2020

Add notebook with DCGAN on MNIST #587

Closed

This was referenced Feb 21, 2020

add PassthroughScoring #595

Merged

Make get_params_for and get_params_for_optimizer public #596

Merged

Facilitate adding custom modules, criteria, or optimizers #597

Merged

BenjaminBossan mentioned this pull request Apr 17, 2020

How to correctly using NeuralNet baseclass (training seems good, but prediction gone wrong) #618

Closed

BenjaminBossan closed this Mar 17, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RFC] Feature facilitate customization of net #593

[RFC] Feature facilitate customization of net #593

BenjaminBossan commented Feb 16, 2020 •

edited

Loading

review-notebook-app bot commented Feb 16, 2020

BenjaminBossan commented Feb 16, 2020

ottonemo commented Feb 18, 2020

BenjaminBossan commented Feb 20, 2020

[RFC] Feature facilitate customization of net #593

[RFC] Feature facilitate customization of net #593

Conversation

BenjaminBossan commented Feb 16, 2020 • edited Loading

PassthroughScoring

Facilitate adding custom torch modules and optimizers

Conclusion

review-notebook-app bot commented Feb 16, 2020

BenjaminBossan commented Feb 16, 2020

ottonemo commented Feb 18, 2020

BenjaminBossan commented Feb 20, 2020

BenjaminBossan commented Feb 16, 2020 •

edited

Loading

`PassthroughScoring`