Unlikelihood Agent #3507

mvh57 · 2021-03-10T19:05:48Z

Hi @hadasah and @stephenroller, following up on this issue from September (#2966), I am wondering if you've had a chance to update the unlikelihood model so that the reward parameter can be non-binary (ie, take on integer or float value, including 0). Thanks for your help with this. I would also be happy to try to help with this, but would need some guidance as to where to begin.

stephenroller · 2021-03-12T15:07:07Z

#3517

mvh57 · 2021-03-15T15:35:50Z

@stephenroller, thank you for making these changes! One remaining question we have is: will these changes carry though to the loss function for the unlikelihood agent (for example --> will a reward: -0.9 be more down weighted than a reward: -0.5; also, will reward: 0 be parsed as such and be effectively ignored by the agent?)

stephenroller · 2021-03-15T16:02:24Z

No, it will not.

You'll need to add some multipliers for the value here:

ParlAI/projects/dialogue_unlikelihood/agents.py

Line 105 in 637ee92

* mle_notnull.float()

mvh57 · 2021-03-16T03:13:29Z

@stephenroller thank you!

jakeane · 2021-03-24T04:38:02Z

Hi @stephenroller. I am working with @mvh57, and I am currently writing the additions to allow the weights to be nonbinary. I have some questions before I go through the process detailed in CONTRIBUTING.md:

Currently, these additions would entail copying the RewardUnlikelihoodAgentTrait class and simply modifying the compute_loss method, specifically 'mle_loss' and 'ul_loss'. Then an agent will take in this modified trait. Is this the approach you would prefer in the codebase?
As for naming this new trait and agent, how would you like them named? Is something like RewardWeightedUnlikelihoodAgentTrait and TransformerWeightedUnlikelihoodAgent solid or too verbose?
I noticed this comment here

ParlAI/projects/dialogue_unlikelihood/agents.py

Line 96 in 637ee92

# note it's >= because convai2 and other teachers all provide a 0 reward

and I was wondering if my modified trait class should handle such instances, where the teacher only provides rewards of 0. My current implementation would cause the mle_loss = 0 in such cases. Is this fine? as this trait should not be used with such teachers.

jakeane · 2021-03-24T04:52:58Z

I suppose I should also ping @hadasah on this, as it seems that she also contributed to this unlikelihood trait.

I also I have one more question about this line:

ParlAI/projects/dialogue_unlikelihood/agents.py

Line 139 in 637ee92

loss = mle_loss + self.opt['alpha'] * ul_loss

Why is ul_loss multiplied by alpha but mle_loss is not?

This is more of a curiosity question, as I am still learning about this stuff. Why does the calculation of mle_loss use nll_loss while the calculation of ul_loss just use log?

stephenroller · 2021-03-24T05:07:20Z

If you look at the definition of NLL loss it's similar.

As far as alpha, it doesn't really matter. We could do it convex (one term gets 1-alpha and one gets alpha). We just chose to implement with just the one scalar and tune it the same.

github-actions · 2021-04-24T00:10:26Z

This issue has not had activity in 30 days. Please feel free to reopen if you have more issues. You may apply the "never-stale" tag to prevent this from happening.

jakeane mentioned this issue Mar 25, 2021

Add weighted version of TransformerUnlikelihoodAgent #3552

Closed

github-actions bot added the stale label Apr 24, 2021

github-actions bot closed this as completed May 2, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unlikelihood Agent #3507

Unlikelihood Agent #3507

mvh57 commented Mar 10, 2021

stephenroller commented Mar 12, 2021

mvh57 commented Mar 15, 2021

stephenroller commented Mar 15, 2021

mvh57 commented Mar 16, 2021

jakeane commented Mar 24, 2021

jakeane commented Mar 24, 2021 •

edited

Loading

stephenroller commented Mar 24, 2021

github-actions bot commented Apr 24, 2021

Unlikelihood Agent #3507

Unlikelihood Agent #3507

Comments

mvh57 commented Mar 10, 2021

stephenroller commented Mar 12, 2021

mvh57 commented Mar 15, 2021

stephenroller commented Mar 15, 2021

mvh57 commented Mar 16, 2021

jakeane commented Mar 24, 2021

jakeane commented Mar 24, 2021 • edited Loading

stephenroller commented Mar 24, 2021

github-actions bot commented Apr 24, 2021

jakeane commented Mar 24, 2021 •

edited

Loading