[RAG] Fix ReGReT #3934

klshuster · 2021-08-12T19:16:14Z

Patch description
Turns out the test I wrote for ReGReT was broken, so I didn't catch the bugs here. ReGReT should work now.

CC #3927
Testing steps
Fixed the test and confirmed everything works

$ pytest -k TestRegret
====== test session starts ======
platform linux -- Python 3.7.9, pytest-6.2.1, py-1.10.0, pluggy-1.0.0.dev0
rootdir: /private/home/kshuster/ParlAI, configfile: pytest.ini
plugins: hydra-core-1.0.7, requests-mock-1.8.0, regressions-2.1.1, datadir-1.3.1
collected 666 items / 664 deselected / 2 selected

nightly/gpu/test_rag.py ..                                                                                                                                                     [100%]

====== slowest 10 durations ======
56.77s call     tests/nightly/gpu/test_rag.py::TestRegret::test_rag_regret_sep
11.50s call     tests/nightly/gpu/test_rag.py::TestRegret::test_rag_regret_same

(4 durations < 0.005s hidden.  Use -vv to show these durations.)
===== 2 passed, 664 deselected, 2 warnings in 76.86s (0:01:16) =====

mojtaba-komeili

LGTM.

spencerp

Nice! How broken was this? Are there any examples in docs that we need to update with this new flag for them to work?

also, plz teach me how to fix regret more generally thx

klshuster · 2021-08-16T19:24:10Z

How broken was this?

So broken it literally didn't work.

Are there any examples in docs that we need to update with this new flag for them to work?

I haven't included any examples in the docs for this, as it was not a highlighted model within the project; however, if more people are interested in the future this is something I can revisit.

plz teach me how to fix regret more generally thx

Some code pointers, if you are indeed interested:

ReGReT only has three (well, now four) flags: --regret controls using the model; --regret-model-file can be a separate retrieve+generate model; and --regret-intermediate-maxlen controls the maximum generation length of the intermediate generation (used in the second round of retrieval)
We build the ReGReT model in build_regret_model. This is essentially a no-op if we don't specify the model file, as the regret model is just the same as the normal, full model. Otherwise, we load up and create a new RagModel with the specified model file. There are a couple optimizations here that make this faster/more RAM-efficient; first, if the separate regret model has the same options for its index and passages, we share them between the two models so we don't load it twice; and, with this new flag I'm adding, you can now manually specify to share the same index and passages.
During training and eval, there are two functions required for using ReGReT: _regret_generate and _regret_rebatchify. The former simply swaps around agent attributes and runs a full generation phase to get the outputs from the first round of ReGReT; the latter then creates a new batch where the query vector is no longer the human response but the generation from the first round of ReGReT; note that the final generation is still dependent on the input text, and it is just the retrieval component that receives augmented input

fix regret

240c317

klshuster requested review from spencerp and mojtaba-komeili August 12, 2021 19:16

facebook-github-bot added the CLA Signed label Aug 12, 2021

klshuster mentioned this pull request Aug 12, 2021

AttributeError: 'function' object has no attribute 'retriever' - Training ReGReT model #3927

Closed

mojtaba-komeili approved these changes Aug 13, 2021

View reviewed changes

spencerp approved these changes Aug 13, 2021

View reviewed changes

klshuster added 6 commits August 16, 2021 15:24

fix docstring

f070097

Merge branch 'master' into fix_regret

7e95f47

change test to work

f9c2323

force fp16

d3c9634

logging

6841177

set dict file correctly

19511c6

klshuster merged commit 71f1e93 into master Aug 19, 2021

klshuster deleted the fix_regret branch August 19, 2021 20:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RAG] Fix ReGReT #3934

[RAG] Fix ReGReT #3934

klshuster commented Aug 12, 2021 •

edited

Loading

mojtaba-komeili left a comment

spencerp left a comment •

edited

Loading

klshuster commented Aug 16, 2021

[RAG] Fix ReGReT #3934

[RAG] Fix ReGReT #3934

Conversation

klshuster commented Aug 12, 2021 • edited Loading

mojtaba-komeili left a comment

Choose a reason for hiding this comment

spencerp left a comment • edited Loading

Choose a reason for hiding this comment

klshuster commented Aug 16, 2021

klshuster commented Aug 12, 2021 •

edited

Loading

spencerp left a comment •

edited

Loading