Support multiple MASK tokens in LM inspector #63

jowagner · 2021-03-31T13:02:56Z

inspect_lm_huggingface.py now has an option to repeat [MASK] tokens but this doesn't work due to huggingface/transformers#3609

We could implement our own solution using AutoModelWithLMHead, following suggestions in my comment in the above transformer issue, or implement a solution inside the transformer library and make a PR.

Also look at FitBERT, SpanBERT and other tools that may already have implemented this.

Meng et al. 2022 Rewire-then-Probe: A Contrastive Recipe for Probing Biomedical Knowledge of Pre-trained Language Models propose a workaround for obtaining multi-token answers from BERT.

Edit:

There is a PR the change from single mask to multi mask support for pytorch huggingface/transformers#10222
More concrete idea
How do others address the problem, e.g. https://github.com/marcotcr/checklist ?

The text was updated successfully, but these errors were encountered:

jowagner added the enhancement New feature or request label Mar 31, 2021

jowagner mentioned this issue May 29, 2021

Investigate effect of ## glue on prefixes #80

Open

jowagner added the project Suitable for a student or intern project label May 24, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support multiple MASK tokens in LM inspector #63

Support multiple MASK tokens in LM inspector #63

jowagner commented Mar 31, 2021 •

edited

Loading

Support multiple MASK tokens in LM inspector #63

Support multiple MASK tokens in LM inspector #63

Comments

jowagner commented Mar 31, 2021 • edited Loading

jowagner commented Mar 31, 2021 •

edited

Loading