-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
tag_test and escaped xfst tags #71
Comments
The tag test is too fragile in two different ways:
Fixing any of these would solve your issue, and fixing both would make the tag test much more robust. @flammie could you have a look? |
Yeah The problem that undeclared and typoed multichars compile into one arc per byte kind of paths cannot really be figured out from binary fst. It's a design failure in lexc that can ultimately only be fixed by rethinking the alphabet handling over all tools. The best that can be done finding misspelt and undeclared tags from lexc entries is by guessing that +anything is a tag by convention. |
Yes, and that is essentially what we already do. We can still improve the non-guessing part of the tag test, and that is what I suggest we do. |
tag_test is failing for me with the message
I found the offending tag
+Use%/GC
inshared-smi
where it appears in embedded
xfst
inlexc
, meaning the actual tag is+Use/GC
. Couldtag_test
be adapted to handle this?The text was updated successfully, but these errors were encountered: