Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

English Interrogative-WHInfo rules #5

Open
nschneid opened this issue Mar 20, 2024 · 7 comments
Open

English Interrogative-WHInfo rules #5

nschneid opened this issue Mar 20, 2024 · 7 comments

Comments

@nschneid
Copy link
Collaborator

These are 46 WH items found by Amir's old rules, but not the current rules: https://universal.grew.fr/?custom=65fb5e4b3fe93

Examples:

  • What?
  • What about brain damage?
  • What's their personality like?
  • What is it they say in yoga class?
  • What are your thoughts about Rick Santorum’s views on gay rights?
@nschneid
Copy link
Collaborator Author

@WesScivetti

@nschneid
Copy link
Collaborator Author

I made some adjustments to the rules for WH-interrogatives, in particular adding rules to match WH words as predicates. Let's see if that fixes the discrepancy.

@amir-zeldes
Copy link
Collaborator

The data in UD was scripted using depedit, rather than grew, so there may be subtle differences somehow. I'm still not sure how to bake the grew rules into the GUM build bot easily, I wish there was a python library implementing the grew transformations...

@s-herrera
Copy link
Collaborator

There is a Python binding, that requires to install OCamL. If you want to use it, I recommend using it as it is done in the script of this project. The .grs files we use are not really valid for grew. I have tried to make this difference by adding the extension .ucxn.grs.

@amir-zeldes
Copy link
Collaborator

Thanks for pointing that out - I will have to think about how this could work, since it would add an entire additional programming language as a dependency for building GUM...

@nschneid
Copy link
Collaborator Author

If we're keeping this repo alive as a hub, maybe GUM can ship its pre-Cxn-annotated data to this repo, and the script calling Grew can automatically add the UCxn layer? I.e. treat UCxn as a service called by the GUM build bot (or other treebanks).

@amir-zeldes
Copy link
Collaborator

If it was implemented as a REST API or similar that would be an option (we need the build to be runnable on other people's machines just by cloning the git repo, but demanding an Internet connection for the build should be fine)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants