-
Notifications
You must be signed in to change notification settings - Fork 250
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Noun+Number constructions #654
Comments
I have a similar question for Scottish Gaelic. In the sports commentary subcorpus the commentator says things like "Alba neoni Yugoslavia neoni" 'Scotland nil Yugoslavia nil'. This isn't nummod, though, is it? Tentatively going for |
@colinbatchelor in this case neoni is the predicate/focus/rhema. I will put it as the root. And Alba as |
@sylvainkahane That makes sense. I think I prefer |
I think the current practice is to use |
A similar older issue: #466 Perhaps we really need to put something about this in the guidelines. |
What about subtyping, e.g. |
I'm not sure the number is actually the modifier here, despite the pluralization of days... My expectation is that if only one of two things can be omitted then the one we can't omit is the head: Example question "Which one do you want?". Possible answers:
Couldn't 'number' be a little like a title or modifier, expanding what kind of entity '2' refers to? Intuitively I like |
One way to interpret this is that "number" turns a cardinal number into an ordinal number. Are there languages where ordinality is regularly expressed compositionally, with two syntactic words, one being a cardinal number? |
Or rather, turning a cardinal into an ordinal is one function of this construction. Another is to express a conventional label for something: "room number 45B". |
I think that in such constructions, numbers behave like nouns (if not as nouns). The analysis might be the same as in N N constructions, such as President Trump (cf. #503). Which also means that the solutions depend on the language: if in English the head seems to be the number (number 3 can commute with 3 but not with number), in French, a determiner is obligatory and I would choose the noun as the head (le nombre 3 commutes with le nombre and almost all noun modifiers are on the right of the noun). |
@nschneid This is an interesting idea, but I have a problem with it
I presume this is analogous to your question above:
I see a contrast in (a) new station number 3 and old station number 3, In English the matter seems to be semantically scaled.
In a two- or three-dimensional setting, however, there is no intuitive starting point.
Building or streetcar identities are in no logical/intuitive relationship to ordinals:
|
If it is a new station officially designated with the number 3, I would say "I work at the new station number 3". Without the determiner, I read it as equivalent to "the 3rd new station". The bracketing is presumably different: [[new station] [number 3]] being the 3rd of the new stations, vs. new [station number 3] for the officially designated Station 3. |
I don't have a strong opinion on this, and I agree with a lot of the issues pointed out in the comments here. However, since the current convention is to use |
Is |
@parapluirevel Yes, I agree that depending on context, "number X" can act more like an ordinal or more like a way to identify something by a proper name (that happens to involve a numeric designator). In the latter case "number" may be optional. But I'm not sure whether this ambiguity should or should not be taken into account in terms of the dependency relation. I think it's important that the designator in the name construction be a label of some sort and not something with its own independent referent: "Room GHC 5713" is fine, but I would not say "*Room Gates Hillman" or but rather "the Gates-Hillman Room" or just "Gates-Hillman" for short. The article is mandatory with "Room" as the head: "*I went to Gates-Hillman Room". Same with "Building", but "Hall" (as on a university campus) prefers not to have an article: "I went to Gates-Hillman Hall". In any case, UD is far from providing a complete account of the internal structure of and constraints on proper names and other constructions involving dates and values. |
In the EWT corpus I found cases where a year is nummod of a month in a date.
Does it make sense? According to https://universaldependencies.org/u/dep/nummod.html:
the year is not modifying the meaning of the month with a quantity, is it? |
I think I've posted about this in the past (not sure about the issue number(s)), in my opinion dates in English should have the day as the head, and the month and year as modifiers of the day. In UD_English-GUM we do:
With the idea that "July 17" is a subtype of days of the kind "17th day of the month", and this is "the July one" out of those (so treat it like a |
@arademaker See #455 and linked issues. |
Hi, in |
I think generally we treat words 'as they are pronounced', so we would treat "first" and "1st" the same. Therefore if a language has an explicit ordinal marker (e.g. in German we can say "der 1. Versuch" - 'the first try', and the period means it is pronounced like 'first'), then I would also tag it ADJ and attach as amod. |
In #455 nothing was said about the |
The decision for number one and similar is that they are treated as |
Actually I believe it should be |
"UD analyzes the number as a noun"—you're saying this is a statement about the deprel and not the UPOS? |
Hmm, now that I see the wording again, it certainly doesn't sound so—although that was how I understood the consensus when we were writing the paper! Too bad that we already had such a huge number of annotated examples that we didn't bother to add one here, too. I was convinced we were talking about the deprels; more specifically, that |
I think this is exactly the construction we discussed in the mischievous nominals paper, no? As of now I think it's nummod in EWT and dep in GUM, which just means 'awaiting decision'. The way English uses nmod it would have to be nmod:npmod, but Nathan and I were intending to put this under the proposed nmod:desc subtype. I'm also fine with dep TBH, which as I you know I take to mean "we know what's the head but none of our other labels fit at the moment" |
In constructions like the following, should the number or the preceding noun be the head, and if the latter, should
nummod
apply?nummod(number, one)
?nummod(number, one)
?appos(station, number)
?nummod(day, 2)
?(encountered in investigating UniversalDependencies/UD_English-EWT#79)
The text was updated successfully, but these errors were encountered: