Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

NER corpus: add special treatment for first sentence of article #4

Open
ogrisel opened this issue Apr 11, 2011 · 0 comments
Open

NER corpus: add special treatment for first sentence of article #4

ogrisel opened this issue Apr 11, 2011 · 0 comments

Comments

@ogrisel
Copy link
Owner

ogrisel commented Apr 11, 2011

In the vast majority of Wikipedia article, the noun phrase at the beginning of the article is the name of the entity described by the article it-self, even though there is no self pointing link to referencing it-self. This make the NER corpus scripts miss many potentially informative links that might hurt the performance of the trained OpenNLP models.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant