-
Notifications
You must be signed in to change notification settings - Fork 25k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support ignore_keywords flag for word delimiter graph token filter #59563
Conversation
…lter Support ignore_keywords flag for word delimiter graph token filter Lucene's WordDelimiterGraphFilter allows to skip processing of tokens tagged as keyword. However the Elasticsearch word delimiter graph token filter does not support this yet. I would like to update the Elasticsearch implementation to incorporate the ignore_keywords flag to enable better customization of token filters Fix for elastic#59491
Pinging @elastic/es-search (:Search/Analysis) |
@elasticmachine ok to test |
thanks for running this through the tests, the failures look unrelated
|
Yes, we've had an internal version bump so all the BWC tests are expecting different versions. Can you merge in the master branch and push again? This looks good, I think we also need to update the relevant docs (see word-delimiter-graph-filter.asciidoc for where to make changes). |
I have added docs (reused wording from lucene docs) and also merged in the latest upstream/master |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
docs/reference/analysis/tokenfilters/word-delimiter-graph-tokenfilter.asciidoc
Outdated
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. Made a small change to the docs.
Thanks @malpani ! |
thanks @romseygeek for reviewing and merging this in! |
This introduces a new property for the word delimiter graph token filter to configure ignoring of keywords. It relates to this change elastic/elasticsearch#59563
This introduces a new property for the word delimiter graph token filter to configure ignoring of keywords. It relates to this change elastic/elasticsearch#59563
This introduces a new property for the word delimiter graph token filter to configure ignoring of keywords. It relates to this change elastic/elasticsearch#59563
This introduces a new property for the word delimiter graph token filter to configure ignoring of keywords. It relates to this change elastic/elasticsearch#59563
This introduces a new property for the word delimiter graph token filter to configure ignoring of keywords. It relates to this change elastic/elasticsearch#59563 Co-authored-by: Steve Gordon <[email protected]>
This introduces a new property for the word delimiter graph token filter to configure ignoring of keywords. It relates to this change elastic/elasticsearch#59563 Co-authored-by: Steve Gordon <[email protected]>
This commit allows customizing the word delimiter token filters to skip processing tokens tagged as keyword through the
ignore_keywords
flag Lucene's WordDelimiterGraphFilter already exposes.Fix for #59491