You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
any chance for RE2 support? (go and DuckDB for instance).
I'm using pomsky to generate regexes (w/o the features RE2 lacks) so testing with PCRE is equivalent but having this checked off during tests would be nice.
RE2 is pretty similar to Rust (in that it doesn't support advanced features like lookaround assertions or backreferences, which can have exponential runtime performance).
The counting forms x{n,m}, x{n,}, and x{n} reject forms that create a minimum or maximum repetition count above 1000. Unlimited repetitions are not subject to this restriction.
\w,\d, \s and \b are not Unicode aware (like in JavaScript)
Supported Unicode properties:
General categories (but not LC)
Scripts
So there will be some restrictions:
To use [word] or %, you need to disable unicode;
< and > are unsupported
No binary Unicode properties, such as Alphabetic or Emoji
No Unicode script extensions
No lookaround, recursion, backreferences, or atomic groups
No repetition with a bound bigger than 1000
No Grapheme
There might be more, which I will find out once I test RE2 more thoroughly.
Would be great if you could include explicit RE2 support for generating regexes.
I'm already using pomsky with RE2, using the subset/avoiding the limitations you indentified, so no issue there.
Originally posted by @fundef1 in #112 (comment)
The text was updated successfully, but these errors were encountered: