[gelbooru] regex for extractor not allowing alternate parameter order in url. #2821
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
The regex currently implemented wouldn't support urls where the
s, page, and id
orders were different.placing
page=post
ands=view
into look-aheads from the?
forward allows for any order while also checking if present.URLs tested
Match 2 was gotten by searching for 'cat' and then clicking on an image. Seems gelbooru embeds search parameters
Old regex
Results:
1: Match:
https://gelbooru.com/index.php?page=post&s=view&id=7586232
, Group post:7586232
2: Match:
https://gelbooru.com/index.php?page=post&s=view&id=7586232
, Group post:7586232
3: No match
New regex
Results:
1: Match:
https://gelbooru.com/index.php?page=post&s=view&id=7586232
, Group post:7586232
2: Match:
https://gelbooru.com/index.php?page=post&s=view&id=7582526&tags=cat
, Group post7586232
3: Match:
https://gelbooru.com/index.php?id=7586232&page=post&s=view
, Group post:7586232
I was unable to determine if the old regex was made with the intention of sanitizing additional arguments ( such as removing
&tag=cat
) or if it was just a byproduct or a 'bonus' of the regex