Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

improve URLTextSearcher to take an array of re_patterns to run against the same url #6

Open
jgstew opened this issue Aug 18, 2021 · 1 comment
Labels
processor AutoPkg Processor

Comments

@jgstew
Copy link
Owner

jgstew commented Aug 18, 2021

The idea is to get the contents of a URL once, but then run multiple RegEx parses against the content to extract different things. Currently URLTextSearcher has to be used multiple times to achieve this and will download the web page multiple times to do so.

Example: https://github.com/jgstew/jgstew-recipes/blob/main/Microsoft/SQLServerManagementStudio-Win.download.recipe.yaml

@jgstew jgstew added the processor AutoPkg Processor label Sep 21, 2021
@jgstew
Copy link
Owner Author

jgstew commented Aug 31, 2023

One workaround for this is to just download the webpage itself to the filesystem, then use URLTextSearcher multiple times on the cached copy.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
processor AutoPkg Processor
Projects
None yet
Development

No branches or pull requests

1 participant