Replies: 2 comments 5 replies
-
Either |
Beta Was this translation helpful? Give feedback.
1 reply
-
One way to detect |
Beta Was this translation helpful? Give feedback.
4 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Somewhat of a vague and broad question, but still would be interested in knowing even where to begin looking for this sort of thing (#34 inspired this question).
Suppose we're using
got-scraping
not even for scraping, but to reliably connect to arbitrary websites (my specific use case is getting the canonical URL).From my testing, got-scraping does pretty well against real-world websites (even with well-known "cloud IP"). However, I have to assume that at certain point, it will get blocked by some anti-bot software (e.g. cloudflare), and I'd like to know whether I got blocked vs. whether the resource simply failed to load.
How would one go about detecting that you've been blocked using got-scraping (or just got in general)?
Thanks and a happy new year
Beta Was this translation helpful? Give feedback.
All reactions