-
Notifications
You must be signed in to change notification settings - Fork 58
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add more Proxy Providers #36
Comments
I may add a couple of these later if you provide some info on how to do it, what needs to be implemented etc. |
Hello @la55u Every proxy parser currently extends the base UrlParser class which represents any URL containing Proxy information. If you want to support a new provider, for instance coolProxy you would create a new class extending UrlParser. Then by inspecting the html fields needs and using BeautifulSoup you could retrive the proexy information. You might also need to decode hidden information: for example the IP of the specific provider is encoded. You will need to do something like:
PS: Some of the existing provides have updated their websites, adding extra javascript or encodings to hide proxy information (thats why some existing providers currently fail). However this does not mean there is not a way around it :) Let me know if this makes sense - I would be happy to help! |
I don't really get this encoding. For example the IPs that are listed on the coolProxy website are not the actual proxy IPs that we need? |
Hey @la55u When you view the source code of the provider (Ctrl+U with Google Chrome) you will realise that every proxy is a row in an html table. In that table most of the information can be traversed directly but the IPs for example are 'text/javascript' elements - meaning that you need to do a bit more to decode them :) In the provider above for example, the first tag(td) in the first table row I found looks like: <script type="text/javascript">document.write(Base64.decode(str_rot13("BGZhBGxhAv4kAGt=")))</script> Now if we do encode in rot13 the stirng above: codecs.getencoder( "rot-13" )("BGZhBGxhAv4kAGt=") we will get something like: "OTMuOTkuNi4xNTg=" Does it make sense? |
Possible Proxy Lists
Providers that require an API key:
Could also parse related forum thread - blackhatworld:
The text was updated successfully, but these errors were encountered: