Comments (3)
I tried adding some logic to detect an HTTP 503 error and do some backoff throttling, but the underlying request is made in the googlesearch
module. You may be able to spread the load between a couple VPS servers or I could add an option to round-robin through a list of HTTP proxies, but again the underlying request is made in the googlesearch
module (https://github.com/MarioVilas/googlesearch/blob/master/googlesearch/__init__.py#L124)
from pagodo.
I tried to bypass google bot detection system with random countries like google.cz, google.pl, google.com.br, using rondomized proxies, with random user agents and random sleep time between requests but I got banned.. there are some dorks that will trigger google detection like inurl:".php" or "site:xxx.com" but if you try simple requests like "?id=foo" (with out file type) google will not consider it as a bot.. I did no tried too much so i cant confirm that.. I heard that v3n0m project has a captcha solver but it would be hard to implement in pagodo i guess :d
from pagodo.
Just pushed some updates to master. I've had success running this lately with the new default values...it may take around 4 days to complete though. I'm going to close this for now, unless you have some new data I can work with.
from pagodo.
Related Issues (20)
- Is this project being actively maintained? HOT 1
- Tool is not working HOT 2
- Add color to terminal results HOT 5
- [!] Specify a valid file containing Google dorks with -g HOT 2
- EXCEPTION: HTTP Error 429: Too Many Requests HOT 12
- details HOT 4
- option with --proxy HOT 12
- traceback problem HOT 2
- Some parameters doesn't work HOT 2
- Error while running ./pagodo and ./ghdb_scraper.py
- Google dorks HOT 1
- syntax erron in line 125
- unicode decode error HOT 2
- Python 3.11.2 line 125 SyntaxError: invalid syntax HOT 7
- SSLError(SSLZeroReturnError(6, 'TLS/SSL connection has been closed (EOF) (_ssl.c:992)'))) HOT 3
- ModuleNotFoundError: No module named 'yagooglesearch' HOT 13
- Failed to resolve 'myproxy' HOT 5
- Hi HOT 1
- Add the import of proxies from a file.
- GHDB scraper produces inaccurate output HOT 7
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pagodo.