A search-engine-cli-scraper for more than 15 search engines, including Google. duckduckgo, Bing, Ask, etc...
NOTE: For educationnal purpose, am not responsible of the bad use of this tool !
- Python (3.x)
- Docker-CE (Not required for all search-engine, just few of them)
- You need to install all requirements :
pip3 install -r requirements.txt
- Install geckodriver :
# For linux users
# cd /home/your-user-name
wget https://github.com/mozilla/geckodriver/releases/download/v0.24.0/geckodriver-v0.24.0-linux64.tar.gz
# Unzip the file
tar -xvzf geckodriver*
chmod +x geckodriver
# Add it to PATH
export PATH=$PATH:/path-to-extracted-file/.
# For other OS's users, please check releases on https://github.com/mozilla/geckodriver/releases
- Pull and run the splash-scrap module from docker-hub (Some of search engine need this):
docker run -p 8050:8050 scrapinghub/splash
- Replace
example.config.txt
byconfig.txt
and provide the running IP for the splash-scrap
How to use Falla:
usage: main.py [-h] [-e ENGINE] [-q QUERY]
optional arguments:
-h, --help show this help message and exit
-e ENGINE, --engine ENGINE
The search engine
-q QUERY, --query QUERY
The query text
- To list all search-engine:
$ python3 -m app.main
# output
[+] Falla [the search-engine-scraper]
[+] Listing search-Engines
[+] > google
[+] > bing
[+] > aol
[+] > dogpile
[+] > falla
[+] > ask
[+] > qwant
[+] > duckduckgo
[+] > mojeek
[+] > gibiru
[+] > yandex
[+] > yahoo
[+] > searchencrypt
[+] > iem
[+] > kallasearch
[+] > wosx
- To search something:
$ python3 -m app.main -e google -q "sanix darker"
# output
- Sanix-darker