GithubHelp home page GithubHelp logo

johnalvarez713 / scrapein Goto Github PK

View Code? Open in Web Editor NEW

This project forked from landoncrabtree/scrapein

0.0 0.0 0.0 12 KB

A python script useful for reconnaissance and OSINT that can be used to harvest employee email addresses from a specific company through LinkedIn.

Python 100.00%

scrapein's Introduction

ScrapeIn

OSINT gathering tool that allows you to compile a list of employee email addresses from a specific company via LinkedIn. Useful during the reconnaissance phase of a penetration test and can be used further to try and get initial access (ie: targeted phishing campaigns)

This tool is for EDUCATIONAL purposes only.

This tool is based off @Morganc3's https://github.com/morganc3/LinkedInHarvester. I cleaned up the code and added some vital features that were missing.

Features

  • Collect employee email addresses en-masse from a specific company
  • Custom address formating, such as {first}.{last} or {l}{first}
  • Output email addresses to a text file
  • Narrow down employee search results by keywords, such as "human resources" or "sales"

Getting Started

  1. git clone https://github.com/landoncrabtree/ScrapeIn.git
  2. cd ScrapeIn
  3. pip install -r requirements.txt
  4. python3 scrapein.py

Usage (basic)

Finding Company ID

search searchbar And then where it says currentCompany=["<ID>"]

Authorizing LinkedIn

In order to use LinkedIn search to its full capabilities, you must authenticate with a CSRF token and a session ID. These are stored as cookies. cookies You are looking for JSESSIONID and li_at. These values will need to be pasted in the config.yml.

Running the tool

$ scrapein.py <company_id> <domain>
$ python3 scrapein.py 12345 google.com

Usage (advanced)

  • -o|--output : Filename or path to output
  • -k|--keyword : Keyword to narrow results down by
  • -f|--format : Custom email address formatting string
$python3 scrapein.py <company_id> <domain> -o|--output -k|--keyword -f|--format
$ python3 scrapein.py 12345 google.com -o emails.txt -k 'human resources' -f '{l}-{first}'

Email Address Formatting

One of the features this tool is the ability to format emails to virtually any format with some simple placeholders. It is recommended to use Hunter.io to figure out the company's email address formatting scheme. Then, you can use the following placeholders in the -f argument:

{first} = first name
{last} = last name
{f} = first initial
{l} = last initial
{fx} = first x characters in first name (ie: {f3} )
{lx} = first x characters in last name (ie: {l3} )

google

-f {first}{last}

meta

-f {last}

microsoft

-f {last}{f}

scrapein's People

Contributors

landoncrabtree avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.