Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/0xZDH/BridgeKeeper
Scrape, Hunt, and Transform names and usernames
https://github.com/0xZDH/BridgeKeeper
linkedin-scraper name-generation osint python3 username username-generator usernames web-scraping
Last synced: 3 months ago
JSON representation
Scrape, Hunt, and Transform names and usernames
- Host: GitHub
- URL: https://github.com/0xZDH/BridgeKeeper
- Owner: 0xZDH
- License: mit
- Created: 2019-09-18T16:57:55.000Z (over 5 years ago)
- Default Branch: master
- Last Pushed: 2022-11-19T10:09:30.000Z (about 2 years ago)
- Last Synced: 2024-06-18T06:35:12.629Z (7 months ago)
- Topics: linkedin-scraper, name-generation, osint, python3, username, username-generator, usernames, web-scraping
- Language: Python
- Homepage:
- Size: 48.8 KB
- Stars: 101
- Watchers: 7
- Forks: 10
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- License: LICENSE
Awesome Lists containing this project
- cybersources - BridgeKeeper
- awesome-starz - 0xZDH/BridgeKeeper - Scrape, Hunt, and Transform names and usernames (Python)
README
# BridgeKeeper
Scrape employee names from search engine LinkedIn profiles
Hunt for emails and username formats via Hunter.io
Transform names to given username format(s)## Usage
```
usage: bridgekeeper.py [flags]BridgeKeeper - v1.0.0
options:
-h, --help show this help message and exitTarget(s):
-c COMPANY, --company COMPANY
target company to search for LinkedIn profiles
(e.g. 'Example Ltd.')-n NAMES, --names NAMES
string (comma delimited) or file containing names
to be converted to usernames (format: 'First (M) Last')Username Formatting:
-f FORMAT, --format FORMAT
username format (format identifiers:
{first}, {middle}, {last}, {f}, {m}, {l}, [#])-a API, --api API hunter.io API key for email format identification
and email scraping-d DOMAIN, --domain DOMAIN
domain name of target company for hunter.io email
format identification and email scraping--lower force usernames to all lower case
--upper force usernames to all upper case
Search Engine Configuration:
--depth DEPTH number of pages deep to search each search engine
(Default: 5)--bing-cookies BING_COOKIES
string or cookie file for Bing search engine
(disabled)--duckduckgo-cookies DUCKDUCKGO_COOKIES
string or cookie file for DuckDuckGo search engine--google-cookies GOOGLE_COOKIES
string or cookie file for Google search engine--yahoo-cookies YAHOO_COOKIES
string or cookie file for Yahoo search engineHTTP Configuration:
--timeout TIMEOUT HTTP request timeout in seconds
(Default: 25 seconds)--proxy PROXY proxy to pass HTTP traffic through: `host:port`
Output Configuration:
-o OUTPUT, --output OUTPUT
directory to write output files to
(Default: output)Debug:
--version print the tool version and exit--debug enable debug output
```Gather employee names for a company, Example Ltd., and convert each name into an 'flast' username formatted email:
`bridgekeeper.py --company "Example, Ltd." --format {f}{last}@example.com --depth 10 --output example-employees`Gather employee names and email addresses from search engines and Hunter.io:
`bridgekeeper.py --company "Example, Ltd." --domain example.com --api {API_KEY} --depth 10 --output example-employees`Convert an already generated list of names to usernames:
`bridgekeeper.py --names names.txt --format {f}{last}@example.com --output example-employees`Username format examples (BridgeKeeper supports middle names as well as character limited usernames - e.g. only 4 characters of a last name is used):
```
Name: John Adams Smith
{f}{last} > jsmith
{f}{m}.{last} > ja.smith
{f}{last}[4]@example.com > [email protected]
```## Features
* Support scraping against four major search engines: Bing, DuckDuckGo, Google, and Yahoo
* **Note**: Bing search engine has been disabled for the time being due to inconsistent results
* Name parsing to strip LinkedIn titles, certs, prefixes, etc.
* Search engine blacklist evasion via cookie files
* Username formatting
* Name trimming
* e.g. If a username format has only the first 4 characters of the last name
* Hyphenated last name handling
* Duplicate username handling
* Incrementing numbers appended to duplicate usernames
* Support Hunter.io scraping:
* Identification of email format for a specified domain
* Retrieval of known emails for a specified domain### Acknowledgements
* **[m8r0wn](https://github.com/m8r0wn)**: [CrossLinked](https://github.com/m8r0wn/CrossLinked)
* **[initstring](https://github.com/initstring)**: [linkedin2username](https://github.com/initstring/linkedin2username)
* **[nullg0re](https://github.com/nullg0re)**: Code to gather username format and emails via Hunter.io