https://github.com/shadawck/recon-archy

Linkedin Tools (and maybe later other source) to reconstruct a company hierarchy from scraping relations and jobs title
https://github.com/shadawck/recon-archy

automation company-data crawler cybersecurity geckodriver golang linkedin organisational-analysis osint osinttool reconnaissance scraper selenium

Last synced: 3 months ago
JSON representation

Linkedin Tools (and maybe later other source) to reconstruct a company hierarchy from scraping relations and jobs title

Host: GitHub
URL: https://github.com/shadawck/recon-archy
Owner: shadawck
License: gpl-3.0
Created: 2020-06-25T14:38:51.000Z (about 5 years ago)
Default Branch: master
Last Pushed: 2020-08-04T11:26:42.000Z (almost 5 years ago)
Last Synced: 2024-06-19T06:54:25.130Z (about 1 year ago)
Topics: automation, company-data, crawler, cybersecurity, geckodriver, golang, linkedin, organisational-analysis, osint, osinttool, reconnaissance, scraper, selenium
Language: Go
Homepage:
Size: 36.8 MB
Stars: 16
Watchers: 2
Forks: 1
Open Issues: 9
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# recon-archy
Linkedin Tools (and maybe later other source) to reconstruct a company hierarchy from scraping relations and jobs title.

### Status

* [x] crawl : Crawl employees specific to a company
* [ ] analyse : Perform analysis on collected data.
* [ ] build : Build organisational chart of the company.

## Dependencies
First, make sure you have xdfb and openjdk-11-jre installed :
```
sudo apt-get install xvfb openjdk-11-jre
```

If you don't have ``openjdk-11-jre`` for your system with a package manager, just download it manually from [AdoptOpenJDK](https://adoptopenjdk.net/releases.html):
```
wget https://github.com/AdoptOpenJDK/openjdk11-binaries/releases/download/jdk-11.0.7%2B10/OpenJDK11U-jdk_x64_linux_hotspot_11.0.7_10.tar.gz
sudo tar -xvf OpenJDK11U-jdk_x64_linux_hotspot_11.0.7_10.tar.gz -C /usr/
# and add java to your path
echo "export PATH=$PATH:/usr/jdk:/usr/jdk/bin:/usr/jdk/lib/:/usr/jdk/jre:/usr/jdk/jre/bin/:/usr/jdk/jre/lib/" >> ~/.bashrc
source ~/.bashrc
```
Then if you don't have it, download firefox
```
sudo apt install firefox
```

## Requirements
### For an Installation **with** golang

ReconArchy need golang if you want to install it with go. If go is not install on your system refer to [golang documentation](https://golang.org/doc/install) to install it. Then go to [Installation with golang](#golang)

### For an installation **without** Golang

Golang need Geckodriver (the WebDriver for firefox) and a selenium server.
So download the last version of [Selenium Server (Grid)](https://www.selenium.dev/downloads/)
```sh
wget https://selenium-release.storage.googleapis.com/3.141/selenium-server-standalone-3.141.59.jar
mv selenium-server-standalone-3.141.59.jar selenium-server-standalone
```
And the last version of [Geckodriver](https://github.com/mozilla/geckodriver/releases) for your architecture.

## Installation
## Golang

To install ``recon-archy`` just run :
```
go get github.com/remiflavien1/recon-archy
```
Next we need to install the dependencies :
- The Selenium server
- And the Geckodriver.

Fortunatly the [tebeka/selenium](https://github.com/tebeka/selenium) (which is a internal dependencies of ``recon-archy``) provide everything for us :
```sh
cd $GOPATH/pkg/mod/github.com/tebeka/[email protected]/vendor/
go run init.go --alsologtostderr --download_browsers --download_latest
```
That's it, you're good to go to [usage](#usage)

### Binaries
Precompiled binaries will be available with the first stable version in the [release](https://github.com/remiflavien1/recon-archy/releases) section.

### From source
Assuming your environnement is well configured (GOPATH, GOROOT...):
```sh
git clone https://github.com/remiflavien1/recon-archy
cd recon-archy
go build
go install
```

## Usage

First you need to start as many server as worker you want.
*Notes: The number of workers is limited to 4 for the time being. So at max launch 4 standalone server if you want to work with 4 worker. **This part will be automated in the future***
```sh
# launch 4 standalone servers
./init-server 4
```

Next add your linkedin credential in `.creds` (*interactive mode will be added in the future*)

And then launch ReconArchy (With the previous example, here you will use 4 Workers/Threads)
```sh
recon-archy crawl -t -c
```
For example :
```sh
recon-archy crawl -t 4 -c redhat
```

You can use help menu on command and subcommand for more information. But for now there is not much to cover.
```sh
$ recon-archy --help
NAME:
ReconArchy - Crawl 1000 employees of a choosen company and build their organizational chart

USAGE:
recon-archy [global options] command [command options] [arguments...]

COMMANDS:
crawl crawl employees specific to a company
analyse Perform analysis on collected data.
build Build organisational chart of the company
help, h Shows a list of commands or help for one command

GLOBAL OPTIONS:
--help, -h show help (default: false)
```

```sh
$ recon-archy crawl --help
NAME:
recon-archy crawl - crawl employees specific to a company

USAGE:
recon-archy crawl [command options] [arguments...]

OPTIONS:
--threads value, -t value Adjust number of crawling worker (default: "1")
--company value, -c value Name of the target company
--help, -h show help (default: false)
```

Crawl result can be retrieve in `/data/`

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/shadawck/recon-archy

Awesome Lists containing this project

README