{"id":20448497,"url":"https://github.com/shadawck/recon-archy","last_synced_at":"2025-04-13T01:27:31.849Z","repository":{"id":57532529,"uuid":"274939983","full_name":"shadawck/recon-archy","owner":"shadawck","description":"Linkedin Tools (and maybe later other source) to reconstruct a company hierarchy from scraping relations and jobs title","archived":false,"fork":false,"pushed_at":"2020-08-04T11:26:42.000Z","size":38594,"stargazers_count":16,"open_issues_count":9,"forks_count":1,"subscribers_count":2,"default_branch":"master","last_synced_at":"2024-06-19T06:54:25.130Z","etag":null,"topics":["automation","company-data","crawler","cybersecurity","geckodriver","golang","linkedin","organisational-analysis","osint","osinttool","reconnaissance","scraper","selenium"],"latest_commit_sha":null,"homepage":"","language":"Go","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"gpl-3.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/shadawck.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2020-06-25T14:38:51.000Z","updated_at":"2024-05-17T02:41:13.000Z","dependencies_parsed_at":"2022-09-26T18:22:08.492Z","dependency_job_id":null,"html_url":"https://github.com/shadawck/recon-archy","commit_stats":null,"previous_names":[],"tags_count":1,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/shadawck%2Frecon-archy","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/shadawck%2Frecon-archy/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/shadawck%2Frecon-archy/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/shadawck%2Frecon-archy/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/shadawck","download_url":"https://codeload.github.com/shadawck/recon-archy/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":224766823,"owners_count":17366395,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["automation","company-data","crawler","cybersecurity","geckodriver","golang","linkedin","organisational-analysis","osint","osinttool","reconnaissance","scraper","selenium"],"created_at":"2024-11-15T10:35:22.720Z","updated_at":"2024-11-15T10:35:23.458Z","avatar_url":"https://github.com/shadawck.png","language":"Go","funding_links":[],"categories":[],"sub_categories":[],"readme":"# recon-archy\nLinkedin Tools (and maybe later other source) to reconstruct a company hierarchy from scraping relations and jobs title.\n\n### Status\n\n* [x] crawl   : Crawl employees specific to a company\n* [ ] analyse : Perform analysis on collected data.\n* [ ] build   : Build organisational chart of the company.\n\n## Dependencies \nFirst, make sure you have xdfb and openjdk-11-jre installed : \n```\nsudo apt-get install xvfb openjdk-11-jre\n```\n\nIf you don't have ``openjdk-11-jre`` for your system with a package manager, just download it manually from [AdoptOpenJDK](https://adoptopenjdk.net/releases.html):\n```\nwget https://github.com/AdoptOpenJDK/openjdk11-binaries/releases/download/jdk-11.0.7%2B10/OpenJDK11U-jdk_x64_linux_hotspot_11.0.7_10.tar.gz\nsudo tar -xvf OpenJDK11U-jdk_x64_linux_hotspot_11.0.7_10.tar.gz -C /usr/\n# and add java to your path \necho \"export PATH=$PATH:/usr/jdk:/usr/jdk/bin:/usr/jdk/lib/:/usr/jdk/jre:/usr/jdk/jre/bin/:/usr/jdk/jre/lib/\" \u003e\u003e ~/.bashrc\nsource ~/.bashrc\n```\nThen if you don't have it, download firefox\n```\nsudo apt install firefox\n```\n\n## Requirements\n### For an Installation **with** golang\n\nReconArchy need golang if you want to install it with go. If go is not install on your system refer to [golang documentation](https://golang.org/doc/install) to install it. Then go to [Installation with golang](#golang)\n\n### For an installation **without** Golang\n\nGolang need Geckodriver (the WebDriver for firefox) and a selenium server.   \nSo download the last version of [Selenium Server (Grid)](https://www.selenium.dev/downloads/)\n```sh\nwget https://selenium-release.storage.googleapis.com/3.141/selenium-server-standalone-3.141.59.jar\nmv selenium-server-standalone-3.141.59.jar selenium-server-standalone\n```\nAnd the last version of [Geckodriver](https://github.com/mozilla/geckodriver/releases) for your architecture.\n\n## Installation\n## Golang \n\nTo install ``recon-archy`` just run :\n```\ngo get github.com/remiflavien1/recon-archy\n```\nNext we need to install the dependencies :\n- The Selenium server \n- And the Geckodriver. \n\nFortunatly the [tebeka/selenium](https://github.com/tebeka/selenium) (which is a internal dependencies of ``recon-archy``) provide everything for us :\n```sh\ncd $GOPATH/pkg/mod/github.com/tebeka/selenium@v0.9.9/vendor/\ngo run init.go --alsologtostderr  --download_browsers --download_latest\n```\nThat's it, you're good to go to [usage](#usage)\n\n### Binaries\nPrecompiled binaries will be available with the first stable version in the [release](https://github.com/remiflavien1/recon-archy/releases) section.\n\n\n### From source\nAssuming your environnement is well configured (GOPATH, GOROOT...): \n```sh\ngit clone https://github.com/remiflavien1/recon-archy\ncd recon-archy\ngo build\ngo install\n```\n\n## Usage\n\nFirst you need to start as many server as worker you want. \n*Notes: The number of workers is limited to 4 for the time being. So at max launch 4 standalone server if you want to work with 4 worker. **This part will be automated in the future***\n```sh\n# launch 4 standalone servers\n./init-server 4\n```\n\nNext add your linkedin credential in `.creds` (*interactive mode will be added in the future*)\n\nAnd then launch ReconArchy (With the previous example, here you will use 4 Workers/Threads)\n```sh\nrecon-archy crawl -t \u003cWORKERS\u003e -c \u003cCOMPANY\u003e\n```\nFor example : \n```sh\nrecon-archy crawl -t 4 -c redhat\n```\n\nYou can use help menu on command and subcommand for more information. But for now there is not much to cover.\n```sh \n$ recon-archy --help\nNAME:\n   ReconArchy - Crawl 1000 employees of a choosen company and build their organizational chart\n\nUSAGE:\n   recon-archy [global options] command [command options] [arguments...]\n\nCOMMANDS:\n   crawl    crawl employees specific to a company\n   analyse  Perform analysis on collected data.\n   build    Build organisational chart of the company\n   help, h  Shows a list of commands or help for one command\n\nGLOBAL OPTIONS:\n   --help, -h  show help (default: false)\n```\n\n```sh\n$ recon-archy crawl --help\nNAME:\n   recon-archy crawl - crawl employees specific to a company\n\nUSAGE:\n   recon-archy crawl [command options] [arguments...]\n\nOPTIONS:\n   --threads value, -t value  Adjust number of crawling worker (default: \"1\")\n   --company value, -c value  Name of the target company\n   --help, -h                 show help (default: false)\n```\n\nCrawl result can be retrieve in `/data/`","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fshadawck%2Frecon-archy","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fshadawck%2Frecon-archy","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fshadawck%2Frecon-archy/lists"}