{"id":19899100,"url":"https://github.com/samrb-dev/autoseekout","last_synced_at":"2025-10-11T00:06:18.829Z","repository":{"id":132414632,"uuid":"543887980","full_name":"SamRB-dev/AutoSeekOut","owner":"SamRB-dev","description":"A simple web scraping bot for scraping information from seekout.com written in Python and Selenium","archived":false,"fork":false,"pushed_at":"2024-06-13T19:58:20.000Z","size":12298,"stargazers_count":7,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"AutoSeekOut-1.2.1","last_synced_at":"2025-09-19T00:01:23.352Z","etag":null,"topics":["bot","dataextraction","python3","scraping-python","seekout","selenium","selenium-python","webscraping"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/SamRB-dev.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2022-10-01T04:28:25.000Z","updated_at":"2024-11-25T20:37:47.000Z","dependencies_parsed_at":"2024-02-21T12:25:31.376Z","dependency_job_id":"1d5d9212-fd07-4cd9-9049-01ab15198b8d","html_url":"https://github.com/SamRB-dev/AutoSeekOut","commit_stats":null,"previous_names":[],"tags_count":2,"template":false,"template_full_name":null,"purl":"pkg:github/SamRB-dev/AutoSeekOut","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/SamRB-dev%2FAutoSeekOut","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/SamRB-dev%2FAutoSeekOut/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/SamRB-dev%2FAutoSeekOut/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/SamRB-dev%2FAutoSeekOut/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/SamRB-dev","download_url":"https://codeload.github.com/SamRB-dev/AutoSeekOut/tar.gz/refs/heads/AutoSeekOut-1.2.1","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/SamRB-dev%2FAutoSeekOut/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":279005647,"owners_count":26083940,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-10-10T02:00:06.843Z","response_time":62,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["bot","dataextraction","python3","scraping-python","seekout","selenium","selenium-python","webscraping"],"created_at":"2024-11-12T20:07:10.812Z","updated_at":"2025-10-11T00:06:18.811Z","avatar_url":"https://github.com/SamRB-dev.png","language":"Python","readme":"# AutoSeekOut\nA simple, cross-platform web scraping *tool/bot* to scrape data from [Seekout](https://seekout.com/).\nPurely written in **Python** and **Selenium**. It will generate generate a CSV file \nwith all data scraped from *Seekout*. \n\n## Tested On\n#### Linux\n1. Debian\n  * Parrot OS Release 5.1 (Electro Ara) 64-bit\n\n#### Windows\n1. Windows 10 Pro\n  * Version: 21H2\n  * Build: 19044.1889\n\n\n## Installation\n#### Browser Driver\nNo matter which operating system you are using, you need to download the web driver for your prefered browser for Selenium to control your browser. Here's a list of web drivers you can download based on your operating system. \n\n**[Note]: Always make sure you are downloading the driver based on the version of your browser**\n* [Chrome](https://chromedriver.chromium.org/downloads)\n* [Firefox](https://github.com/mozilla/geckodriver)\n* [Safari](https://developer.apple.com/documentation/webkit/testing_with_webdriver_in_safari)\n\n#### Creating a Virtual Environment\nIt's always a good idea to run code in a virtual environment. In order to create a python environment install the following package -\n* For Python 3.7 or above\n  * pip3 install virtualenv\n\nNow to create an environment using virtualenv \n* python3 -m venv env_name \nor,\n* python -m venv env_name\n\nTo activate the environment:\n* Linux/MacOS\n  - source path/env_name/bin/activate\n* Windows\n  - path\\to\\your\\env\\Scripts\\activate\n\nTo deactivate the environment, simply type deactivate\n\n#### Installing Necessary Modules\nAfter you have activated your virtual environment, it's time to install the necessary packages.\nTo install those packages, just type\n* pip3 install -r requirements.txt\n\n## Final Steps\nLastly, to get the script up and running, you need to make few changes in the script itself. On my future updates I'll make sure to reduce these steps to make your life easier. But for subsequent time, make these following changes. \n\n1. Open the scripts in your preferred IDE or text editor.\n\n2. On line 15, DPATH variable, set the path of your browser driver as string. i.e.\n  * DPATH = \"path/to/browser/drive.exe\"\n\n3. On line 27,28 (EMAIL,PASSWD) variables, set your login credetials as string. i.e.\n  * EMAIL = \"example@mail.com\"\n  * PASSWD = \"password123\"\n\n4. On line 31,34 (STARTFROM,LIMIT) variables, set the starting page number(STARTFROM) and last page number (LIMIT) of the project you want the bot to scrape. i.e.\n  * STARTFROM = 1\n  * LIMIT = 100\n\n5. On line 37 (TITLE) variable, set the title to the title of the project data page so that the bot can identify the project. i.e If the project title is \"Projects/Database - Intuit\" then you will set the variable as \n  * TITLE = \"Intuit\"\n\n6. Lastly, on line 40 (FILE) variable, set the file name as your desired file name. i.e.\n  * FILE = \"Intuit.csv\"\n\n### Run the script in CMD/terminal\n```\npython3 AutoSeekOut.py\n```\nOR,\n```\npython AutoSeekOut.py\n```\n","funding_links":[],"categories":[],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsamrb-dev%2Fautoseekout","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fsamrb-dev%2Fautoseekout","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsamrb-dev%2Fautoseekout/lists"}