{"id":20074039,"url":"https://github.com/matheussc017/scrapecontacts","last_synced_at":"2026-05-13T03:02:55.285Z","repository":{"id":256477140,"uuid":"830497113","full_name":"MatheusSC017/ScrapeContacts","owner":"MatheusSC017","description":"This project aims to be a simple ETL that searches websites using a specific term and collects contact information such as email and phone number. Its functionality was designed to be a tool to easily identify potential customers/leads and extract their contact information for future use.","archived":false,"fork":false,"pushed_at":"2025-01-02T21:33:45.000Z","size":37,"stargazers_count":1,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-01-13T00:33:24.961Z","etag":null,"topics":["bs4","cli","google","python3","selenium"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/MatheusSC017.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-07-18T11:45:16.000Z","updated_at":"2025-01-02T21:33:49.000Z","dependencies_parsed_at":"2024-11-13T14:50:12.090Z","dependency_job_id":"68cd7975-a1d1-4c17-9d31-a00c10bd3ef4","html_url":"https://github.com/MatheusSC017/ScrapeContacts","commit_stats":null,"previous_names":["matheussc017/scrapecontacts"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/MatheusSC017%2FScrapeContacts","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/MatheusSC017%2FScrapeContacts/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/MatheusSC017%2FScrapeContacts/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/MatheusSC017%2FScrapeContacts/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/MatheusSC017","download_url":"https://codeload.github.com/MatheusSC017/ScrapeContacts/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":241504321,"owners_count":19973219,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["bs4","cli","google","python3","selenium"],"created_at":"2024-11-13T14:49:11.115Z","updated_at":"2026-05-13T03:02:50.238Z","avatar_url":"https://github.com/MatheusSC017.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Scrape Contacts\n\nThis project aims to be a simple ETL that searches websites using a specific term and collects contact information such as email and phone number. Its functionality was designed to be a tool to easily identify potential customers/leads and extract their contact information for future use.\n\n## Required enviroment variables\n\nAfter setting the variables you will need to start the [Search API](https://console.cloud.google.com/apis/api/customsearch.googleapis.com/)\n\n### API_KEY\nThis variable represents the API key of the [GCP credentials](https://console.cloud.google.com/apis/credentials).\n\n### SEARCH_ENGINE_ID\nYou will need to set up a [Google search engine](https://programmablesearchengine.google.com/) with business search related settings.\n\n### OPENAI_KEY\nCreate an API key to use the GPT model through the OPENAI [OpenAI docs](https://platform.openai.com/docs/api-reference/introduction)\n\n## CLI\n\n\u003e python cli.py \"Termo de Busca\"\n\n## Usage\n\nUse the command below to run the API in the port 5000\n\n\u003e flask --app src run\n\nYou can also run this application through gunicorn using below command, it is configured for port 8000\n\n\u003e gunicorn -b :8080 'src:create_app()'\n\n## Docker\n\nRun the commands below to build and run the container image\n\n\u003e docker build -t scrap_contacts .\n\n\u003e docker run scrap_contacts\n\n## Endpoints:\n\n### /\nThis endpoint accepts POST and GET methods, you can use these methods to request a contact search based on a specific term and retrieve the last storage search for those terms respectively.\n\n#### POST json parameters\n\n- search_term: Required parameter, represent the term used during the search.\n- number: Optional parameter, that delimit the number of results, the pattern value is 10.\n- exclude: Optional parameter, you can set to exclude specific links from the search.\n\n#### GET json parameters\n\n- cached_search: Optional parameter, represent the term used during the search and the name used to save tha cached results. If this parameter is not provided, the endpoint will return a list of all cached results.","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmatheussc017%2Fscrapecontacts","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fmatheussc017%2Fscrapecontacts","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmatheussc017%2Fscrapecontacts/lists"}