{"id":28209644,"url":"https://github.com/juletopi/webscraping_com_python","last_synced_at":"2025-08-19T20:14:42.256Z","repository":{"id":266401424,"uuid":"898233989","full_name":"juletopi/Webscraping_com_Python","owner":"juletopi","description":"🐍• Repositório back-end sobre os conceitos de \"webscraping\" utilizando a linguagem Python.","archived":false,"fork":false,"pushed_at":"2024-12-04T15:20:34.000Z","size":31,"stargazers_count":1,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-05-17T16:14:57.686Z","etag":null,"topics":["beautifulsoup","playwright","python","requests","venv","venv-python","virtual-environment","webscraping","webscrapping"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/juletopi.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-12-04T03:03:52.000Z","updated_at":"2025-02-13T13:21:04.000Z","dependencies_parsed_at":"2024-12-04T05:29:39.553Z","dependency_job_id":null,"html_url":"https://github.com/juletopi/Webscraping_com_Python","commit_stats":null,"previous_names":["juletopi/webscraping_com_python"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/juletopi%2FWebscraping_com_Python","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/juletopi%2FWebscraping_com_Python/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/juletopi%2FWebscraping_com_Python/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/juletopi%2FWebscraping_com_Python/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/juletopi","download_url":"https://codeload.github.com/juletopi/Webscraping_com_Python/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/juletopi%2FWebscraping_com_Python/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":259154431,"owners_count":22813599,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["beautifulsoup","playwright","python","requests","venv","venv-python","virtual-environment","webscraping","webscrapping"],"created_at":"2025-05-17T16:12:57.097Z","updated_at":"2025-08-19T20:14:42.244Z","avatar_url":"https://github.com/juletopi.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"\u003c!--\n❗ ➤ References used in this Repository:\n🔗 • https://github.com/piyushsuthar/github-readme-quotes\n🔗 • https://github.com/kyechan99/capsule-render\n🔗 • https://github.com/DenverCoder1/custom-icon-badges\n🔗 • https://shields.io\n🔗 • https://emoji.gg\n🔗 • https://getemoji.com\n--\u003e\n\n\u003c!-- PRESENTATION --\u003e\n\n\u003cdiv align=\"center\"\u003e\n  \u003cimg src=\"https://github.com/user-attachments/assets/e459ca7b-7f36-43aa-b2a4-d43a233ca14e\" alt=\"PythonWebscraping-pic\" width=\"530px\" title=\"Repositório de Webscraping com Python\"\u003e\n\u003c/div\u003e\n\u003cbr\u003e\n\n\u003c!-- SUMMARY --\u003e\n\n\u003ch2 align=\"center\"\u003eSumário 🧾\u003c/h2\u003e\n\n\u003cdiv align=\"center\"\u003e\n  \u003cp align=\"center\"\u003e\n    \u003ca href=\"#-apresentação\"\u003eApresentação\u003c/a\u003e \u0026#xa0; | \u0026#xa0;\n    \u003ca href=\"#-configurando-o-ambiente-virtual\"\u003eConfigurando o Ambiente Virtual\u003c/a\u003e \u0026#xa0; | \u0026#xa0;\n    \u003ca href=\"#-utilizando-a-biblioteca-requests\"\u003eUtilizando a Biblioteca Requests\u003c/a\u003e\n  \u003c/p\u003e\n  \u003cp align=\"center\"\u003e\n    \u003ca href=\"#-implementando-o-webscraping\"\u003eImplementando o Webscraping\u003c/a\u003e \u0026#xa0; | \u0026#xa0;\n    \u003ca href=\"#️-tecnologias-utilizadas\"\u003eTecnologias Utilizadas\u003c/a\u003e \u0026#xa0; | \u0026#xa0;\n    \u003ca href=\"#-contribuições\"\u003eContribuições\u003c/a\u003e \u0026#xa0; | \u0026#xa0;\n    \u003ca href=\"#-autor\"\u003eAutor\u003c/a\u003e\n  \u003c/p\u003e\n\u003c/div\u003e\n\u003cbr\u003e\n\n\u003c!-- ABOUT THE PAGE --\u003e\n\n## 📖 Apresentação\nEste projeto tem como objetivo demonstrar como o processo de **webscraping** utilizando a linguagem **Python**, abordando desde a configuração inicial até a extração de dados de páginas web utilizando bibliotecas populares como `requests` e `BeautifulSoup`. \n\nFora idealizado durante uma das oficinas da **21ª Semana Nacional de Ciência e Tecnologia** realizada em dezembro de 2024 no **IFRO Campus Ji-Paraná**.\n\n\u003cdiv align=\"left\"\u003e\n  \u003ch6\u003e\u003ca href=\"#sumário-\"\u003e Voltar para o início ↺\u003c/a\u003e\u003c/h6\u003e\n\u003c/div\u003e\n\n## 🔧 Configurando o Ambiente Virtual \n1. **Crie o ambiente virtual**:\n   ```bash\n   python -m venv venv\n   ```  \n2. **Ative o ambiente virtual**:  \n   - **Windows**:  \n     ```bash\n     venv\\Scripts\\activate\n     ```  \n   - **Linux/Mac**:  \n     ```bash\n     source venv/bin/activate\n     ```  \n3. **Instale as dependências**:  \n   ```bash\n   pip install -r requirements.txt\n   ```\n\n\u003cdiv align=\"left\"\u003e\n  \u003ch6\u003e\u003ca href=\"#sumário-\"\u003e Voltar para o início ↺\u003c/a\u003e\u003c/h6\u003e\n\u003c/div\u003e\n\n## 📥 Utilizando a Biblioteca Requests \nA biblioteca `requests` permite enviar requisições HTTP de forma simples. É ideal para interagir com APIs ou capturar o HTML de páginas web.  \n\n### Exemplo: Requisição GET\n\n```python\nimport requests\nimport json\n\nresponse = requests.get(\"https://httpbin.org/get\")\nprint(f\"Status Code: {response.status_code}\")\nprint(f\"Dados da resposta: {json.dumps(response.json(), indent=2)}\")\n```\n\n### Exemplo: Requisição POST\n\n```python\nimport requests\nimport json\n\ndata = {\n    \"name\": \"Julio\",\n    \"age\": 21,\n    \"city\": \"Ji-Parana\",\n    \"state\": \"RO\"\n}\n\nresponse = requests.post(\"https://httpbin.org/post\", data=data)\nprint(f\"\\nStatus Code: {response.status_code} - {json.dumps(response.json(), indent=2)}\")\n```\n\n\u003cdiv align=\"left\"\u003e\n  \u003ch6\u003e\u003ca href=\"#sumário-\"\u003e Voltar para o início ↺\u003c/a\u003e\u003c/h6\u003e\n\u003c/div\u003e\n\n## 🌐 Implementando o Webscraping\n\n### Exemplo: Obter Clima e Coordenadas\n\n```python\nimport requests\n\nNOMINATIM_URL = \"https://nominatim.openstreetmap.org/search.php\"\nOPEN_METEO_URL = \"https://api.open-meteo.com/v1/forecast\"\nHEADERS = {\"referer\": \"https://nominatim.openstreetmap.org/search.html\"}\n\ndef get_coordinates(city_name):\n    params = {\"q\": city_name.strip(), \"polygon_geojson\": 1, \"format\": \"jsonv2\"}\n    response = requests.get(NOMINATIM_URL, params=params, headers=HEADERS)\n    if response.status_code == 200:\n        location = response.json()[0]\n        return float(location[\"lat\"]), float(location[\"lon\"])\n    else:\n        print(f\"Erro ao obter coordenadas: {response.status_code}\")\n\ndef get_weather(lat, lon):\n    params = {\"latitude\": lat, \"longitude\": lon, \"current_weather\": True}\n    response = requests.get(OPEN_METEO_URL, params=params)\n    if response.status_code == 200:\n        weather = response.json()[\"current_weather\"]\n        return weather[\"temperature\"], weather[\"weathercode\"]\n    else:\n        print(f\"Erro ao obter clima: {response.status_code}\")\n\ncity = input(\"\\nDigite o nome da cidade: \")\nlat, lon = get_coordinates(city)\nif lat and lon:\n    temp, weather_code = get_weather(lat, lon)\n    print(f\"\\nClima em {city}:\\nTemperatura: {temp}ºC\\nCódigo do clima: {weather_code}\")\n```\n\n\u003cdiv align=\"left\"\u003e\n  \u003ch6\u003e\u003ca href=\"#sumário-\"\u003e Voltar para o início ↺\u003c/a\u003e\u003c/h6\u003e\n\u003c/div\u003e\n\n## 🛠️ Tecnologias Utilizadas\n- **Python**\n- **Requests**\n\n\u003cdiv align=\"left\"\u003e\n  \u003ch6\u003e\u003ca href=\"#sumário-\"\u003e Voltar para o início ↺\u003c/a\u003e\u003c/h6\u003e\n\u003c/div\u003e\n\n\u003c!-- CONTRIBUTIONS --\u003e\n\n## 🤝 Contribuições\n\n\u003cp\u003eTodas as contribuições ao projeto são bem vindas!\u003cbr\u003eSe você deseja contribuir para este projeto, há várias maneiras de fazer isso. Você pode:\u003c/p\u003e\n\u003cul\u003e\n  \u003cli\u003eReportar bugs ou problemas;\u003c/li\u003e\n  \u003cli\u003ePropor novos recursos ou melhorias;\u003c/li\u003e\n  \u003cli\u003eCorrigir problemas através de pull requests;\u003c/li\u003e\n  \u003cli\u003eAjudar a melhorar a documentação;\u003c/li\u003e\n  \u003cli\u003eCompartilhar o projeto com outras pessoas.\u003c/li\u003e\n\u003c/ul\u003e\n\u003cp\u003ePara saber mais sobre como contribuir, consulte o guia de contribuição \u003ca href=\"https://github.com/juletopi/Webscraping_com_Python/blob/main/CONTRIBUTING.md\"\u003eCONTRIBUTING.md\u003c/a\u003e.\u003c/p\u003e\n\n\u003cdiv align=\"left\"\u003e\n  \u003ch6\u003e\u003ca href=\"#sumário-\"\u003e Voltar para o início ↺\u003c/a\u003e\u003c/h6\u003e\n\u003c/div\u003e\n\u003cbr\u003e\n\n\u003c!-- AUTHOR --\u003e\n\n## 👤 Autor\n\n\u003ctable\u003e\n  \u003ctr\u003e\n    \u003ctd valign=\"middle\" width=\"25%\"\u003e\n      \u003cdiv align=\"center\"\u003e  \n        \u003ca href=\"https://github.com/juletopi\" title=\"Perfil no GitHub\" aria-label=\"GitHub - Juletopi\"\u003e\n          \u003cimg src=\"https://user-images.githubusercontent.com/76459155/220271784-9f930c36-c370-4518-9b56-604627c6e2b5.png\" width=\"150\" alt=\"Profile Pic - Juletopi\"/\u003e\n          \u003cbr\u003e\n          \u003csub\u003e\u003cstrong\u003eJúlio Cézar | Juletopi\u003c/strong\u003e\u003c/sub\u003e\n          \u003cbr\u003e\n        \u003c/a\u003e\n      \u003c/div\u003e\n    \u003c/td\u003e\n    \u003ctd valign=\"middle\" width=\"75%\"\u003e\n      \u003cul style=\"list-style: none; padding-left: 0; margin: 0;\"\u003e\n        \u003cli\u003e\n          \u003cimg src=\"https://cdn.jsdelivr.net/gh/devicons/devicon/icons/linkedin/linkedin-original.svg\" width=\"15\" alt=\"LinkedIn\" style=\"vertical-align:middle;\"\u003e\n          LinkedIn — \n          \u003ca href=\"https://www.linkedin.com/in/julio-cezar-pereira-camargo/\" target=\"_blank\" rel=\"noopener noreferrer\" aria-label=\"LinkedIn - Júlio Cézar P. Camargo\"\u003e\n            Júlio Cézar P. Camargo\n          \u003c/a\u003e\n        \u003c/li\u003e\n        \u003cli\u003e\n          \u003cimg src=\"https://pngimg.com/uploads/email/email_PNG100738.png\" width=\"15\" alt=\"Email\" style=\"vertical-align:middle;\"\u003e\n          Email — \n          \u003ca href=\"mailto:juliocezarpvh@hotmail.com\" aria-label=\"Send email - juliocezarpvh@hotmail.com\"\u003e\n            juliocezarpvh@hotmail.com\n          \u003c/a\u003e\n        \u003c/li\u003e\n        \u003cli\u003e\n          \u003cimg src=\"https://cdn3.emoji.gg/emojis/2116-facebook.png\" width=\"15\" alt=\"Facebook\" style=\"vertical-align:middle;\"\u003e\n          Facebook — \n          \u003ca href=\"https://www.facebook.com/juhletopi\" target=\"_blank\" rel=\"noopener noreferrer\" aria-label=\"Facebook - Juhletopi\"\u003e\n            facebook.com/juhletopi\n          \u003c/a\u003e\n        \u003c/li\u003e\n        \u003cli\u003e\n          \u003cimg src=\"https://cdn3.emoji.gg/emojis/6333-instagram.png\" width=\"15\" alt=\"Instagram\" style=\"vertical-align:middle;\"\u003e\n          Instagram — \n          \u003ca href=\"https://www.instagram.com/juletopi/\" target=\"_blank\" rel=\"noopener noreferrer\" aria-label=\"Instagram - Juletopi\"\u003e\n            @juletopi\n          \u003c/a\u003e\n        \u003c/li\u003e\n      \u003c/ul\u003e\n    \u003c/td\u003e\n  \u003c/tr\u003e\n  \u003ctr\u003e\n    \u003ctd colspan=\"2\" align=\"center\"\u003e\n      \u003cimg src=\"https://juletopi.github.io/JCPC_Portfolio/src/images/initialsLogo.png\" width=\"25\" alt=\"Portfolio\" align=\"center\"/\u003e\n      Portfolio —\n      \u003ca href=\"https://juletopi.github.io/JCPC_Portfolio/\" target=\"_blank\" rel=\"noopener noreferrer\" aria-label=\"Portfolio - Juletopi\"\u003e\n        juletopi.github.io/JCPC_Portfolio\n      \u003c/a\u003e\n    \u003c/td\u003e\n  \u003c/tr\u003e\n\u003c/table\u003e\n\n\u003cdiv align=\"left\"\u003e\n  \u003ch6\u003e\u003ca href=\"#sumário-\"\u003e Voltar para o início ↺\u003c/a\u003e\u003c/h6\u003e\n\u003c/div\u003e\n\n\u003cbr\u003e\n\n\u003c!-- THANK YOU, GOODBYE --\u003e\n\n----\n\n\u003cdiv align=\"center\"\u003e\n  Feito com 🤍 e ☕ por \u003ca href=\"https://github.com/juletopi\"\u003e Juletopi\u003c/a\u003e.\n\u003c/div\u003e\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fjuletopi%2Fwebscraping_com_python","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fjuletopi%2Fwebscraping_com_python","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fjuletopi%2Fwebscraping_com_python/lists"}