{"id":25441049,"url":"https://github.com/codeonthespectrum/web-scrap","last_synced_at":"2026-02-16T14:05:22.970Z","repository":{"id":273951535,"uuid":"921406366","full_name":"codeonthespectrum/web-scrap","owner":"codeonthespectrum","description":"Este projeto realiza o web scraping da Wikipédia para obter dados sobre os municípios mais populosos do estado do Rio de Janeiro. ","archived":false,"fork":false,"pushed_at":"2025-01-24T00:22:06.000Z","size":11,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-06-18T07:47:51.338Z","etag":null,"topics":["data-analysis","data-visualization","webscraping"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/codeonthespectrum.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2025-01-23T22:19:37.000Z","updated_at":"2025-01-24T11:33:36.000Z","dependencies_parsed_at":"2025-01-24T00:22:26.705Z","dependency_job_id":"3d76ca95-c581-4f1e-8a0a-fc8255818ae4","html_url":"https://github.com/codeonthespectrum/web-scrap","commit_stats":null,"previous_names":["barbiedeti/web-scrap","codeonthespectrum/web-scrap"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/codeonthespectrum/web-scrap","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/codeonthespectrum%2Fweb-scrap","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/codeonthespectrum%2Fweb-scrap/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/codeonthespectrum%2Fweb-scrap/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/codeonthespectrum%2Fweb-scrap/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/codeonthespectrum","download_url":"https://codeload.github.com/codeonthespectrum/web-scrap/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/codeonthespectrum%2Fweb-scrap/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":29509288,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-02-16T09:05:14.864Z","status":"ssl_error","status_checked_at":"2026-02-16T08:55:59.364Z","response_time":115,"last_error":"SSL_read: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["data-analysis","data-visualization","webscraping"],"created_at":"2025-02-17T12:19:32.295Z","updated_at":"2026-02-16T14:05:22.955Z","avatar_url":"https://github.com/codeonthespectrum.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"\u003ch1\u003eWeb Scraping \u0026 Municípios mais populosos do Estado do Rio\u003c/h1\u003e \n\n\u003cp align=\"center\"\u003e\n\u003cimg src=\"https://img.shields.io/badge/python-3670A0?style=for-the-badge\u0026logo=python\u0026logoColor=ffdd54\"/\u003e\n\u003cimg src=\"https://img.shields.io/badge/Pandas-2C2D72?style=for-the-badge\u0026logo=pandas\u0026logoColor=white\"/\u003e\n\u003cimg src=\"https://img.shields.io/badge/Numpy-777BB4?style=for-the-badge\u0026logo=numpy\u0026logoColor=white\"/\u003e\n\u003cimg src=\"https://img.shields.io/badge/Matplotlib-%23ffffff.svg?style=for-the-badge\u0026logo=Matplotlib\u0026logoColor=black\"/\u003e\n\u003cimg src=\"https://img.shields.io/badge/Visual%20Studio%20Code-0078d7.svg?style=for-the-badge\u0026logo=visual-studio-code\u0026logoColor=white\"/\u003e\n\n\n\u003e \u003cimg src=\"http://img.shields.io/static/v1?label=STATUS\u0026message=CONCLUIDO\u0026color=GREEN\u0026style=for-the-badge\"/\u003e\n\n\u003c/p\u003e\n\n\n### Tópicos \n\n:small_blue_diamond: [Descrição do projeto](#descrição-do-projeto)\n\n:small_blue_diamond: [Funcionalidades](#funcionalidades)\n\n:small_blue_diamond: [Deploy da Aplicação](#deploy-da-aplicação-dash)\n\n:small_blue_diamond: [Pré-requisitos](#pré-requisitos)\n\n:small_blue_diamond: [Como rodar a aplicação](#como-rodar-a-aplicação-arrow_forward)\n\n\n## Descrição do projeto \n\n\u003cp align=\"justify\"\u003e\n  ste projeto realiza o web scraping da Wikipédia para obter dados sobre os municípios mais populosos do estado do Rio de Janeiro. O objetivo é demonstrar as etapas de coleta, transformação e disponibilização desses dados aplicando Web Scraping Ético para uso em análise e visualização de dados.\n\u003c/p\u003e\n\n## Funcionalidades\n\n:heavy_check_mark: Extração de Dados  \n\n:heavy_check_mark: Armazenamento de Dados \n\n:heavy_check_mark: Visualização de Dados \n\n:heavy_check_mark: Análise de Dados\n\n## Deploy da Aplicação :dash:\n\n\u003e Visualização em formato de gráfico após coleta e limpeza dos dados\n![WhatsApp Image 2025-01-23 at 8 19 41 PM](https://github.com/user-attachments/assets/64485d9e-ff68-43dc-b36b-3526f868a706)\n\n\n## Como rodar a aplicação :arrow_forward:\n\nNo terminal, clone o projeto: \n\n```\ngit clone https://github.com/barbiedeti/web-scrap.git\n```\n\n## Casos de Uso\n**Pesquisas demográficas:** O projeto pode ser usado por pesquisadores para obter dados atualizados sobre a população dos municípios.\n\n### Arquitetura de Dados\n```\nWikipédia -\u003e Scraping (BeautifulSoup) -\u003e Transformação (pandas) -\u003e Armazenamento (CSV/SQLite) -\u003e Visualização (Matplotlib)\n```\n\n\n## Diagrama\n\n![Captura de Tela 2025-01-23 às 21 10 20](https://github.com/user-attachments/assets/8715858c-37e1-456c-900f-7a8c41187622)\n\n\n## Linguagens, dependencias e libs utilizadas :books:\n\n- Python\n- BeautifulSoup\n- Pandas\n- NumPy\n- MatplotLib\n- Requests\n\n\n## Desenvolvedora :octocat:\n\n| [\u003cimg src=\"https://avatars.githubusercontent.com/u/142019936?v=4\" width=115\u003e\u003cbr\u003e\u003csub\u003eKim Gomes\u003c/sub\u003e](https://github.com/barbiedeti) |   \n| :---: |\n\n## Licença \n\n\u003cimg src=\"http://img.shields.io/static/v1?label=License\u0026message=MIT\u0026color=green\u0026style=for-the-badge\"/\u003e\n\nCopyright :copyright: 2025 - Web Scraping \u0026 Municípios mais populosos do Estado do Rio\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fcodeonthespectrum%2Fweb-scrap","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fcodeonthespectrum%2Fweb-scrap","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fcodeonthespectrum%2Fweb-scrap/lists"}