{"id":20414727,"url":"https://github.com/walterowisk/dio_labproject-pipeline-etl-python","last_synced_at":"2025-04-12T16:53:28.399Z","repository":{"id":191694222,"uuid":"685182684","full_name":"walterowisk/DIO_LabProject-Pipeline-ETL-Python","owner":"walterowisk","description":"Desafio de projeto proposto pela DIO dentro do Santander Bootcamp 2023 - Ciência de Dados com Python","archived":false,"fork":false,"pushed_at":"2023-08-31T13:40:17.000Z","size":141,"stargazers_count":4,"open_issues_count":0,"forks_count":0,"subscribers_count":2,"default_branch":"main","last_synced_at":"2025-04-10T19:17:17.310Z","etag":null,"topics":["colab-notebook","data-science","dio-bootcamp","etl","etl-pipeline","google-colab","python"],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/walterowisk.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2023-08-30T17:25:32.000Z","updated_at":"2024-04-02T17:40:47.000Z","dependencies_parsed_at":"2024-11-15T06:12:18.673Z","dependency_job_id":"60d35f96-e759-4462-88a1-f6b9a373d843","html_url":"https://github.com/walterowisk/DIO_LabProject-Pipeline-ETL-Python","commit_stats":null,"previous_names":["walterowisk/dio_labproject-pipeline-etl-python"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/walterowisk%2FDIO_LabProject-Pipeline-ETL-Python","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/walterowisk%2FDIO_LabProject-Pipeline-ETL-Python/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/walterowisk%2FDIO_LabProject-Pipeline-ETL-Python/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/walterowisk%2FDIO_LabProject-Pipeline-ETL-Python/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/walterowisk","download_url":"https://codeload.github.com/walterowisk/DIO_LabProject-Pipeline-ETL-Python/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":248601381,"owners_count":21131609,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["colab-notebook","data-science","dio-bootcamp","etl","etl-pipeline","google-colab","python"],"created_at":"2024-11-15T06:12:13.850Z","updated_at":"2025-04-12T16:53:28.377Z","avatar_url":"https://github.com/walterowisk.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"\u003cdiv align=\"center\"\u003e\n\u003cimg src=\"https://hermes.digitalinnovation.one/assets/diome/logo-full.svg\" alt=\"Logo Bootcamp\" width=\"80\"\u003e\n\u003ch1\u003eSantander Bootcamp 2023 \u003cbr\u003e Ciência de Dados com Python\u003c/h1\u003e\n\u003cimg src=\"https://hermes.dio.me/tracks/03253ff0-95b9-4904-84e7-2063e9d6cb26.png\" alt=\"Logo Bootcamp\" width=\"220\"\u003e\n\u003c/div\u003e\n\n##  :brain: Desafio Original DIO: Explorando IA Generativa em um Pipeline de ETL com Python\nNotebook do desafio original resolvido pelo Venilton da DIO:\n\u003ca target=\"_blank\" href=\"https://colab.research.google.com/drive/1SF_Q3AybFPozCcoFBptDSFbMk-6IVGF-?usp=sharing#scrollTo=k5fA5OrXt1a3\"\u003e\n  \u003cimg src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/\u003e\n\u003c/a\u003e\n\n## :rocket: Entendendo o desafio\nInspirado pelo projeto modelo o aluno deveria replicar ou reimaginar uma pipeline ETL utilizando Python.\n\n## :bar_chart: Meu projeto 🤽‍♂️🚴‍♀️🏄⚽🏈\nImaginando uma loja de produtos esportivos meu desafio foi criar um pipeline ETL para extrair dados de vendas de um arquivo CSV, realizar algumas transformações simples como cálculo de total de vendas por produto e por período e por fim realizar carregamento dos dados transformados em um novo arquivo CSV além de criar uma visualização em tela para mostrar o resultados por meio de gráficos.\n\n## :technologist: Etapas do Pipeline de ETL\n### :white_check_mark: Extract\nNesta etapa vamos extrair os dados de vendas do arquivo `dados-venda.csv`. Este arquivo traz informações referentes ao ano de 2023 considerando o período de janeiro a agosto. As colunas contidas no arquivo são as seguintes: `Produto`, `Data`, `Quantidade` e `Valor`.\n\n### :white_check_mark: Transform\nAgora vamos calcular o total de vendas por produto e por mês.\n\n### :white_check_mark: Load\n Salvando os dados transformados em um novo arquivo CSV e gerando gráfico de barras e de linha usando a biblioteca `Matplotlib`\n\n## :battery: Stack utilizada\n![VSCODE](https://img.shields.io/badge/Visual%20Studio%20Code-007ACC.svg?style=for-the-badge\u0026logo=Visual-Studio-Code\u0026logoColor=white)\n![PYTHON](https://img.shields.io/badge/Python-3776AB.svg?style=for-the-badge\u0026logo=Python\u0026logoColor=white)\n![GIT](https://img.shields.io/badge/Git-F05032.svg?style=for-the-badge\u0026logo=Git\u0026logoColor=white)\n![GOOGLE COLAB](https://img.shields.io/badge/Google%20Colab-F9AB00.svg?style=for-the-badge\u0026logo=Google-Colab\u0026logoColor=white)\n\n## :notebook_with_decorative_cover:\t Notebook do meu projeto no Google Colab\n\u003ca target=\"_blank\" href=\"https://colab.research.google.com/github/walterowisk/DIO_LabProject-Pipeline-ETL-Python/blob/main/DIO_LabProject_Pipeline_ETL_Analisando_Dados_de_Venda.ipynb\"\u003e\n  \u003cimg src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/\u003e\n\u003c/a\u003e","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fwalterowisk%2Fdio_labproject-pipeline-etl-python","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fwalterowisk%2Fdio_labproject-pipeline-etl-python","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fwalterowisk%2Fdio_labproject-pipeline-etl-python/lists"}