Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists by cleberzumba

A curated list of projects in awesome lists by cleberzumba .

https://github.com/cleberzumba/load-databricks

Importando dados desestruturados no spark databricks com pyspark

Last synced: 31 Dec 2024

https://github.com/cleberzumba/data-import-hadoop

Importando dados do BD Oracle para o HDFS

Last synced: 31 Dec 2024

https://github.com/cleberzumba/terraform-code

Códigos Terrafom

Last synced: 31 Dec 2024

https://github.com/cleberzumba/kafka-cluster

CONFIGURANDO E INICIANDO UM KAFKA CLUSTER COM ZOOKEEPER

Last synced: 31 Dec 2024

https://github.com/cleberzumba/pipeline-etl-python

Pipeline ETL em Python para Carga de Dados e Analytics no Apache Cassandra

Last synced: 31 Dec 2024

https://github.com/cleberzumba/data-analysis-with-apache-spark-and-databricks

San Francisco Fire Calls. Creating a Spark application on the Databricks using PySpark and SQL for common data analytics patterns and operations on a San Francisco Fire Department Calls dataset.

databricks pyspark spark sql

Last synced: 16 Nov 2024

https://github.com/cleberzumba/spark-dataframe-reader

Reading CSV, JSON, Parquet and AVRO File

Last synced: 31 Dec 2024

https://github.com/cleberzumba/sql-detecting-duplicates-and-removing

Análise de dados detectando duplicidade e removendo dados duplicados - SQL

Last synced: 31 Dec 2024

https://github.com/cleberzumba/data-ingestion-and-analysis-with-azure-databricks

Data ingestion and analysis project using AWS and Azure technologies, integrated with the powerful Databricks platform.

Last synced: 31 Dec 2024

https://github.com/cleberzumba/serverless-data-engineering-project-aws

Climate Analytics and Processing Pipeline on AWS

Last synced: 21 Dec 2024

https://github.com/cleberzumba/exploratory-analysis-techniques-in-python

Técnicas de Análise Exploratória e Interpretação de Gráficos Estatísticos

Last synced: 31 Dec 2024

https://github.com/cleberzumba/data-engineering-in-python-language

Criação de Pipeline de Extração, Limpeza, Transformação e Enriquecimento de Dados

Last synced: 31 Dec 2024

https://github.com/cleberzumba/sparksql-udf

calculate the average speed by age group and speed range

Last synced: 31 Dec 2024

https://github.com/cleberzumba/sql-analytics-and-data-science

Analytical skills with SQL to extract statistics, perform calculations and aggregations, summarize data, generate reports and perform various types of analysis.

Last synced: 31 Dec 2024

https://github.com/cleberzumba/pipeline-etl-python-airflow

Pipeline ETL em Python no Apache Airflow para carga de dados no MySQL

Last synced: 31 Dec 2024

https://github.com/cleberzumba/hadoop-in-pseudodistributed-mode

Installation and Configuration of the Big Data Environment with Hadoop and Spark

hadoop spark

Last synced: 31 Dec 2024

https://github.com/cleberzumba/python-data-engineer

Data analysis showing specific metrics in some chart views.

Last synced: 31 Dec 2024