Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists by cleberzumba
A curated list of projects in awesome lists by cleberzumba .
https://github.com/cleberzumba/load-databricks
Importando dados desestruturados no spark databricks com pyspark
Last synced: 31 Dec 2024
https://github.com/cleberzumba/data-import-hadoop
Importando dados do BD Oracle para o HDFS
Last synced: 31 Dec 2024
https://github.com/cleberzumba/kafka-cluster
CONFIGURANDO E INICIANDO UM KAFKA CLUSTER COM ZOOKEEPER
Last synced: 31 Dec 2024
https://github.com/cleberzumba/pipeline-etl-python
Pipeline ETL em Python para Carga de Dados e Analytics no Apache Cassandra
Last synced: 31 Dec 2024
https://github.com/cleberzumba/data-analysis-with-apache-spark-and-databricks
San Francisco Fire Calls. Creating a Spark application on the Databricks using PySpark and SQL for common data analytics patterns and operations on a San Francisco Fire Department Calls dataset.
Last synced: 16 Nov 2024
https://github.com/cleberzumba/spark-dataframe-reader
Reading CSV, JSON, Parquet and AVRO File
Last synced: 31 Dec 2024
https://github.com/cleberzumba/sql-detecting-duplicates-and-removing
Análise de dados detectando duplicidade e removendo dados duplicados - SQL
Last synced: 31 Dec 2024
https://github.com/cleberzumba/data-ingestion-and-analysis-with-azure-databricks
Data ingestion and analysis project using AWS and Azure technologies, integrated with the powerful Databricks platform.
Last synced: 31 Dec 2024
https://github.com/cleberzumba/serverless-data-engineering-project-aws
Climate Analytics and Processing Pipeline on AWS
Last synced: 21 Dec 2024
https://github.com/cleberzumba/exploratory-analysis-techniques-in-python
Técnicas de Análise Exploratória e Interpretação de Gráficos Estatísticos
Last synced: 31 Dec 2024
https://github.com/cleberzumba/data-engineering-in-python-language
Criação de Pipeline de Extração, Limpeza, Transformação e Enriquecimento de Dados
Last synced: 31 Dec 2024
https://github.com/cleberzumba/sparksql-udf
calculate the average speed by age group and speed range
Last synced: 31 Dec 2024
https://github.com/cleberzumba/sql-analytics-and-data-science
Analytical skills with SQL to extract statistics, perform calculations and aggregations, summarize data, generate reports and perform various types of analysis.
Last synced: 31 Dec 2024
https://github.com/cleberzumba/pipeline-etl-python-airflow
Pipeline ETL em Python no Apache Airflow para carga de dados no MySQL
Last synced: 31 Dec 2024
https://github.com/cleberzumba/hadoop-in-pseudodistributed-mode
Installation and Configuration of the Big Data Environment with Hadoop and Spark
Last synced: 31 Dec 2024
https://github.com/cleberzumba/python-data-engineer
Data analysis showing specific metrics in some chart views.
Last synced: 31 Dec 2024